# LSTM-RNN-VAD
**Repository Path**: sanghy6/LSTM-RNN-VAD
## Basic Information
- **Project Name**: LSTM-RNN-VAD
- **Description**: Voice Activity Detection LSTM-RNN learning model
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2020-08-31
- **Last Updated**: 2020-12-19
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
# LSTM-RNN Voice Activity Detection
##### REQUIRED PACKAGES
numpy, tensorflow, libROSA, matplotlib
##### FILES
\- dataset_utils.py
Dataset related utilities: One-hot encoding, wav file normalisation, TRS to CSV conversion, JSON to CSV conversion, Youtube wav download for the AudioSet Google corpus, Liblinear library data transformations
\- metrics_utils.py
(NOT FINALISED) Metrics' related utilities for the baseline VAD methods
\- feature_extractor.py
Feature extraction class to extract MFCC, deltas, double deltas, RSE
\- VAD_model.py
LSTM-RNN tensorflow learning model
\- \__main__.py
The program's main entry point
\- /checkpoint
Tensorflow checkpoint directory for saving and restoring learning models
\- /parameter
LSTM-RNN learning model hyper-parameters, training parameters, and log/checkpoint directories names
\- /notebook
Jupyter notebooks to test initial VAD prototypes