# LSTM-RNN-VAD **Repository Path**: sanghy6/LSTM-RNN-VAD ## Basic Information - **Project Name**: LSTM-RNN-VAD - **Description**: Voice Activity Detection LSTM-RNN learning model - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-08-31 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # LSTM-RNN Voice Activity Detection ##### REQUIRED PACKAGES
numpy, tensorflow, libROSA, matplotlib
##### FILES
\- dataset_utils.py
Dataset related utilities: One-hot encoding, wav file normalisation, TRS to CSV conversion, JSON to CSV conversion, Youtube wav download for the AudioSet Google corpus, Liblinear library data transformations \- metrics_utils.py
(NOT FINALISED) Metrics' related utilities for the baseline VAD methods \- feature_extractor.py
Feature extraction class to extract MFCC, deltas, double deltas, RSE \- VAD_model.py
LSTM-RNN tensorflow learning model \- \__main__.py
The program's main entry point \- /checkpoint
Tensorflow checkpoint directory for saving and restoring learning models \- /parameter
LSTM-RNN learning model hyper-parameters, training parameters, and log/checkpoint directories names \- /notebook
Jupyter notebooks to test initial VAD prototypes