# rnn-transducer **Repository Path**: tinydeer/rnn-transducer ## Basic Information - **Project Name**: rnn-transducer - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 1 - **Created**: 2024-05-13 - **Last Updated**: 2024-05-13 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # RNN-Transducer A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition # Environment - pytorch >= 0.4 - warp-transducer ## Train ```python python train.py -config config/aishell.yaml ``` ## Eval ``` python eval.py -config config/aishell.yaml ``` ## Experiments The details of our RNN-Transducer are as follows. ```yaml model: enc: type: lstm hidden_size: 320 n_layers: 4 bidirectional: True dec: type: lstm hidden_size: 512 n_layers: 1 embedding_dim: 512 vocab_size: 4232 dropout: 0.2 ``` All experiments are conducted on AISHELL-1. During decoding, we use beam search with width of 5 for all the experiments. A character-level 5-gram language model from training text, is integrated into beam searching by shallow fusion. | MODEL | DEV(CER) | TEST(CER) | |:---: | :---:|:---: | | RNNT+pretrain+LM | 10.13 | 11.82 | ## Acknowledge Thanks to [warp-transducer](https://github.com/HawkAaron/warp-transducer).