data/librispeech/README.md
The Librispeech corpus consists of about 1000 hours of read English speech. The Librispeech corpus can be downloaded for free from here
| DEV-CLEAN WER % / LER % | DEV-OTHER WER % / LER % | TEST-CLEAN WER % / LER % | TEST-OTHER WER % / LER % | MODEL | PAPER |
|---|---|---|---|---|---|
| - | 3.52 / - | - | 4.11 / - | SOTA 2019 | End-to-end ASR: from Supervised to Semi-Supervised Learning with Modern Architectures |
| - | 7.27 / - | - | 7.95 / - | streaming_convnets | Scaling Up Online Speech Recognition Using ConvNets |
| 3.01 / - | 8.86 / - | 3.28 / - | 9.84 / - | seq2seq_tds | Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions |
| 4.6 / 2.3 | 13.8 / 9.0 | 4.8 / - | 14.5 / - | conv_glu | Wav2Letter: an End-to-End ConvNet-based Speech Recognition System, Letter-Based Speech Recognition with Gated ConvNets |
To download and prepare the audio and text for training/evaluation run (replace [...] with a path where the data should be stored)
python3 prepare.py --dst [...]
The following structure will be generated
tree -L 3
.
├── audio
│ ├── dev-clean.tar
│ ├── dev-other.tar
│ ├── LibriSpeech
│ │ ├── BOOKS.TXT
│ │ ├── CHAPTERS.TXT
│ │ ├── dev-clean
│ │ ├── dev-other
│ │ ├── LICENSE.TXT
│ │ ├── README.TXT
│ │ ├── SPEAKERS.TXT
│ │ ├── test-clean
│ │ ├── test-other
│ │ ├── train-clean-100
│ │ ├── train-clean-360
│ │ └── train-other-500
│ ├── test-clean.tar
│ ├── test-other.tar
│ ├── train-clean-100.tar
│ ├── train-clean-360.tar
│ └── train-other-500.tar
├── lists
│ ├── dev-clean.lst
│ ├── dev-other.lst
│ ├── test-clean.lst
│ ├── test-other.lst
│ ├── train-clean-100.lst
│ ├── train-clean-360.lst
│ └── train-other-500.lst
└── text
├── dev-clean.txt
├── dev-other.txt
├── librispeech-lm-norm.txt
├── librispeech-lm-norm.txt.lower.shuffle
├── test-clean.txt
├── test-other.txt
├── train-clean-100.txt
├── train-clean-360.txt
└── train-other-500.txt