data/timit/README.md
The TIMIT corpus can be purchased from the LDC LDC93S1.
The data is mapped from 61 to 39 phonemes for training and testing. The map used here is taken from the Kaldi TIMIT recipe.
timit_LDC93S1.tar.tar -xf timit_LDC93S1.tar
sph2pipe see https://www.ldc.upenn.edu/language-resources/tools/sphere-conversion-tools:wget https://www.ldc.upenn.edu/sites/www.ldc.upenn.edu/files/ctools/sph2pipe_v2.5.tar.gz
tar -xzf sph2pipe_v2.5.tar.gz && cd sph2pipe_v2.5
gcc -o sph2pipe *.c -lm
To prepare the audio and text data for training/evaluation run (set necessary paths instead of [...])
python3 prepare.py --src [...]/timit --dst [...] --sph2pipe [...]/sph2pipe_v2.5/sph2pipe
The following structure will be generated
tree -L 2
.
├── audio
│ ├── test
│ ├── train
│ └── valid
├── lists
│ ├── test.lst
│ ├── test.phn.lst
│ ├── train.lst
│ ├── train.phn.lst
│ ├── valid.lst
│ └── valid.phn.lst
├── text
│ ├── test.phn.txt
│ ├── test.txt
│ ├── train.phn.txt
│ ├── train.txt
│ ├── valid.phn.txt
│ └── valid.txt
├── timit
│ ├── CONVERT
│ ├── README.DOC
│ ├── SPHERE
│ └── TIMIT
└── timit_LDC93S1.tar