Digital Speech Processing : Automatic Speech Recoginition with LibriSpeech dataset

How to train

# Set the environment
0. Virtual environment (ex: conda) is recommended 
1. ' pip install -r requirements.txt' or install : numpy, pandas, python-Levenshtein, librosa, numba, matplotlib
2. Install pytorch considering your CUDA version : https://pytorch.org/get-started/locally/
3. ' unzip Libri_data.zip -d ./DSP_project/ ' # unzip dataset
4. Set PATHs in train.py and test.py (+ set data path at ./Libri_Data/*.csv files)

# train and test
5. ' python DSP_ASR_LibriSpeech/train.py '
6. ' python DSP_ASR_LibriSpeech/test.py '

Recommended Environmet

Ubuntu 20.04.3 LTS
CUDA Version: 11.5
Python 3.8

Dataset

: Part of Libri Speech Clean data 360 (label length is under 150) - Download it from online

Base Model

: Location based Attention - CER 36.318% WER 73.181% at 160 Epochs - CER 31.254% WER 66.985% at 200 Epochs

Performance metric

: Word Error Rate = (S + I + D) / N - S : # of substitutions - I : # of deletions - D : # of insertions - N : # of words in the target label

Reference

specaugmentation code : https://github.com/SeanNaren/deepspeech.pytorch
Model (CLOVA Call) : https://github.com/clovaai/ClovaCall

For Beginners

Keywords, good to study

Listen, attend and spell
SpecAugment
Learning scheduler
teacher forcing

Good reference

Paperwithcode for LibriSpeech https://paperswithcode.com/sota/speech-recognition-on-librispeech-test-clean

For students taking DSP class

submit as zip file, including ..

All code files you uesd for training
One best trained model(.pth) and test reult
Report as pdf (detail will be announced)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
DSP_ASR_LibriSpeech		DSP_ASR_LibriSpeech
Libri_data		Libri_data
log		log
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digital Speech Processing : Automatic Speech Recoginition with LibriSpeech dataset

How to train

Recommended Environmet

Dataset

Base Model

Performance metric

Reference

For Beginners

Keywords, good to study

Good reference

For students taking DSP class

submit as zip file, including ..

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

InhaDSP/ASR_tutorial

Folders and files

Latest commit

History

Repository files navigation

Digital Speech Processing : Automatic Speech Recoginition with LibriSpeech dataset

How to train

Recommended Environmet

Dataset

Base Model

Performance metric

Reference

For Beginners

Keywords, good to study

Good reference

For students taking DSP class

submit as zip file, including ..

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages