Skip to content

ethan-alderson/SpokenDigitRecognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SpokenDigitRecognition

A Project to recognize spoken digits via a convolutionary neural network

Converts labeled audio files to spectrograms via short time fourier transforms, which are then used to train a convolutionary neural network.

Libraries: PyTorch, NumPy, Pandas, Librosa, Matplotlib

Training: Running main.py will initiate a 25 epoch training cycle utilizing 80% of the data, followed by an accuracy test on the remaining 20% of the data. This test has shown that the model reaches well over 90% accuracy with dropout layers used in training and a 0.001 learning rate.

About

Recognizes spoken digits via a trained neural network

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages