Speech Recognition
Speech Recognition
Speech Recognition
BATCH 10
Team
Team members:
Technlogies used:
Python
• PyAudio provides Python bindings for PortAudio, the cross-platform audio I/O library.
• With PyAudio, you can easily use Python to play and record audio on a variety of platforms.
H
Author
asan U. Zaman
Date of Conference: 26-28 Oct. 2018
Conclusion 4
In this work an end to end speech to text conversion model using neural networks is
implemented. Techniques such as max pooling and batch normalization are used to
further optimize the model and boost its accuracy. The process of porting the trained
model to a Raspberry pi is explained. The usage of these kind of neural network models
is confined to the labels used in the dataset. Better datasets with more labels and
inclusion of various accents improve the application efficiency