Text - To - Speech Converter: Bachelor of Engineering IN Computer Science & Engineering
Text - To - Speech Converter: Bachelor of Engineering IN Computer Science & Engineering
Text - To - Speech Converter: Bachelor of Engineering IN Computer Science & Engineering
BACHELOR OF ENGINEERING
IN
Submitted to:
Mr. GurjeetpalBawa
Submitted By:
16BCS1313
Nitesh Kumar
16BCS1253
I am highly indebted to Ms. Manpreet Kaur for her guidance and constant supervision as well
as for providing necessary information regarding the project & also for her support in
completing the project.
I would like to express my special gratitude and thanks to my classmates and teachers for
giving me such attention and time.
My thanks and appreciations also go to my mentor in developing the project and people who
have willingly helped me out with their abilities.
Nitesh Kumar
Text - to - speech conversion software project is windows based application that reads a text
file to the user. The software reads a text file and associated pronunciations in its temporary
database. The program then reads an entire word to the user. The software can be effectively
used to help read the text document for the user so that the user does not constantly need to
look at the screen and read the entire document.
Text to speech converter is a recent software project that allows even the visually challenged
to read and understand various documents. The blinds cannot read a document, so this
software can be an assistant to them who would read out those documents for them. It can
also be a great help for those who cannot speak. The person can simply type what he/she
wants to say and the software would give a voice to them by speaking what they wanted to
say. So, this software is not just an advancement towards the future development but also a
boon for those who cannot speak and see.
A text-to-speech system (or "engine") is composed of two parts:[3] a front-end and a back-
end. The front-end has two major tasks. First, it converts raw text containing symbols like
numbers and abbreviations into the equivalent of written-out words. This process is often
called text normalization, pre-processing,or tokenization. The front-end then assigns phonetic
transcriptions to each word, and divides and marks the text into prosodic units,
like phrases, clauses, and sentences. The process of assigning phonetic transcriptions to
words is called text-to-phoneme or grapheme-to-phoneme conversion. Phonetic transcriptions
and prosody information together make up the symbolic linguistic representation that is
output by the front-end. The back-end—often referred to as the synthesizer—then converts
the symbolic linguistic representation into sound. In certain systems, this part includes the
computation of the target prosody (pitch contour, phoneme durations), which is then imposed
on the output speech.
List of Figures
. . .
. . .
. . .
1 Introduction
Text - to - speech conversion software project is windows based application that reads a text
file to the user. The software reads a text file or entered text or the image selected and
associated pronunciations in its temporary database. The program then reads an entire word
to the user. The softwarecan be effectively used to help read the text, pdf documents or image
text or entered text for the user so that the user does not constantly need to look at the screen
and read the entire document or image or text.
Text to speech converter is a recent software project that allows even the visually challenged
to read and understand various documents. The blinds cannot read a document, so this
software can be an assistant to them who would read out those documents for them. It can
also be a great help for those who cannot speak. The person can simply type what he/she
wants to say and the software would give a voice to them by speaking what they wanted to
say. The user just have to select the Interactive mode and then write what he wants to say in
the textarea and then he can easily express what he wanted to say by simply clicking the
convert button. So, this software is not just an advancement towards the future development
but also a boon for those who cannot speak and see. This technology can also be utilized for
various purposes, e.g. car navigation, announcements in railway stations, response services in
telecommunications, and e-mail reading. Thus, if we think more innovatively, we can easily
get more applications out of it.
TTS works with nearly every personal digital device, including computers, smartphones and
tablets. All kinds of text files can be read aloud, including Word and Pages documents. Even
online web pages can be read aloud. The voice in TTS is computer-generated, and reading
speed can usually be sped up or slowed down. This software can has a quality in which the
voice quality varies, but some voices sound human. This feature is specifically designed to
give a real feel to the voice. There are even computer-generated voices that sound like
children speaking. The software designed uses the computerized female voice. Many TTS
tools highlight words as they are read aloud. This allows kids to see text and hear it at the
same time. Some TTS tools also have a technology called optical character
recognition (OCR). OCR allows TTS tools to read text aloud from images. For example, your
child could take a photo of a street sign and have the words on the sign turned into audio. The
designed software actually allows the accomplishment of this feature we have given the
option of converting image text to speech. Different files can also be converted using this
software. Text, document, or pdf files can easily be read using the software.
A text-to-speech system (or "engine") is composed of two parts:[3] a front-end and a back-
end. The front-end has two major tasks. First, it converts raw text containing symbols like
numbers and abbreviations into the equivalent of written-out words. This process is often
called text normalization, pre-processing, or tokenization. The front-end then
assigns phonetic transcriptions to each word, and divides and marks the text into prosodic
units, like phrases, clauses, and sentences. The process of assigning phonetic transcriptions to
words is called text-to-phoneme or grapheme-to-phoneme conversion. Phonetic transcriptions
and prosody information together make up the symbolic linguistic representation that is
output by the front-end. The back-end—often referred to as the synthesizer—then converts
the symbolic linguistic representation into sound. In certain systems, this part includes the
computation of the target prosody (pitch contour, phoneme durations),[4] which is then
imposed on the output speech.
1.2About Language
The language used for the project text – to speech conversion is python. Python is a high-
level, interpreted, interactive and object-oriented scripting language. Python is designed to
be highly readable. It uses English keywords frequently where as other languages use
punctuation, and it has fewer syntactical constructions than other languages.
History of Python
Python was developed by Guido van Rossum in the late eighties and early nineties at the
National Research Institute for Mathematics and Computer Science in the Netherlands.
Python is derived from many other languages, including ABC, Modula-3, C, C++, Algol-68,
SmallTalk, and Unix shell and other scripting languages.
Python is copyrighted. Like Perl, Python source code is now available under the GNU
General Public License (GPL).
Python is now maintained by a core development team at the institute, although Guido van
Rossum still holds a vital role in directing its progress.
Currently, PSF supports two versions, Python 2.x & Python 3.x. Python 2.0 was released in
October 2000 and includes a large number of features. PSF continues to support version
Python 2 because a large body of existing code could not be forward ported to Python 3. So,
they will support Python 2 until 2020.
Python 3.0 was released on December 3rd, 2008. It was designed to rectify certain flaws in
earlier version. This version is not completely backward-compatible with previous versions.
However, many of its major features have since been back-ported to the Python 2.6.x and
2.7.x version series. Releases of Python 3 include 2 to 3 utilities to facilitate the automation
of translation of Python 2 code to Python 3.
Python Features
Easy-to-learn − Python has few keywords, simple structure, and a clearly defined
syntax. This allows the student to pick up the language quickly.
Easy-to-read − Python code is more clearly defined and visible to the eyes.
Easy-to-maintain − Python's source code is fairly easy-to-maintain.
A broad standard library − Python's bulk of the library is very portable and cross-
platform compatible on UNIX, Windows, and Macintosh.
Interactive Mode − Python has support for an interactive mode which allows
interactive testing and debugging of snippets of code.
Portable − Python can run on a wide variety of hardware platforms and has the same
interface on all platforms.
Extendable − You can add low-level modules to the Python interpreter. These
modules enable programmers to add to or customize their tools to be more efficient.
Databases − Python provides interfaces to all major commercial databases.
GUI Programming − Python supports GUI applications that can be created and
ported to many system calls, libraries and windows systems, such as Windows MFC,
Macintosh, and the X Window system of Unix.
Scalable − Python provides a better structure and support for large programs than
shell scripting.
Apart from the above-mentioned features, Python has a big list of good features, few are
listed below −
Python Advantages
Python provides enhanced readability. For that purpose, uniform indents are used to
delimit blocks of statements instead of curly brackets, like in many languages such as
C, C++ and Java.
Python is free and distributed as open-source software. A large programming
community is actively involved in the development and support of Python libraries for
various applications such as web frameworks, mathematical computing and data
science.
Python is a cross-platform language. It works equally on different OS platforms like
Windows, Linux, Mac OSX etc. Hence Python applications can be easily ported
across OS platforms.
Python supports multiple programming paradigms including imperative, procedural,
object-oriented and functional programming styles.
Python is an extensible language. Additional functionality (other than what is
provided in the core language) can be made available through modules and packages
written in other languages (C, C++, Java etc)
A standard DB-API for database connectivity has been defined in Python. It can be
enabled using any data source (Oracle, MySQL, SQLite etc.) as a backend to the
Python program for storage, retrieval and processing of data.
Standard distribution of Python contains the Tkinter GUI toolkit, which is the
implementation of popular GUI library called Tcl/Tk. An attractive GUI can be
constructed using Tkinter. Many other GUI libraries like Qt, GTK, WxWidgets etc.
are also ported to Python.
Python can be integrated with other popular programming technologies like C, C++,
Java, ActiveX and CORBA.
Today Python has become the language of choice for data scientists. Python libraries
like NumPy, Pandas and Matplotlib are extensively used in the process of data
analysis, including the collection, processing and cleansing of data sets, applying
mathematical algorithms and generating visualizations for the benefit of users.
Commercial and community Python distributions by third-parties such
as Anaconda and ActiveState provide all the essential libraries required for data
science.
Machine Learning - This is another key application area of Python. Python libraries
such as Scikit-learn, Tensorflowand NLTK are widely used for the prediction of
trends like customer satisfaction, projected values of stocks etc. Some of the real-
world applications of machine learning include medical diagnosis, statistical
arbitrage, basket analysis, sales prediction etc.
Web Development - This is another application area in which Python is becoming
popular. Web application framework libraries like django, Pyramid, Flask etc. make it
very easy to develop and deploy simple as well as complex web applications. These
frameworks are used extensively by various IT companies. Dropbox for example uses
django as a backend to store and, synchronize local folders.
Most of the web servers today are compatible with WSGI (Web Server Gateway
Interface) - a specification for the universal interface between Python web
frameworks and web servers. All leading web servers such as Apache, IIS, Nginxetc
can now host Python web applications. Google's App Engine hosts web applications
built with almost all Python web frameworks.
Image Processing - The OpenCV library is commonly used for face detection and
gesture recognition. OpenCV is a C++ library, but has been ported to Python. Because
of the rapid development of this feature, Python is a very popular choice from image
processing.
Game Development - Python is a popular choice for game developers.
The PyGame library is extensively used for building games for desktop as well as for
mobile platforms. PyGame applications can be installed on Android too.
Embedded Systems and IOT - Another important area of Python application is in
embedded systems. Raspberry Pi is a very popular yet a low-cost single-board
computer. It is being extensively used in automation products, robotics, IoT, and
kiosk applications. Popular microcontrollers like Arduino are used in many IoT
products and are being programmed with Python. A lightweight version of Python
called Micropython has been developed especially for microcontrollers. A special
Micropython-compatible controller called PyBoard has also been developed.
Android Apps -Although Android apps are predominantly developed using Android
SDK, which is similar to Java, Python can also be used to develop Android apps.
Python's Kivy library has all the functionalities required for a mobile application.
Automated Jobs - Python is extremely useful and widely used for automating CRON
(Command Run ON) jobs. Certain tasks like backups, defined in Python scripts can
be scheduled to be invoked automatically by the operating system scheduler to be
executed at predefined times.
Running Python
There are three different ways to start Python −
Interactive Interpreter
You can start Python from Unix, DOS, or any other system that provides you a command-
line interpreter or shell window.
$python # Unix/Linux
or
Python% # Unix/Linux
or
C: >python # Windows/DOS
1 -d
2 -O
3 -S
4 -v
5 -X
disable class-based built-in exceptions (just use strings); obsolete starting with
version 1.6.
6 -c cmd
7 file
or
or
Windows − PythonWin is the first Windows interface for Python and is an IDE with
a GUI.
Macintosh − The Macintosh version of Python along with the IDLE IDE is available
from the main website, downloadable as either MacBinary or BinHex'd files.
If you are not able to set up the environment properly, then you can take help from your
system admin. Make sure the Python environment is properly set up and working perfectly
fine.
1.4Feasibility Study
Economic Feasibility
The text-to-speech conversion software is a very affordable software which requires
the use of python only. There is no special need of any recorder or any other gadget or
equipment that can act as an overhead in the cost of this software.
Technical Feasibility
This software only requires the use of python which is already widespread and used.
Youtube which is used by billions of users has some parts of it implemented using
python. So the only technology required in this project is already available and
familiar. Hence, this software is technologically feasible
Operational Feasibility
The software would be very easy to use and is designed only to help the people
especially, the ones who cannot speak and also those who are visually challenged.
This software would be a great help to them to live a normal life.
2 SRS
2.1 Introduction
2.1.1 Purpose
Among the many definition that could be given of text – to – speech, that
describes it as a way of having computer audibly communicate information to
the user is probably the most relevant within the context of this statement. In
situations where visual feedback is inadequate or even impossible, audible
feedback may be an essential feature; in many situations it may just add extra
value to a product. Generally, text – to – speech provides a very valuable and
flexible alternative for digital – audio recordings where :
Recordings are too expensive.
Disk storage is insufficient to store the recordings.
The application does not know ahead of time what it will need to
speak.
The information varies too much to record and store all the
alternatives.
2.1.2 Document Conventions
To prepare this SRS we had used the lettering style of Times New Roman and
font size for sub headings is 14 with bold. The matter which is mentioned in
this SRS is 12 with a lettering style Times New Roman. Then headings are of
lettering style Times New Roman with Font Size 16 with bold letters. Then the
important points are mentioned in italics.
2.1.3 Intended Audience and Reading Suggestions
This SRS can be read by all the developers. The rest of the part of SRS
mentions the benefits of our project, how to use the project, how the project
was developed, what are the major things we have taken into consideration.
2.1.4 Project Scope
The term “Text – to - Speech” or TTS for short, refers to the process by which
plain text is converted into digital audio and then “spoken”. This speaking can
be in the form of actually sending the audio through a computer’s speakers (or
other capable device), or simply saving the computer audio for later playback.
For the most part, all TTS conversion engine can be broken out into three
methods used to convert phonemes (the smallest phonetic unit in a language
that is capable of conveying distinction in meaning, such as the m of mat and
the b of bat in English) into audible sound. The supplied Microsoft Speech
engines used the second method. The three methods are described in the
following paragraphs.
In this module user has to enter some text and it can listen the speech by clicking the
Convert button present at the bottom. User can listen selected text or entered text. In
this module we have to design GUI which provides text area to enter text. This
Module opens up when we click on Interactive Mode in the main menu.
In this module, user can input text file as input for converting text into speech. In this
module functionalities are:
In this module, user can input text file as input for converting text into speech. In this
module functionalities are:
Getting path of input file.
Open the file.
Reading file.
Read text passed to speech module.
This Module would first take an image as an input which would involve selecting
or browsing an image and then the Convert button pass it text acquired from the
image to the speech module and we get the audio form of the text of the image.
The software development lifecycle used (SDLC) used for this project was the iterative
waterfall model. In a practical software development project, the classical waterfall model is
hard to use. So, Iterative waterfall model was thought of as incorporating the necessary
changes to the classical waterfall model to make it usable in practical software development
projects. It is almost same as the classical waterfall model except some changes are made to
increase the efficiency of the software development. This gave us the required flexibility of
changes.
The iterative waterfall model provides feedback paths from every phase to its preceding
phases, which is the main difference from the classical waterfall model.
Feedback paths introduced by the iterative waterfall model are shown in the figure below.
When errors are detected at some later phase, these feedback paths allow correcting errors
committed by programmers during some phase. The feedback paths allowed us to rework on
the phase in which errors were committed and these changes were reflected in the later
phases. But, there is no feedback path to the stage – feasibility study, because once a project
has been taken, does not give up the project easily.
It is good to detect errors in the same phase in which they are committed. It reduces the effort
and time required to correct the errors.
Coding and Unit Testing − With the coding of the application complete, the testing
of the written code now comes into scene. Testing checks if there are any flaws in
the designed software and if the software has been designed as per the listed
specifications. A proper execution of this stage ensures that the client interested in
the created software, will be satisfied with the finished product. If there are any
flaws, the software development process must step back to the design phase. In the
design phase, changes are implemented and then the succeeding stages of coding and
testing are again carried out.With inputs from the system design, the system is first
developed in small programs called units, which are integrated in the next phase.
Each unit is developed and tested for its functionality, which is referred to as Unit
Testing.
Integration and System Testing − All the units developed in the implementation
phase are integrated into a system after testing of each unit. This is an important part
as the errors may not be seen in individual units but while integration of these units
there can be some errors which needs to be corrected. Post integration the entire
system is tested for any faults and failures.
Deployment of system − Once the functional and non-functional testing is done; the
product is deployed in the customer environment or released into the market.
Maintenance − There are some issues which come up in the client environment. To
fix those issues, patches are released. Also to enhance the product some better
versions are released. Maintenance is done to deliver these changes in the customer
environment. It is a never ending phase. Once the system is running in production
environment, problems come up. The issues that are related to the system are solved
only after deployment of the system. The problems arise from time to time and need
to be solved; hence this phase is referred as maintenance.
Phase Containment of Errors: The principle of detecting errors as close to their points of
commitment as possible is known as Phase containment of errors.
The choice for a software development lifecycle is generally made on the basis of its
advantages and disadvantages. For the text of speech converter the SDLC (Software
Development Lifecycle) chosen was Iterative Waterfall Model was chosen as the
disadvantages were overpowered by the advantages of the model making it more suitable for
this project.
Advantages of Iterative Waterfall Model
Feedback Path: In the classical waterfall model, there are no feedback paths, so there
is no mechanism for error correction. But in iterative waterfall model feedback path
from one phase to its preceding phase allows correcting the errors that are committed
and these changes are reflected in the later phases.
Simple: Iterative waterfall model is very simple to understand and use. That’s why it is
one of the most widely used software development models.
Drawbacks of Iterative Waterfall Model
Difficult to incorporate change requests: The major drawback of the iterative
waterfall model is that all the requirements must be clearly stated before starting of the
development phase. Customer may change requirements after some time but the
iterative waterfall model does not leave any scope to incorporate change requests that
are made after development phase starts.
Incremental delivery not supported: In the iterative waterfall model, the full software
is completely developed and tested before delivery to the customer. There is no scope
for any intermediate delivery. So, customers have to wait long for getting the software.
Overlapping of phases not supported: Iterative waterfall model assumes that one
phase can start after completion of the previous phase, But in real projects, phases may
overlap to reduce the effort and time needed to complete the project.
Risk handling not supported: Projects may suffer from various types of risks. But,
Iterative waterfall model has no mechanism for risk handling.
Limited customer interactions: Customer interaction occurs at the start of the project
at the time of requirement gathering and at project completion at the time of software
delivery. These fewer interactions with the customers may lead to many problems as the
finally developed software may differ from the customers’ actual requirements.
Tkinter - Tkinter is a Python binding to the Tk GUI toolkit. It is the standard Python
interface to the Tk GUI toolkit,and is Python's de facto standard GUI.Tkinter is
included with standard Linux, Microsoft Windowsand Mac OS X installs of
Python.The name Tkinter comes from Tk interface. Tkinter was written by Fredrik
Lundh.Tkinter is free software released under a Python license. As with most other
modern Tk bindings, Tkinter is implemented as a Python wrapper around a
complete Tcl interpreter embedded in the Python interpreter. Tkinter calls are
translated into Tcl commands which are fed to this embedded interpreter, thus making
it possible to mix Python and Tcl in a single application. Python 2.7 and Python 3.1
incorporate the "themed Tk" ("ttk") functionality of Tk 8.5. This allows Tk widgets to
be easily themed to look like the native desktop environment in which the application
is running, thereby addressing a long-standing criticism of Tk (and hence of Tkinter).
There are several popular GUI library alternatives available, such
as wxPython, PyQt (PySide), Pygame, Pyglet, and PyGTK
Creating a GUI application using Tkinter is an easy task. All you need to do is
perform the following steps −
Functions
Pyttsx3 –Pyttsx is a good text to speech conversion library in python but it was
written only in python2 until now ! Even some fair amount of googling didn’t help
much to get tts library compatible with pyton3.
There is however , one library gTTS which works perfectly in python3 but it needs
internet connection to work since it relies on google to get the audio data.ButPyttsx is
completely offline and works seemlesly and has multiple tts-engine support.The codes
in this repos are slightly modified version of the pyttsx module of python 2.x and is a
clone from westonpace’s repo. The purpose of creating this repo is to help those who
want to have an offline tts lib for Python3 and don’t want to port it from python2 to
python3 themselves.
Usage
import pyttsx3;
engine = pyttsx3.init();
The development of a software is not an easy or one day task. It requires a lot of time
and discussion where the real need for the software is considered and analysed . The
software was first tested for feasibility then requirements were specified and analysed.
Then designing was done followed by coding, and testing. The Iterative Waterfall
Model was used in order to provide feedback and make necessary changes even after
the completion of a module. The detailed description of the steps followed while the
development of the project as described as follows:
The first step to be considered for the development of a project is the need of project
as mentioned earlier. First, we considered the reason behind making this project. As
mentioned earlier, the main goal behind choosing this text-to-speech converter is its
feature of helping the people. This software is helpful for the visually challenged
people to read and understand various documents and those who cannot speak would
have a voice. This technology can also be utilized for various purposes, e.g. car
navigation, announcements in railway stations, response services in
telecommunications, and e-mail reading.
Then requirement analysis was done where we analyze what all we require for this
project and what would be the best technology for it. We have chosen python for our
project as it is widely used nowadays and there won’t be any problem in acceptance
of this technology. Moreover, this project would be a great help for teaching purpose,
and for those who are blind and cannot speak by giving a voice and an eye for
reading. All these gave us a green signal to move ahead towards the development of
the project. Then we thought of a system that would speak what is typed by a user and
this would be a great help for those who cannot speak. The blinds cannot read the
documents and sometimes the lessons needs to be dictated in a class, giving us the
need for reading from documents. The text on various signals and images also needs
to be read which gave us the need of reading from the images. All these requirements
showed us a way towards the developments of text to speech converter with three
modules – first, Interactive Mode, that can convert the input typed by a user, second,
Convert from File, which would convert text from text and pdf files, and third,
Convert from Image, that would convert the text from image.
Then a design was be prepared where we demonstrated our expected design, the
design which specify how our project is going to look and how the modules would be
represented to the user. This was the most time consuming part of the project
development as the User Interface is an essential element of any software. If a
software is not convenient for a user then the software is not considered good. So
proper GUI was to be created that would be simple to use and should provide the
efficient output.
Then the design was then corrected and modified manier times according to the
suggestions of our friends and mentor and all the changes were meant to make the
system more efficient and for making it look more attractive. Colors were modified
the functionalities were added and the interface was more simplified to provide the
ease to use.
After that, the implementation was done using python which was the coding for the
project. The project was divided into modules, as mentioned earlier, where first
module converted the written text into speech and second module convert files text
into speech which is further divided into conversion of text and pdf files and the third
module, last one convert the text in an image to speech. The testing of each module
after its coding was done. Following it, testing was done where all the errors or
unexpected results are corrected again by coding while integration of the modules.
The new additions were also be done. The overall system was again tested for its
correct functioning. Thus, we used the Unit Testing, Integration Testing, and System
Testing for the project Development so that the correctness of individual module and
overall system can be verified.
Then after the implementation and testing of the software, we got it tested by our
friends to know about their reviews and the suggestion were welcomed and the
required modifications were done and again got tested.
4 DESIGN
The above screenshot displays the home screen for the text to speech converter. This page
provides three options for the user to convert text to speech – first, interactive mode that
converts the copied or typed text. Second, convert from file that converts the text and pdf file
and third, convert from image that convert the text written on an image to speech.
After selecting interactive mode the above frame opens up where the user can paste a copied
text or type text in the textarea provided and this typed or copied text is converted to speech
when the user clicks on the convert button.
The above frame appears when convert from file is chosen in main menu. The screen here
provides the option of converting either a text or a pdf file to speech. Then the contents of file
gets displayed in the textarea and then the convert button gives the audio output.
The above shown screen spears when a file is to be chosen for converting it to speech.
The above frame opens up when convert from file is clicked in main menu. Here a text file is
chosen and its text appears in the textare and then after clicking convert we can listen to the
audio.
The above screenshot is taken when the pdf file is chosen to be converted to speech. Here
convert button would convert the text from pdf file to speech and the back button would take
us to the home page.
The above screen again gets displayed when we press the back button present at the top right
corner. The back button helps us to jump back from a module to the home page.
The above screen appears when we click on Convert from Image button in the homepage.
The above screen pops up which gives us the option of browsing image by clicking on
Upload Image. The convert button converts the text in the image to audio.
The above screenshot displays the screen where an image was selected for its text be
converted to speech. The selected image then appears in the space in the middle as shown in
figure. Then by clicking the convert button we get our desired output.
5 CONCLUSION AND FUTURE SCOPE
Text - to - speech conversion software project is windows based application that reads a text
file to the user. The software reads a text file or entered text or the image selected and
associated pronunciations in its temporary database. The program then reads an entire word
to the user. The softwarecan be effectively used to help read the text, pdf documents or image
text or entered text for the user so that the user does not constantly need to look at the screen
and read the entire document or image or text.
Text to speech converter is a recent software project that allows even the visually challenged
to read and understand various documents. The blinds cannot read a document, so this
software can be an assistant to them who would read out those documents for them. It can
also be a great help for those who cannot speak. The person can simply type what he/she
wants to say and the software would give a voice to them by speaking what they wanted to
say. The user just have to select the Interactive mode and then write what he wants to say in
the textarea and then he can easily express what he wanted to say by simply clicking the
convert button. So, this software is not just an advancement towards the future development
but also a boon for those who cannot speak and see. This technology can also be utilized for
various purposes, e.g. car navigation, announcements in railway stations, response services in
telecommunications, and e-mail reading. Thus, if we think more innovatively, we can easily
get more applications out of it.
6 REFERENCES
https://www.geeksforgeeks.org/software-engineering-iterative-waterfall-model/
https://www.tutorialspoint.com/python/python_overview.htm
https://www.sestek.com/2014/10/introduction-to-text-to-speech/
https://pythonspot.com/tag/tkinter/
https://pyttsx.readthedocs.io/en/latest/
https://pypi.org/project/pytesseract/