Introduction To Natural Language Processing and NLTK
Introduction To Natural Language Processing and NLTK
Introduction To Natural Language Processing and NLTK
Natural Language
Processing and NLTK
What is Natural Language Processing
(NLP)?
• NLP is a subfield of Artificial Intelligence (AI) concerned with the
interaction between computers and human language.
• It aims to bridge the gap between how humans communicate and
how machines process information.
• NLP techniques enable computers to understand, interpret, and
generate human language in various forms like text, speech, and
handwriting.
Components of NLP
There are two components of NLP as given −
1. Natural Language Understanding (NLU)
Understanding involves the following tasks −
Mapping the given input in natural language into useful
representations.
Analyzing different aspects of the language.
2. Natural Language Generation
(NLG)
It is the process of producing meaningful phrases and
sentences in the form of natural language from some internal
representation.
It involves −
Text planning − It includes retrieving the relevant content from
knowledge base.
Sentence planning − It includes choosing required words, forming
meaningful phrases, setting tone of the sentence.
Text Realization − It is mapping sentence plan into sentence
structure.
The NLU is harder than NLG.
Difficulties in NLU
• NL has an extremely rich form and structure.
• It is very ambiguous. There can be different levels of ambiguity −
• Lexical ambiguity − It is at very primitive level such as word-level.
• For example, treating the word “board” as noun or verb?
• Syntax Level ambiguity − A sentence can be parsed in different ways.
• For example, “He lifted the beetle with red cap.” − Did he use cap to lift
the beetle or he lifted a beetle that had red cap?
• Referential ambiguity − Referring to something using pronouns. For
example, Rima went to Gauri. She said, “I am tired.” − Exactly who is
tired?
• One input can mean different meanings.
NLP Terminology