What Is Natural Language Processing?

The document discusses natural language processing (NLP), which uses computational linguistics and machine learning to give computers the ability to understand, process, and generate human language. It describes common NLP tasks like speech recognition, part-of-speech tagging, and sentiment analysis. Statistical NLP and deep learning models have improved NLP capabilities. Popular tools for NLP include Python/NLTK and IBM Watson. The document provides examples of NLP applications such as machine translation, chatbots, spam detection, and text summarization.

Uploaded by

Shija Mafulahya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

205 views5 pages

What Is Natural Language Processing?

Uploaded by

Shija Mafulahya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Natural Language Processing (NLP)

Natural language processing strives to build machines that understand and respond to
text or voice data—and respond with text or speech of their own—in much the same
way humans do.

What is natural language processing?

Natural language processing (NLP) refers to the branch of computer science—and
more specifically, the branch of artificial intelligence or AI—concerned with giving
computers the ability to understand text and spoken words in much the same way
human beings can.

NLP combines computational linguistics—rule-based modeling of human language—

with statistical, machine learning, and deep learning models. Together, these
technologies enable computers to process human language in the form of text or voice
data and to ‘understand’ its full meaning, complete with the speaker or writer’s intent
and sentiment.

NLP drives computer programs that translate text from one language to another,
respond to spoken commands, and summarize large volumes of text rapidly—even in
real time. There’s a good chance you’ve interacted with NLP in the form of voice-
operated GPS systems, digital assistants, speech-to-text dictation software, customer
service chatbots, and other consumer conveniences. But NLP also plays a growing role
in enterprise solutions that help streamline business operations, increase employee
productivity, and simplify mission-critical business processes.

NLP tasks
Human language is filled with ambiguities that make it incredibly difficult to write
software that accurately determines the intended meaning of text or voice data.
Homonyms, homophones, sarcasm, idioms, metaphors, grammar and usage
exceptions, variations in sentence structure—these just a few of the irregularities of
human language that take humans years to learn, but that programmers must teach
natural language-driven applications to recognize and understand accurately from the
start, if those applications are going to be useful.

Several NLP tasks break down human text and voice data in ways that help the
computer make sense of what it's ingesting. Some of these tasks include the following:

 Speech recognition, also called speech-to-text, is the task of reliably

converting voice data into text data. Speech recognition is required for any
application that follows voice commands or answers spoken questions. What
makes speech recognition especially challenging is the way people talk—
quickly, slurring words together, with varying emphasis and intonation, in
different accents, and often using incorrect grammar.
 Part of speech tagging, also called grammatical tagging, is the process of
determining the part of speech of a particular word or piece of text based on its
use and context. Part of speech identifies ‘make’ as a verb in ‘I can make a
paper plane,’ and as a noun in ‘What make of car do you own?’
 Word sense disambiguation is the selection of the meaning of a word with
multiple meanings through a process of semantic analysis that determine the
word that makes the most sense in the given context. For example, word sense
disambiguation helps distinguish the meaning of the verb 'make' in ‘make the
grade’ (achieve) vs. ‘make a bet’ (place).
 Named entity recognition, or NEM, identifies words or phrases as useful
entities. NEM identifies ‘Kentucky’ as a location or ‘Fred’ as a man's name.
 Co-reference resolution is the task of identifying if and when two words refer
to the same entity. The most common example is determining the person or
object to which a certain pronoun refers (e.g., ‘she’ = ‘Mary’), but it can also
involve identifying a metaphor or an idiom in the text (e.g., an instance in which
'bear' isn't an animal but a large hairy person).
 Sentiment analysis attempts to extract subjective qualities—attitudes,
emotions, sarcasm, confusion, suspicion—from text.
 Natural language generation is sometimes described as the opposite of
speech recognition or speech-to-text; it's the task of putting structured
information into human language.

See the blog post “NLP vs. NLU vs. NLG: the differences between three natural
language processing concepts” for a deeper look into how these concepts relate.

NLP tools and approaches

Python and the Natural Language Toolkit (NLTK)

The Python programing language provides a wide range of tools and libraries for
attacking specific NLP tasks. Many of these are found in the Natural Language Toolkit,
or NLTK, an open source collection of libraries, programs, and education resources for
building NLP programs.

The NLTK includes libraries for many of the NLP tasks listed above, plus libraries for
subtasks, such as sentence parsing, word segmentation, stemming and lemmatization
(methods of trimming words down to their roots), and tokenization (for breaking
phrases, sentences, paragraphs and passages into tokens that help the computer better
understand the text). It also includes libraries for implementing capabilities such as
semantic reasoning, the ability to reach logical conclusions based on facts extracted
from text.

Statistical NLP, machine learning, and deep learning

The earliest NLP applications were hand-coded, rules-based systems that could
perform certain NLP tasks, but couldn't easily scale to accommodate a seemingly
endless stream of exceptions or the increasing volumes of text and voice data.

Enter statistical NLP, which combines computer algorithms with machine learning
and deep learning models to automatically extract, classify, and label elements of text
and voice data and then assign a statistical likelihood to each possible meaning of those
elements. Today, deep learning models and learning techniques based on convolutional
neural networks (CNNs) and recurrent neural networks (RNNs) enable NLP systems
that 'learn' as they work and extract ever more accurate meaning from huge volumes of
raw, unstructured, and unlabeled text and voice data sets.

For a deeper dive into the nuances between these technologies and their learning
approaches, see “AI vs. Machine Learning vs. Deep Learning vs. Neural Networks:
What’s the Difference?”

NLP use cases

Natural language processing is the driving force behind machine intelligence in many
modern real-world applications. Here are a few examples:

 Spam detection: You may not think of spam detection as an NLP solution, but
the best spam detection technologies use NLP's text classification capabilities
to scan emails for language that often indicates spam or phishing. These
indicators can include overuse of financial terms, characteristic bad grammar,
threatening language, inappropriate urgency, misspelled company names, and
more. Spam detection is one of a handful of NLP problems that experts
consider 'mostly solved' (although you may argue that this doesn’t match your
email experience).
 Machine translation: Google Translate is an example of widely available NLP
technology at work. Truly useful machine translation involves more than
replacing words in one language with words of another. Effective translation
has to capture accurately the meaning and tone of the input language and
translate it to text with the same meaning and desired impact in the output
language. Machine translation tools are making good progress in terms of
accuracy. A great way to test any machine translation tool is to translate text to
one language and then back to the original. An oft-cited classic example: Not
long ago, translating “The spirit is willing but the flesh is weak” from English to
Russian and back yielded “The vodka is good but the meat is rotten.” Today,
the result is “The spirit desires, but the flesh is weak,” which isn’t perfect, but
inspires much more confidence in the English-to-Russian translation.
 Virtual agents and chatbots: Virtual agents such as Apple's Siri and
Amazon's Alexa use speech recognition to recognize patterns in voice
commands and natural language generation to respond with appropriate action
or helpful comments. Chatbots perform the same magic in response to typed
text entries. The best of these also learn to recognize contextual clues about
human requests and use them to provide even better responses or options over
time. The next enhancement for these applications is question answering, the
ability to respond to our questions—anticipated or not—with relevant and
helpful answers in their own words.
 Social media sentiment analysis: NLP has become an essential business tool
for uncovering hidden data insights from social media channels. Sentiment
analysis can analyze language used in social media posts, responses, reviews,
and more to extract attitudes and emotions in response to products,
promotions, and events–information companies can use in product designs,
advertising campaigns, and more.
 Text summarization: Text summarization uses NLP techniques to digest huge
volumes of digital text and create summaries and synopses for indexes,
research databases, or busy readers who don't have time to read full text. The
best text summarization applications use semantic reasoning and natural
language generation (NLG) to add useful context and conclusions to
summaries.

Natural language processing and IBM Watson

 IBM has innovated in the artificial intelligence space by pioneering NLP-driven
tools and services that enable organizations to automate their complex
business processes while gaining essential business insights. These tools
include:
o Watson Discovery - Surface high-quality answers and rich insights from
your complex enterprise documents - tables, PDFs, big data and more -
with AI search. Enable your employees to make more informed decisions
and save time with real-time search engine and text mining capabilities
that perform text extraction and analyze relationships and patterns buried
in unstructured data. Watson Discovery leverages custom NLP models
and machine learning methods to provide users with AI that understands
the unique language of their industry and business. Explore Watson
Discovery
o Watson Natural Language Understanding (NLU) - Analyze text in
unstructured data formats including HTML, webpages, social media, and
more. Increase your understanding of human language by leveraging this
natural language tool kit to identify concepts, keywords, categories,
semantics, and emotions, and to perform text classification, entity
extraction, named entity recognition (NER), sentiment analysis, and
summarization. Explore Watson Natural Language Understanding
o Watson Assistant - Improve the customer experience while reducing
costs. Watson Assistant is an AI chatbot with an easy-to-use visual
builder so you can deploy virtual agents across any channel, in
minutes. Explore Watson Assistant
o Purpose-built for healthcare and life sciences domains, IBM Watson
Annotator for Clinical Data extracts key clinical concepts from natural
language text, like conditions, medications, allergies and procedures.
Deep contextual insights and values for key clinical attributes develop
more meaningful data. Potential data sources include clinical notes,
discharge summaries, clinical trial protocols and literature data.
 For more information on how to get started with one of IBM Watson's natural
language processing technologies, visit the IBM Watson Natural Language
Processing page.

Natural Language Processing-ppt
No ratings yet
Natural Language Processing-ppt
40 pages
AI Mcqs UNIT 1-3
No ratings yet
AI Mcqs UNIT 1-3
37 pages
Gated Recurrent Unit: Master Sidsd - S2
100% (1)
Gated Recurrent Unit: Master Sidsd - S2
23 pages
NLP UNIT-II PPT
No ratings yet
NLP UNIT-II PPT
45 pages
Nepali Natural Language Processing
No ratings yet
Nepali Natural Language Processing
30 pages
Corpus Based Amharic Sentiment Lexicon Generation
No ratings yet
Corpus Based Amharic Sentiment Lexicon Generation
4 pages
Cloud Computing and Big Data: Technologies, Applications and Security
No ratings yet
Cloud Computing and Big Data: Technologies, Applications and Security
406 pages
Hussain Ya Sir
No ratings yet
Hussain Ya Sir
55 pages
Aman
No ratings yet
Aman
71 pages
Natural Language Processing GRADE 10 - 2021
No ratings yet
Natural Language Processing GRADE 10 - 2021
82 pages
ArtificialIntelligenceinFinancialMarkets_AlgorithmsandApplications
No ratings yet
ArtificialIntelligenceinFinancialMarkets_AlgorithmsandApplications
11 pages
Module Code & Module Title CU6051NA - Artificial Intelligence Assessment Weightage & Type 20% Individual Coursework Year and Semester 2019-20 Autumn
No ratings yet
Module Code & Module Title CU6051NA - Artificial Intelligence Assessment Weightage & Type 20% Individual Coursework Year and Semester 2019-20 Autumn
10 pages
recomendation system using Machine Learning
No ratings yet
recomendation system using Machine Learning
37 pages
UNIT V Application Layer
100% (1)
UNIT V Application Layer
18 pages
Engineering Design Knowledge Representation Based On Logic and Objects
No ratings yet
Engineering Design Knowledge Representation Based On Logic and Objects
19 pages
NLP Steps Basic
No ratings yet
NLP Steps Basic
26 pages
Compilation of MA Course Descriptions by Prof Benhima
No ratings yet
Compilation of MA Course Descriptions by Prof Benhima
15 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
Syllabus (AI - ML BlackBelt Plus Program)
No ratings yet
Syllabus (AI - ML BlackBelt Plus Program)
18 pages
Hate Speech On Social Media
No ratings yet
Hate Speech On Social Media
11 pages
How To Call The Text Analytics REST API: 07/30/2019 3 Minutes To Read o
No ratings yet
How To Call The Text Analytics REST API: 07/30/2019 3 Minutes To Read o
4 pages
Anna University: Chennai 600 025
No ratings yet
Anna University: Chennai 600 025
10 pages
draft_v1
No ratings yet
draft_v1
15 pages
Findings On Paper 23242
No ratings yet
Findings On Paper 23242
7 pages
NLP Unit 1 and 2
No ratings yet
NLP Unit 1 and 2
106 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
Challenges and Issues in Sentiment Analysis - A Comprehensive Survey
No ratings yet
Challenges and Issues in Sentiment Analysis - A Comprehensive Survey
18 pages
Cbse - Department of Skill Education Artificial Intelligence
No ratings yet
Cbse - Department of Skill Education Artificial Intelligence
11 pages
Test Documentation and Test Case Design: Qa Lead/Release Lead Vmware, Inc
No ratings yet
Test Documentation and Test Case Design: Qa Lead/Release Lead Vmware, Inc
48 pages
Stress Detection Project Using Machine Learning
No ratings yet
Stress Detection Project Using Machine Learning
9 pages
Load Testing of Web Applications Using HP Load Runner and CA Wily Introscope Tools
100% (1)
Load Testing of Web Applications Using HP Load Runner and CA Wily Introscope Tools
4 pages
Fakenews
No ratings yet
Fakenews
5 pages
A Large-Scale Analysis of The Marketplace Characteristics in Fiverr
No ratings yet
A Large-Scale Analysis of The Marketplace Characteristics in Fiverr
11 pages
Unit 2_chapter_2_Social_media_marketing_strategies
No ratings yet
Unit 2_chapter_2_Social_media_marketing_strategies
17 pages
NLP Iat QB
No ratings yet
NLP Iat QB
10 pages
Viva PPT
No ratings yet
Viva PPT
11 pages
ai answers (1) (1)
No ratings yet
ai answers (1) (1)
6 pages
AI Glossary
No ratings yet
AI Glossary
5 pages
Git and GitHub For Beginners
No ratings yet
Git and GitHub For Beginners
19 pages
Artificial Intelligence in Internal Controls and Risk Management
No ratings yet
Artificial Intelligence in Internal Controls and Risk Management
17 pages
WEEK - 5 SOLID Principles
No ratings yet
WEEK - 5 SOLID Principles
23 pages
Salesforce Developer (100 Questions)
No ratings yet
Salesforce Developer (100 Questions)
19 pages
Langauage Model
No ratings yet
Langauage Model
148 pages
RAG Notes
No ratings yet
RAG Notes
4 pages
Session 11-12 - Text Analytics
No ratings yet
Session 11-12 - Text Analytics
38 pages
10 Salesforce Einstein Features
No ratings yet
10 Salesforce Einstein Features
13 pages
Neo4j Interview Questions Answers
No ratings yet
Neo4j Interview Questions Answers
4 pages
Module 3
No ratings yet
Module 3
5 pages
Im-Rag Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
No ratings yet
Im-Rag Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
11 pages
AI Institutes
No ratings yet
AI Institutes
98 pages
Introduction To Natural Language Processing (NLP)
No ratings yet
Introduction To Natural Language Processing (NLP)
87 pages
NATURAL LANGUAGE PROCESSING IN Education
No ratings yet
NATURAL LANGUAGE PROCESSING IN Education
17 pages
Pytest Pres
No ratings yet
Pytest Pres
51 pages
Generative AI and Its Impact To Everyday Business
No ratings yet
Generative AI and Its Impact To Everyday Business
21 pages
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
No ratings yet
Deep Learning With Databricks: Srijith Rajamohan, Ph.D. John O'Dwyer
38 pages
Agent Based Models Are Here and Disrupting GPT RAG 1717410571
No ratings yet
Agent Based Models Are Here and Disrupting GPT RAG 1717410571
12 pages
Introduction - Hugging Face NLP Course
No ratings yet
Introduction - Hugging Face NLP Course
8 pages
Module-5:: Network Analysis
No ratings yet
Module-5:: Network Analysis
22 pages
Development of An Indian Legal Language Model (LLM) For Enhanced Legal Text Analysis and Assistance
No ratings yet
Development of An Indian Legal Language Model (LLM) For Enhanced Legal Text Analysis and Assistance
7 pages
Machine Learning For Information Retrieval
No ratings yet
Machine Learning For Information Retrieval
23 pages
Natural Language Processing
No ratings yet
Natural Language Processing
20 pages
NLP Notes
No ratings yet
NLP Notes
71 pages
Natural Language Processing
100% (1)
Natural Language Processing
3 pages
Internship Report
No ratings yet
Internship Report
13 pages
NLP: Background and Overview: Introduction To Natural Language Processing (CSE5321)
No ratings yet
NLP: Background and Overview: Introduction To Natural Language Processing (CSE5321)
30 pages
ServiceNow - Novant - Health Story 2 PDF
No ratings yet
ServiceNow - Novant - Health Story 2 PDF
2 pages
Generative AI
No ratings yet
Generative AI
25 pages
NLP Lect Unit I
100% (1)
NLP Lect Unit I
140 pages
Natural Language Processing Inside Pages 2
No ratings yet
Natural Language Processing Inside Pages 2
159 pages
6CS4 AI Unit-5
No ratings yet
6CS4 AI Unit-5
65 pages
Natural Language Processing: Dr. Abdulfetah A.A
No ratings yet
Natural Language Processing: Dr. Abdulfetah A.A
25 pages
Generative AI Interview Questions and Answers
No ratings yet
Generative AI Interview Questions and Answers
7 pages
Natural Language Processing: Dr. Tulasi Prasad Sariki SCOPE, VIT Chennai
No ratings yet
Natural Language Processing: Dr. Tulasi Prasad Sariki SCOPE, VIT Chennai
29 pages
CS 8520: Artificial Intelligence: Knowledge Representation
No ratings yet
CS 8520: Artificial Intelligence: Knowledge Representation
30 pages
Lecture 1: Introduction To NLP: Understand Concepts Applications
No ratings yet
Lecture 1: Introduction To NLP: Understand Concepts Applications
32 pages
Unit-1 Aim 502
No ratings yet
Unit-1 Aim 502
15 pages
NLP Akash
No ratings yet
NLP Akash
4 pages
NLP Question Paper Solution
No ratings yet
NLP Question Paper Solution
27 pages
Using SOLR For Enabling Highly Customized Sitewide Navigation
No ratings yet
Using SOLR For Enabling Highly Customized Sitewide Navigation
12 pages
UNIT I_NLP
No ratings yet
UNIT I_NLP
24 pages
10 Natural Language Processing
No ratings yet
10 Natural Language Processing
27 pages
Hands-On Lab With LLMs and Gen AI Within IDC
No ratings yet
Hands-On Lab With LLMs and Gen AI Within IDC
57 pages
Microsoft AI-900 Vfeb-2024 by - Xakinato 110q
No ratings yet
Microsoft AI-900 Vfeb-2024 by - Xakinato 110q
69 pages
Automation
No ratings yet
Automation
3 pages
Hugging Face Case Study 112023
No ratings yet
Hugging Face Case Study 112023
2 pages
Table of Content
No ratings yet
Table of Content
13 pages
2023 Intro To Generative Ai
No ratings yet
2023 Intro To Generative Ai
15 pages
Web Scraping with Python Step by Step: A Practical Guide with Examples
From Everand
Web Scraping with Python Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
AppDynamics Third Edition
From Everand
AppDynamics Third Edition
Gerardus Blokdyk
No ratings yet