0% found this document useful (0 votes)

7 views

NLP Lab File

Uploaded by

Bharat Mishra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

NLP Lab File

Uploaded by

Bharat Mishra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

DELHI TECHNOLOGICAL

UNIVERSITY
SE-316
NATURAL LANGUAGE PROCESSING

Department of Software Engineering

Delhi Technological University
Bawana Road, Delhi-110042

Submitted by
Prashant Tiwari
Roll Number :- 2K20/IT/103
Batch :- IT-B

Submitted to : Dr. Divyashikha Sethia

Department of Software Engineering
Delhi Technological University
INDEX

S. No. Experiment Date

1. Import nltk and download the ‘stopwords’ 13-01-2023

and ‘punkt’ packages

2. Import spacy and load the language model. 13-01-2023

3. WAP in python to tokenize a given text. 20-01-2023

4. WAP in python to get the sentences of a 03-03-2023

text document.

5. WAP in python to tokenize text with 03-02-2023

stopwords as delimiters.

6. WAP in python to add custom stop words in 03-02-2023

spaCy.

7. WAP to remove punctuations, perform 24-02-2023

stemming, lemmatize given text and extract
usernames from emails

8. WAP to do spell correction, extract all 07-03-2023

nouns, pronouns and verbs in a given text

9. WAP to find similarity between two words 31-03-2023

and classify a text as positive/negative
sentiment
EXPERIMENT - 1
AIM : Import nltk and download the ‘stopwords’ and ‘punkt’
packages

CODE :
import nltk

nltk.download('stopwords')
nltk.download('punkt')

OUTPUT :
EXPERIMENT - 2
AIM : Import spacy and load the language model

CODE :
import spacy
nlp_eng = spacy.load('en_core_web_sm')
nlp_multi = spacy.load('xx_ent_wiki_sm')

OUTPUT :
EXPERIMENT - 3
AIM : WAP in python to tokenize a given text

CODE :
from nltk import word_tokenize
text = "Last week, the University of Cambridge shared its own research
that shows if everyone wears a mask outside home,dreaded ‘second wave’
of the pandemic can be avoided."
text = word_tokenize(text)
for t in text:
print(t)

OUTPUT :
EXPERIMENT - 4
AIM : WAP in python to get the sentences of a text document.

CODE :
file = open('04.txt')
Input_text = file.read()
ans = Input_text.split('.')

for an in ans:
print(an,'\n')

OUTPUT :
EXPERIMENT - 5
AIM : WAP in python to tokenize text with stopwords as
delimiters.

CODE :
text = "Walter was feeling anxious. He was diagnosed today. He probably
is the best person I know."

stop_words_and_delims = ['was', 'is', 'the', '.', ',', '-', '!', '?']

for r in stop_words_and_delims:
text = text.replace(r, 'DELIM')

words = [t.strip() for t in text.split('DELIM')]

words_filtered = list(filter(lambda a: a not in [''], words))
for word in words_filtered:
print(word)

OUTPUT :
EXPERIMENT - 6
AIM : WAP in python to add custom stop words in spaCy.

CODE :
import spacy

nlp = spacy.load('en_core_web_sm')

custom_stop_words = ['was', 'is','the','JUNK','NIL','of','more' ,'.',

',', '-', '!', '?','a']
for word in custom_stop_words:
nlp.vocab[word].is_stop = True

doc = nlp("Jonas was a JUNK great guy NIL Adam was evil NIL Martha JUNK
was more of a fool")
for token in doc:
if not token.is_stop:
print(token.text, end=" ")

OUTPUT :
EXPERIMENT - 7
AIM : WAP to remove punctuations, perform stemming,
lemmatize given text and extract usernames from emails

CODE :
punctuations = '''!()-[]{};:'"\,<>./?@#$%^&*_~'''

string = "Jonas!!! great \\guy <> Adam --evil [Martha] ;;fool() ."

ans = ""
for char in string:
if char not in punctuations:
ans+=char

print(ans)

from nltk.stem import PorterStemmer

from nltk.tokenize import word_tokenize
text= "Dancing is an art. Students should be taught dance as a subject
in schools . I danced in many of my school function. Some people are
always hesitating to dance."
ans = ""
stemmer = PorterStemmer()
tokens = word_tokenize(text)
for token in tokens:
ans+=stemmer.stem(token)
ans+=" "
print(ans)

from nltk.corpus import wordnet

from nltk.tokenize import word_tokenize

from nltk.stem.wordnet import WordNetLemmatizer

lemmatizer = WordNetLemmatizer()
text= "Dancing is an art. Students should be taught dance as a subject
in schools . I danced in many of my school function. Some people are
always hesitating to dance."
ans = ""
tokens = word_tokenize(text)
for token in tokens:
ans+=lemmatizer.lemmatize(token, wordnet.VERB)
ans+=" "
print(ans)
from nltk.tokenize import word_tokenize

text= "The new registrations are potter709@gmail.com ,

elixir101@gmail.com. If you find any disruptions, kindly contact
granger111@gamil.com or severus77@gamil.com "

text_list = word_tokenize(text)
usernames = []
for i in range(len(text_list)):
if text_list[i] == "@":
usernames.append(text_list[i-1])
print(usernames)

OUTPUT :
EXPERIMENT - 8
AIM : WAP to do spell correction, extract all nouns, pronouns
and verbs in a given text

CODE :
from textblob import TextBlob
text="He is a gret person. He beleives in bod"
textb = TextBlob(text)
correct_text = textb.correct()
print(correct_text)

import nltk
from nltk import word_tokenize, pos_tag
text="James works at Microsoft. She lives in manchester and likes to
play the flute"
tokens = word_tokenize(text)
parts_of_speech = nltk.pos_tag(tokens)
nouns = list(filter(lambda x: x[1] == "NN" or x[1] == "NNP",
parts_of_speech))
for noun in nouns:
print(noun[0])

from nltk import pos_tag, word_tokenize

text = "I may bake a cake for my birthday. The talk will introduce
reader about Use of baking"

words = word_tokenize(text)

verb_phrases = []
for i in range(len(words)):
if i > 0 and pos_tag(words)[i][1] == 'VB':
verb_phrase = words[i-1] + ' ' + words[i]
verb_phrases.append(verb_phrase)

for i in verb_phrases:
print (i)

OUTPUT :
EXPERIMENT - 9
AIM : WAP to find similarity between two words and classify a
text as positive/negative sentiment

CODE :
import spacy

nlp = spacy.load('en_core_web_md')
words = "amazing terrible excellent"

tokens = nlp(words)

token1, token2, token3 = tokens[0], tokens[1], tokens[2]

print(f"Similarity between {token1} and {token2} : ",

token1.similarity(token2))
print(f"Similarity between {token1} and {token3} : ",
token1.similarity(token3))

from textblob import TextBlob

text = "It was a very pleasant day"
print(TextBlob(text).sentiment)

OUTPUT :

Genetics Analysis and Principles 5th Edition by Brooker Solution Manual
No ratings yet
Genetics Analysis and Principles 5th Edition by Brooker Solution Manual
13 pages
NLP Lab File
No ratings yet
NLP Lab File
15 pages
NLP Lab File (1)
No ratings yet
NLP Lab File (1)
13 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
NLP LAB_MANUAL (1)
No ratings yet
NLP LAB_MANUAL (1)
33 pages
NLP Lab Programs
No ratings yet
NLP Lab Programs
3 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
NLP LAB MANUAL
No ratings yet
NLP LAB MANUAL
17 pages
NLP (1)
No ratings yet
NLP (1)
12 pages
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
100% (1)
NLP LAB MANUAL 3-2 AIML R22 UPDATE (1)
20 pages
NLP-Lab Manual - Ashwini - Kachare
No ratings yet
NLP-Lab Manual - Ashwini - Kachare
41 pages
Sahil NLP
No ratings yet
Sahil NLP
16 pages
7 idf
No ratings yet
7 idf
5 pages
NLP Lab Manual (R20)
50% (2)
NLP Lab Manual (R20)
24 pages
277e5fcb-2a64-4802-9bfa-c0b031207675
No ratings yet
277e5fcb-2a64-4802-9bfa-c0b031207675
20 pages
01 NLP - Merged Vinay
No ratings yet
01 NLP - Merged Vinay
27 pages
Wsma Final Manual
No ratings yet
Wsma Final Manual
58 pages
SK NLP Practical (FS)
No ratings yet
SK NLP Practical (FS)
22 pages
NLP 02
No ratings yet
NLP 02
6 pages
NLP - Practical List
No ratings yet
NLP - Practical List
14 pages
CH4
No ratings yet
CH4
15 pages
3 b Morphology
No ratings yet
3 b Morphology
3 pages
NLP Experiment 2
No ratings yet
NLP Experiment 2
5 pages
Text Processing
No ratings yet
Text Processing
16 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
54 pages
NLTK Tutorial
No ratings yet
NLTK Tutorial
33 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
15 pages
Natural Language Processing: Practical 1
No ratings yet
Natural Language Processing: Practical 1
64 pages
20BCP123 - NLP Lab Manual
No ratings yet
20BCP123 - NLP Lab Manual
45 pages
NLP Manual (1-12)
No ratings yet
NLP Manual (1-12)
55 pages
Nlp Lab Manual
No ratings yet
Nlp Lab Manual
21 pages
Final_NLP_Lab_File
No ratings yet
Final_NLP_Lab_File
28 pages
20BCP112 - NLP Lab - LAB - Manual
No ratings yet
20BCP112 - NLP Lab - LAB - Manual
65 pages
Lab Manual - NLP
No ratings yet
Lab Manual - NLP
60 pages
NLP Lab Manual Lab Work
No ratings yet
NLP Lab Manual Lab Work
24 pages
NLP Programs
No ratings yet
NLP Programs
5 pages
Tokenizer
No ratings yet
Tokenizer
4 pages
NLP Lab Complete
No ratings yet
NLP Lab Complete
23 pages
7.TextAnalysis
No ratings yet
7.TextAnalysis
3 pages
AIML_P4
No ratings yet
AIML_P4
12 pages
NLP Manual (1-12) 1
No ratings yet
NLP Manual (1-12) 1
56 pages
Shubham Jade MSC It 31031420010 NLP Practical Journal
No ratings yet
Shubham Jade MSC It 31031420010 NLP Practical Journal
17 pages
NLP Practicals All
No ratings yet
NLP Practicals All
57 pages
1
No ratings yet
1
2 pages
Lab Prgms Weel1-Output
No ratings yet
Lab Prgms Weel1-Output
4 pages
NLP FinAL (1)
No ratings yet
NLP FinAL (1)
27 pages
NLP_Lab_1.ipynb - Colab
No ratings yet
NLP_Lab_1.ipynb - Colab
4 pages
Natural Language Processing
No ratings yet
Natural Language Processing
17 pages
Week-4 NLP 2
No ratings yet
Week-4 NLP 2
2 pages
H7 W5 NLP - Merged
No ratings yet
H7 W5 NLP - Merged
17 pages
NLP Lab Manual
No ratings yet
NLP Lab Manual
33 pages
p4
No ratings yet
p4
10 pages
NLP Op
No ratings yet
NLP Op
16 pages
NLP Lab 1
No ratings yet
NLP Lab 1
1 page
NLP Record
No ratings yet
NLP Record
15 pages
1-NLP - Lab Manual
No ratings yet
1-NLP - Lab Manual
15 pages
NLP___
No ratings yet
NLP___
28 pages
Lab2 IR
No ratings yet
Lab2 IR
16 pages
Machine Learning NLP LAB Sayak Mallick
No ratings yet
Machine Learning NLP LAB Sayak Mallick
4 pages
1
No ratings yet
1
13 pages
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
From Everand
C# Package Mastery: 100 Essentials in 1 Hour - 2024 Edition
Tenko
No ratings yet
Jurnal Yop Androni
No ratings yet
Jurnal Yop Androni
24 pages
2024-2025 - SFT 04 - 1ST Round - Manual - en
No ratings yet
2024-2025 - SFT 04 - 1ST Round - Manual - en
5 pages
Practice Problems (Correlation, Regression and Probability)
No ratings yet
Practice Problems (Correlation, Regression and Probability)
5 pages
Catalogo - Sistema Resight - ZEISS
No ratings yet
Catalogo - Sistema Resight - ZEISS
2 pages
Supply Chain Management
No ratings yet
Supply Chain Management
22 pages
Environmental Microbiology 3rd Edition Walter Reineke Michael Schlmann instant download
No ratings yet
Environmental Microbiology 3rd Edition Walter Reineke Michael Schlmann instant download
91 pages
CT1 Preparation Notes
No ratings yet
CT1 Preparation Notes
23 pages
Implementation of 5'S in The Warehouse of A Construction Company
No ratings yet
Implementation of 5'S in The Warehouse of A Construction Company
4 pages
Poster Apcan 09-Final
No ratings yet
Poster Apcan 09-Final
1 page
Name: Erlangga Auliyan Waluya Student's ID Number (NPM) : 120060013 Whatsapp Number: 082121900947 1. Articles Summary
No ratings yet
Name: Erlangga Auliyan Waluya Student's ID Number (NPM) : 120060013 Whatsapp Number: 082121900947 1. Articles Summary
8 pages
Lecture 11 - Strain Transformation
No ratings yet
Lecture 11 - Strain Transformation
29 pages
Class 6 Maths Chapter 9 Data Handling
100% (1)
Class 6 Maths Chapter 9 Data Handling
13 pages
1920 S5 T1 Revision Paper (Reading) - Suggested Answers
No ratings yet
1920 S5 T1 Revision Paper (Reading) - Suggested Answers
8 pages
QUARTER 4 Face To Face
No ratings yet
QUARTER 4 Face To Face
5 pages
Grammmar in Communication
No ratings yet
Grammmar in Communication
25 pages
Business Value of SME Digitalisation When Does It Pay Off More
No ratings yet
Business Value of SME Digitalisation When Does It Pay Off More
21 pages
Challenges To The Transfer of Agricultural Technologies in Nigeria
No ratings yet
Challenges To The Transfer of Agricultural Technologies in Nigeria
7 pages
Flow Characteristics in Local Scour
No ratings yet
Flow Characteristics in Local Scour
16 pages
Chapter 1 Innovation Management
No ratings yet
Chapter 1 Innovation Management
21 pages
Artaud, Antonin - Letter To The Legislator of The Drug Act (1967)
No ratings yet
Artaud, Antonin - Letter To The Legislator of The Drug Act (1967)
8 pages
Uetz 1991 - Habitat Structure and Spider Foraging
No ratings yet
Uetz 1991 - Habitat Structure and Spider Foraging
24 pages
Science 8 Unit 9 Quiz
No ratings yet
Science 8 Unit 9 Quiz
2 pages
Materi Awareness ISO 37001:2018
No ratings yet
Materi Awareness ISO 37001:2018
160 pages
T2-19H-4.11-READING-THE-GROWTH-MINDSET
No ratings yet
T2-19H-4.11-READING-THE-GROWTH-MINDSET
11 pages
Ac Sa Math Yr7 Refreshing Drinks
No ratings yet
Ac Sa Math Yr7 Refreshing Drinks
18 pages
Nwafor
No ratings yet
Nwafor
16 pages
Bilingualism in Development
No ratings yet
Bilingualism in Development
26 pages
(PDF Download) Test Bank For The Biology of Cancer Second Edition Fulll Chapter
100% (5)
(PDF Download) Test Bank For The Biology of Cancer Second Edition Fulll Chapter
30 pages
A New Edible Wild Mushroom Species Panus Sribuabanensis Panaceae Polyporales From Northern Thailand and Its Nutritional Composition Total Phenoli
No ratings yet
A New Edible Wild Mushroom Species Panus Sribuabanensis Panaceae Polyporales From Northern Thailand and Its Nutritional Composition Total Phenoli
13 pages