Skip to content

natthanaphop-isa/nlp-tutorials-codebasics

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nlp-tutorials

Tutorials For Beginners For Natural Language Processing

NLP Tasks

  1. Text Classification
  2. Text Similarity
  • Sentence Encoder
  • Cosine Similarity
  1. Information Extraction
  • Sentence Segmentation
  • Word Tokenization
  • Part of Speech Tagging
    • Named Entity Recognition
    • Key Phrase Extraction
  • Syntactic Parsing
  • Coreference Resolution
  • Relation Extraction
  • Event Extraction
  1. Information Retrieval
  • TFIDF score
  • BERT
  • Google Search
  • Return the relevant website sorted by order
  1. Chatbots
  2. Machine Translator
  3. Language Modeling
  4. Text Summarization
  5. Voice Assistants
  6. Topic Modeling

NLP Pipeline

  1. Data Acquisition
  2. Text Extraction & Cleanup
  3. Pre-processing
  • Sentence Segmentation/tokenization
  • Word Tokenization
  • Stemming & Lemmatization
  1. Model Building
  • Feature Engineeering
  • Machine Learning
  1. Evaluation
  • Confusion Matrix 2D,3D
  • Accuracy, Precision, Recall, F1-score
  1. Model Deployment
  2. Monitor & Update

Text Presentation

  • One Hot Encoding
  • Bag of Words
  • TF-IDF
  • Word Embeddings

"Often in NLP, feeding a good text representation to an ordinary algorithm will get you much further compared to applying a top-notch algorithm to an ordinary text representation."

One hot encoding

  • Doesn't capture the meaning of a word
  • High memory consumption
  • Out of vocabulary problem
  • No fixed length representation

Bag of Words

  • Large memeory consumption
  • Doesn't capture the meaning of words properly

About

Tutorials For Beginners For Natural Language Processing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%