0% found this document useful (0 votes)

117 views

NLP Assignment 2

The document is an assignment for a Natural Language Processing course. It includes details about the assignment such as the course code, semester, and names of the lecturer and students. It does not include any specific assignment questions or details.

Uploaded by

Adam Hafizi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

117 views

NLP Assignment 2

Uploaded by

Adam Hafizi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

BITI 3413: NATURAL LANGUAGE PROCESSING

SEM 1, 2023/2024

ASSIGNMENT 2

LECTURER’S NAME:

NAME MATRIC NO

Muhammad Adam Hafizi bin Hashim Tee B032110306

Muhammad Fakhrul Hazwan Bin Fahrurazi B032110357

i) Who is the creator and when was it introduced?

The Text-to-Text Transfer Transformer or T5 was created by a team of researchers. That

includes Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael

Matena, Yanqi Zhou, Wei Li, and Peter J. Liu. It was introduced in 2020 by this group of

individuals.

ii) Purpose of the LLM model in NLP

In natural language processing (NLP), T5 was developed to provide a fused framework

for many NLP tasks. By transforming them into a text-to-text format where both the input and

output are stated in natural language text.The structure of its architecture simplifies the execution

of various NLP tasks. The large language model uses a single model and training aim to make it

work. T5 handles tasks like translation, summarization, answering questions and many more.

The goal of T5 is to unify multiple NLP processes into a single framework to improve

performance. It can improve performance in diverse NLP applications and accelerate the process

of generating and deploying models.

iii) Model architecture (with diagram, if any)

iv) The methodologies of the LLM model development

a) Transformer Architecture

- The Transformer architecture, first presented by Vaswani et al. in their paper

"Attention is All You Need," serves as the foundation for T5. The transformer

architecture is ideally suited for addressing long range dependencies in sequential

data, such as natural language, because it processes input sequences in parallel

using a self-attention mechanism.

b) Pre-training

- Pre-training is performed on a large corpus with diverse text input styles for T5.

By predicting missing segments of the input sequence, the model is capable of

generating text that is both consistent and contextually appropriate during

pre-training. To enable the model to capture general language patterns and

semantic understanding, this pre-training phase is essential.

c) Text-to-Text Framework

- T5 stands out due to its text-to-text framework, where various NLP tasks share a

common text generation format instead of using task-specific architectures. This

means all tasks involve natural language text for both input and output. This

smooth approach simplifies the training and capacitates the model to effortlessly

handle various tasks in NLP.

d) Task Formulation–

- For fine-tuning on specific NLP tasks, T5 requires task-specific prompts that

frame the task as a text generation problem. This framing allows T5 to adapt to

different tasks using a consistent methodology. Essentially, T5 is guided to

approach each task as if it were generating text, even when the desired output isn't
strictly text-based. By framing tasks in this way, T5 can leverage its core text

generation capabilities to tackle a wide range of NLP challenges.

e) Multi-Task and Large-Scale Learning

- T5, or Text-To-Text Transfer Transformer, demonstrates improved performance

through a combination of multi-task learning and large-scale training. Multi-task

learning is employed both in the pre-training and fine-tuning stages, enabling the

model to simultaneously tackle multiple tasks. This approach capitalizes on the

shared knowledge across tasks, enhancing the model's overall capabilities.

Additionally, T5 leverages the advantages of large-scale training, involving

extensive datasets and powerful hardware such as GPUs or TPUs. The model

benefits from exposure to a diverse range of data, allowing it to learn intricate

patterns and relationships. The synergy of multi-task learning and large-scale

training contributes to T5's effectiveness in understanding and generating

human-like text across various language task

f) Evaluation and Iterative Improvement

- The development of the model follows an iterative process that includes

continuous evaluation and refinement. Researchers assess the model's

performance across benchmark datasets for diverse NLP tasks, pinpoint areas

requiring improvement, and iteratively adjust both the model architecture and

training methodologies.
v) Advantages and Weakness of the LLM

The advantage of T5 is flexible, the text-to-text model design can handle different kinds

of Natural Language Processing Tasks by simply changing the input and output formats. This

makes the model development easier since there is no requirement for task-specific architectures.

T5 can translate, summarize, answer questions, and classify text. This proves that T5 is versatile

and efficient in solving many language problems.

Another key strength of T5 lies in its extensive pre-training on massive datasets, allowing

it to clean insights from a wide range of language patterns and structures. This large-scale

pretraining contributes significantly to the model's proficiency in capturing nuanced linguistic

features, thereby enhancing its overall performance on downstream tasks. This foundational

knowledge, acquired during pre-training, positions T5 as a robust and effective language model,

capable of understanding and generating coherent text across diverse contexts.

T5's prowess is further exemplified by its consistently improved performance, achieving

state-of-the-art results on prominent NLP benchmarks like GLUE and SuperGLUE. This

indicates its exceptional ability to grasp complex language structures and patterns, translating

into high-quality outputs across a multitude of tasks. The model's success in these benchmarks

underscores its effectiveness and competitiveness in the rapidly evolving landscape of NLP

research and applications.

Moreover, T5 leverages transfer learning as a key methodology to bolster its performance

on downstream tasks. By initially pre-training on a vast corpus of data, T5 acquires a broad

understanding of general language patterns, which is then fine-tuned for specific applications.

This transfer learning approach enhances T5's adaptability, allowing it to leverage previously

gained knowledge and apply it to new, task-specific challenges. The model's versatility in

handling various NLP tasks positions it as a powerful tool for researchers and practitioners

seeking a comprehensive and adaptable solution.

The T5 model, with its impressive performance in natural language processing (NLP),

introduces notable challenges. One significant drawback is its substantial size, surpassing models

like BERT by over thirty times. This hinders accessibility for researchers and practitioners

relying on commodity GPU hardware due to increased difficulties and costs. Despite its

successes, the model's susceptibility to brittleness and un-human-like failures underscores the

ongoing complexities in achieving robust and human-like language understanding, particularly in

real-world applications.

Additionally, the success of T5 highlights the pressing need for improved evaluation

methodologies in the NLP community. The existing challenges in creating clean, challenging,

and realistic test datasets are acknowledged, emphasizing the necessity of establishing fair

benchmarks that accurately assess the capabilities of these advanced language models. This

recognition of evaluation shortcomings signals a call for continued efforts to enhance the

reliability of assessments and to drive progress in the field.

Furthermore, the ethical implications associated with biases present in the training data

of models like T5 are a significant concern. The learned biases related to race, gender, and

nationality can render the deployment of such models in real-world applications potentially

illegal or unethical, necessitating meticulous debiasing efforts by product engineers. The passage

underscores the importance of addressing biases in a task-independent manner, presenting it as a

substantial open problem within the realm of NLP, and emphasizing the critical role of ethical

considerations in the deployment of advanced language models.

In conclusion, T5 represents a groundbreaking advancement in natural language

processing, showcasing unparalleled flexibility with its text-to-text model design. Through

extensive pre-training on massive datasets, T5 attains a profound understanding of linguistic

nuances, consistently achieving state-of-the-art performance on benchmarks like GLUE and

SuperGLUE. While recognizing its strengths, it's crucial to acknowledge challenges tied to its

substantial size and ethical considerations regarding biases. As T5 shapes the NLP landscape, its

successes and challenges propel ongoing research, fostering progress and ethical deployment in

the dynamic realm of language models.

vi) Include one NLP application that uses the LLM

One application that uses the T5 Large Language model is text summarization which

involves generating concise and coherent summaries that capture the important information from

longer pieces of text. When using T5 for text summarization, the model is fine-tuned to a dataset
that contains pairs of longer documents and their corresponding human-generated summaries.

During training, the input consists of the document and the output is the generated summary. The

models learn to understand the content of the document and generate a summary that will capture

the key information in a human-like manner.

The T5 is powerful but the quality of summarization depends on the training data and the

fine tuning process. Continuous evaluation and refinement are necessary to make sure the

generated summaries meet high standards of accuracy and informativeness.

vii) References (include 2-5 article papers that you referred when preparing your article)

Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., Zhou, Y., Li, W., & Liu, P.

J. (2020). Exploring the Limits of Transfer Learning with a Unified Text-to-Text

Transformer. Journal of Machine Learning Research, 21(140), 1–67.

https://jmlr.org/papers/volume21/20-074/20-074.pdf

T5 - a lazy data science guide. (n.d.).

https://mohitmayank.com/a_lazy_data_science_guide/natural_language_processing/T5/

Mishra, P. (2021, December 14). Understanding T5 Model : Text to Text Transfer Transformer

model. Medium.

https://towardsdatascience.com/understanding-t5-model-text-to-text-transfer-transformer-

model-69ce4c165023
Bahani, M., Ouaazizi, A. E., & Maalmi, K. (2023). The effectiveness of T5, GPT-2, and BERT

on text-to-image generation task. Pattern Recognition Letters, 173, 57–63.

https://doi.org/10.1016/j.patrec.2023.08.001

T5. (n.d.). https://huggingface.co/docs/transformers/model_doc/t5

Whitepaper - Foundational Large Language Models & Text Generation
100% (1)
Whitepaper - Foundational Large Language Models & Text Generation
75 pages
Resume Format: Melbourne Careers Centre
No ratings yet
Resume Format: Melbourne Careers Centre
2 pages
Large Language Models
From Everand
Large Language Models
A. Scholtens
2/5 (2)
Research Paper
No ratings yet
Research Paper
2 pages
Research_paper[1][1][1] Final - Copy
No ratings yet
Research_paper[1][1][1] Final - Copy
4 pages
Research Paper[1][1][1] Final[1] - Copy
No ratings yet
Research Paper[1][1][1] Final[1] - Copy
4 pages
Demystifying Large Language Models: Unraveling the Mysteries of Language Transformer Models, Build from Ground up, Pre-train, Fine-tune and Deployment
From Everand
Demystifying Large Language Models: Unraveling the Mysteries of Language Transformer Models, Build from Ground up, Pre-train, Fine-tune and Deployment
James Chen
No ratings yet
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
Text Summarization Using The T5 Transformer Model
No ratings yet
Text Summarization Using The T5 Transformer Model
3 pages
Controllable Sentence Simplification With A Unified Text-to-Text Transfer Transformer
No ratings yet
Controllable Sentence Simplification With A Unified Text-to-Text Transfer Transformer
12 pages
Introduction_to_LLMs
No ratings yet
Introduction_to_LLMs
2 pages
Others Indigo Case Study PPT
No ratings yet
Others Indigo Case Study PPT
9 pages
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
Lec 03
No ratings yet
Lec 03
91 pages
LLMS&TRANSFORMERS
No ratings yet
LLMS&TRANSFORMERS
4 pages
NLP PREP
No ratings yet
NLP PREP
14 pages
NLP- AI2214601 unit 1to unit 5 notes
No ratings yet
NLP- AI2214601 unit 1to unit 5 notes
98 pages
Overview of The Transformer-Based Models For NLP Tasks
No ratings yet
Overview of The Transformer-Based Models For NLP Tasks
5 pages
Amazons GPT55X: A Comprehensive Overview and Analysis
No ratings yet
Amazons GPT55X: A Comprehensive Overview and Analysis
6 pages
LongT5 Paper
No ratings yet
LongT5 Paper
13 pages
THE PERFECT CHATBOT DOC
No ratings yet
THE PERFECT CHATBOT DOC
11 pages
Chapter Four_ NLP
No ratings yet
Chapter Four_ NLP
15 pages
The Diverse Landscape of Large Language Models Deepsense Ai
No ratings yet
The Diverse Landscape of Large Language Models Deepsense Ai
16 pages
DP Module 5
No ratings yet
DP Module 5
8 pages
Language Identification: Fundamentals and Applications
From Everand
Language Identification: Fundamentals and Applications
Fouad Sabry
No ratings yet
MthMLP
No ratings yet
MthMLP
6 pages
Performance Analysis and Comparison of LLMS Based On Transformer Technology
No ratings yet
Performance Analysis and Comparison of LLMS Based On Transformer Technology
12 pages
Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion
From Everand
Mastering Transformers: The Journey from BERT to Large Language Models and Stable Diffusion
Savaş Yıldırım
No ratings yet
aa
No ratings yet
aa
11 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Rishabh Sharma (Anantika Johari)
No ratings yet
Rishabh Sharma (Anantika Johari)
8 pages
NLP Notes For Students
No ratings yet
NLP Notes For Students
18 pages
LLM
No ratings yet
LLM
41 pages
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
From Everand
Unraveling the Magic of Large Language Models: A Journey into the Future of Communication
Lila Hartney
No ratings yet
Hocken Maier 25
No ratings yet
Hocken Maier 25
46 pages
Unit 4 LLM
No ratings yet
Unit 4 LLM
11 pages
mt5 A Massively Multilingual Pre Trained Text To Text 9iojxtx56w
No ratings yet
mt5 A Massively Multilingual Pre Trained Text To Text 9iojxtx56w
16 pages
Explanation Based Learning: Fundamentals and Applications
From Everand
Explanation Based Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
WT5?! Training Text-to-Text Models To Explain Their Predictions
No ratings yet
WT5?! Training Text-to-Text Models To Explain Their Predictions
16 pages
CHATGPT DALL.E 3: Complete Guide. Third Edition
From Everand
CHATGPT DALL.E 3: Complete Guide. Third Edition
Hesham Mohamed Elsherif
No ratings yet
14-LookingForward
No ratings yet
14-LookingForward
48 pages
AI-Driven Natural Language Processing Using Transformer Models
No ratings yet
AI-Driven Natural Language Processing Using Transformer Models
3 pages
230514224 v 1
No ratings yet
230514224 v 1
31 pages
Basics of Chat GPT: How to utilize this powerful tool to enhance your life!
From Everand
Basics of Chat GPT: How to utilize this powerful tool to enhance your life!
Adam Larsen
No ratings yet
Bert
No ratings yet
Bert
5 pages
Large Language Models
No ratings yet
Large Language Models
10 pages
ChatGPT for Linguists: Revolutionize Language Research and Analysis with AI-Driven Insights (2024 Guide)
From Everand
ChatGPT for Linguists: Revolutionize Language Research and Analysis with AI-Driven Insights (2024 Guide)
JED RAMOS
No ratings yet
LLM - Seminar Report
No ratings yet
LLM - Seminar Report
13 pages
Introduction To LLMS: Transformers Types of Llms Configuration Settings
100% (2)
Introduction To LLMS: Transformers Types of Llms Configuration Settings
7 pages
A Survey On Transformers in NLP With Focus On Efficiency
No ratings yet
A Survey On Transformers in NLP With Focus On Efficiency
31 pages
Transformer-Based Regression Models For Assessing Reading Passage Complexity: A Deep Learning Approach in Natural Language Processing
No ratings yet
Transformer-Based Regression Models For Assessing Reading Passage Complexity: A Deep Learning Approach in Natural Language Processing
14 pages
Comparative Analysis of T5 Model For Abstractive Text Summarization On Different Datasets
No ratings yet
Comparative Analysis of T5 Model For Abstractive Text Summarization On Different Datasets
7 pages
The NLP Cookbook Modern Recipes For Transformer Ba
No ratings yet
The NLP Cookbook Modern Recipes For Transformer Ba
29 pages
Deep Learning For Natural Language Processing: July 2021
No ratings yet
Deep Learning For Natural Language Processing: July 2021
10 pages
15 Ai Tools Changing The World Script
No ratings yet
15 Ai Tools Changing The World Script
9 pages
ChatGPT- Jack of All Trades, Master of None
No ratings yet
ChatGPT- Jack of All Trades, Master of None
37 pages
Constrained Conditional Model: Fundamentals and Applications
From Everand
Constrained Conditional Model: Fundamentals and Applications
Fouad Sabry
No ratings yet
LLM 1
No ratings yet
LLM 1
6 pages
Large Language Models
100% (1)
Large Language Models
23 pages
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
No ratings yet
Recent Advances in Natural Language Processing Via Large Pre-Trained Language Models-A Survey
40 pages
T5 Presentation
No ratings yet
T5 Presentation
64 pages
Introduction To Accounting and Business
100% (1)
Introduction To Accounting and Business
64 pages
Social Science
No ratings yet
Social Science
2 pages
SAeedpdf
No ratings yet
SAeedpdf
4 pages
MLP Log Book
No ratings yet
MLP Log Book
2 pages
Non-Normal Incidence of Waves at Interfaces: in This Lecture You Will Learn
No ratings yet
Non-Normal Incidence of Waves at Interfaces: in This Lecture You Will Learn
11 pages
Russo-Japanese War 1904 Timeline
No ratings yet
Russo-Japanese War 1904 Timeline
13 pages
Regression Analysis, Linear or Nonlinear Regression? That Is The Question. - Minitab
No ratings yet
Regression Analysis, Linear or Nonlinear Regression? That Is The Question. - Minitab
11 pages
Project JAISON
No ratings yet
Project JAISON
61 pages
LED Film Viewer
No ratings yet
LED Film Viewer
2 pages
Hamza Iqbal Resume
No ratings yet
Hamza Iqbal Resume
1 page
Letter Recognition Using HOlland-Style Adaptive Classifiers PDF
No ratings yet
Letter Recognition Using HOlland-Style Adaptive Classifiers PDF
22 pages
Btech 1 Sem Engineering Mathematics 1 Nas103 2019
No ratings yet
Btech 1 Sem Engineering Mathematics 1 Nas103 2019
2 pages
Exercises For Grade 6
No ratings yet
Exercises For Grade 6
6 pages
Her Milk, His Desire (Swells, Lacey) (Z-Library)
No ratings yet
Her Milk, His Desire (Swells, Lacey) (Z-Library)
52 pages
Rhel8 SH
No ratings yet
Rhel8 SH
12 pages
Management Principles Developed by Henri Fayol
No ratings yet
Management Principles Developed by Henri Fayol
1 page
STD 7th Revision
No ratings yet
STD 7th Revision
6 pages
Parthian Shot Issue 2 - 29.06.10
100% (1)
Parthian Shot Issue 2 - 29.06.10
8 pages
Animation
No ratings yet
Animation
8 pages
Lexmark Supplies Guide 2003
No ratings yet
Lexmark Supplies Guide 2003
22 pages
Install Guide 6.4
No ratings yet
Install Guide 6.4
96 pages
CH 14
No ratings yet
CH 14
21 pages
Alien RPG Custom Character Sheet Mono FF-1
No ratings yet
Alien RPG Custom Character Sheet Mono FF-1
1 page
DODDS - 1957 - Notes On Some Manuscripts of Plato
No ratings yet
DODDS - 1957 - Notes On Some Manuscripts of Plato
8 pages
Last Quarter On 1 August 2002 Thursday
No ratings yet
Last Quarter On 1 August 2002 Thursday
1 page
Margin Call Blog
No ratings yet
Margin Call Blog
4 pages
KEYS Get Ready For IELTS Listening
No ratings yet
KEYS Get Ready For IELTS Listening
44 pages
Physical Examination For Pregnant Woman
No ratings yet
Physical Examination For Pregnant Woman
31 pages
Feed-Mill-Maintance-and-Sanitation-Guide
No ratings yet
Feed-Mill-Maintance-and-Sanitation-Guide
13 pages