Compiler Design - Syntax Analysis

Uploaded by

bourefrefabdennour

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Compiler Design - Syntax Analysis

Uploaded by

bourefrefabdennour

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Compiler Design Chapter 03 : Syntax analysis

CHAPTER 03:
SYNTAX ANALYSIS

The second phase of a compiler is called syntax analysis. The input to this phase consists
of a stream of tokens put out by the lexical analysis phase. They are then checked for proper
syntax, i.e. the compiler checks to ensure the statements and expressions are correctly formed.
Some examples of syntax errors in Java are:

x = (2+3) * 9); // mismatched parentheses

if x>y x = 2; // missing parentheses

while (x==3) do f1(); // invalid keyword do

when the compiler encounters such an error, it should put out an informative message for
the user. At this point, it is not necessary for the compiler to generate an object program. A
compiler is not expected to guess the intended purpose of a program with syntax errors. A good
compiler, however, will continue scanning the input for additional syntax errors.

1. Context free Grammars(CFG)

CFG is used to specify the syntax of a language. A grammar naturally describes the hierarchical
structure of most programming language constructs.
1.1.Formal Definition of Grammars
A context-free grammar has four components:
1. A set of terminal symbols, sometimes referred to as "tokens." The terminals are the
elementary symbols of the language defined by the grammar.
2. A set of nonterminals, sometimes called "syntactic variables." Each non-terminal represents
a set of strings of terminals, in a manner we shall describe.

23
Compiler Design Chapter 03 : Syntax analysis

3. A set of productions, where each production consists of a nonterminal, called the head or left
side of the production, an arrow, and a sequence of terminals and1or nonterminals, called the
body or right side of the production. The intuitive intent of a production is to specify one of the
written forms of a construct; if the head nonterminal represents a construct, then the body
represents a written form of the construct.
4. A designation of one of the nonterminals as the start symbol. Production is for a nonterminal
if the nonterminal is the head of the production. A string of terminals is a sequence of zero or
more terminals. The string of zero terminals, written as E, is called the empty string.
1.2. Derivations
A grammar derives strings by beginning with the start symbol and repeatedly replacing a
nonterminal by the body of a production for that nonterminal. The terminal strings that can be
derived from the start symbol form the language defined by the grammar. Leftmost and
Rightmost Derivation of a String
• Leftmost derivation − A leftmost derivation is obtained by applying production to the
leftmost variable in each step.
Two Left most Derivations for given input String are:

• Rightmost derivation − A rightmost derivation is obtained by applying production to the

rightmost variable in each step.
Example: Let any set of production rules in a CFG be
X → X+X | X*X |X| a over an alphabet {a}.
The leftmost derivation for the string "a+a*a" is:
X → X+X → a+X → a + X*X → a+a*X → a+a*a
The rightmost derivation for the above string "a+a*a" is:
X → X*X → X*a → X+X*a → X+a*a → a+a*a

24
Compiler Design Chapter 03 : Syntax analysis

1.3.Derivation or Yield of a Tree

The derivation or the yield of a parse tree is the final string obtained by concatenating the labels
of the leaves of the tree from left to right, ignoring the Nulls. However, if all the leaves are Null,
derivation is Null. parse tree pictorially shows how the start symbol of a grammar derives a
string in the language. If nonterminal A has a production A → XYZ, then a parse tree may have
an interior node labeled A with three children labeled X, Y, and Z, from left to right:

Given context-free grammar, a parse tree according to the grammar is a tree with the following
properties:
1. The root is labeled by the start symbol.
2. Each leaf is labeled by a terminal or by e.
3. Each interior node is labeled by a nonterminal
If A is the nonterminal labeling of some interior node and X I, Xz, . . . , Xn are the labels of the
children of that node from left to right, then there must be a production A → X1X2... Xn. Here,
X1, X2,. . . , Xn, each stand for a symbol that is either a terminal or a nonterminal. As a special
case, if A → c is a production, then a node labeled A may have a single child labeled E.
2. Parse Tree
A parse tree is a graphical representation of a derivation that filters out the order in which
productions are applied to replace non-terminals.
- Each interior node of a parse tree represents the application of a production.
- All the interior nodes are Non terminals and all the leaf nodes are terminals.
- All the leaf nodes reading from the left to right will be the output of the parse tree.
If a node n is labeled X and has children n1,n2,n3,…nk with labels X1,X2,…Xk respectively,
then there must be a production A->X1X2…Xk in the grammar.
Example1:- Parse tree for the input string - (id + id) using the above Context free Grammar is:

25
Compiler Design Chapter 03 : Syntax analysis

The Following figure shows step by step construction of parse tree using CFG for the parse
tree for the input string - (id + id).

Example2:- Parse tree for the input string id+id*id using the above Context free Grammar is:

3. Ambiguity
A grammar can have more than one parse tree generating a given string of terminals. Such a
grammar is said to be ambiguous. To show that a grammar is ambiguous, all we need to do is
find a terminal string that is the yield of more than one parse tree. Since a string with more than
one parse tree usually has more than one meaning, we need to design unambiguous grammars

26
Compiler Design Chapter 03 : Syntax analysis

for compiling applications, or to use ambiguous grammars with additional rules to resolve the
ambiguities.
Example: this grammar is ambiguous: W= id+id+id

4. Eliminating ambiguous grammar

Ambiguity of the grammar that produces more than one parse tree for leftmost or rightmost
derivation can be eliminated by re-writing the grammar.

4.1. Eliminating left-recursion

Because we try to generate a leftmost derivation by scanning the input from left to right,
grammars of the form A → A x may cause endless recursion. Such grammars are called left-
recursive and they must be transformed if we want to use a top-down parser.

 A grammar is left recursive if, for a non-terminal A, there is a derivation A+ A

 To eliminate direct left recursion replace

1) A → A | with A’ →  A’|

2) A → A1 | A2 | ... | Am | 1 | 2 | ... | n with A → 1B | 2B | ... | nB

B → 1B | 2B | ... | mB | 

4.2. Left-factoring

Left factoring is a grammar transformation that is useful for producing a grammar suitable for
predictive parsing. When it is not clear which of two alternative productions to use to expand a
non-terminal A, we can rewrite the A-productions to defer the decision until we have seen
enough of the input to make the right choice.
 Consider S → if E then S else S | if E then S

27
Compiler Design Chapter 03 : Syntax analysis

 Which of the two productions should we use to expand non-terminal S when the next
token is if?
We can solve this problem by factoring out the common part in these rules.
A → 1 | 2 |...| n |  becomes A → B|  B → 1 | 2 |...| n
Consider the grammar , G :
S → iEtS | iEtSeS | a
E→b
Left factored, this grammar becomes
S → iEtSS’ | a
S’ → eS |ε
E→b

Final Project Report
50% (4)
Final Project Report
52 pages
R12 Oracle Process Manufacturing Cost Management
No ratings yet
R12 Oracle Process Manufacturing Cost Management
5 pages
Core Java Report PDF
33% (3)
Core Java Report PDF
46 pages
CST302_COMPILER_DESIGN_MODULE 2
No ratings yet
CST302_COMPILER_DESIGN_MODULE 2
19 pages
Module 2
No ratings yet
Module 2
19 pages
2014-CD Ch-03 SAn
No ratings yet
2014-CD Ch-03 SAn
21 pages
Class Three
No ratings yet
Class Three
74 pages
SE Compiler Chapter 3-Parser
No ratings yet
SE Compiler Chapter 3-Parser
27 pages
Compiler Design Unit 2
No ratings yet
Compiler Design Unit 2
24 pages
CD Unit-2 (R20)
No ratings yet
CD Unit-2 (R20)
38 pages
Chapter-3-Syntax Analysis
No ratings yet
Chapter-3-Syntax Analysis
126 pages
Syntax Analysis: Role of Parsers
No ratings yet
Syntax Analysis: Role of Parsers
6 pages
Unit 3 Syntax - Analyzer
No ratings yet
Unit 3 Syntax - Analyzer
56 pages
12.2Unit 2
No ratings yet
12.2Unit 2
25 pages
2 Syntax Analysis - Introduction
No ratings yet
2 Syntax Analysis - Introduction
8 pages
Compiler Design - Syntax Analysis
No ratings yet
Compiler Design - Syntax Analysis
14 pages
2. Simple Syntax Directed Translation
No ratings yet
2. Simple Syntax Directed Translation
51 pages
3 Role of Parser
No ratings yet
3 Role of Parser
135 pages
Lecture 03
No ratings yet
Lecture 03
36 pages
CD Unit 2
100% (1)
CD Unit 2
20 pages
CD UNIT 3
No ratings yet
CD UNIT 3
76 pages
Chapter 3
No ratings yet
Chapter 3
180 pages
Compiler Design Chapter-3
0% (1)
Compiler Design Chapter-3
177 pages
Chapter – 3
No ratings yet
Chapter – 3
46 pages
Chapter 3 Syntax Analysis Full Reading Material
No ratings yet
Chapter 3 Syntax Analysis Full Reading Material
76 pages
Module 2 C D Notes
No ratings yet
Module 2 C D Notes
21 pages
Unit-2 PCD
No ratings yet
Unit-2 PCD
36 pages
Class 18 Context Free Grammar
No ratings yet
Class 18 Context Free Grammar
35 pages
II. Parser: Syntax Analysis
No ratings yet
II. Parser: Syntax Analysis
18 pages
Syntax Analysis (Part-I)
No ratings yet
Syntax Analysis (Part-I)
88 pages
Cdmodule 2
No ratings yet
Cdmodule 2
22 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
37 pages
Chapter 3 - Syntax Analysis Part One
No ratings yet
Chapter 3 - Syntax Analysis Part One
10 pages
CH03
No ratings yet
CH03
57 pages
Compiler Construction Week 04 Syntax Analysis I)
No ratings yet
Compiler Construction Week 04 Syntax Analysis I)
41 pages
Chapter Four
No ratings yet
Chapter Four
54 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
56 pages
C Depart
No ratings yet
C Depart
7 pages
Unit 2
No ratings yet
Unit 2
45 pages
CD Unit 2
No ratings yet
CD Unit 2
19 pages
CD Unit-Ii
No ratings yet
CD Unit-Ii
34 pages
12. UNIt-II-P
No ratings yet
12. UNIt-II-P
57 pages
Chapter 3 Syntax Analysis (Parsing)
No ratings yet
Chapter 3 Syntax Analysis (Parsing)
29 pages
COMPILER DESIGN UNIT 2
No ratings yet
COMPILER DESIGN UNIT 2
44 pages
[Week 4] Syntax Analysis (CFG)
No ratings yet
[Week 4] Syntax Analysis (CFG)
50 pages
Chapter 3 Syntax Analysis
No ratings yet
Chapter 3 Syntax Analysis
20 pages
Syntax Analysis
No ratings yet
Syntax Analysis
58 pages
ACD-UNIT-4 Notes
No ratings yet
ACD-UNIT-4 Notes
32 pages
CD Chapter-3
No ratings yet
CD Chapter-3
105 pages
KCA015 Unit2
No ratings yet
KCA015 Unit2
29 pages
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
No ratings yet
3-Module 2 - Role of Parser - Parse Tree-02-08-2024
76 pages
Chapter 3 Syntax Analysis (Parsing)
No ratings yet
Chapter 3 Syntax Analysis (Parsing)
29 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
43 pages
2CFL
No ratings yet
2CFL
20 pages
AT&CD Unit 2
No ratings yet
AT&CD Unit 2
26 pages
Compilers - Week 3
No ratings yet
Compilers - Week 3
17 pages
ATCD PPT Module-3
No ratings yet
ATCD PPT Module-3
136 pages
Lesson 3: Syntax Analysis: Risul Islam Rasel
No ratings yet
Lesson 3: Syntax Analysis: Risul Islam Rasel
106 pages
Syntax Analysis
No ratings yet
Syntax Analysis
73 pages
TPL Lect 17-20
No ratings yet
TPL Lect 17-20
8 pages
Compiler Design Module 2 Notes 2022-23 02-04-2023 Modified
No ratings yet
Compiler Design Module 2 Notes 2022-23 02-04-2023 Modified
46 pages
Unit Iii
No ratings yet
Unit Iii
95 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Communication Satellites in Computer Networks PDF
No ratings yet
Communication Satellites in Computer Networks PDF
2 pages
CS Dept Practical Exams Timetable 2024
No ratings yet
CS Dept Practical Exams Timetable 2024
4 pages
Use of E-Learning in Uttarakhand School Education System: Case Study of Open Source E-Learning Tools For Fundamental Mathematics and Sciences
No ratings yet
Use of E-Learning in Uttarakhand School Education System: Case Study of Open Source E-Learning Tools For Fundamental Mathematics and Sciences
4 pages
Chapter 3 Time Series Decomposition
No ratings yet
Chapter 3 Time Series Decomposition
20 pages
Background Job Scheduling in SAP
No ratings yet
Background Job Scheduling in SAP
27 pages
A Seminar Report on Augumented Reality
No ratings yet
A Seminar Report on Augumented Reality
32 pages
8051 programming
No ratings yet
8051 programming
4 pages
2N6027
No ratings yet
2N6027
6 pages
Vital Area Identification Approach
No ratings yet
Vital Area Identification Approach
20 pages
Step by Step Guide On Creating Physical Standby Using
No ratings yet
Step by Step Guide On Creating Physical Standby Using
14 pages
tNavAHMUserGuideEnglish PDF
No ratings yet
tNavAHMUserGuideEnglish PDF
188 pages
Needs, Requirements, Verification, Validation Lifecycle Manual (Requirements Working Group Etc.) (Z-lib.org).PDF-cdeKey_OLIJMIE4HKAGJBQA2V7SFGUCOQXSYC3Q
No ratings yet
Needs, Requirements, Verification, Validation Lifecycle Manual (Requirements Working Group Etc.) (Z-lib.org).PDF-cdeKey_OLIJMIE4HKAGJBQA2V7SFGUCOQXSYC3Q
472 pages
Assignment DCA2203 IA QP Set 1 & 2 March'24
No ratings yet
Assignment DCA2203 IA QP Set 1 & 2 March'24
2 pages
Anshu Dsa File 10-30
No ratings yet
Anshu Dsa File 10-30
50 pages
Module 1 - Operating System
No ratings yet
Module 1 - Operating System
20 pages
Entity Framework Notes For Professionals
No ratings yet
Entity Framework Notes For Professionals
94 pages
Ejercicio 2 display 7 segmentos
No ratings yet
Ejercicio 2 display 7 segmentos
6 pages
315-508 BLE Dongle - DS - 081913
No ratings yet
315-508 BLE Dongle - DS - 081913
2 pages
Improving Upgrowth Algorithm Using Top-K Itemset Mining High Utility
No ratings yet
Improving Upgrowth Algorithm Using Top-K Itemset Mining High Utility
12 pages
Mark Scheme (Results) June 2011: International GCSE
No ratings yet
Mark Scheme (Results) June 2011: International GCSE
24 pages
Nba Final Report
No ratings yet
Nba Final Report
25 pages
Alcate 5620 CMIP OSS Interface Module Release 7.1 Technical Reference
100% (1)
Alcate 5620 CMIP OSS Interface Module Release 7.1 Technical Reference
722 pages
Activity No. 1 Introduction To Solid Works
No ratings yet
Activity No. 1 Introduction To Solid Works
5 pages
Busbar Protection REB670: Application Manual
No ratings yet
Busbar Protection REB670: Application Manual
458 pages
Azure
No ratings yet
Azure
5 pages
Free CV Template Layout
100% (1)
Free CV Template Layout
6 pages
VLF Floor Scale & Hawk Terminal
No ratings yet
VLF Floor Scale & Hawk Terminal
52 pages