Hw2sol PDF

Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

CSE 105: Introduction to the Theory of Computation, Spring 2003 D.

Micciancio
Solutions of Problem Set 2 (Revised) May 7, 2003
Solutions of Problem Set 2 (Revised)
2.1 Given the grammar G:
E E +T | T
T T F | F
F ( E ) | a
Give the parse trees and derivations for each string.
a. The derivation is E T F a. The parse tree is shown in Figure 1.
b. The derivation is E E +T T +T F +T a +T a +F a + a. The parse
tree is shown in Figure 1.
c. The derivation is E E +T E +T +T T +T +T F +T +T a +T +T
a +F +T a + a +T a + a +F a + a + a. The parse tree is shown in Figure 1.
d. The derivation is E T F (E) (T) (F) ((E)) ((T)) ((F)) ((a)).
The parse tree is shown in Figure 1.
E
E
T
F
a
+
T
F
a
E
T
F
a
T
F
a
E
T
F
E
T
F
E
T
F
E
T
F
a
E
T
F
a
+
E
+
a
( )
(
)
Figure 1: 2.1 (a), (b), (c), and (d)
2.3 The given context-free grammar G is
1
CSE 105, Solutions of Problem Set 2 (Revised) 2
R XRX | S
S aTb | bTa
T XTX | X |
X a | b
a. The variables of G are { R, X, S, T }. The terminals of G are { a, b }. The start variable is
R.
b. Strings in L(G) are ab, ba, and aaaabbaaa.
c. Strings not in L(G) are aaa, aba, and bb.
d. T aba: false.
e. T

aba: true.
f. T T: false.
g. T

T: true.
h. XXX

aba: true.
i. X

aba: false.
j. T

XX: true.
k. T

XXX: true.
l. S

: false
m. The language generated by L is the language of all strings w over { a, b } such that w is
not palindrome, that is, w = w
R
.
2.6 b. L is the complement of the language { a
n
b
n
: n 0 }.
First, lets see what the complement of L looks like:
L = { a
n
b
m
: n = m} { (a b)

ba(a b)

}
Lets call the leftmost language L
1
and the rightmost L
2
.
The context-free grammar that generate L
1
is
S
1
aS
1
b | T | U
T aT | a
U Ub | b
The context-free grammar that generate L
2
is
S
2
RbaR
R RR| a | b |
Therefore, the context-free grammar G that generate L = L
1
L
2
is
S S
1
| S
2
S
1
aS
1
b | T | U
S
2
RbaR
T aT | a
U Ub | b
R RR| a | b |
CSE 105, Solutions of Problem Set 2 (Revised) 3
c. L = { w#x : w
R
is a substring of x for w, x {0, 1}

}.
The context-free grammar G that generate L is
S TR
T 0T0 | 1T1 | #R
R RR| 0 | 1 |
2.7 a). The language of strings over the alphabet { a, b } with twice as many as as bs. The PDA
that recognizes this language is shown in Figure 2.
a, h
, h
, $
, $
q
0
q
1
q
2
q
3
q
4
q
5
q
6
q
7
q
8
b, $ $
b, h
, h
, $
a, $ $
b, a
, a
a, a
a, a
b, h
, $ $
, a
Figure 2: 2.7 (a)
d). L = {x
1
#x
2
#x
k
| k 1, each x
i
{ a, b } and for some i and j, x
i
= x
R
j
}.
The PDA of Figure 3 recognizes L.
, $
q
0
q
1
, $
q
2
q
4
, $
,
,
a, a
b, b
,
a,
b,
b,
a,
q
3
q
5
q
6
q
7
#,
,
#,
,
#,
#,
,
a, a
b, b
Figure 3: 2.7 (d)
CSE 105, Solutions of Problem Set 2 (Revised) 4
2.8 This sentence the girl touches the boy with the ower has two distinct parse trees as shown
in Figure 4.
This shows that this CFG is ambiguous.
S
NP VP
CN
A N
the girl
CV
PP
touches
NP
P CN
A N
the ower
with
CN
A N
the boy
S
NP VP
CN CV
A N
NP
CN PP
A N P CN
A N
the girl
touches
the boy with
the ower
Figure 4: Problem 2.8
2.13 Given the grammar G:
S TT | U
T 0T | T0 | #
U 0U00 | #
a). The language generated by L = L(G) is the set of strings that either are composed by the
concatenation of 3 arbitrary-length strings of zeroes (delimited by the symbol #) or strings
of the form 0
k
#0
2k
for k 0. More formally, a word w in L is of the form w = 0
k
1
#0
k
2
#0
k
3
(where k
1
, k
2
, k
3
0) or of the form w = 0
k
#0
2k
for k 0.
b). Lets prove L is not regular. Toward a contradiction, assume L is regular. By the pumping
lemma, there exists a pumping length p > 0 such that for any word w L, |w| p, we can
split word w into 3 pieces w = xyz, |y| > 0, |xy| p, such that for any integer i 0, w

= xy
i
z
belongs to language L too. Now, consider the word w = 0
p
#0
2p
. Clearly this word belongs
to L and |w| p so it must sutisfy the conditions of the theorem. Any partition of word w
into 3 pieces xyz much be such that y is composed only by 0s from the leftmost sequence
of 0s (otherwise |xy| would be larger than p which is not allowed). Therefore, y = 0
k
for
k > 0. We now pump down this word, obtaining w

= xy
0
z. Clearly, word w

= 0
pk
#0
2p
does not belong to L because k > 0. So, any possible partition of w cannot be pumped for
all i > 0 (since i = 0 does not work!). We have obtained a contradiction. Therefore, L must
be non regular.
CSE 105, Solutions of Problem Set 2 (Revised) 5
2.25 Notice that Y generates all possible strings. The grammar generates all strings NOT of the
forma
k
b
k
for k 0. Thus the complement of the language generated is L(G) = {a
k
b
k
: k 0}.
The CFG for this language is very easy to generate:
S aSb|
2.26 C = { x#y : x, y {0, 1}

, and x = y }.
Solution: (Revised) The fundamental idea is that if x and y dier, there must be a position k
in both words such that x
k
= y
k
. Our PDA, while reading the rst string, will count symbols
(by pushing some other symbol x into the stack) until nondeterministically will decide to
store (remember) the current symbol in the stack. Then, when reading the second string it,
the PDA will count symbols (by poping the xs from the stack) until nondeterministically
will decide to check whether the current symbol matches the symbol stored in the stack. If
they dier, it will accept. Otherwise it will reject.
q
1
q
2
q
0
0, 0
, $ 1, 1
0, x
1, x
0,
1,
, $
0,
1,
0,
1,
q
3
0, x
1, x
q
4
0, x
1, x
q
5
q
6
#, 0
#, 1
Figure 5: 2.26. PDA recognizing language C.
1.20 Give a formal denition of the nite state transducer (FST) model from exercise 1.19, fol-
lowing the patterns of Denition 1.1 (page 35 in the textbook). Assume that an FST has
an input alphabet and an output alphabet but not a set of accepting states. Include a
formal denition of the computation of an FST.
A nite state transducer (FST) is a 5-tuple (Q, , , , q
0
) where
Q is a nite set of states,
is a nite of symbols, called the input alphabet,
is a nite of symbols, called the output alphabet,
: Q Q

is the transition function (where

= { }), and
q
0
is the start state.
The computation of an FST proceeds as follows. Let M = (Q, , , , q
0
) be a FST, w =
w
1
, . . . , w
n
be a string over alphabet and =
1
, . . . ,
n
be a string over alphabet

(ie. some of the


i
can be ). The computation of M on input w produces output if there
exists a sequence of states r
0
, r
1
, . . . , r
n
in Q such that the following conditions hold:
1. r
0
= q
0
,
CSE 105, Solutions of Problem Set 2 (Revised) 6
2. (r
i1
, w
i
) = (r
i
,
i
), for i = 1, . . . , n.
1.21 Using the solution you gave to Exercise 1.20, give a formal description of the machines T
1
and T
2
pictured in Exercise 1.19.
The description of FST T
1
= (Q
1
,
1
,
1
,
1
, s
1
) is as follows:
1. Q
1
= { q
1
, q
2
}
2.
1
= { 0, 1, 2 }
3.
1
= { 0, 1 }
4. s
1
= q
1
5.
1
is given by the following table:

1
0 1 2
q
1
q
1
/0 q
1
/1 q
2
/1
q
2
q
1
/0 q
2
/1 q
2
/1
The description of FST T
2
= (Q
2
,
2
,
2
,
2
, s
2
) is as follows:
1. Q
2
= { q
1
, q
2
, q
3
}
2.
2
= { a, b }
3.
2
= { 0, 1 }
4. s
2
= q
1
5.
2
is given by the following table:

2
a b
q
1
q
2
/1 q
3
/1
q
2
q
3
/1 q
1
/0
q
3
q
1
/0 q
2
/1
1.32 Say that string x is a prex of a string y if a string z exists where xz = y and that x is
a proper prex of y if in addition x = y. Show that the class of regular languages is closed
under the following operation:
a) NOPREFIX(A) = { w A : no proper prex of w is a member of A}.
Before doing anything, we need to be sure we understand the problem. To prove that a
language A is closed under a given operation (like NOPREFIX) we need to show that,
given regular language A we can show that the language L = NOPREFIX(A) is also
regular. How do we proceed? First, since the given language A is regular, we know there
exits a DFA (say M) that recognizes it. We will prove that L = NOPREFIX(A) is regular
by showing that there is a NFA (say N) that recognizes it. And how do we nd N? We build
N by properly modifying DFA M we say we construct an NFA for L using the DFA M.
Once this is done, the nal step is proving that the NFA we built is correct, namely that
it recognizes language L (formally, this means proving that if string x is recognized by NFA
CSE 105, Solutions of Problem Set 2 (Revised) 7
N then x belongs to L and, conversely, that if x belongs to L then it is recognized by N).
Details follow.
Assume there is a DFA M = (Q, , , q
0
, F) that accepts A. This means we are given all the
components of N, namely Q, , , q
0
and F we can used them as we wish.
The language we want to build an NFA for is
NOPREFIX(A) = { w A : no proper prex of w is a member of A}
An NFA for it doesnt seem quite straightforward to build. So then? A strategy that usually
works is to consider the complement of this language (or something close to the complement).
We consider the language formed by ALL the words w that do have a proper prex in A:
L
1
= { w

: some string y in A is a proper prex of w }


= { w

: there is y A and v

s.t. w = yv } .
We construct an NFA N for this language. Building a NFA means showing all its components
explicitly, meaning Q
1
,
1
,
1
and F
1
such that N = (Q
1
,
1
,
1
, F
1
). Here it is,
Q
1
= Q { q
f
}, where q
f
Q is a new state we have introduced,

1
= (no reason to change the input alphabet),

1
is dened as follows. For any q Q
1
and any s { }:
NOPREFIX(A) =

{ (q, s) } if q QF and s
{ (q, s) } { q
f
} if q F and s
{ q
f
} if q = q
f
and s
otherwise.
q
1
= q
0
F
1
= { q
f
}
We now prove the construction is correct.
If w is a string in L
1
, there is a string y in A which is a proper prex of w, or w = yx,
where x is not empty. If w is taken as the input of N, some computation on y will end
at an accepting state in M, and then some computation on the x part will end at the
state q
f
. This means w is accepted by N.
If w is a string accepted by N. Then, there is some computation that ends at q
f
. By
construction of N, that computation must arrive at one of the accepting states in M
before it ends at q
f
. If we call y to the proper prex of w used in that part of the
computation, we can see that M on input y will end up at one of its accepting states.
This means y is in A and w is a member of L
1
.
We conclude noticing that NOPREFIX(A) = A L
1
. Since the class of regular languages
is closed under intersection and complement, we have that NOPREFIX(A) is regular too.

You might also like