pumping lemma (thing) by ariels

Let L be a regular language. That is, L is a set of words which is recognised by a finite state automaton (or, equivalently, by some regular expression). Consider a DFA recognising L, and suppose L is infinite. Suppose a DFA D recognising L has k states. We may write the steps D takes when recognising a word w=w₁w₂...w_n above the letters:

s₁s₂...s_n
w₁w₂...w_n

This just means that after reading w₁...w_k, D is in state s_k.

Then for a word w in L of length >k, due to the pigeonhole principle, some state s must repeat itself: s=s_i=s_j, i < j. Then D must also accept

w₁w₂...w_iw_j+1...w_n
w₁w₂...w_i...w_jw_i+1...w_j...w_n

and so on. In fact, if we write x=w₁...w_i, y=w_i+1...w_j, z=w_j+1...w_n (x and y may both be empty), then w=xyz, and for all nonnegative integers m, the word xy^mz is also in L.

We've just proved

[The pumping] lemma: Let L be an infinite regular language. Then there exists some k such that if w∈L has more than k letters, then w=xyz where
For all j≥0, xy^kz∈L.

The pumping lemma is useful for proving certain languages are not regular. This approach is usually taken in introductory computation courses (see e..g pumping lemma proof that the balanced braces language is not regular). However, in practice the Myhill / Nerode theorem is much more useful for proving that languages are not regular.

Pumping lemmas also exist for CFGs (context free grammars); this is noded below.

Myhill / Nerode Theorem	balanced braces language	Words that sound dirty but really aren't	Now here it is, your moment of Zen
DFA	pumping lemma proof that the balanced braces language is not regular	pigeonhole principle	regular expression
Lemma	finite state automaton	pumping lemma proof that the "a^n b^n" language is not regular	regular language
Proof by intimidation	context-free language	GNU	"1^p" p prime language
Stirling numbers	Node your homework	Zorn's lemma	CFL
context-free grammar	the expressive power of regular expressions	Derivable	Information wants to be free