Properties of Regular Languages

So far we have seen different ways of specifying regular language: DFA, NFA, ε-NFA, regular expressions and regular grammar. We noted that all these different expressions are equal in power by showing the equivalences. Regular expressions and grammars are considered as generators of regular language while the machines (DFA, NFA, ε-NFA) are considered as acceptors of the language.

Now we will look at the properties of regular language. The properties can be broadly classified as two parts: (A) Closure properties and (B) Decision properties

(A) Closure Properties

1. Complementation

If a language L is regular its complement L' is regular.

Let DFA(L) denote the DFA for the language L. Modify the DFA as follows to obtain DFA(L').

Change the final states to non-final states.
Change the non-final states to final states.

Since there exists a DFA(L') now, L' is regular.

This can be shown by an example using a DFA. Let L denote the language containing strings that begins and ends with a. Σ = {a, b}. The DFA for L is given below.

Note: q₃ denotes the dead state.
Once you enter q₃, you remain in it forever.

L' denotes the language that does not contain strings that begin and end with a. This implies L' contains strings that

begins with a and ends with b
begins with b and ends with a
begins with b and ends with b

The DFA for L' is obtained by flipping the final states of DFA(L) to non-final states and vice-versa. The DFA for L' is given below.

q₀ ensures ε is accepted

q₁ ensures all strings that begin with a and end with b are accepted.

q₃ ensures all strings that begin with b (ending with either a or b) are accepted.

Important Note: While specifying the DFA for L, we have also included the dead state q₃. It is important to include the dead state(s) if we are going to derive the complement DFA since, the dead state(s) too would become final in the complementation. If we didn't add the dead state(s) originally, the complement will not accept all strings supposed to be accepted.

In the above example, if we didn't include q₃ originally, the complement will not accept strings starting with b. It will only accept strings that begin with a and end with b which is only a subset of the complement.

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER COMPLEMENTATION.

2. Union

If L₁ and L₂ are regular, then L₁ ∪ L₂ is regular.

This is easier proved using regular expressions. If L₁ is regular, there exists a regular expression R1 to describe it. Similarly, if L₂ is regular, there exists a regular expression R2 to describe it. R1 + R2 denotes the regular expression that describe L₁ ∪ L₂. Therefore, L₁ ∪ L₂ is regular.

This again can be shown using an example. If L₁ is a language that contains strings that begin with a and L₂ is a language that contain strings that end with a, then L₁ ∪ L₂ denotes the language the contain strings that either begin with a or end with a.

- a(a+b)* is the regular expression that denotes L₁.

- (a+b)*a is the regular expression that denotes L₂.

- L₁ ∪ L₂ is denoted by the regular expression a(a+b)* + (a+b)*a. Therefore, L₁ ∪ L₂ is regular.

In terms of DFA, we can say that a DFA(L₁ ∪ L₂) accepts those strings that are accepted by either DFA(L₁) or DFA(L₂) or both.

DFA(L₁ ∪ L₂) can be constructed by adding a new start state and new final state.
The new start state connects to the two start states of DFA(L₁) and DFA(L₂) by εtransitions.
Similarly, two ε transitions are added from the final states of DFA(L₁) and DFA(L₂) to the new final state.
Convert this resulting NFA to its equivalent DFA.

As an exercise you can try this approach of DFA construction for union for the given example.

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER UNION.

3. Intersection

If L₁ and L₂ are regular, then L₁ ∩ L₂ is regular.

Since a language denotes a set of (possibly infinite) strings and we have shown above that regular languages are closed under union and complementation, by De Morgan's law can be applied to show that regular languages are closed under intersection too.

L₁ and L₂ are regular ⇒ L₁' and L₂' are regular (by Complementation property)
L₁' ∪ L₂' is regular (by Union property)
L₁ ∩ L₂ is regular (by De Morgan's law)

In terms of DFA, we can say that a DFA(L₁ ∩ L₂) accepts those strings that are accepted by both DFA(L₁) and DFA(L₂).

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER INTERSECTION.

4. Concatenation

If L₁ and L₂ are regular, then L₁ . L₂ is regular.

This can be easily proved by regular expressions. If R1 is a regular expression denoting L₁ and R2 is a regular expression denoting L₂, then we R1 . R2 denotes the regular expression denoting L₁ . L₂. Therefore, L₁ . L₂ is regular.

In terms of DFA, we can say that a DFA(L₁ . L₂) can be constructed by adding an ε-trainstion from the final state of DFA(L₁) - which now ceases to be the final state - to the start state of DFA(L₂). You can try showing this using an example.

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER CONCATENATION.

5. Kleene star

If L is regular, then L* is regular.

This can be easily proved by regular expression. If L is regular, then there exists a regular expression R. We know that if R is a regular expression, R* is a regular expression too. R* denotes the language L*. Therefore L* is regular.

In terms of DFA, in the DFA(L) we add two ε transitions, one from start state to final state and another from final state to start state. This denotes DFA(L*). You can try showing this for an example.

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER KLEENE STAR.

6. Difference

If L₁ and L₂ are regular, then L₁ - L₂ is regular.

We know that L₁ - L₂ = L₁ ∩ L₂'

L₁ and L₂ are regular ⇒ L₁ and L₂' are regular (by Complementation property)
L₁ ∩ L₂' is regular (by Intersection property)
L₁ - L₂ is regular (by De Morgan's law)

In terms of DFA, we can say that a DFA(L₁ - L₂) accepts those strings that are accepted by both DFA(L₁) and not accepted by DFA(L₂). You can try showing this for an example.

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER DIFFERENCE.

7. Reverse

If L is regular, then L^R is regular.

Let DFA(L) denote the DFA of L. Make the following modifications to construct DFA(L^R).

Change the start state of DFA(L) to the final state.
Change the final state of DFA(L) to the start state.

In case there are more than one final state in DFA(L), first add a new final state and add ε- transitions from the final states (which now cease to be final states any more) and perform this step.

Reverse the direction of the arrows.

You can try showing this using an example.

CONCLUSION: REGULAR LANGUAGES ARE CLOSED UNDER REVERSAL.