Lecture notes 20190329

Denotational Semantics 2

Remark. Some material in this lecture is from << Software Foundation >> volume 1 and volume 2.

Require Import PL.Imp6.

Review: Programs' Denotational Semantics

We have learnt how to define integer expression's denotational semantics. We can define it using Coq's recursive function.

Module Relation_Operators.

Definition id {A: Type}: A → A → Prop := fun a b ⇒ a = b.

Definition empty {A B: Type}: A → B → Prop := fun a b ⇒ False.

Definition concat {A B C: Type} (r₁: A → B → Prop) (r₂: B → C → Prop): A → C → Prop :=
fun a c ⇒ ∃b, r₁ a b ∧ r₂ b c.

Definition filter1 {A B: Type} (f: A → Prop): A → B → Prop :=
fun a b ⇒ f a.

Definition filter2 {A B: Type} (f: B → Prop): A → B → Prop :=
fun a b ⇒ f b.

Definition union {A B: Type} (r₁ r₂: A → B → Prop): A → B → Prop :=
fun a b ⇒ r₁ a b ∨ r₂ a b.

Definition intersection {A B: Type} (r₁ r₂: A → B → Prop): A → B → Prop :=
fun a b ⇒ r₁ a b ∧ r₂ a b.

Definition omega_union {A B: Type} (rs: nat → A → B → Prop): A → B → Prop :=
fun st₁ st₂ ⇒ ∃n, rs n st₁ st₂.

End Relation_Operators.

Import Relation_Operators.

Definition aexp_dequiv (d₁ d₂: state → Z): Prop :=
∀st, d₁ st = d₂ st.

Definition aexp_equiv (a₁ a₂: aexp): Prop :=
aexp_dequiv (aeval a₁) (aeval a₂).

Definition if_sem
  (b: bexp)
  (then_branch else_branch: state → state → Prop)
  : state → state → Prop
:=
  union
    (intersection then_branch (filter1 (beval b)))
    (intersection else_branch (filter1 (beval (BNot b)))).

Soppose then_branch and else_branch are denotations of the then-branch and else-branch of an if-command. Here, the first clause of union the set of program state pairs (st₁, st₂) such that:

(st₁, st₂) belongs to then_branch and b is true on st₁

And similarly, the second clause of union the set of program state pairs (st₁, st₂) such that:

(st₁, st₂) belongs to else_branch and b is false on st₁

The union of them is the semantics of an if-command.

Fixpoint iter_loop_body
  (b: bexp)
  (loop_body: state → state → Prop)
  (n: nat)
  : state → state → Prop
:=
  match n with
  | O ⇒
         intersection
           id
           (filter1 (beval (BNot b)))
  | S n' ⇒
            intersection
              (concat
                loop_body
                (iter_loop_body b loop_body n'))
              (filter1 (beval b))
  end.

Definition loop_sem (b: bexp) (loop_body: state → state → Prop)
: state → state → Prop
:=
omega_union (iter_loop_body b loop_body).

Suppose loop_body is the denotation of loop body, then

iter_loop_body b loop_body n

defines the begining-ending-program-state pairs of excuting the loop body for exactly n times. And

loop_sem b loop_body

defines whole while-loop's semantics.

Fixpoint ceval (c: com): state → state → Prop :=
  match c with
  | CSkip ⇒ id
  | CAss X E ⇒
      fun st₁ st₂ ⇒
        st₂ X = aeval E st₁ ∧
        ∀Y, X ≠ Y → st₁ Y = st₂ Y
  | CSeq c₁ c₂ ⇒ concat (ceval c₁) (ceval c₂)
  | CIf b c₁ c₂ ⇒ if_sem b (ceval c₁) (ceval c₂)
  | CWhile b c ⇒ loop_sem b (ceval c)
  end.

Definition com_dequiv (d₁ d₂: state → state → Prop): Prop :=
∀st₁ st₂, d₁ st₁ st₂ ↔ d₂ st₁ st₂.

Definition cequiv (c₁ c₂: com): Prop :=
com_dequiv (ceval c₁) (ceval c₂).

Adding all components up, we can define denotational semantics of progrmas and program equivalence.

Bourbaki-Witt Theorem

For now, we have successfully defined a fixpoint construction loop_sem which satisfies the recursive equation loop_recur:

Theorem loop_recur: ∀b loop_body,
  com_dequiv
    (loop_sem b loop_body)
    (union
      (intersection
        (concat loop_body
          (loop_sem b loop_body))
        (filter1 (beval b)))
      (intersection
        id
        (filter1 (beval (BNot b))))).
Proof.

  intros.
  unfold com_dequiv.
  intros.
  split.
  + intros.
    unfold loop_sem, omega_union in H.
    unfold union.
    destruct H as [n H].
    destruct n as [| n'].
    - right.
      simpl in H.
      exact H.
    - left.
      simpl in H.
      unfold concat, intersection in H.
      unfold concat, intersection.
      destruct H as [[st' [? ?]] ?].
      split.
      * ∃st'.
        split.
        { exact H. }
        unfold loop_sem, omega_union.
        ∃n'.
        exact H₀.
      * exact H₁.
  + intros.
    unfold loop_sem, omega_union.
    unfold union in H.
    destruct H.
    - unfold intersection, concat in H.
      destruct H as [[st' [? ?]] ?].
      unfold loop_sem, omega_union in H₀.
      destruct H₀ as [n ?].
      ∃(S n).
      simpl.
      unfold intersection, concat.
      split.
      * ∃st'.
        split.
        { exact H. }
        { exact H₀. }
      * exact H₁.
    - ∃O.
      simpl.
      exact H.
Qed.

This lemma that we proved last time is actually one special case of Bourbaki-Witt fixpoint theorem.

Partial Order

A partial order (偏序) on a set A is a binary relation R (usually written as ≤) which is reflexive (自反), transitive (传递), and antisymmetric (反对称). Formally,

    ∀x: A, x ≤ x;
    ∀x y z: A, x ≤ y → y ≤ z → x ≤ z;
    ∀x y: A, x ≤ y → y ≤ x → x = y.

The least element of A w.r.t. a partial order ≤ is also called bottom:

∀x: A, bot ≤ x

Chain

A subset of elements in A is called a chain w.r.t. a partial order ≤ if any two elements in this subset are comparable. For example, if a sequence xs: nat → A is monotonically increasing:

∀n: nat, xs n ≤ xs (n + 1),

then it forms a chain.

A partial order ≤ is called complete if every chain has its least upper bound lub and greatest lower bound glb. In short, the set A (companied with order ≤) is called a complete partial ordering, CPO (完备偏序集). Some text books require chains to be nonempty. We do not put such restriction on chain's definition here. Thus, the empty set is a chain. Its least upper bound is the least element of A, in other words, bot.

Monotonic and Continuous Functions

Given two CPOs A, ≤A= and B, ≤B=, a function F: A → B is called monotonic (单调) if it preserves order. Formally,

∀x y: A, x ≤A= y → F(x) ≤B= F(y).

A function F: A → B is called continuous (连续) if it preserves lub. Formally,

∀xs: chain(A), lub(F(xs)) = F(lub(xs))

Here, the lub function on the left hand side means the least upper bound defined by B and the one on the right hand side is defined by A.

The definition of continuous does not require the preservation of glb becasue CPOs are usually defined in a direction that larger elements are more defined .

Least fixpoint

Given a CPO A, we can always construct a sequence of elements as follows:

bot, F(bot), F(F(bot)), F(F(F(bot))), ...

Obviously, bot ≤ F(bot) is true due to the definition of bot. If F is monotonic, it is immediately followed by F(bot) ≤ F(F(bot)). Similarly,

F(F(bot)) ≤ F(F(F(bot))), F(F(F(bot))) ≤ F(F(F(F(bot)))) ...

In other words, if F is monotonic, this sequence is a chain.

Main theorem: given a CPO A, if it has a least element, then every monotonic continuous function F has a fixpoint and the least fixpoint of F is:

lub [bot, F(bot), F(F(bot)), F(F(F(bot))), ...].

Proof.

On one hand, this least upper bound is a fixpoint:

    F (lub [bot, F(bot), F(F(bot)), F(F(F(bot))), ...]) =
    lub [F(bot), F(F(bot)), F(F(F(bot))), F(F(F(F(bot)))), ...] =
    lub [bot, F(bot), F(F(bot)), F(F(F(bot))), ...].

The first equality is true because F is continuous. The second equality is true because bot is less than or equal to all other elements in the sequence.

On the other hand, this fixpoint is the least one. For any other fixpoint x, in other words, suppose F(x) = x. Then,

bot ≤ x

Thus,

F(bot) ≤ F(x) = x

due to the fact that F is monotonic and x is a fixpoint. And so on,

F(F(bot)) ≤ x, F(F(F(bot))) ≤ x, F(F(F(F(bot)))) ≤ x, ...

That means, x is an upper bound of bot, F(bot), F(F(bot)), .... It must be greater than or equal to

lub [bot, F(bot), F(F(bot)), F(F(F(bot))), ...].

QED.

Denotation of Loops as Bourbaki-Witt Fixpoint

Our definition loop_sem is actually a Bourbaki-Witt fixpoint of the recursive equation defined by loop_recur. In this case, set A is the set of binary relations between program stats, i.e. A := state → state → Prop.

The equivalence relation defined on A is com_dequiv. The partial order defined on A is the subset relation, i.e.

"d₁ ≤ d₂" := ∀st₁ st₂, (d₁ st₁ st₂) → (d₂ st₁ st₂).

We can easily show that this binary relation is actually a partial order, i.e., it is reflexive, transitive and antisymmetric.

Moreoever, this partial ordering is a CPO. The least upper bound, lub, of a chain is the union of all binary relations in the chain. Specifically, omega_union defines the lub of a sequence of relations.

In the end, the function that maps d to

    (union
      (concat
        (intersection loop_body
          (filter1 (beval b)))
        d)
      (intersection
        id
        (filter1 (beval (BNot b)))))

is monotonic and continuous. And loop_sem is exactly the Bourbaki-Witt fixpoint of this function.

Program Equivalence

For examples of command equivalence, let's start by looking at some trivial program transformations involving Skip:

Theorem skip_left : ∀c,
  cequiv
    (Skip;; c)
    c.
Proof.
  intros.
  unfold cequiv, com_dequiv.
  intros.
  split; intros.
  + simpl in H.
    unfold concat, id in H.
    destruct H as [st' [? ?]].
    rewrite H.
    exact H₀.
  + simpl.
    unfold concat, id.
    ∃st₁.
    split.
    - reflexivity.
    - exact H.
Qed.

Also, we can prove that adding a Skip after a command results in an equivalent program.

Theorem skip_right : ∀c,
  cequiv
    (c ;; Skip)
    c.
Proof.
(* WORKED IN CLASS *)
  intros.
  unfold cequiv, com_dequiv.
  intros.
  split; intros.
  + simpl in H.
    unfold concat, id in H.
    destruct H as [st' [? ?]].
    rewrite <- H₀.
    exact H.
  + simpl.
    unfold concat, id.
    ∃st₂.
    split.
    - exact H.
    - reflexivity.
Qed.

Now we show that we can swap the branches of an IF if we also negate its guard.

Theorem swap_if_branches : ∀b e₁ e₂,
  cequiv
    (If b Then e₁ Else e₂ EndIf)
    (If (BNot b) Then e₂ Else e₁ EndIf).
Proof.
  intros.
  unfold cequiv, com_dequiv.
  intros.
  simpl.
  unfold if_sem.
  unfold union, intersection, filter1.
  split; intros.
  + (* -> *)
    destruct H as [[? ?] | [? ?]].
    - (* b is true *)
      right.
      split.
      * exact H.
      * simpl.

The next line tauto reads "tautology" (重言式). This tactic can be used to reason able normal logic connectives, including implication, conjunction, disjuction, negation and logical equivalence.

        tauto.
    - (* b is false *)
      left.
      split.
      * exact H.
      * exact H₀.
  + (* <- *)
    destruct H as [[? ?] | [? ?]].
    - (* b is false *)
      right.
      split.
      * exact H.
      * exact H₀.
    - (* b is true *)
      left.
      split.
      * exact H.
      * simpl in H₀.
        tauto.
Qed.

An interesting fact about While commands is that any number of copies of the body can be "unrolled" without changing meaning. Loop unrolling is a common transformation in real compilers.

Theorem loop_unrolling : ∀b c,
  cequiv
    (While b Do c EndWhile)
    (If b Then (c ;; While b Do c EndWhile) Else Skip EndIf).
Proof.
  intros.
  unfold cequiv, com_dequiv.
  intros.
  simpl.
  pose proof loop_recur b (ceval c).
  unfold com_dequiv in H.
  specialize (H st₁ st₂).
  unfold if_sem.
  exact H.
Qed.

Usually, we do not distinguish (c₁;;c₂);;c₃ with c₁;;(c₂;;c₃) when we write real programs. We do not have to because they have the same behavior.

Theorem seq_assoc : ∀c₁ c₂ c₃,
  cequiv ((c₁;;c₂);;c₃) (c₁;;(c₂;;c₃)).
Proof.
  intros.
  unfold cequiv, com_dequiv.
  intros st₁ st₄.
  simpl.
  split; unfold concat; intros.
  + (* -> *)
    destruct H as [st₃ [H H₃₄]].
    destruct H as [st₂ [H₁₂ H₂₃]].
    ∃st₂.
    split.
    - exact H₁₂.
    - ∃st₃.
      split.
      * exact H₂₃.
      * exact H₃₄.
  + (* <- *)
    destruct H as [st₂ [H₁₂ H]].
    destruct H as [st₃ [H₂₃ H₃₄]].
    ∃st₃.
    split.
    - ∃st₂.
      split.
      * exact H₁₂.
      * exact H₂₃.
    - exact H₃₄.
Qed.

The following theorem says, if two expressions are equivalent, then assigning their value into the same variable has the same behavior.

Theorem CAss_congruence : ∀(X: var) (E E': aexp),
  aexp_equiv E E' →
  cequiv (CAss X E) (CAss X E').
Proof.
  intros.
  unfold cequiv, com_dequiv.
  intros st₁ st₂.
  simpl.
  split; intros.
  - (* -> *)
    destruct H₀.
    split.
    + (* For X *)
      unfold aexp_equiv, aexp_dequiv in H.
      specialize (H st₁).
      rewrite <- H.
      exact H₀.
    + (* For other program variables *)
      exact H₁.
  - (* <- *)
    destruct H₀.
    split.
    + (* For X *)
      unfold aexp_equiv, aexp_dequiv in H.
      specialize (H st₁).
      rewrite H.
      exact H₀.
    + (* For other program variables *)
      exact H₁.
Qed.

We will verify that the equivalence on coms really are equivalences — i.e., that they are reflexive, symmetric, and transitive. The proofs are all easy.

Lemma refl_com_dequiv : ∀(d : state → state → Prop),
com_dequiv d d.

Proof.
  unfold com_dequiv.
  intros.
  tauto.
Qed.

Lemma refl_cequiv : ∀(c : com), cequiv c c.
Proof.
  unfold cequiv.
  intros.
  apply refl_com_dequiv.
Qed.

Lemma sym_com_dequiv : ∀(d₁ d₂: state → state → Prop),
com_dequiv d₁ d₂ → com_dequiv d₂ d₁.

Proof.
  unfold com_dequiv.
  intros.
  specialize (H st₁ st₂).
  tauto.
Qed.

Lemma sym_cequiv : ∀(c₁ c₂ : com),
  cequiv c₁ c₂ → cequiv c₂ c₁.
Proof.
  unfold cequiv.
  intros.
  apply sym_com_dequiv.
  exact H.
Qed.

Lemma trans_com_dequiv : ∀(d₁ d₂ d₃ : state → state → Prop),
com_dequiv d₁ d₂ → com_dequiv d₂ d₃ → com_dequiv d₁ d₃.

Proof.
  unfold com_dequiv.
  intros.
  specialize (H st₁ st₂).
  specialize (H₀ st₁ st₂).
  tauto.
Qed.

Lemma trans_cequiv : ∀(c₁ c₂ c₃ : com),
  cequiv c₁ c₂ → cequiv c₂ c₃ → cequiv c₁ c₃.
Proof.
  unfold cequiv.
  intros.
  pose proof trans_com_dequiv _ _ _ H H₀.
  exact H₁.
Qed.

Behavioral equivalence is also a congruence. That is, the equivalence of two subprograms implies the equivalence of the larger programs in which they are embedded.

The main idea is that the congruence property allows us to replace a small part of a large program with an equivalent small part and know that the whole large programs are equivalent without doing an explicit proof about the non-varying parts — i.e., the "proof burden" of a small change to a large program is proportional to the size of the change, not the program.

We will prove the congruence property for loops in class. Other congruence properties' proofs will be left as homework.

Theorem: Equivalence is a congruence for WHILE — that is, if c is equivalent to c', then While b Do c EndWhile is equivalent to While b Do c' EndWhile.

Theorem CWhile_congruence : ∀b c c',
  cequiv c c' →
  cequiv (While b Do c EndWhile) (While b Do c' EndWhile).
Proof.
  unfold cequiv, com_dequiv.
  intros.
  simpl.
  unfold loop_sem.
  unfold omega_union.

Hmmm, it seems that we need an auxilliary lemma for iter_loop_body's congruence.

Abort.

Lemma iter_loop_body_congruence: ∀b loop_body loop_body' n,
  com_dequiv loop_body loop_body' →
  com_dequiv (iter_loop_body b loop_body n) (iter_loop_body b loop_body' n).
Proof.
  intros.

Here, we need to do induction over n.

  induction n.
  + simpl.
    apply refl_com_dequiv.
  + simpl.

Hmm, it would be better if we have auxilliary lemmas for relation operators.

Abort.

Here, we first prove that concat has congruence property.

Lemma concat_congruence: ∀(d₁ d₂ d₁' d₂': state → state → Prop),
  com_dequiv d₁ d₁' →
  com_dequiv d₂ d₂' →
  com_dequiv (concat d₁ d₂) (concat d₁' d₂').
Proof.
  unfold com_dequiv.
  intros.
  unfold concat.
  split; intros H₁; destruct H₁ as [st [? ?]].
  + ∃st.
    split.
    - specialize (H st₁ st).
      tauto.
    - specialize (H₀ st st₂).
      tauto.
  + ∃st.
    split.
    - specialize (H st₁ st).
      tauto.
    - specialize (H₀ st st₂).
      tauto.
Qed.

Also, intersection has congruence property.

Lemma intersection_congruence: ∀(d₁ d₂ d₁' d₂': state → state → Prop),
  com_dequiv d₁ d₁' →
  com_dequiv d₂ d₂' →
  com_dequiv (intersection d₁ d₂) (intersection d₁' d₂').
Proof.
  unfold com_dequiv.
  intros.
  unfold intersection.
  specialize (H st₁ st₂).
  specialize (H₀ st₁ st₂).
  tauto.
Qed.

Third, union has congruence property. Its proof is similar.

Lemma union_congruence: ∀(d₁ d₂ d₁' d₂': state → state → Prop),
  com_dequiv d₁ d₁' →
  com_dequiv d₂ d₂' →
  com_dequiv (union d₁ d₂) (union d₁' d₂').
Proof.
  unfold com_dequiv.
  intros.
  unfold union.
  specialize (H st₁ st₂).
  specialize (H₀ st₁ st₂).
  tauto.
Qed.

In addition, we prove the congruence property of omega_union.

Lemma omega_union_congruence: ∀(ds₁ ds₂: nat → state → state → Prop),
  (∀n, com_dequiv (ds₁ n) (ds₂ n)) →
  com_dequiv (omega_union ds₁) (omega_union ds₂).
Proof.
  unfold com_dequiv.
  intros.
  unfold omega_union.
  split; intros H₀; destruct H₀ as [n ?]; ∃n.
  + specialize (H n st₁ st₂).
    tauto.
  + specialize (H n st₁ st₂).
    tauto.
Qed.

Now, we are ready to prove the congruence property of iter_loop_body.

Lemma iter_loop_body_congruence: ∀b loop_body loop_body' n,
  com_dequiv loop_body loop_body' →
  com_dequiv (iter_loop_body b loop_body n) (iter_loop_body b loop_body' n).
Proof.
  intros.
  induction n.
  + simpl.
    apply refl_com_dequiv.
  + simpl.
    apply intersection_congruence.
    - apply concat_congruence.
      * exact H.
      * exact IHn.
    - apply refl_com_dequiv.
Qed.

And eventually, we will prove our main theorem.

Theorem CWhile_congruence : ∀b c c',
  cequiv c c' →
  cequiv (While b Do c EndWhile) (While b Do c' EndWhile).
Proof.
  unfold cequiv.
  intros.
  simpl.
  unfold loop_sem.
  apply omega_union_congruence.
  intros.
  apply iter_loop_body_congruence.
  exact H.
Qed.

(* Wed Mar 27 17:33:26 UTC 2019 *)