Lecture notes 20190412

Small Step Semantics 1

Remark. Some material in this lecture is from << Software Foundation >> volume 1 and volume 2.

Require Import PL.Imp6.
Require Coq.Relations.Relation_Operators.
Require Coq.Relations.Relation_Definitions.

General Idea: Small Step

Till now, we have learnt how to define a programming language's denotational semantics. Most commonly, the denotation of a program is defined as a binary relation between program states. Such denotational semantics are also called big-step operational semantics — the denotation of a program tells the result of program execution (given the starting state), "all in one step".

Small step semantics, on the other hand, talks more about intermediate states. It defines how a program will execute, step by step. If we use the following expression's evaluation process as example,

2 + 2 + 3 * 4,

denotational semantics says:

2 + 2 + 3 * 4 ==> 16

while small step operational semantics says:

      2 + 2 + 3 * 4
      --> 4 + 3 * 4
      --> 4 + 12
      --> 16.

In short, we will learn how to define a "small-step" relation that specifies, for a given program, how the "atomic steps" of computation are performed.

Small Step Semantics for Expression Evaluation

It can be useful to think of this step relation of expression evaluation as an abstract machine.

At any moment, the state of the machine is an integer expression.
A step of this machine is an atomic unit of computation — here, a single arithmetic operation or loading program variable's value.
The halting states of the machine are ones where there is no more computation to be done.

Given an expression a, we can compute its value as follows:

Take a as the starting state of the machine.
Repeatedly use the step relation to find a sequence of machine states, starting with a, where each state steps to the next.
When no more forward step is possible, "read out" the final state of the machine as the result of computation.

Intuitively, it is clear that the final states of the machine are always constant expressions Anum n for some n.

Inductive aexp_halt: aexp → Prop :=
| AH_num : ∀n, aexp_halt (ANum n).

Of course, we could define it using a Coq function instead of an inductive predicate. Here is this alternative (but equivalent approach) approach.

Module Playground.

Definition aexp_halt (a: aexp): Prop :=
  match a with
  | ANum _ ⇒ True
  | _ ⇒ False
  end.

End Playground.

Then we define our step relation.

Inductive astep : state → aexp → aexp → Prop :=
  | AS_Id : ∀st X,
      astep st
        (AId X) (ANum (st X))

  | AS_Plus1 : ∀st a₁ a₁' a₂,
      astep st
        a₁ a₁' →
      astep st
        (APlus a₁ a₂) (APlus a₁' a₂)
  | AS_Plus2 : ∀st a₁ a₂ a₂',
      aexp_halt a₁ →
      astep st
        a₂ a₂' →
      astep st
        (APlus a₁ a₂) (APlus a₁ a₂')
  | AS_Plus : ∀st n₁ n₂,
      astep st
        (APlus (ANum n₁) (ANum n₂)) (ANum (n₁ + n₂))

  | AS_Minus1 : ∀st a₁ a₁' a₂,
      astep st
        a₁ a₁' →
      astep st
        (AMinus a₁ a₂) (AMinus a₁' a₂)
  | AS_Minus2 : ∀st a₁ a₂ a₂',
      aexp_halt a₁ →
      astep st
        a₂ a₂' →
      astep st
        (AMinus a₁ a₂) (AMinus a₁ a₂')
  | AS_Minus : ∀st n₁ n₂,
      astep st
        (AMinus (ANum n₁) (ANum n₂)) (ANum (n₁ - n₂))

  | AS_Mult1 : ∀st a₁ a₁' a₂,
      astep st
        a₁ a₁' →
      astep st
        (AMult a₁ a₂) (AMult a₁' a₂)
  | AS_Mult2 : ∀st a₁ a₂ a₂',
      aexp_halt a₁ →
      astep st
        a₂ a₂' →
      astep st
        (AMult a₁ a₂) (AMult a₁ a₂')
  | AS_Mult : ∀st n₁ n₂,
      astep st
        (AMult (ANum n₁) (ANum n₂)) (ANum (n₁ * n₂)).

This definition seems to be super long. Let's read it part by part. But please keep in mind that what we define here is only a single step evaluation relation.

The first part of this definition talks about the value of a program variable:

astep st
(AId X) (ANum (st X)).

In short, it says: a program variable X's variable is X's variable and this evaluation process has only one step.

The second part of this definition talks about how the sum of two subexpressions are computed.

  | AS_Plus1 : ∀st a₁ a₁' a₂,
      astep st
        a₁ a₁' →
      astep st
        (APlus a₁ a₂) (APlus a₁' a₂)
  | AS_Plus2 : ∀st a₁ a₂ a₂',
      aexp_halt a₁ →
      astep st
        a₂ a₂' →
      astep st
        (APlus a₁ a₂) (APlus a₁ a₂')
  | AS_Plus : ∀st n₁ n₂,
      astep st
        (APlus (ANum n₁) (ANum n₂)) (ANum (n₁ + n₂))

It says, the left side is computed first, then the right side. When both sides are computed, the sum of them can be computed in another step.

Combining these two parts together, we are already able to describe the evaluation process of some nontrivial examples. For example, when X's value is 1 and Y's value is 2, X + (3 + Y) will be evaluated by the following steps:

    X + (3 + Y)
    --> 1 + (3 + Y)
    --> 1 + (3 + 2)
    --> 1 + 5
    --> 6.

We can prove this in Coq.

Module Step_Example1.

Import Abstract_Pretty_Printing.

Example step_1: ∀(X Y: var) (st: state),
  st X = 1 →
  astep st (X + (3 + Y)) (1 + (3 +Y)).
Proof.
  intros.
  apply AS_Plus1.
  rewrite <- H.
  apply AS_Id.
Qed.

Example step_2: ∀(Y: var) (st: state),
  st Y = 2 →
  astep st (1 + (3 +Y)) (1 + (3 + 2)).
Proof.
  intros.
  apply AS_Plus2.
  { apply AH_num. }
  apply AS_Plus2.
  { apply AH_num. }
  rewrite <- H.
  apply AS_Id.
Qed.

Example step_3: ∀(st: state),
  astep st (1 + (3 + 2)) (1 + 5).
Proof.
  intros.
  apply AS_Plus2.
  { apply AH_num. }
  apply AS_Plus.
Qed.

Example step_4: ∀(st: state),
  astep st (1 + 5) 6.
Proof.
  intros.
  apply AS_Plus.
Qed.

End Step_Example1.

The small step semantics for "minus" and "multiplication" are defined in very similar way. And we can also define bool expression's evaluation as follows. If you forget details about bexp's inductive definition, just use Print bexp as a cheat sheet.

(* Print bexp. *)

Inductive bexp_halt: bexp → Prop :=
| BH_True : bexp_halt BTrue
| BH_False : bexp_halt BFalse.

Inductive bstep : state → bexp → bexp → Prop :=

  | BS_Eq₁ : ∀st a₁ a₁' a₂,
      astep st
        a₁ a₁' →
      bstep st
        (BEq a₁ a₂) (BEq a₁' a₂)
  | BS_Eq₂ : ∀st a₁ a₂ a₂',
      aexp_halt a₁ →
      astep st
        a₂ a₂' →
      bstep st
        (BEq a₁ a₂) (BEq a₁ a₂')
  | BS_Eq_True : ∀st n₁ n₂,
      n₁ = n₂ →
      bstep st
        (BEq (ANum n₁) (ANum n₂)) BTrue
  | BS_Eq_False : ∀st n₁ n₂,
      n₁ ≠ n₂ →
      bstep st
        (BEq (ANum n₁) (ANum n₂)) BFalse

  | BS_Le₁ : ∀st a₁ a₁' a₂,
      astep st
        a₁ a₁' →
      bstep st
        (BLe a₁ a₂) (BLe a₁' a₂)
  | BS_Le₂ : ∀st a₁ a₂ a₂',
      aexp_halt a₁ →
      astep st
        a₂ a₂' →
      bstep st
        (BLe a₁ a₂) (BLe a₁ a₂')
  | BS_Le_True : ∀st n₁ n₂,
      n₁ ≤ n₂ →
      bstep st
        (BLe (ANum n₁) (ANum n₂)) BTrue
  | BS_Le_False : ∀st n₁ n₂,
      n₁ > n₂ →
      bstep st
        (BLe (ANum n₁) (ANum n₂)) BFalse

  | BS_NotStep : ∀st b₁ b₁',
      bstep st
        b₁ b₁' →
      bstep st
        (BNot b₁) (BNot b₁')
  | BS_NotTrue : ∀st,
      bstep st
        (BNot BTrue) BFalse
  | BS_NotFalse : ∀st,
      bstep st
        (BNot BTrue) BTrue

  | BS_AndStep : ∀st b₁ b₁' b₂,
      bstep st
        b₁ b₁' →
      bstep st
       (BAnd b₁ b₂) (BAnd b₁' b₂)
  | BS_AndTrue : ∀st b,
      bstep st
       (BAnd BTrue b) b
  | BS_AndFalse : ∀st b,
      bstep st
       (BAnd BFalse b) BFalse.

Remark: when evaluating a conjunction of two boolean expression, we use short circuit evaluation. That is, the right hand side will not be evaluated if the left hand side is false. For example, when X's value is 1, X ≤ 0 && 0 < X + 10 will be evaluated by the following steps:

    X ≤ 0 && 0 ≤ X + 10
    --> 1 ≤ 0 && 0 ≤ X + 10
    --> False && 0 ≤ X + 10
    --> False.

Module Step_Example2.

Import Abstract_Pretty_Printing.

Example step_1: ∀(X: var) (st: state),
  st X = 1 →
  bstep st ((X ≤ 0) && (0 ≤ X + 10)) ((1 ≤ 0) && (0 ≤ X + 10)).
Proof.
  intros.
  apply BS_AndStep.
  apply BS_Le₁.
  rewrite <- H.
  apply AS_Id.
Qed.

Example step_2: ∀(X: var) (st: state),
  bstep st ((1 ≤ 0) && (0 ≤ X + 10)) (BFalse && (0 ≤ X + 10)).
Proof.
  intros.
  apply BS_AndStep.
  apply BS_Le_False.
  omega.
Qed.

Example step_3: ∀(X: var) (st: state),
  bstep st (BFalse && (0 ≤ X + 10)) BFalse.
Proof.
  intros.
  apply BS_AndFalse.
Qed.

End Step_Example2.

Reflexive, Transive Closure

After our single step relation is defined, we can derive the multi-step relation based on it. That is we want to say: on state st, from expression a₁ we can arrive a₂ after some number of steps. More specifically, it can be zero step, one step, two steps, etc. In math, this multi-step relation is called the reflexive, transitive closure (自反传递闭包) of step.

The reflexive, transitive closure of a relation R is defined as:

(Def A₁) the smallest relation that contains R and that is both reflexive and transitive.

It is equivalent to say:

(Def A₂) the result of expanding R by reflexivity and transitivity;
(Def B₁) the smallest reflexive relation which is closed under right-concatenating R;
(Def B₂) the result of expanding the identity relation by right-concatenating R repeatedly;
(Def C₁) the smallest reflexive relation which is closed under left-concatenating R;
(Def C₂) the result of expanding the identity relation by left-concatenating R repeatedly;
(Def D) the union of the following relations: the identity relation, the relation R, the concatenation of two Rs, the concatenation of three R's, etc.

Let's formally describe some concepts in these equivalent definitions.

We have seen the first three definitions in our lectures about denotational semantics.

Module Relation_Definitions.

Identity relation.

Definition id {A: Type}: A → A → Prop :=
fun a b ⇒ a = b.

Concatenation of two relations.

Definition concat {A B C: Type} (R₁: A → B → Prop) (R₂: B → C → Prop): A → C → Prop:=
fun a c ⇒ ∃b, R₁ a b ∧ R₂ b c.

The union of countably many relations.

Definition omega_union {A B: Type} (Rs: nat → A → B → Prop): A → B → Prop :=
fun a b ⇒ ∃n, Rs n a b.

Then we have some new definitions. Reflexivity is a property of relations.

Definition Reflexive {A: Type} (R: A → A → Prop): Prop :=
∀x, R x x.

Transitivity is another property of relations.

Definition Transitive {A: Type} (R: A → A → Prop): Prop :=
∀x y z, R x y → R y z → R x z.

We say that a relation R₁ is a subrelation of R₂ if every pair of elements in R₁ is in R₂. In some sense, subrelation can be treated as a property of pairs of relations.

Definition subrelation {A B: Type} (R R': A → B → Prop): Prop:=
∀(x : A) (y : B), R x y → R' x y.

Furthermore, for any property Pr of relations, when we say "R is the smallest relation satisfying Pr", we mean "R satisfies Pr and for any other R', if R' satisfies Pr, then R is a subrelation of R'".

Definition is_smallest_relation {A B: Type} (Pr: (A → B → Prop) → Prop) (R: A → B → Prop) :=
Pr R ∧ ∀R', Pr R' → subrelation R R'.

End Relation_Definitions.

Now, we are ready to formulate those equivalent definitions of reflexive, transitive closure.

Module A₁.

Import Relation_Definitions.

The following definition says: Rc is R's reflexive, transitive closure if and only if it is the smallest relation that contains R and that is both reflexive and transitive.

Definition is_clos_refl_trans {A: Type} (R Rc: A → A → Prop): Prop :=
  is_smallest_relation
    (fun Rc' ⇒ subrelation R Rc' ∧
                Reflexive Rc' ∧
                Transitive Rc')
    Rc.

This definition does not say that any relation has a reflexive, transitive closure. We have to prove it later.

End A₁.

Module A₂.

Import Relation_Definitions.

The following definition says: Rc is R's reflexive, transitive closure if and only if it is the result of expanding R by reflexivity and transitivity. Here, expansion can be defined by Coq's inductive predicate.

Inductive clos_refl_trans {A: Type} (R: A → A → Prop) : A → A → Prop :=
    | rt_step x y (H : R x y) : clos_refl_trans R x y
    | rt_refl x : clos_refl_trans R x x
    | rt_trans x y z
          (Hxy : clos_refl_trans R x y)
          (Hyz : clos_refl_trans R y z) :
          clos_refl_trans R x z.

In short, for any relation R, we define clos_refl_trans R to be its reflexive, transitive closure. Now, let's read this Coq definition line-by-line and make sure that we understand it.

rt_step x y (H : R x y) : clos_refl_trans R x y

This first constructor rt_step says: we start from R to defined clos_refl_trans R, i.e. for any x and y, clos_refl_trans R x y holds if R x y is true.

rt_refl x : clos_refl_trans R x x

This second constructor rt_refl says: we expand clos_refl_trans R by reflexivity.

    rt_trans x y z
          (Hxy : clos_refl_trans R x y)
          (Hyz : clos_refl_trans R y z) :
          clos_refl_trans R x z.

And most interestingly, we expand clos_refl_trans R x y by transitivity using this constructor rt_trans. The intuition is that we keep adding new pairs (x, z) into the expansion result until no more updates can be made.

Module Example.

Local Open Scope nat.

For example, using this definition, we can prove that the reflexive and transitive closure of the next_nat relation coincides with the le relation.

Inductive next_nat (n : nat) : nat → Prop :=
nn : next_nat n (S n).

Theorem next_nat_closure_is_le : ∀n m: nat,
(n ≤ m) ↔ ((clos_refl_trans next_nat) n m).

Proof.
  intros n m. split.
  - (* -> *)
    intro H. induction H.
    + (* le_n *) apply rt_refl.
    + (* le_S *)
      apply rt_trans with m. apply IHle. apply rt_step.
      apply nn.
  - (* <- *)
    intro H. induction H.
    + (* rt_step *) inversion H. apply le_S. apply le_n.
    + (* rt_refl *) apply le_n.
    + (* rt_trans *)
      apply le_trans with y.
      apply IHclos_refl_trans1.
      apply IHclos_refl_trans2. Qed.

End Example.

End A₂.

Module A1_vs_A₂.

Import Relation_Definitions.

We mentioned that definition A₁ and definition A₂ are equivalent. Now, let's prove their equivalence. That is, we will show that the reflexive, transitive closure defined by A₂ does satisfy the criterion defined by A₁.

Theorem def_equiv: ∀(A: Type) (R: A → A → Prop),
  A₁.is_clos_refl_trans R (A₂.clos_refl_trans R).
Proof.
  intros.
  unfold A₁.is_clos_refl_trans.
  unfold is_smallest_relation.
  split.

We first prove that A₂.clos_refl_trans R is actually reflexive and transitive, and it does contain R. We then prove that it is the smallest relation that satisfies all these three properties.

  + assert (subrelation R (A₂.clos_refl_trans R)).
    {
      unfold subrelation.
      intros.
      apply A₂.rt_step.
      exact H.
    }
    assert (Reflexive (A₂.clos_refl_trans R)).
    {
      unfold Reflexive.
      intros.
      apply A₂.rt_refl.
    }
    assert (Transitive (A₂.clos_refl_trans R)).
    {
      unfold Transitive.
      intros.
      apply A₂.rt_trans with y.
      + exact H₁.
      + exact H₂.
    }
    tauto.
  + intros.
    destruct H as [? [? ?]].

Now, we suppose that R' is a relation that contains R and is both reflexive and transitive. We are going to prove that it is larger than A₂.clos_refl_trans R.

    unfold subrelation.
    intros.
    induction H₂.
    - (* rt_step case *)
      unfold subrelation in H.
      specialize (H x y H₂).
      exact H.
    - (* rt_refl case *)
      unfold Reflexive in H₀.
      apply H₀.
    - (* rt_trans case *)
      unfold Transitive in H₁.
      specialize (H₁ x y z).
      tauto.
Qed.

End A1_vs_A₂.

(* Among these two definitions, A₂ is obviously easier to use in Coq. This
definition is also part of Coq's standard library. *)

Import Coq.Relations.Relation_Operators.
Import Coq.Relations.Relation_Definitions.

Print clos_refl_trans.
(* Inductive clos_refl_trans (A : Type) (R : relation A) (x : A) : A -> Prop :=
       rt_step : forall y : A, R x y -> clos_refl_trans A R x y
     | rt_refl : clos_refl_trans A R x x
     | rt_trans : forall y z : A,
                  clos_refl_trans A R x y ->
                  clos_refl_trans A R y z ->
                  clos_refl_trans A R x z *)

Print relation.
(* relation = fun A : Type => A -> A -> Prop
: Type -> Type *)

In the beginning of this part, we also mentioned other definitions of reflexive, transitive closure. Definition B₂ and C₂ are also included in Coq standard library.

Print clos_refl_trans_n₁.

(* Inductive clos_refl_trans_n₁ (A : Type) (R : relation A) (x : A) : A -> Prop :=
       rtn1_refl : clos_refl_trans_n₁ A R x x
     | rtn1_trans : forall y z : A,
                    R y z ->
                    clos_refl_trans_n₁ A R x y ->
                    clos_refl_trans_n₁ A R x z *)

Print clos_refl_trans_1n.
(* Inductive clos_refl_trans_1n (A : Type) (R : relation A) (x : A) : A -> Prop :=
       rt1n_refl : clos_refl_trans_1n A R x x
     | rt1n_trans : forall y z : A,
                    R x y ->
                    clos_refl_trans_1n A R y z ->
                    clos_refl_trans_1n A R x z *)

All these definitions are proved equivalent with each other in the standard library. You, students of the course, can try to write down other definitions (B₁, C₁ and D) and understand their equivalence.

(* Thu Apr 11 14:39:22 UTC 2019 *)