tools/memory-model/Documentation: Put redefinition of rcu-fence into explanation.txt

[linux.git] / tools / memory-model / Documentation / explanation.txt
diff --git a/tools/memory-model/Documentation/explanation.txt b/tools/memory-model/Documentation/explanation.txt

index 488f11f6c588a6e51f62af2ff5a3d28cfdef2e2b..ecf6cccea5c367e44893440debe089e79e4f69b1 100644 (file)
--- a/tools/memory-model/Documentation/explanation.txt
+++ b/tools/memory-model/Documentation/explanation.txt
@@ -27,7 +27,7 @@ Explanation of the Linux-Kernel Memory Consistency Model
    19. AND THEN THERE WAS ALPHA
    20. THE HAPPENS-BEFORE RELATION: hb
    21. THE PROPAGATES-BEFORE RELATION: pb
-  22. RCU RELATIONS: rcu-link, rcu-gp, rcu-rscsi, rcu-fence, and rb
+  22. RCU RELATIONS: rcu-link, rcu-gp, rcu-rscsi, rcu-order, rcu-fence, and rb
    23. LOCKING
    24. ODDS AND ENDS
  
@@ -206,7 +206,7 @@ goes like this:
         P0 stores 1 to buf before storing 1 to flag, since it executes
         its instructions in order.
  
-       Since an instruction (in this case, P1's store to flag) cannot
+       Since an instruction (in this case, P0's store to flag) cannot
         execute before itself, the specified outcome is impossible.
  
  However, real computer hardware almost never follows the Sequential
@@ -419,7 +419,7 @@ example:
  
  The object code might call f(5) either before or after g(6); the
  memory model cannot assume there is a fixed program order relation
-between them.  (In fact, if the functions are inlined then the
+between them.  (In fact, if the function calls are inlined then the
  compiler might even interleave their object code.)
  
  
@@ -499,7 +499,7 @@ different CPUs (external reads-from, or rfe).
  
  For our purposes, a memory location's initial value is treated as
  though it had been written there by an imaginary initial store that
-executes on a separate CPU before the program runs.
+executes on a separate CPU before the main program runs.
  
  Usage of the rf relation implicitly assumes that loads will always
  read from a single store.  It doesn't apply properly in the presence
@@ -955,7 +955,7 @@ atomic update.  This is what the LKMM's "atomic" axiom says.
  THE PRESERVED PROGRAM ORDER RELATION: ppo
  -----------------------------------------
  
-There are many situations where a CPU is obligated to execute two
+There are many situations where a CPU is obliged to execute two
  instructions in program order.  We amalgamate them into the ppo (for
  "preserved program order") relation, which links the po-earlier
  instruction to the po-later instruction and is thus a sub-relation of
@@ -1425,8 +1425,8 @@ they execute means that it cannot have cycles.  This requirement is
  the content of the LKMM's "propagation" axiom.
  
  
-RCU RELATIONS: rcu-link, rcu-gp, rcu-rscsi, rcu-fence, and rb
--------------------------------------------------------------
+RCU RELATIONS: rcu-link, rcu-gp, rcu-rscsi, rcu-order, rcu-fence, and rb
+------------------------------------------------------------------------
  
  RCU (Read-Copy-Update) is a powerful synchronization mechanism.  It
  rests on two concepts: grace periods and read-side critical sections.
@@ -1536,29 +1536,29 @@ Z's CPU before Z begins but doesn't propagate to some other CPU until
  after X ends.)  Similarly, X ->rcu-rscsi Y ->rcu-link Z says that X is
  the end of a critical section which starts before Z begins.
  
-The LKMM goes on to define the rcu-fence relation as a sequence of
+The LKMM goes on to define the rcu-order relation as a sequence of
  rcu-gp and rcu-rscsi links separated by rcu-link links, in which the
  number of rcu-gp links is >= the number of rcu-rscsi links.  For
  example:
  
         X ->rcu-gp Y ->rcu-link Z ->rcu-rscsi T ->rcu-link U ->rcu-gp V
  
-would imply that X ->rcu-fence V, because this sequence contains two
+would imply that X ->rcu-order V, because this sequence contains two
  rcu-gp links and one rcu-rscsi link.  (It also implies that
-X ->rcu-fence T and Z ->rcu-fence V.)  On the other hand:
+X ->rcu-order T and Z ->rcu-order V.)  On the other hand:
  
         X ->rcu-rscsi Y ->rcu-link Z ->rcu-rscsi T ->rcu-link U ->rcu-gp V
  
-does not imply X ->rcu-fence V, because the sequence contains only
+does not imply X ->rcu-order V, because the sequence contains only
  one rcu-gp link but two rcu-rscsi links.
  
-The rcu-fence relation is important because the Grace Period Guarantee
-means that rcu-fence acts kind of like a strong fence.  In particular,
-E ->rcu-fence F implies not only that E begins before F ends, but also
-that any write po-before E will propagate to every CPU before any
-instruction po-after F can execute.  (However, it does not imply that
-E must execute before F; in fact, each synchronize_rcu() fence event
-is linked to itself by rcu-fence as a degenerate case.)
+The rcu-order relation is important because the Grace Period Guarantee
+means that rcu-order links act kind of like strong fences.  In
+particular, E ->rcu-order F implies not only that E begins before F
+ends, but also that any write po-before E will propagate to every CPU
+before any instruction po-after F can execute.  (However, it does not
+imply that E must execute before F; in fact, each synchronize_rcu()
+fence event is linked to itself by rcu-order as a degenerate case.)
  
  To prove this in full generality requires some intellectual effort.
  We'll consider just a very simple case:
@@ -1572,7 +1572,7 @@ and there are events X, Y and a read-side critical section C such that:
  
         2. X comes "before" Y in some sense (including rfe, co and fr);
  
-       2. Y is po-before Z;
+       3. Y is po-before Z;
  
         4. Z is the rcu_read_unlock() event marking the end of C;
  
@@ -1585,7 +1585,26 @@ G's CPU before G starts must propagate to every CPU before C starts.
  In particular, the write propagates to every CPU before F finishes
  executing and hence before any instruction po-after F can execute.
  This sort of reasoning can be extended to handle all the situations
-covered by rcu-fence.
+covered by rcu-order.
+
+The rcu-fence relation is a simple extension of rcu-order.  While
+rcu-order only links certain fence events (calls to synchronize_rcu(),
+rcu_read_lock(), or rcu_read_unlock()), rcu-fence links any events
+that are separated by an rcu-order link.  This is analogous to the way
+the strong-fence relation links events that are separated by an
+smp_mb() fence event (as mentioned above, rcu-order links act kind of
+like strong fences).  Written symbolically, X ->rcu-fence Y means
+there are fence events E and F such that:
+
+       X ->po E ->rcu-order F ->po Y.
+
+From the discussion above, we see this implies not only that X
+executes before Y, but also (if X is a store) that X propagates to
+every CPU before Y executes.  Thus rcu-fence is sort of a
+"super-strong" fence: Unlike the original strong fences (smp_mb() and
+synchronize_rcu()), rcu-fence is able to link events on different
+CPUs.  (Perhaps this fact should lead us to say that rcu-fence isn't
+really a fence at all!)
  
  Finally, the LKMM defines the RCU-before (rb) relation in terms of
  rcu-fence.  This is done in essentially the same way as the pb
@@ -1596,7 +1615,7 @@ before F, just as E ->pb F does (and for much the same reasons).
  Putting this all together, the LKMM expresses the Grace Period
  Guarantee by requiring that the rb relation does not contain a cycle.
  Equivalently, this "rcu" axiom requires that there are no events E
-and F with E ->rcu-link F ->rcu-fence E.  Or to put it a third way,
+and F with E ->rcu-link F ->rcu-order E.  Or to put it a third way,
  the axiom requires that there are no cycles consisting of rcu-gp and
  rcu-rscsi alternating with rcu-link, where the number of rcu-gp links
  is >= the number of rcu-rscsi links.
@@ -1750,7 +1769,7 @@ addition to normal RCU.  The ideas involved are much the same as
  above, with new relations srcu-gp and srcu-rscsi added to represent
  SRCU grace periods and read-side critical sections.  There is a
  restriction on the srcu-gp and srcu-rscsi links that can appear in an
-rcu-fence sequence (the srcu-rscsi links must be paired with srcu-gp
+rcu-order sequence (the srcu-rscsi links must be paired with srcu-gp
  links having the same SRCU domain with proper nesting); the details
  are relatively unimportant.