rust-lang
diff --git a/‎src/doc/tarpl/README.md
Lines changed: 3 additions & 3 deletions b/‎src/doc/tarpl/README.md
Lines changed: 3 additions & 3 deletions
diff --git a/‎src/doc/tarpl/arc-and-mutex.md
Lines changed: 1 addition & 1 deletion b/‎src/doc/tarpl/arc-and-mutex.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/doc/tarpl/atomics.md
Lines changed: 80 additions & 81 deletions b/‎src/doc/tarpl/atomics.md
Lines changed: 80 additions & 81 deletions
diff --git a/‎src/doc/tarpl/casts.md
Lines changed: 15 additions & 12 deletions b/‎src/doc/tarpl/casts.md
Lines changed: 15 additions & 12 deletions
@@ -34,6 +34,6 @@ Due to the nature of advanced Rust programming, we will be spending a lot of tim
 talking about *safety* and *guarantees*. In particular, a significant portion of
 the book will be dedicated to correctly writing and understanding Unsafe Rust.
 
-[trpl]: https://doc.rust-lang.org/book/
-[The stack and heap]: https://doc.rust-lang.org/book/the-stack-and-the-heap.html
-[Basic Rust]: https://doc.rust-lang.org/book/syntax-and-semantics.html
+[trpl]: ../book/
+[The stack and heap]: ../book/the-stack-and-the-heap.html
+[Basic Rust]: ../book/syntax-and-semantics.html
@@ -1,6 +1,6 @@
 % Implementing Arc and Mutex
 
-Knowing the theory is all fine and good, but the *best* was to understand
+Knowing the theory is all fine and good, but the *best* way to understand
 something is to use it. To better understand atomics and interior mutability,
 we'll be implementing versions of the standard library's Arc and Mutex types.
 
 
@@ -2,21 +2,22 @@
 
 Rust pretty blatantly just inherits C11's memory model for atomics. This is not
 due this model being particularly excellent or easy to understand. Indeed, this
-model is quite complex and known to have [several flaws][C11-busted]. Rather,
-it is a pragmatic concession to the fact that *everyone* is pretty bad at modeling
+model is quite complex and known to have [several flaws][C11-busted]. Rather, it
+is a pragmatic concession to the fact that *everyone* is pretty bad at modeling
 atomics. At very least, we can benefit from existing tooling and research around
 C.
 
 Trying to fully explain the model in this book is fairly hopeless. It's defined
-in terms of madness-inducing causality graphs that require a full book to properly
-understand in a practical way. If you want all the nitty-gritty details, you
-should check out [C's specification (Section 7.17)][C11-model]. Still, we'll try
-to cover the basics and some of the problems Rust developers face.
+in terms of madness-inducing causality graphs that require a full book to
+properly understand in a practical way. If you want all the nitty-gritty
+details, you should check out [C's specification (Section 7.17)][C11-model].
+Still, we'll try to cover the basics and some of the problems Rust developers
+face.
 
-The C11 memory model is fundamentally about trying to bridge the gap between
-the semantics we want, the optimizations compilers want, and the inconsistent
-chaos our hardware wants. *We* would like to just write programs and have them
-do exactly what we said but, you know, *fast*. Wouldn't that be great?
+The C11 memory model is fundamentally about trying to bridge the gap between the
+semantics we want, the optimizations compilers want, and the inconsistent chaos
+our hardware wants. *We* would like to just write programs and have them do
+exactly what we said but, you know, *fast*. Wouldn't that be great?
 
 
 
@@ -41,33 +42,35 @@ x = 2;
 y = 3;
 ```
 
-This has inverted the order of events *and* completely eliminated one event. From
-a single-threaded perspective this is completely unobservable: after all the
-statements have executed we are in exactly the same state. But if our program is
-multi-threaded, we may have been relying on `x` to *actually* be assigned to 1 before
-`y` was assigned. We would *really* like the compiler to be able to make these kinds
-of optimizations, because they can seriously improve performance. On the other hand,
-we'd really like to be able to depend on our program *doing the thing we said*.
+This has inverted the order of events *and* completely eliminated one event.
+From a single-threaded perspective this is completely unobservable: after all
+the statements have executed we are in exactly the same state. But if our
+program is multi-threaded, we may have been relying on `x` to *actually* be
+assigned to 1 before `y` was assigned. We would *really* like the compiler to be
+able to make these kinds of optimizations, because they can seriously improve
+performance. On the other hand, we'd really like to be able to depend on our
+program *doing the thing we said*.
 
 
 
 
 # Hardware Reordering
 
 On the other hand, even if the compiler totally understood what we wanted and
-respected our wishes, our *hardware* might instead get us in trouble. Trouble comes
-from CPUs in the form of memory hierarchies. There is indeed a global shared memory
-space somewhere in your hardware, but from the perspective of each CPU core it is
-*so very far away* and *so very slow*. Each CPU would rather work with its local
-cache of the data and only go through all the *anguish* of talking to shared
-memory *only* when it doesn't actually have that memory in cache.
+respected our wishes, our *hardware* might instead get us in trouble. Trouble
+comes from CPUs in the form of memory hierarchies. There is indeed a global
+shared memory space somewhere in your hardware, but from the perspective of each
+CPU core it is *so very far away* and *so very slow*. Each CPU would rather work
+with its local cache of the data and only go through all the *anguish* of
+talking to shared memory *only* when it doesn't actually have that memory in
+cache.
 
 After all, that's the whole *point* of the cache, right? If every read from the
 cache had to run back to shared memory to double check that it hadn't changed,
 what would the point be? The end result is that the hardware doesn't guarantee
-that events that occur in the same order on *one* thread, occur in the same order
-on *another* thread. To guarantee this, we must issue special instructions to
-the CPU telling it to be a bit less smart.
+that events that occur in the same order on *one* thread, occur in the same
+order on *another* thread. To guarantee this, we must issue special instructions
+to the CPU telling it to be a bit less smart.
 
 For instance, say we convince the compiler to emit this logic:
 
@@ -82,86 +85,82 @@ x = 1;              y *= 2;
 
 Ideally this program has 2 possible final states:
 
-* `y = 3`: (thread 2 did the check before thread 1 completed)
-* `y = 6`: (thread 2 did the check after thread 1 completed)
+* `y = 3`: (thread 2 did the check before thread 1 completed) y = 6`: (thread 2
+* `did the check after thread 1 completed)
 
 However there's a third potential state that the hardware enables:
 
 * `y = 2`: (thread 2 saw `x = 2`, but not `y = 3`, and then overwrote `y = 3`)
 
 It's worth noting that different kinds of CPU provide different guarantees. It
-is common to seperate hardware into two categories: strongly-ordered and weakly-
-ordered. Most notably x86/64 provides strong ordering guarantees, while ARM and
-provides weak ordering guarantees. This has two consequences for
-concurrent programming:
+is common to separate hardware into two categories: strongly-ordered and weakly-
+ordered. Most notably x86/64 provides strong ordering guarantees, while ARM
+provides weak ordering guarantees. This has two consequences for concurrent
+programming:
 
 * Asking for stronger guarantees on strongly-ordered hardware may be cheap or
   even *free* because they already provide strong guarantees unconditionally.
   Weaker guarantees may only yield performance wins on weakly-ordered hardware.
 
-* Asking for guarantees that are *too* weak on strongly-ordered hardware
-  is more likely to *happen* to work, even though your program is strictly
-  incorrect. If possible, concurrent algorithms should be tested on
-  weakly-ordered hardware.
+* Asking for guarantees that are *too* weak on strongly-ordered hardware   is
+  more likely to *happen* to work, even though your program is strictly
+  incorrect. If possible, concurrent algorithms should be tested on   weakly-
+  ordered hardware.
 
 
 
 
 
 # Data Accesses
 
-The C11 memory model attempts to bridge the gap by allowing us to talk about
-the *causality* of our program. Generally, this is by establishing a
-*happens before* relationships between parts of the program and the threads
-that are running them. This gives the hardware and compiler room to optimize the
-program more aggressively where a strict happens-before relationship isn't
-established, but forces them to be more careful where one *is* established.
-The way we communicate these relationships are through *data accesses* and
-*atomic accesses*.
+The C11 memory model attempts to bridge the gap by allowing us to talk about the
+*causality* of our program. Generally, this is by establishing a *happens
+before* relationships between parts of the program and the threads that are
+running them. This gives the hardware and compiler room to optimize the program
+more aggressively where a strict happens-before relationship isn't established,
+but forces them to be more careful where one *is* established. The way we
+communicate these relationships are through *data accesses* and *atomic
+accesses*.
 
 Data accesses are the bread-and-butter of the programming world. They are
 fundamentally unsynchronized and compilers are free to aggressively optimize
-them. In particular, data accesses are free to be reordered by the compiler
-on the assumption that the program is single-threaded. The hardware is also free
-to propagate the changes made in data accesses to other threads
-as lazily and inconsistently as it wants. Mostly critically, data accesses are
-how data races happen. Data accesses are very friendly to the hardware and
-compiler, but as we've seen they offer *awful* semantics to try to
-write synchronized code with. Actually, that's too weak. *It is literally
-impossible to write correct synchronized code using only data accesses*.
+them. In particular, data accesses are free to be reordered by the compiler on
+the assumption that the program is single-threaded. The hardware is also free to
+propagate the changes made in data accesses to other threads as lazily and
+inconsistently as it wants. Mostly critically, data accesses are how data races
+happen. Data accesses are very friendly to the hardware and compiler, but as
+we've seen they offer *awful* semantics to try to write synchronized code with.
+Actually, that's too weak. *It is literally impossible to write correct
+synchronized code using only data accesses*.
 
 Atomic accesses are how we tell the hardware and compiler that our program is
-multi-threaded. Each atomic access can be marked with
-an *ordering* that specifies what kind of relationship it establishes with
-other accesses. In practice, this boils down to telling the compiler and hardware
-certain things they *can't* do. For the compiler, this largely revolves
-around re-ordering of instructions. For the hardware, this largely revolves
-around how writes are propagated to other threads. The set of orderings Rust
-exposes are:
-
-* Sequentially Consistent (SeqCst)
-* Release
-* Acquire
-* Relaxed
+multi-threaded. Each atomic access can be marked with an *ordering* that
+specifies what kind of relationship it establishes with other accesses. In
+practice, this boils down to telling the compiler and hardware certain things
+they *can't* do. For the compiler, this largely revolves around re-ordering of
+instructions. For the hardware, this largely revolves around how writes are
+propagated to other threads. The set of orderings Rust exposes are:
+
+* Sequentially Consistent (SeqCst) Release Acquire Relaxed
 
 (Note: We explicitly do not expose the C11 *consume* ordering)
 
-TODO: negative reasoning vs positive reasoning?
-TODO: "can't forget to synchronize"
+TODO: negative reasoning vs positive reasoning? TODO: "can't forget to
+synchronize"
 
 
 
 # Sequentially Consistent
 
 Sequentially Consistent is the most powerful of all, implying the restrictions
-of all other orderings. Intuitively, a sequentially consistent operation *cannot*
-be reordered: all accesses on one thread that happen before and after it *stay*
-before and after it. A data-race-free program that uses only sequentially consistent
-atomics and data accesses has the very nice property that there is a single global
-execution of the program's instructions that all threads agree on. This execution
-is also particularly nice to reason about: it's just an interleaving of each thread's
-individual executions. This *does not* hold if you start using the weaker atomic
-orderings.
+of all other orderings. Intuitively, a sequentially consistent operation
+*cannot* be reordered: all accesses on one thread that happen before and after a
+SeqCst access *stay* before and after it. A data-race-free program that uses
+only sequentially consistent atomics and data accesses has the very nice
+property that there is a single global execution of the program's instructions
+that all threads agree on. This execution is also particularly nice to reason
+about: it's just an interleaving of each thread's individual executions. This
+*does not* hold if you start using the weaker atomic orderings.
 
 The relative developer-friendliness of sequential consistency doesn't come for
 free. Even on strongly-ordered platforms sequential consistency involves
@@ -173,26 +172,26 @@ confident about the other memory orders. Having your program run a bit slower
 than it needs to is certainly better than it running incorrectly! It's also
 *mechanically* trivial to downgrade atomic operations to have a weaker
 consistency later on. Just change `SeqCst` to e.g. `Relaxed` and you're done! Of
-course, proving that this transformation is *correct* is whole other matter.
+course, proving that this transformation is *correct* is a whole other matter.
 
 
 
 
 # Acquire-Release
 
-Acquire and Release are largely intended to be paired. Their names hint at
-their use case: they're perfectly suited for acquiring and releasing locks,
-and ensuring that critical sections don't overlap.
+Acquire and Release are largely intended to be paired. Their names hint at their
+use case: they're perfectly suited for acquiring and releasing locks, and
+ensuring that critical sections don't overlap.
 
 Intuitively, an acquire access ensures that every access after it *stays* after
 it. However operations that occur before an acquire are free to be reordered to
 occur after it. Similarly, a release access ensures that every access before it
-*stays* before it. However operations that occur after a release are free to
-be reordered to occur before it.
+*stays* before it. However operations that occur after a release are free to be
+reordered to occur before it.
 
 When thread A releases a location in memory and then thread B subsequently
 acquires *the same* location in memory, causality is established. Every write
-that happened *before* A's release will be observed by B *after* it's release.
+that happened *before* A's release will be observed by B *after* its release.
 However no causality is established with any other threads. Similarly, no
 causality is established if A and B access *different* locations in memory.
 
 
@@ -1,12 +1,13 @@
 % Casts
 
-Casts are a superset of coercions: every coercion can be explicitly invoked via a
-cast, but some conversions *require* a cast. These "true casts" are generally regarded
-as dangerous or problematic actions. True casts revolve around raw pointers and
-the primitive numeric types. True casts aren't checked.
+Casts are a superset of coercions: every coercion can be explicitly invoked via
+a cast, but some conversions *require* a cast. These "true casts" are generally
+regarded as dangerous or problematic actions. True casts revolve around raw
+pointers and the primitive numeric types. True casts aren't checked.
 
 Here's an exhaustive list of all the true casts. For brevity, we will use `*`
-to denote either a `*const` or `*mut`, and `integer` to denote any integral primitive:
+to denote either a `*const` or `*mut`, and `integer` to denote any integral
+primitive:
 
  * `*T as *U` where `T, U: Sized`
  * `*T as *U` TODO: explain unsized situation
@@ -37,19 +38,21 @@ expression, `e as U2` is not necessarily so (in fact it will only be valid if
 For numeric casts, there are quite a few cases to consider:
 
 * casting between two integers of the same size (e.g. i32 -> u32) is a no-op
-* casting from a larger integer to a smaller integer (e.g. u32 -> u8) will truncate
+* casting from a larger integer to a smaller integer (e.g. u32 -> u8) will
+  truncate
 * casting from a smaller integer to a larger integer (e.g. u8 -> u32) will
     * zero-extend if the source is unsigned
     * sign-extend if the source is signed
 * casting from a float to an integer will round the float towards zero
     * **NOTE: currently this will cause Undefined Behaviour if the rounded
-      value cannot be represented by the target integer type**. This is a bug
-      and will be fixed. (TODO: figure out what Inf and NaN do)
-* casting from an integer to float will produce the floating point representation
-  of the integer, rounded if necessary (rounding strategy unspecified).
-* casting from an f32 to an f64 is perfect and lossless.
+      value cannot be represented by the target integer type**. This includes
+      Inf and NaN. This is a bug and will be fixed.
+* casting from an integer to float will produce the floating point
+  representation of the integer, rounded if necessary (rounding strategy
+  unspecified)
+* casting from an f32 to an f64 is perfect and lossless
 * casting from an f64 to an f32 will produce the closest possible value
-  (rounding strategy unspecified).
+  (rounding strategy unspecified)
     * **NOTE: currently this will cause Undefined Behaviour if the value
       is finite but larger or smaller than the largest or smallest finite
       value representable by f32**. This is a bug and will be fixed.