Skip to content

Commit a2ba952

Browse files
committed
Mention NFKC, reformat lines to reflect 'semantic clause' structure.
1 parent 49d00b2 commit a2ba952

File tree

1 file changed

+7
-10
lines changed

1 file changed

+7
-10
lines changed

doc/rust.md

Lines changed: 7 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -118,19 +118,16 @@ production. See [tokens](#tokens) for more information.
118118

119119
## Input format
120120

121-
Rust input is interpreted as a sequence of Unicode codepoints encoded in
122-
UTF-8. No normalization is performed during input processing. Most Rust
123-
grammar rules are defined in terms of printable ASCII-range codepoints, but
124-
a small number are defined in terms of Unicode properties or explicit
125-
codepoint lists. ^[Surrogate definitions for the special Unicode productions
126-
are provided to the grammar verifier, restricted to ASCII range, when
127-
verifying the grammar in this document.]
121+
Rust input is interpreted as a sequence of Unicode codepoints encoded in UTF-8,
122+
normalized to Unicode normalization form NFKC.
123+
Most Rust grammar rules are defined in terms of printable ASCII-range codepoints,
124+
but a small number are defined in terms of Unicode properties or explicit codepoint lists.
125+
^[Substitute definitions for the special Unicode productions are provided to the grammar verifier, restricted to ASCII range, when verifying the grammar in this document.]
128126

129127
## Special Unicode Productions
130128

131-
The following productions in the Rust grammar are defined in terms of
132-
Unicode properties: `ident`, `non_null`, `non_star`, `non_eol`, `non_slash`,
133-
`non_single_quote` and `non_double_quote`.
129+
The following productions in the Rust grammar are defined in terms of Unicode properties:
130+
`ident`, `non_null`, `non_star`, `non_eol`, `non_slash`, `non_single_quote` and `non_double_quote`.
134131

135132
### Identifiers
136133

0 commit comments

Comments
 (0)