Skip to content

Commit 3aca40d

Browse files
authored
closes bpo-36861: Update Unicode database to 12.1.0. (GH-13214)
Adds ㋿.
1 parent 289f1f8 commit 3aca40d

File tree

10 files changed

+15381
-15377
lines changed

10 files changed

+15381
-15377
lines changed

Doc/library/stdtypes.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -351,7 +351,7 @@ Notes:
351351
The numeric literals accepted include the digits ``0`` to ``9`` or any
352352
Unicode equivalent (code points with the ``Nd`` property).
353353

354-
See http://www.unicode.org/Public/12.0.0/ucd/extracted/DerivedNumericType.txt
354+
See http://www.unicode.org/Public/12.1.0/ucd/extracted/DerivedNumericType.txt
355355
for a complete list of code points with the ``Nd`` property.
356356

357357

Doc/library/unicodedata.rst

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -17,8 +17,8 @@
1717

1818
This module provides access to the Unicode Character Database (UCD) which
1919
defines character properties for all Unicode characters. The data contained in
20-
this database is compiled from the `UCD version 12.0.0
21-
<http://www.unicode.org/Public/12.0.0/ucd>`_.
20+
this database is compiled from the `UCD version 12.1.0
21+
<http://www.unicode.org/Public/12.1.0/ucd>`_.
2222

2323
The module uses the same names and symbols as defined by Unicode
2424
Standard Annex #44, `"Unicode Character Database"
@@ -175,6 +175,6 @@ Examples:
175175

176176
.. rubric:: Footnotes
177177

178-
.. [#] http://www.unicode.org/Public/12.0.0/ucd/NameAliases.txt
178+
.. [#] http://www.unicode.org/Public/12.1.0/ucd/NameAliases.txt
179179
180-
.. [#] http://www.unicode.org/Public/12.0.0/ucd/NamedSequences.txt
180+
.. [#] http://www.unicode.org/Public/12.1.0/ucd/NamedSequences.txt

Doc/reference/lexical_analysis.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -316,7 +316,7 @@ The Unicode category codes mentioned above stand for:
316316
* *Nd* - decimal numbers
317317
* *Pc* - connector punctuations
318318
* *Other_ID_Start* - explicit list of characters in `PropList.txt
319-
<http://www.unicode.org/Public/12.0.0/ucd/PropList.txt>`_ to support backwards
319+
<http://www.unicode.org/Public/12.1.0/ucd/PropList.txt>`_ to support backwards
320320
compatibility
321321
* *Other_ID_Continue* - likewise
322322

Doc/whatsnew/3.8.rst

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -510,9 +510,8 @@ Added new clock :data:`~time.CLOCK_UPTIME_RAW` for macOS 10.12.
510510
unicodedata
511511
-----------
512512

513-
* The :mod:`unicodedata` module has been upgraded to use the `Unicode 12.0.0
514-
<http://blog.unicode.org/2019/03/announcing-unicode-standard-version-120.html>`_
515-
release.
513+
* The :mod:`unicodedata` module has been upgraded to use the `Unicode 12.1.0
514+
<http://blog.unicode.org/2019/05/unicode-12-1-en.html>`_ release.
516515

517516
* New function :func:`~unicodedata.is_normalized` can be used to verify a string
518517
is in a specific normal form. (Contributed by Max Belanger and David Euresti in

Lib/test/test_unicodedata.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -80,7 +80,7 @@ class UnicodeFunctionsTest(UnicodeDatabaseTest):
8080

8181
# Update this if the database changes. Make sure to do a full rebuild
8282
# (e.g. 'make distclean && make') to get the correct checksum.
83-
expectedchecksum = '4cb02a243aed7c251067386dd738189146fddf94'
83+
expectedchecksum = 'c44a49ca7c5cb6441640fe174ede604b45028652'
8484
def test_function_checksum(self):
8585
data = []
8686
h = hashlib.sha1()
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
Update the Unicode database to version 12.1.0.

Modules/unicodedata_db.h

Lines changed: 1032 additions & 1032 deletions
Some generated files are not rendered by default. Learn more about customizing how changed files appear on GitHub.

Modules/unicodename_db.h

Lines changed: 14337 additions & 14333 deletions
Large diffs are not rendered by default.

Objects/unicodetype_db.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2925,7 +2925,7 @@ static const unsigned short index2[] = {
29252925
5, 5, 5, 5, 5, 5, 27, 27, 27, 27, 27, 27, 27, 27, 27, 27, 27, 27, 27, 27,
29262926
27, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5,
29272927
5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5,
2928-
5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 0, 55, 55, 55, 55, 55,
2928+
5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 55, 55, 55, 55, 55,
29292929
388, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55,
29302930
55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55,
29312931
55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55, 55,

Tools/unicode/makeunicodedata.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -41,7 +41,7 @@
4141
# * Doc/library/stdtypes.rst, and
4242
# * Doc/library/unicodedata.rst
4343
# * Doc/reference/lexical_analysis.rst (two occurrences)
44-
UNIDATA_VERSION = "12.0.0"
44+
UNIDATA_VERSION = "12.1.0"
4545
UNICODE_DATA = "UnicodeData%s.txt"
4646
COMPOSITION_EXCLUSIONS = "CompositionExclusions%s.txt"
4747
EASTASIAN_WIDTH = "EastAsianWidth%s.txt"

0 commit comments

Comments
 (0)