Releases: PyThaiNLP/pythainlp
PyThaiNLP v4.0.2 Released!
PyThaiNLP v4.0.2
is a bug fix release of PyThaiNLP v4.0
.
Upgrade: pip install -U pythainlp
Documentation: https://pythainlp.github.io/docs/4.0
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
What's Changed
- fixed bug by @kangkengkhadev in #798
- fig เอือน อวน by @kangkengkhadev in #799
Full Changelog: v4.0.1...v4.0.2
Contributors
Thanks all the contributors. (Image made with contributors-img)
If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.
PyThaiNLP v4.0.1 Released!
PyThaiNLP v4.0.1
is a bug fix release of PyThaiNLP v4.0
.
Upgrade: pip install -U pythainlp
Documentation: https://pythainlp.github.io/docs/4.0
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
What's Changed
- Fix mishandling Karun in Kavee Matra Checker by @HRNPH in #793
- adding tonemark removal to fix mattra checking by @HRNPH in #795
Full Changelog: v4.0.0...v4.0.1
Contributors
Thanks all the contributors. (Image made with contributors-img)
If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.
PyThaiNLP 4.0 Released!
PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).
See 4.0 Milestone.
Documentation: https://pythainlp.github.io/docs/4.0
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
If you want to contribute to PyThaiNLP, you can read Contributing to PyThaiNLP.
What is new?
Deprecation and other API changes
- Delete all LST20 model #728
- 947c7be Change pythainlp.tools.misspell to pythainlp.tools.misspell.misspell
Improve
Tokenizer
Tag
Util
Transliterate
- Add Thai2Rom ONNX model #743
Khavee
Parse
- Add ud_goeswith #757
Corpus
- Add new science word #763
Full Changelog
- Improve: Reduce import time by @wannaphong in #719
- Create CITATION.cff by @wannaphong in #721
- Fix/broken numeric data format (#652) by @noppayut in #723
- Add blackboard pos_tag to cls by @wannaphong in #734
- Update perceptron.py by @wannaphong in #736
- Feature/integrate transliteration dictionary (#681) by @noppayut in #735
- Delete all LST20 model by @wannaphong in #728
- Add blackboard cls by @wannaphong in #732
- Add blackboard pos_tag by @wannaphong in #733
- Add style.css: extend docs page width by @LXZE in #742
- Add rule to TCC and Change TCC rule for newmm by @wannaphong in #741
- Setup action to check for code formatting by @new5558 in #746
- Add more test for TCC by @wannaphong in #747
- Add Thai2Rom ONNX model by @new5558 in #743
- Add pythainlp.util.count_thai_chars by @wannaphong in #748
- Feature: keyword extraction with keybert and frequency ranking by @noppayut in #751
- Add ud_goeswith by @wannaphong in #757
- Bump tensorflow from 2.7.2 to 2.9.3 by @dependabot in #758
- Add new science word by @wannaphong in #763
- Add thai_strptime and convert_years by @wannaphong in #767
- Fix typo in thai_full_month_lists for February by @PhakphumV in #770
- Add pythainlp.util.phoneme by @wannaphong in #772
- Add remove tone ipa by @wannaphong in #776
- add khavee to pythainlp by @kangkengkhadev in #777
- Add khavee docs tests by @wannaphong in #778
- add aek/too checker function to khavee by @HRNPH in #779
- Add Thai NER 2.0 by @wannaphong in #781
- Add Copyright to the header files by @wannaphong in #782
- Fixed some issues in Khavee. It's a problem with use อ by @kangkengkhadev in #785
- PyThaiNLP 4.0 beta 1 by @wannaphong in #786
- fix some bugs and add check_karu_lahu function by @kangkengkhadev in #787
- PyThaiNLP 4.0 Released! by @wannaphong in #789
Full Changelog: v3.1.0...v4.0.0
Contributors
Thanks all the contributors. (Image made with contributors-img)
If you want to contributing to PyThaiNLP, you can read Contributing to PyThaiNLP.
New Contributors
- @LXZE made their first contribution in #742
- @new5558 made their first contribution in #746
- @PhakphumV made their first contribution in #770
- @kangkengkhadev made their first contribution in #777
- @HRNPH made their first contribution in #779
PyThaiNLP v4.0.0-beta1
This post will give you the change log for PyThaiNLP 4.0. PyThaiNLP published the first version is 0.0.4 to PyPI at 6 years ago, so PyThaiNLP 4.0 will have special codename. The codename for PyThaiNLP 4.0 is PyThaiNLP 4.0 (Real).
This release is the first beta release of PyThaiNLP 4.0.
Schedule
- Beta release: 1 April 2023
- Production release: 14 April 2023
See 4.0 Milestone.
What is new?
Deprecation and other API changes
- Delete all LST20 model #728
- 947c7be Change pythainlp.tools.misspell to pythainlp.tools.misspell.misspell
Improve
Tokenizer
Tag
Util
Transliterate
- Add Thai2Rom ONNX model #743
Khavee
Parse
- Add ud_goeswith #757
Corpus
- Add new science word #763
What's Changed
- Improve: Reduce import time by @wannaphong in #719
- Create CITATION.cff by @wannaphong in #721
- Fix/broken numeric data format (#652) by @noppayut in #723
- Add blackboard pos_tag to cls by @wannaphong in #734
- Update perceptron.py by @wannaphong in #736
- Feature/integrate transliteration dictionary (#681) by @noppayut in #735
- Delete all LST20 model by @wannaphong in #728
- Add blackboard cls by @wannaphong in #732
- Add blackboard pos_tag by @wannaphong in #733
- Add style.css: extend docs page width by @LXZE in #742
- Add rule to TCC and Change TCC rule for newmm by @wannaphong in #741
- Setup action to check for code formatting by @new5558 in #746
- Add more test for TCC by @wannaphong in #747
- Add Thai2Rom ONNX model by @new5558 in #743
- Add pythainlp.util.count_thai_chars by @wannaphong in #748
- Feature: keyword extraction with keybert and frequency ranking by @noppayut in #751
- Add ud_goeswith by @wannaphong in #757
- Bump tensorflow from 2.7.2 to 2.9.3 by @dependabot in #758
- Add new science word by @wannaphong in #763
- Add thai_strptime and convert_years by @wannaphong in #767
- Fix typo in thai_full_month_lists for February by @PhakphumV in #770
- Add pythainlp.util.phoneme by @wannaphong in #772
- Add remove tone ipa by @wannaphong in #776
- add khavee to pythainlp by @kangkengkhadev in #777
- Add khavee docs tests by @wannaphong in #778
- add aek/too checker function to khavee by @HRNPH in #779
- Add Thai NER 2.0 by @wannaphong in #781
- Add Copyright to the header files by @wannaphong in #782
- Fixed some issues in Khavee. It's a problem with use อ by @kangkengkhadev in #785
- PyThaiNLP 4.0 beta 1 by @wannaphong in #786
New Contributors
- @LXZE made their first contribution in #742
- @new5558 made their first contribution in #746
- @PhakphumV made their first contribution in #770
- @kangkengkhadev made their first contribution in #777
- @HRNPH made their first contribution in #779
Full Changelog: v3.1.0...v4.0.0-beta1
PyThaiNLP v3.1.1 Released!
PyThaiNLP v3.1.1 is the releases updates of PyThaiNLP v3.1.0.
What's Changed
pythainlp.tools.misspell
changed topythainlp.tools.misspell.misspell
.- Add Reduce import time #719 to PyThaiNLP 3.1.1 #753
- Doc: Lst20 deprecation warning for 3.1.1 (#749) #752 (Thank you @noppayut)
Full Changelog: v3.1.0...v3.1.1
You can install or upgrade by pip install pythainlp==3.1.1
.
Documentation: https://pythainlp.github.io/docs/3.1
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
See 3.1 Milestone.
Contributors
Thanks all the contributors. (Image made with contributors-img)
PyThaiNLP v3.1.0 Released!
This is the release version for PyThaiNLP v3.1.0
You can install by pip install pythainlp==3.1.0
.
Documentation: https://pythainlp.github.io/docs/3.1
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
See 3.1 Milestone.
What is new?
Deprecation and other API changes
#687 Remove deprecated function
- pythainlp.word_vector; doesnt_match, get_model, most_similar_cosmul, sentence_vectorizer, similarity. use WordVector class instead
- pythainlp.util.delete_tone. use pythainlp.util.remove_tonemark instead
- Remove pythainlp.util.time_time. use pythainlp.util.time_to_thaiword instead
- pythainlp.tokenize.syllable_tokenize. use pythainlp.tokenize.subword_tokenize instead
Dependency Parsing
- Now, PyThaiNLP support dependency_parsing 🎉 Add pythainlp.parse.dependency_parsing #706
Name Entity Tagging
- #665 Add Thai-NNER
pythainlp.tag.NNER
- #658 Add LST20NER onnx model. It is LST20NER model to onnx model from fine-turning by WangchanBERTa model.
Transliteration
- #659 Add ISO 11940 transliteration
- #660 Add Thai W2P v0.2
- #686 Add wunsen
- #694 Wunsen Mandarin and Japanese update
PyThaiNLP Corpus downloader
- #656 Add support zip/tar.gz to download corpus
Text normalization
- #673 Add a normalising rule for Lakkhangyao ๅ
Translate
- #674 add gpu option
Text summarize
- #679 Add mt5 cpe kmutt thai sentence sum
Util
- #682 Add live-dead syllable classification
- #684 Add live dead syllable classify
- #690 Add tone detector
Soundex
- #699 Add Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique
Other
- #689 map NG tag to PART
- #691 Remove TinyDB as a dependency
- #692 Fix notifications that newer versions of corpora are available
- Add warning about LST20 license
Contributors
New Contributors
- @chameleonTK made their first contribution in #673
- @vikimark made their first contribution in #674
- @BLKSerene made their first contribution in #691
- @cakimpei made their first contribution in #694
Full Changelog: v3.0.10...v3.1.0
All Contributors
Thanks all the contributors. (Image made with contributors-img)
We build Thai NLP.
PyThaiNLP
PyThaiNLP v3.1.0-beta0
This is the beta version for PyThaiNLP v3.1.
You can install by pip install --pre pythainlp==3.1.0b0
.
Documentation: https://pythainlp.github.io/dev-docs/
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
See 3.1 Milestone.
What is new?
Deprecation and other API changes
#687 Remove deprecated function
- pythainlp.word_vector; doesnt_match, get_model, most_similar_cosmul, sentence_vectorizer, similarity. use WordVector class instead
- pythainlp.util.delete_tone. use pythainlp.util.remove_tonemark instead
- Remove pythainlp.util.time_time. use pythainlp.util.time_to_thaiword instead
- pythainlp.tokenize.syllable_tokenize. use pythainlp.tokenize.subword_tokenize instead
Dependency Parsing
- Now, PyThaiNLP support dependency_parsing 🎉 Add pythainlp.parse.dependency_parsing #706
Name Entity Tagging
- #665 Add Thai-NNER
pythainlp.tag.NNER
- #658 Add LST20NER onnx model. It is LST20NER model to onnx model from fine-turning by WangchanBERTa model.
Transliteration
- #659 Add ISO 11940 transliteration
- #660 Add Thai W2P v0.2
- #686 Add wunsen
- #694 Wunsen Mandarin and Japanese update
PyThaiNLP Corpus downloader
- #656 Add support zip/tar.gz to download corpus
Text normalization
- #673 Add a normalising rule for Lakkhangyao ๅ
Translate
- #674 add gpu option
Text summarize
- #679 Add mt5 cpe kmutt thai sentence sum
Util
- #682 Add live-dead syllable classification
- #684 Add live dead syllable classify
- #690 Add tone detector
Soundex
- #699 Add Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique
Other
- #689 map NG tag to PART
- #691 Remove TinyDB as a dependency
- #692 Fix notifications that newer versions of corpora are available
- Add warning about LST20 license
What's Changed
- Add more words from Royal Society by @wannaphong in #653
- Add support zip/tar.gz to download corpus by @wannaphong in #656
- Update from dev by @wannaphong in #657
- Add ISO 11940 transliteration by @wannaphong in #659
- Add Thai W2P v0.2 and PyThaiNLP v3.0.6dev0 by @wannaphong in #660
- Add LST20NER onnx model by @wannaphong in #658
- Add Thai-NNER by @wannaphong in #665
- Update dev base from 3.0 base by @wannaphong in #668
- PyThaiNLP 3.0.7 by @wannaphong in #670
- Update dev branche from pythainlp-3.0 branche by @wannaphong in #672
- Normalise Lakkhangyao by @chameleonTK in #673
- add gpu option by @vikimark in #674
- Bump tensorflow from 2.5.3 to 2.6.4 by @dependabot in #677
- Bump tensorflow from 2.6.4 to 2.7.2 by @dependabot in #678
- Add mt5 cpe kmutt thai sentence sum by @wannaphong in #679
- Add live-dead syllable classification by @wannaphong in #682
- Fixed CI Bug by @wannaphong in #683
- Add live dead syllable classify by @wannaphong in #684
- Add wunsen by @wannaphong in #686
- Add ThaiSum sentence segmentor by @chameleonTK in #688
- map NG tag to PART by @chameleonTK in #689
- Add tone detector by @wannaphong in #690
- Remove deprecated function by @wannaphong in #687
- Remove TinyDB as a dependency by @BLKSerene in #691
- Fix notifications that newer versions of corpora are available by @BLKSerene in #692
- Start PyThaiNLP v3.1.0-dev0 by @wannaphong in #693
- Wunsen Mandarin and Japanese update by @cakimpei in #694
- Add Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique by @wannaphong in #699
- Fixed #700 by @wannaphong in #701
- Update add-word_detokenize from dev by @wannaphong in #703
- Add word_detokenize by @wannaphong in #697
- Move model by @wannaphong in #705
- Add pythainlp.parse.dependency_parsing by @wannaphong in #706
New Contributors
- @chameleonTK made their first contribution in #673
- @vikimark made their first contribution in #674
- @BLKSerene made their first contribution in #691
- @cakimpei made their first contribution in #694
Full Changelog: v3.0.9...v3.1.0-beta0
All Contributors
Thanks all the contributors. (Image made with contributors-img)
We build Thai NLP.
PyThaiNLP
PyThaiNLP v3.0.10 Released!
PyThaiNLP v3.0.10
is This release is a bug fix release of PyThaiNLP v3.0.9
.
Bug Fixed
- Fixed Wrong tag mapping from lst20 to UD #711
Full Changelog: v3.0.9...v3.0.10
You can install by pip install pythainlp
or upgrade by pip install -U pythainlp
.
Documentation: https://pythainlp.github.io/docs/3.0/index.html
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
Contributors
Thanks all the contributors. (Image made with contributors-img)
PyThaiNLP v3.1.0-dev3
This is a development release for PyThaiNLP v3.1.
You can install by pip install --pre pythainlp==3.1.0.dev3
.
Documentation: https://pythainlp.github.io/dev-docs/
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
See 3.1 Milestone.
What's Changed
- Move model by @wannaphong in #705
- Add pythainlp.parse.dependency_parsing by @wannaphong in #706
Full Changelog: v3.1.0-dev2...v3.1.0-dev3
All Contributors
Thanks all the contributors. (Image made with contributors-img)
We build Thai NLP.
PyThaiNLP
PyThaiNLP v3.1.0-dev2
This is the development release for PyThaiNLP v3.1.
You can install by pip install --pre pythainlp==3.1.0.dev2
.
Documentation: https://pythainlp.github.io/dev-docs/
Report bug: https://github.com/PyThaiNLP/pythainlp/issues
See 3.1 Milestone.
What's Changed
- Add Thai-English Cross-Language Transliterated Word Retrieval using Soundex Technique by @wannaphong in #699
- Fixed #700 by @wannaphong in #701
- Update add-word_detokenize from dev by @wannaphong in #703
- Add word_detokenize by @wannaphong in #697
Full Changelog: v3.1.0-dev1...v3.1.0-dev2
All Contributors
Thanks all the contributors. (Image made with contributors-img)