Releases: PyThaiNLP/pythainlp
PyThaiNLP 2.0
PyThaiNLP 2.0
PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.
PyThaiNLP includes Thai word tokenizers, transliterators, soundex converters, part-of-speech taggers, and spell checkers.
📖 For details on upgrading from PyThaiNLP 1.7 to PyThaiNLP 2.0, see From PyThaiNLP 1.7 to PyThaiNLP 2.0
📖 For ThaiNER user after upgrading from PyThaiNLP 1.7 to PyThaiNLP 2.0, see Upgrade ThaiNER from PyThaiNLP 1.7 to PyThaiNLP 2.0
📫 follow us on Facebook Pythainlp
What's new in version 2.0 ?
- New NorvigSpellChecker spell checker class, which can be initialized with custom dictionary.
- Terminate Python 2 support. Remove all Python 2 compatibility code.
- Remove old, obsolated, deprecated, and experimental code.
- Thai2fit (Upgrade ULMFiT-related codes to fastai 1.0)
- ThaiNER 1.0
- Remove sentiment analysis
- Improved word_tokenize (newmm, mm) and dict_word_tokenize
- Improved POS-tagging
- More and improved examples
- see PyThaiNLP 2.0 change log
Links
- User guide : English , ภาษาไทย
- Docs: https://thainlp.org/pythainlp/docs/2.0/
- GitHub: https://github.com/PyThaiNLP/pythainlp
- Issues: https://github.com/PyThaiNLP/pythainlp/issues
Thank you for choosing us.
PyThaiNLP team
PyThaiNLP 2.0 Beta
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to nltk but with focus on Thai language.
PyThaiNLP 2.0 Beta for beta testing PyThaiNLP 2.0.
What's new in PyThaiNLP 2.0 ?
- Consolidate documentation files
- Thai2fit (Upgrade ULMFiT-related codes to fastai 1.0)
- Remove Python 2 compatibility code
- Remove temporary files, experiment files, and obsoleted files
- Remove sentiment analysis
- More consistent indentations in source code
- Improved word_tokenize (newmm, mm) and dict_word_tokenize
- Improved POS-tagging
- More and improved examples
- Improved test coverages with more test case
More details #118
Install
pip install https://github.com/PyThaiNLP/pythainlp/archive/2.0b.zip
Docs : https://thainlp.org/pythainlp/docs/2.0/index.html
Website : https://pythainlp.github.io/
GitHub : https://github.com/PyThaiNLP/pythainlp
Issues : https://github.com/PyThaiNLP/pythainlp/issues
Thank you for choosing us.
PyThaiNLP team
PyThaiNLP 1.7.4
PyThaiNLP 1.7.3
- fixed import thai_syllable.txt
PyThaiNLP 1.7.2
- fix sent_tokenize also split text by vertical line #166
PyThaiNLP 1.7.1
PyThaiNLP 1.7.0.1
- remove import test in PyThaiNLP
- update README.md
PyThaiNLP 1.7.0
PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.
What's new in PyThaiNLP 1.7 ?
- Deprecate Python 2 support
- Refactor pythainlp.tokenize.pyicu for readability
- Add Thai NER model to pythainlp.ner
- thai2vec v0.2 - larger vocab, benchmarking results on Wongnai dataset
- Sentiment classifier based on ULMFit and various product review datasets
- Add ULMFit utility to PyThaiNLP
- Add Thai romanization model thai2rom
- Retrain POS-tagging model
- Improve word tokenize (newmm,mm) and dict_word_tokenize
- Documentation added
Install
pip install https://github.com/PyThaiNLP/pythainlp/archive/1.7.0.zip
Docs : https://thainlp.org/pythainlp/docs/1.7/
GitHub : https://github.com/PyThaiNLP/pythainlp
Issues : https://github.com/PyThaiNLP/pythainlp/issues
Thank you for choosing us.
PyThaiNLP team
PyThaiNLP 1.7 Beta 1
PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.
PyThaiNLP 1.7 Beta 1 for beta testing PyThaiNLP 1.7
What's new in PyThaiNLP 1.7 ?
- Deprecate Python 2 support
- Refactor pythainlp.tokenize.pyicu for readability
- Add Thai NER model to pythainlp.ner
- thai2vec v0.2 - larger vocab, benchmarking results on Wongnai dataset
- Sentiment classifier based on ULMFit and various product review datasets
- Add ULMFit utility to PyThaiNLP
- Add Thai romanization model thai2rom
- Retrain POS-tagging model
- Improve word tokenize (newmm,mm) and dict_word_tokenize
- Documentation added
Install
pip install https://github.com/PyThaiNLP/pythainlp/archive/1.7b1.zip
Docs : https://thainlp.org/pythainlp/docs/1.7/ (in progress)
Website : https://thainlp.org/pythainlp/ (in progress)
GitHub : https://github.com/PyThaiNLP/pythainlp
Issues : https://github.com/PyThaiNLP/pythainlp/issues
Thank you for choosing us.
PyThaiNLP team
PyThaiNLP 1.7 Alpha 2
PyThaiNLP 1.7 Alpha 2 เป็นเวชั่นทดสอบสำหรับนักพัฒนา ไม่แนะนำให้นำไปใช้งานจริง
มีอะไรใหม่ใน PyThaiNLP 1.7
สรุปประเด็นสำคัญ
- เพิ่ม pythainlp.ner เป็น NER สำหรับ PyThaiNLP
- ยกเลิกการสนับสนุน Python 2.7 อย่างเป็นทางการ
- เพิ่ม ULMFit utility เข้ามาใน PyThaiNLP
- ปรับปรุงระบบตัดคำใหม่ ทั้ง newmm และ mm
- thai2vec v0.2
- sentiment analysis ตัวใหม่ทำงานด้วย Deep learning
- เพิ่ม thai2rom เป็น Thai Romanization ทำด้วย Deep learning ในระดับตัวอักษร
- Train Pos tag ใหม่เพิ่มเติมจากเดิม
การติดตั้ง
ใช้คำสั่ง pip install https://github.com/PyThaiNLP/pythainlp/archive/1.7a2.zip
แจ้งข้อผิดพลาดหรือเสนอแนะนำได้ที่ https://github.com/PyThaiNLP/pythainlp/issues