Skip to content

Releases: PyThaiNLP/pythainlp

PyThaiNLP 2.0

31 Mar 19:01
Compare
Choose a tag to compare

PyThaiNLP 2.0

Codacy Badgepypi
Build Status
Build status
Coverage Status
License

PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.

PyThaiNLP includes Thai word tokenizers, transliterators, soundex converters, part-of-speech taggers, and spell checkers.

📖 For details on upgrading from PyThaiNLP 1.7 to PyThaiNLP 2.0, see From PyThaiNLP 1.7 to PyThaiNLP 2.0

📖 For ThaiNER user after upgrading from PyThaiNLP 1.7 to PyThaiNLP 2.0, see Upgrade ThaiNER from PyThaiNLP 1.7 to PyThaiNLP 2.0

📫 follow us on Facebook Pythainlp

What's new in version 2.0 ?

  • New NorvigSpellChecker spell checker class, which can be initialized with custom dictionary.
  • Terminate Python 2 support. Remove all Python 2 compatibility code.
  • Remove old, obsolated, deprecated, and experimental code.
  • Thai2fit (Upgrade ULMFiT-related codes to fastai 1.0)
  • ThaiNER 1.0
  • Remove sentiment analysis
  • Improved word_tokenize (newmm, mm) and dict_word_tokenize
  • Improved POS-tagging
  • More and improved examples
  • see PyThaiNLP 2.0 change log

Links

Thank you for choosing us.

PyThaiNLP team

PyThaiNLP 2.0 Beta

29 Mar 16:32
8a9acdc
Compare
Choose a tag to compare
PyThaiNLP 2.0 Beta Pre-release
Pre-release

PyThaiNLP is a Python package for text processing and linguistic analysis, similar to nltk but with focus on Thai language.

PyThaiNLP 2.0 Beta for beta testing PyThaiNLP 2.0.

What's new in PyThaiNLP 2.0 ?

  • Consolidate documentation files
  • Thai2fit (Upgrade ULMFiT-related codes to fastai 1.0)
  • Remove Python 2 compatibility code
  • Remove temporary files, experiment files, and obsoleted files
  • Remove sentiment analysis
  • More consistent indentations in source code
  • Improved word_tokenize (newmm, mm) and dict_word_tokenize
  • Improved POS-tagging
  • More and improved examples
  • Improved test coverages with more test case

More details #118

Install

pip install https://github.com/PyThaiNLP/pythainlp/archive/2.0b.zip

Docs : https://thainlp.org/pythainlp/docs/2.0/index.html

Website : https://pythainlp.github.io/

GitHub : https://github.com/PyThaiNLP/pythainlp

Issues : https://github.com/PyThaiNLP/pythainlp/issues

Thank you for choosing us.

PyThaiNLP team

PyThaiNLP 1.7.4

09 Mar 11:35
ab79eab
Compare
Choose a tag to compare
  • Fixed #176
  • removed conllu from requirements.txt #175

PyThaiNLP 1.7.3

10 Feb 09:50
8035bf2
Compare
Choose a tag to compare
  • fixed import thai_syllable.txt

PyThaiNLP 1.7.2

28 Dec 05:37
6c75b10
Compare
Choose a tag to compare
  • fix sent_tokenize also split text by vertical line #166

PyThaiNLP 1.7.1

31 Oct 06:49
Compare
Choose a tag to compare
  • Remove duplicated codes , More meaningful exception message, report unknown engine name (@bact )
  • Move test folder , Fix Flake8 errors (@zkan )

and more

PyThaiNLP 1.7.0.1

29 Sep 18:24
6bc4b9d
Compare
Choose a tag to compare
  • remove import test in PyThaiNLP
  • update README.md

PyThaiNLP 1.7.0

22 Sep 13:17
7abc2ef
Compare
Choose a tag to compare

PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.

What's new in PyThaiNLP 1.7 ?

  • Deprecate Python 2 support
  • Refactor pythainlp.tokenize.pyicu for readability
  • Add Thai NER model to pythainlp.ner
  • thai2vec v0.2 - larger vocab, benchmarking results on Wongnai dataset
  • Sentiment classifier based on ULMFit and various product review datasets
  • Add ULMFit utility to PyThaiNLP
  • Add Thai romanization model thai2rom
  • Retrain POS-tagging model
  • Improve word tokenize (newmm,mm) and dict_word_tokenize
  • Documentation added

Install

pip install https://github.com/PyThaiNLP/pythainlp/archive/1.7.0.zip

Docs : https://thainlp.org/pythainlp/docs/1.7/

GitHub : https://github.com/PyThaiNLP/pythainlp

Issues : https://github.com/PyThaiNLP/pythainlp/issues

Thank you for choosing us.

PyThaiNLP team

PyThaiNLP 1.7 Beta 1

15 Sep 15:53
Compare
Choose a tag to compare
PyThaiNLP 1.7 Beta 1 Pre-release
Pre-release

PyThaiNLP is a Python library for natural language processing (NLP) of Thai language.

PyThaiNLP 1.7 Beta 1 for beta testing PyThaiNLP 1.7

What's new in PyThaiNLP 1.7 ?

  • Deprecate Python 2 support
  • Refactor pythainlp.tokenize.pyicu for readability
  • Add Thai NER model to pythainlp.ner
  • thai2vec v0.2 - larger vocab, benchmarking results on Wongnai dataset
  • Sentiment classifier based on ULMFit and various product review datasets
  • Add ULMFit utility to PyThaiNLP
  • Add Thai romanization model thai2rom
  • Retrain POS-tagging model
  • Improve word tokenize (newmm,mm) and dict_word_tokenize
  • Documentation added

Install

pip install https://github.com/PyThaiNLP/pythainlp/archive/1.7b1.zip

Docs : https://thainlp.org/pythainlp/docs/1.7/ (in progress)

Website : https://thainlp.org/pythainlp/ (in progress)

GitHub : https://github.com/PyThaiNLP/pythainlp

Issues : https://github.com/PyThaiNLP/pythainlp/issues

Thank you for choosing us.

PyThaiNLP team

PyThaiNLP 1.7 Alpha 2

06 Sep 15:09
d2f18de
Compare
Choose a tag to compare
PyThaiNLP 1.7 Alpha 2 Pre-release
Pre-release

PyThaiNLP 1.7 Alpha 2 เป็นเวชั่นทดสอบสำหรับนักพัฒนา ไม่แนะนำให้นำไปใช้งานจริง

มีอะไรใหม่ใน PyThaiNLP 1.7

สรุปประเด็นสำคัญ

  • เพิ่ม pythainlp.ner เป็น NER สำหรับ PyThaiNLP
  • ยกเลิกการสนับสนุน Python 2.7 อย่างเป็นทางการ
  • เพิ่ม ULMFit utility เข้ามาใน PyThaiNLP
  • ปรับปรุงระบบตัดคำใหม่ ทั้ง newmm และ mm
  • thai2vec v0.2
  • sentiment analysis ตัวใหม่ทำงานด้วย Deep learning
  • เพิ่ม thai2rom เป็น Thai Romanization ทำด้วย Deep learning ในระดับตัวอักษร
  • Train Pos tag ใหม่เพิ่มเติมจากเดิม

การติดตั้ง

ใช้คำสั่ง pip install https://github.com/PyThaiNLP/pythainlp/archive/1.7a2.zip

แจ้งข้อผิดพลาดหรือเสนอแนะนำได้ที่ https://github.com/PyThaiNLP/pythainlp/issues