Botok is a powerful Python library for tokenizing Tibetan text. It segments text into words with high accuracy and provides optional attributes such as lemma, part-of-speech (POS) tags, and clean ...
You can create a release to package software, along with release notes and links to binary files, for other people to use. Learn more about releases in our docs.