ULex

Ubiquitous Lexicography tool (restricted charset demo), Version 0.2.11

Language/Corpus Procedure Format Action
Uyghur demo text in Uyghur Latin Yëziqi (ULY) script:

Sources for Uyghur

ISO 639-3:uig
Ethnologue:Uyghur: a language of China
Encoding: Saimaiti, Maimaitimin & Zhiwei Feng. 2007. A syllabification algorithm and syllable statistics of written Uyghur. Proceedings of the 4th Corpus Linguistics Conference, Birmingham, UK. PDF
Omniglot: Writing systems and languages of the world.
Wikipedia: Uyghur alphabets.
Data: Universal Declaration of Human Rights in Uyghur
Keywords: Uyghur, concordance, ILG, interlinear glossing, transliteration, wordlist, frequency list, language documentation, computational lexicography.
Registration: Code tables must be registered for incorporation into ULex. Please contact Dafydd Gibbon for detailed specifications Specifications are, essentially: 4 column CSV table of orthographic UTF-8 characters from a Unicode font, with UTF-8 codes (input), IPA characters from a Unicode font, with Unicode entity codes (output).

Dafydd Gibbon 2011-10-16 (Original background image, inspired by Ulex Europaeus © Carl Farmer 2004)