menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
Quielin Quielin March 5, 2016 March 5, 2016 at 4:01:59 PM UTC link Permalink

Hy! Does anyone know what are they using for the Pinyin converter in Tatoeba? is it sinoparserd? Does it also provides tokenizing function too? Thank you!

{{vm.hiddenReplies[25669] ? 'expand_more' : 'expand_less'}} hide replies show replies
gillux gillux March 5, 2016 March 5, 2016 at 6:17:12 PM UTC link Permalink

Hello Quielin, welcome to Tatoeba. As you guessed, the Pinyin converter currently used on Tatoeba is sinoparserd. I don’t understand Chinese, but it seems the generated Pinyin is tokenized: Pinyin “words” are separated by spaces.

{{vm.hiddenReplies[25672] ? 'expand_more' : 'expand_less'}} hide replies show replies
Quielin Quielin March 5, 2016 March 5, 2016 at 7:57:54 PM UTC link Permalink

thank you! i didnt notice that, u are absolutely right. I will try to install it. The information on git is so limited that wasnt sure thay it provides that functionality.