menu
Tatoeba
language
Loo kasutaja Logi sisse
language Eesti
menu
Tatoeba

chevron_right Loo kasutaja

chevron_right Logi sisse

Sirvi

chevron_right Näita suvalist lauset

chevron_right Sirvi keelte kaupa

chevron_right Sirvi nimekirja kaupa

chevron_right Browse by tag

chevron_right Sirvi helisalvestusi

Suhtle

chevron_right Sein

chevron_right Kõigi liikmete nimekiri

chevron_right Liikmete keeled

chevron_right Emakeele rääkijad

search
clear
swap_horiz
search

Menüü

Tagasi seinale

JimBreen JimBreen 21. märts 2010 21. märts 2010 06:11:23 UTC flag Report link Püsilink

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
JimBreen JimBreen 21. märts 2010 21. märts 2010 06:40:28 UTC flag Report link Püsilink

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko 21. märts 2010 21. märts 2010 11:10:11 UTC flag Report link Püsilink

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him