menu
Tatoeba
language
Registrieren Anmelden
language Deutsch
menu
Tatoeba

chevron_right Registrieren

chevron_right Anmelden

Durchsuchen

chevron_right Zufälligen Satz anzeigen

chevron_right Nach Sprache durchsuchen

chevron_right Nach Liste durchsuchen

chevron_right Nach Etikett durchsuchen

chevron_right Audiodateien durchsuchen

Mitglieder

chevron_right Pinnwand

chevron_right Mitgliederliste

chevron_right Mitglieder nach Sprachen

chevron_right Muttersprachler

search
clear
swap_horiz
search
JimBreen JimBreen 21. März 2010 21. März 2010 um 06:11:23 UTC flag Report link zur Pinnwand

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} Antworten verbergen Antworten anzeigen
JimBreen JimBreen 21. März 2010 21. März 2010 um 06:40:28 UTC flag Report link zur Pinnwand

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko 21. März 2010 21. März 2010 um 11:10:11 UTC flag Report link zur Pinnwand

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him