menu
Tatoeba
language
Cofrestru Mewngofnodi
language Cymraeg
menu
Tatoeba

chevron_right Cofrestru

chevron_right Mewngofnodi

Pori

chevron_right Show random sentence

chevron_right Pori yn ôl iaith

chevron_right Pori yn ôl rhestr

chevron_right Pori yn ôl tag

chevron_right Pori sain

Community

chevron_right Mur

chevron_right Rhestr o bob aelod

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Dewislen

Yn ôl i'r mur

JimBreen JimBreen 21 Mawrth 2010 21 Mawrth 2010 am 06:11:23 UTC flag Report link Permalink

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} cuddio'r ymatebion dangos yr ymatebion
JimBreen JimBreen 21 Mawrth 2010 21 Mawrth 2010 am 06:40:28 UTC flag Report link Permalink

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko 21 Mawrth 2010 21 Mawrth 2010 am 11:10:11 UTC flag Report link Permalink

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him