menu
Tatoeba
language
Magrehistro Pumasok
language Tagalog
menu
Tatoeba

chevron_right Magrehistro

chevron_right Pumasok

Magtingin-tingin

chevron_right Show random sentence

chevron_right Magtingin-tingin ayon sa wika

chevron_right Magtingin-tingin ayon sa talaan

chevron_right Magtingin-tingin ayon sa etiketa

chevron_right Magtingin-tingin ng audio

Pamayanan

chevron_right Wall

chevron_right Talaan ng lahat ng mga kasapi

chevron_right Wika ng mga kasapi

chevron_right Mga katutubong tagapagsalita

search
clear
swap_horiz
search
JimBreen JimBreen Marso 21, 2010 Marso 21, 2010 nang 6:11:23 AM UTC flag Report link Permakawing

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} itago ang mga tugon ipakita ang mga tugon
JimBreen JimBreen Marso 21, 2010 Marso 21, 2010 nang 6:40:28 AM UTC flag Report link Permakawing

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko Marso 21, 2010 Marso 21, 2010 nang 11:10:11 AM UTC flag Report link Permakawing

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him