menu
Tatoeba
language
Регистрация Вход
language Русский
menu
Tatoeba

chevron_right Регистрация

chevron_right Вход

Просмотр

chevron_right Показать случайное предложение

chevron_right Обзор по языку

chevron_right Обзор по спискам

chevron_right Обзор по тегам

chevron_right Обзор аудио

Сообщество

chevron_right Стена

chevron_right Все участники

chevron_right Участники по языкам

chevron_right Носители языка

search
clear
swap_horiz
search
JimBreen JimBreen 21 марта 2010 г. 21 марта 2010 г., 06:11:23 UTC flag Report link Пермалинк

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} спрятать ответы показать ответы
JimBreen JimBreen 21 марта 2010 г. 21 марта 2010 г., 06:40:28 UTC flag Report link Пермалинк

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko 21 марта 2010 г. 21 марта 2010 г., 11:10:11 UTC flag Report link Пермалинк

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him