menu
Tatoeba
language
Đăng ký Đăng nhập
language Tiếng Việt
menu
Tatoeba

chevron_right Đăng ký

chevron_right Đăng nhập

Duyệt

chevron_right Hiện câu ngẫu nhiên

chevron_right Duyệt theo ngôn ngữ

chevron_right Duyệt theo danh sách

chevron_right Duyệt theo thẻ

chevron_right Duyệt âm thanh

Cộng đồng

chevron_right Tường

chevron_right Danh sách thành viên

chevron_right Ngôn ngữ thành viên

chevron_right Người bản xứ

search
clear
swap_horiz
search
JimBreen JimBreen 21 tháng 3, 2010 06:11:23 UTC 21 tháng 3, 2010 flag Report link Permalink

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} ẩn câu trả lời hiển thị câu trả lời
JimBreen JimBreen 21 tháng 3, 2010 06:40:28 UTC 21 tháng 3, 2010 flag Report link Permalink

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko 21 tháng 3, 2010 11:10:11 UTC 21 tháng 3, 2010 flag Report link Permalink

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him