menu
ٹٹوبا
language
درج کن لاگِن
language بلۏچی
menu
ٹٹوبا

chevron_right درج کن

chevron_right لاگِن

براوز کن

chevron_right پَدیمیں گالِد ءَ پیشدار

chevron_right زبان ءِ رِد ءَ براوز کن

chevron_right لَڑ ءِ رِد ءَ براوز کن

chevron_right ٹیگ ءِ رِد ءَ براوز کن

chevron_right تواربَند ءَ براوز کن

ٹَکّ

chevron_right دیوال

chevron_right دراھیں باسکانی لَڑ

chevron_right باسکانی زبان

chevron_right پیدائشی گپ جنوک

search
clear
swap_horiz
search
JimBreen JimBreen March 21, 2010 March 21, 2010 at 6:11:23 AM UTC flag Report link دائمکڑی

Traditional and Simplified Chinese

I saw the comment about converting hanzi on-the-fly. Be very cautious about that, as there are many cases where it simply doesn't work. Proper Traditional<->Simplified conversion needs to work at the lexeme level and in some cases needs some context for disambiguation.

Jack Halpern wrote a very good paper about this about 10 years ago:
http://www.cjk.org/cjk/c2c/c2cbasis.htm

PS: how do I make a comment on another posting?

{{vm.hiddenReplies[377] ? 'expand_more' : 'expand_less'}} پسواں چیر دئے پسواں پیشدار
JimBreen JimBreen March 21, 2010 March 21, 2010 at 6:40:28 AM UTC flag Report link دائمکڑی

OK, I worked out how to do a follow-on. I'd clicked "reply" but it hadn't worked. Now it does.

sysko sysko March 21, 2010 March 21, 2010 at 11:10:11 AM UTC flag Report link دائمکڑی

the traditional to simplified chinese is not made at "character by character" level, but try to decompose the sentence (you can see how the sentence has been segmented by looking to pinyin)
As I've said I'm in conctact with the guy who develop it, so don't hesitate to report any bad segmentations, I will report to him