menu
तातोएबा
language
पंजीकरण लॉग इन
language हिन्दी
menu
तातोएबा

chevron_right पंजीकरण

chevron_right लॉग इन

ब्राउज़

chevron_right यादृच्छिक वाक्य दिखाएँ

chevron_right भाषा के अनुसार ब्राउज़ करें

chevron_right सूची के अनुसार ब्राउज़ करें

chevron_right टैग के अनुसार ब्राउज़ करें

chevron_right ऑडियो ब्राउज़ करें

समुदाय

chevron_right वाल

chevron_right सभी सदस्यों की सूची

chevron_right सदस्यों की भाषाएँ

chevron_right देशी वक्ता

search
clear
swap_horiz
search
gillux gillux 17 सितंबर 2014, संपादित 17 सितंबर 2014 को 9:06:03 अपराह्न UTC, संपादित 17 सितंबर 2014 को 9:18:04 अपराह्न UTC link पर्मालिंक

Hello!

I recently worked to improve the furiganas for sentences of the Japanese language. The furiganas are now displayed as hiraganas instead of katakanas. In addition, they are no longer attached to words already in kanas. (Actually, it’s not perfect: when a word contains a mix of kanas and kanjis, the whole word, including the kana parts, is displayed in the furigana.)

In other words, we now have (#3501384):
言い訳[いいわけ] ばっか すん な よ 。
Instead of:
言い訳[イイワケ] ばっか[バッカ] すん[スン] な[ナ] よ[ヨ] 。[。]

Last but not least, the furiganas should contain less errors than they used to. For instance, 来ない is now correctly read as こない instead of *きない. But beware, furiganas are still not 100% accurate.

EDIT: On a side note, I’d like to mention that deploying the updated version of our (terrible) furigana generation software on tatoeba.org was a piece of cake, thanks to the work of pallavshah, one of the GSOC student who worked on Tatoeba this summer. In other words, he saved us hours of tedious work and we can develop faster and safer.

{{vm.hiddenReplies[20438] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
tommy_san tommy_san 18 सितंबर 2014, संपादित 18 सितंबर 2014 को 1:36:52 पूर्वाह्न UTC, संपादित 18 सितंबर 2014 को 2:53:34 पूर्वाह्न UTC link पर्मालिंक

Great! It really looks much better now. Thank you for your hard work, gillux and pallavshah.

I'm looking forward to seeing perfect furigana. I guess the trickiest are words like 飼い犬(かいいぬ), since it's probably difficult for a machine to decide whether it's 飼(か)い犬(いぬ) or 飼(かい)い犬(ぬ). I'm willing to help you if there's anything I can do.