menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
gillux gillux September 17, 2014, edited September 17, 2014 September 17, 2014 at 9:06:03 PM UTC, edited September 17, 2014 at 9:18:04 PM UTC link Permalink

Hello!

I recently worked to improve the furiganas for sentences of the Japanese language. The furiganas are now displayed as hiraganas instead of katakanas. In addition, they are no longer attached to words already in kanas. (Actually, it’s not perfect: when a word contains a mix of kanas and kanjis, the whole word, including the kana parts, is displayed in the furigana.)

In other words, we now have (#3501384):
言い訳[いいわけ] ばっか すん な よ 。
Instead of:
言い訳[イイワケ] ばっか[バッカ] すん[スン] な[ナ] よ[ヨ] 。[。]

Last but not least, the furiganas should contain less errors than they used to. For instance, 来ない is now correctly read as こない instead of *きない. But beware, furiganas are still not 100% accurate.

EDIT: On a side note, I’d like to mention that deploying the updated version of our (terrible) furigana generation software on tatoeba.org was a piece of cake, thanks to the work of pallavshah, one of the GSOC student who worked on Tatoeba this summer. In other words, he saved us hours of tedious work and we can develop faster and safer.

{{vm.hiddenReplies[20438] ? 'expand_more' : 'expand_less'}} hide replies show replies
tommy_san tommy_san September 18, 2014, edited September 18, 2014 September 18, 2014 at 1:36:52 AM UTC, edited September 18, 2014 at 2:53:34 AM UTC link Permalink

Great! It really looks much better now. Thank you for your hard work, gillux and pallavshah.

I'm looking forward to seeing perfect furigana. I guess the trickiest are words like 飼い犬(かいいぬ), since it's probably difficult for a machine to decide whether it's 飼(か)い犬(いぬ) or 飼(かい)い犬(ぬ). I'm willing to help you if there's anything I can do.