For sometime now, I've wanted to know more about how the MeCab analyzer works and why it produces erroneous furigana. These furigana errors translate into erroneous romaji in external applications such as the Imiwa? (nee Kotoba!) app. These errors apparently make a significant impression on new learners.
The erroneous furigana assigned to the proverbs and idioms that I've submitted to Tatoeba Project so far are results of a rule not followed. Here, with this submitted sentence comes a new problem : Archaic readings of the kana themselves.
Critical reading is an important part of foreign language learning. I use it as a student and as a teacher. It reflects my background in critical thinking and analysis from my primary school days. Native and non-native speakers of Japanese alike struggle to understand readings of kanji and archaic kana. Nonetheless, visits to temples, jinja, and participation in traditional culture are all a significant industry and undertaking that should not be dismissed. It's important.
Here, the ひ in 貰ひ is read as い and the furigana though not explicit does a fine job of capturing this. However, a new learner might not make the connection since the い doesn't hover over the ひ
The analyzer isn't doing a fine job. Look at the romaji!
貰 (X もらい) もら
ひ = い
Tags
View all tagsLists
Sentence text
License: CC BY 2.0 FRLogs
This sentence is original and was not derived from translation.
added by archer_root, February 25, 2013