Menu
MeCab 管理人
Could we get a volunteer to take charge of MeCab / MeCab dictionary improvements? There are some things that look like easy fixes (一晩中 ひとばんじゅう) and some things that are just weird glitches (ご覧 has no okurigana). I don't really have the time to look into it myself so I think another person is needed.
As I have commented, adding to MeCab's dictionary won't fix all these. You really have to fix it on a case-by-case basis. The WWWJDIC indices would help here, although not all sentences are indexed.
Yes, but _some_ of them should be fixable via dictionary. Looking at the list
http://tatoeba.org/eng/sentences_lists/show/113
I'd suspect that most of them could be.
Looking at the first one (アマゾン川は延々と北ブ流れている) I'd say it's a mistake by whoever programmed the MeCab interface in Tatoeba. アマゾン川 is handled correctly by MeCab:
アマゾン川 名詞,固有名詞,一般,*,*,*,アマゾン川,アマゾンガワ,アマゾンガワ
The second one (母にひと月に一度手紙を書きます) is the same. Mecab handles ひと月 OK, but Tatoeba ignores it.
ひと月 名詞,一般,*,*,*,*,ひと月,ヒトツキ,ヒトツキ
I think someone is not noticing the kanji at the end of the string returned.
Which dictionary is being used, BTW?