Menu
About Romanized Languages
What about sentences for romanized languages like romanized japanese, chinese, etc? I do not see any romanization in the collections.
Facebook FastText language classifier (language detection) is using Tatoeba as training / validation set (see https://fasttext.cc/blog/2017/10/02/blog-post.html ), and it detects 142 languages with a very good accuracy (better than other language detectors), but it lacks romanized languages versions (like Google Translate, Chrome CLD2, etc.).
There is no such thing as "Romanized languages". Romanization is just a linguistic process transforming one system into another one, the Latin script system. That's a very selfish, self-centered vision of the world, if you want my opinion. But let's put opinions aside.
"Romanized Japanese" is not a language and does not have its place here on Tatoeba. Even if one wanted to make sentences with ローマ字, they would have to agree on how to write them, and this would never happened unless somebody force everybody else to use one system.
And if such a thing happened, then I would ask a transcript into Japanese Katakana of every single language available :) And you can see where it will lead...
It makes sense. My question was wrong. The right question could be: is there a way to add transcription support for those languages than has a transcription system like Japanese, Chinese, etc in Tatoeba.
Thanks for your answer.
I have just found this interesting blog post "Tools for Japanese romanization" on Tatoeba about what I was discussing here:
http://blog.tatoeba.org/2009/02...anization.html