Menu
Lately the language detection has marked quite a few of my Finnish sentences as English in the first place. Some of those sentences are these:
Tom justiinsa teki niin. http://tatoeba.org/fin/sentences/show/3785040
Nouse ylös ja taistele. http://tatoeba.org/fin/sentences/show/3785043
Tom hikoili. http://tatoeba.org/fin/sentences/show/3785774
No onko edes siistiä? http://tatoeba.org/fin/sentences/show/3786584
Onko Tom vielä hereillä? http://tatoeba.org/fin/sentences/show/3789015
Onko Tom edelleen hereillä? http://tatoeba.org/fin/sentences/show/3789014
Puhuvatko he ranskaa? http://tatoeba.org/fin/sentences/show/3790314
Emme me puhu ranskaa. http://tatoeba.org/eng/sentences/show/3793029
Ole varovainen. Älä heitä pois noita papereita. http://tatoeba.org/eng/sentences/show/3794756
Yes, there are some words that could also be English in those sentences (no, me), but otherwise I really can't understand why these are detected as English. If I remember correctly, the language detector needs to be updated from to time, so that it "learns" better what kind of combination of letters should be detected as which language. Has this update been made recently?
I'm not complaning, because it's really something like 1 out of 100 sentences that are detected wrongly and it's no big deal to correct them manually, but I'm just curious. :)
> Has this update been made recently?
Yes, on the 17th of November, 2014.
I’m not familiar with the language detection tool so I can’t tell you much about its weaknesses.