Menu
** Stats & Graphs **
Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/
Thanks for the data. Have you ever considered including numbers of native speakers for each language in the stats? Watching their development might be interesting. The file "user_languages.tar.bz2" on the downloads page has the necessary information for that, I believe.
Technically it's possible and it would be relatively easy to add numbers of native speakers to the stats. The only note I'd like to make is one about the fact that data contained in the "user_languages.tar.bz2" file doesn't reflect the real situation since many members have never specified their native languages. There are also members who specified native languages incorrectly. All in all, I like the idea and will implement it eventually. Thanks for your advice. :-)
Thank you. I'm aware that native speaker counts might not be so reliable, but they would at least give a rough picture.
The last stats show that there's a sudden boost in Turkmen this month. Nearly 50 self-declared native Turkmen accounts have been created recently.
https://tatoeba.org/eng/users/for_language/tuk
Unfortunately, I suspect that most, if not all of those accounts might belong to the same person, possibly a non-native speaker. I explained my suspicion here: https://tatoeba.org/eng/sentenc...omment-1098662
The real problem here isn't using multiple accounts or contributing in non-native languages, but falsely stating oneself as a native speaker. Turkmen sentences here shouldn't be trusted as a reliable source until this issue is cleared up.
Having native speaker counts in the stats would help us noticing such unusual changes, too.
> Unfortunately, I suspect that most, if not all of those accounts might belong to the same person
If your suspicions are based only on the fact that those accounts were created at about the same time, this might be a wrong assumption.
Tatoeba might have been advertised in some Turkmen community, thus a boost in user registrations. This happens from time to time to many language, and is especially noticeable when this language hadn't had a lot of contributors before that.
The use of the letter ı, since it doesn't exist in Turkmen, is suspicious though, indeed.
Having read this:
https://en.wikipedia.org/wiki/Turkmen_alphabet
and this:
https://en.wikipedia.org/wiki/C...urkic_Alphabet
I was left under impression that Common Turkic Alphabet can be used in some cases to write in Turkmen, and it does have the letter ı, but, well, I'm not a specialist.
I sent you a PM.
Ok, I've reviewed all the evidence and it does seem very probable that all those accounts belong to the same person.
However, before anyone does anything about that we still need a trusted native speaker to verify how good those translations are, I'm afraid it's virtually impossible to do anything about that without such verification.
thanks
What the heck are these data?