clear
{{language.name}} No language found.
swap_horiz
{{language.name}} No language found.
search
sharptoothed
26 days ago
** Stats & Graphs **

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/
hide replies
soliloquist
26 days ago
Thanks for the data. Have you ever considered including numbers of native speakers for each language in the stats? Watching their development might be interesting. The file "user_languages.tar.bz2" on the downloads page has the necessary information for that, I believe.
hide replies
sharptoothed
25 days ago
Technically it's possible and it would be relatively easy to add numbers of native speakers to the stats. The only note I'd like to make is one about the fact that data contained in the "user_languages.tar.bz2" file doesn't reflect the real situation since many members have never specified their native languages. There are also members who specified native languages incorrectly. All in all, I like the idea and will implement it eventually. Thanks for your advice. :-)
hide replies
soliloquist
25 days ago - 25 days ago
Thank you. I'm aware that native speaker counts might not be so reliable, but they would at least give a rough picture.

The last stats show that there's a sudden boost in Turkmen this month. Nearly 50 self-declared native Turkmen accounts have been created recently.

https://tatoeba.org/eng/users/for_language/tuk

Unfortunately, I suspect that most, if not all of those accounts might belong to the same person, possibly a non-native speaker. I explained my suspicion here: https://tatoeba.org/eng/sentenc...omment-1098662

The real problem here isn't using multiple accounts or contributing in non-native languages, but falsely stating oneself as a native speaker. Turkmen sentences here shouldn't be trusted as a reliable source until this issue is cleared up.

Having native speaker counts in the stats would help us noticing such unusual changes, too.
hide replies
deniko
25 days ago - 25 days ago
> Unfortunately, I suspect that most, if not all of those accounts might belong to the same person

If your suspicions are based only on the fact that those accounts were created at about the same time, this might be a wrong assumption.

Tatoeba might have been advertised in some Turkmen community, thus a boost in user registrations. This happens from time to time to many language, and is especially noticeable when this language hadn't had a lot of contributors before that.

The use of the letter ı, since it doesn't exist in Turkmen, is suspicious though, indeed.

Having read this:

https://en.wikipedia.org/wiki/Turkmen_alphabet

and this:

https://en.wikipedia.org/wiki/C...urkic_Alphabet

I was left under impression that Common Turkic Alphabet can be used in some cases to write in Turkmen, and it does have the letter ı, but, well, I'm not a specialist.
hide replies
soliloquist
25 days ago
I sent you a PM.
hide replies
deniko
21 days ago
Ok, I've reviewed all the evidence and it does seem very probable that all those accounts belong to the same person.

However, before anyone does anything about that we still need a trusted native speaker to verify how good those translations are, I'm afraid it's virtually impossible to do anything about that without such verification.
Guybrush88
25 days ago
thanks
Colbo
25 days ago
What the heck are these data?