Menü
[not needed anymore- removed by CK]
What are the directions to interpret this data?
What's the meaning of the figure "Native Minus Other Non-Native"?
Did you intend it to be All Sentences Minus Other Non-Native? Or, which is the same, Native PLUS Dead & Constructed? I would definitely understood the figure and why it makes sense to sort by it.
[not needed anymore- removed by CK]
Sherlock Holmes and Doctor Watson flying hot air balloon. They get lost, so they have to land. They have no idea where they are. Sherlock Holmes asks passing by guy: "Excuse me sir, but where are we ?" Guy says: "You are in hot air balloon" and he walks away.
After some thinking Sherlock says : "That person was a mathematician". "How did you know that ?" asks Doctor Watson. Sherlock Holmes replies: "Well first he gave a very precise answer, and second - the answer was completely useless"
deniko magyarul:
Sherlock Holmes és doktor Watson hőlégballonnal repülnek.
Eltévednek, így le kell ereszkedniük. Fogalmuk sincs, hogy hol vannak.
Sherlock Holmes megkérdez egy arra haladó férfit:
- Elnézést uram, de hol vagyunk?
A férfi azt mondja: Ti egy hőlégballonon vagytok. - és elmegy.
Kis gondolkodás után Sherlock azt mondja:
- Ez a személy matematikus volt.
- Honnan tudod? - kérdezi doktor Watson.
Sherlock Holmes válaszol:
- Először nagyon pontos választ adott, másodszor a válasz teljesen haszontalan volt.
If one considers contributions in a native language as positive, and contributions in a non-native language as negative, and does not care about contributions in dead or constructed languages, then the the score reflects how good a contributor one is: It increases as one adds native sentences and decreases as one adds non-native sentences, but does not change with contributions to dead or constructed languages.
"than the the score reflects how good a contributor one is"
This is a value judgment, and one that not everyone shares. I personally prefer to just know exactly what is factored into the score and come to my own conclusion.
Someone asked about interpretation of the data. That is the most straightforward one. I do not necessarily agree with it.
[not needed anymore- removed by CK]
> contribute in their strongest language, which helps our
> project more. I think this is the culture that we should
> be hoping to develop here.
I think this is deeply harmful for the project, because for smaller languages, there might be very few speakers who speak the language as their strongest language. It discourages contributions in these languages.
E.g. the might be some speakers who speak Lower Sorbian better than German, but they are from the older generation and very unlikely to contribute to Tatoeba. Speakers who *could* contribute to Tatoeba would almost certainly speak German as their strongest language, so this policy discourages them from contributing in Lower Sorbian and encourages them to contribute in German.
I understand that the 'use your strongest language' policy kinda-works for English, but please consider the broader picture. *Most* languages in the world are smaller than English, and by designing policies around English and a handful of 'bigger' languages, you're harming *most* languages in the world. Please reconsider.
[not needed anymore- removed by CK]
Even so, there is still a large part of choosing the languages that is subjective. If we compare Swedish and Quechua for instance, they have about the same number of native speakers (more or less 10 million people). Swedish is very much alive, and present everywhere online (although there aren't any active contributor currently on tatoeba), whereas Quechua is slowly getting crushed by Spanish, for different reasons, and its native speakers are unlikely to have access to a computer, even less to understand how it works.
I tend to agree with your arguments, especially after some truly terrible sentences were added to French (from a bot, and from a well-known "team"), but I don't think there is a clear line between a positive and a negative contribution.
Thanks for your answer!
> Can anybody send me a list of such languages?
Wikipedia has a List of endangered languages, which could be a useful starting point, but each subpage has a different format and it's hard to parse automatically:
https://en.wikipedia.org/wiki/L...ered_languages :((