menu
Tatoeba
language
Registrera Logga in
language Svenska
menu
Tatoeba

chevron_right Registrera

chevron_right Logga in

Bläddra

chevron_right Visa framslumpad mening

chevron_right Bläddra efter språk

chevron_right Bläddra efter lista

chevron_right Bläddra efter tagg

chevron_right Bläddra bland ljudinspelningar

Community

chevron_right Vägg

chevron_right Medlemslista

chevron_right Medlemmarnas språk

chevron_right Modersmålstalare

search
clear
swap_horiz
search
doemaar14 doemaar14 1 maj 2023 1 maj 2023 08:28:35 UTC flag Report link Permalänk

Could someone tell @amastan to stop adding so many sentences about Algeria?
14,031 occurrences in English-language sentences, compared to:
'France', 1,001 occurrences,
'India', 381
'China' , 1,296 ,
'America', 1,098 ,
'United States', 1,112
'Japan', 1,734
'Germany', 871

All these countries have bigger populations, and are, dare I say, significantly more recognizable than Algeria.

(While ''French'' does have 13k+ occurrences, almost all of them are only about the language (as opposed to the culture/country), which is spoken by about 100 million people world-wide. ''Spanish'', with 486 million speakers yields only 870 results,)

{{vm.hiddenReplies[39804] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Nuel Nuel 1 maj 2023 1 maj 2023 08:34:59 UTC flag Report link Permalänk

He also seems obsessed with transgender people.

{{vm.hiddenReplies[39805] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Teashrock Teashrock 1 maj 2023, redigerad 1 maj 2023 1 maj 2023 12:36:28 UTC, redigerad 1 maj 2023 12:36:59 UTC flag Report link Permalänk

Does being obsessed with something go against Tatoeba's rules?

Teashrock Teashrock 1 maj 2023 1 maj 2023 12:26:22 UTC flag Report link Permalänk

Excuse me, I was just passing by, but does writing sentences about countries other than “recognized” really go against the rules of this website?

{{vm.hiddenReplies[39806] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
doemaar14 doemaar14 1 maj 2023 1 maj 2023 13:57:33 UTC flag Report link Permalänk

As @Nuel pointed out, Tatoeba's own guideline: ''Avoid using the same words, names, topics or patterns over and over again.''
And it's definitely not coincidence: almost all of these 14,000 sentences are from one single user.
Mere happenstance could never result in that many sentences about Algeria. This project should be representative of the entire world, or at the very least of places most people will be familiar with or actually live in (India, USA, China).
Translating on here gets boring/annoying pretty fast when every other sentence is about Algeria, not to mention the fact you have to filter out sentences containing ''Algeria'' each time you download sentences. We could start spamming different countries as a knee-jerk reaction, but that'd be just as bad.

User55521 User55521 1 maj 2023 1 maj 2023 18:43:35 UTC flag Report link Permalänk

How Algeria is worse than Tom and Mary?

{{vm.hiddenReplies[39810] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
maaster maaster 20 maj 2023 20 maj 2023 04:26:53 UTC flag Report link Permalänk

+1

Yorwba Yorwba 1 maj 2023 1 maj 2023 19:02:11 UTC flag Report link Permalänk

Everyone here has the same means of contacting Amastan, which is to send a private message. So you can in fact tell him to stop adding so many sentences about Algeria, as can anyone else. Doing so directly in private is also more likely to reach the intended recipient than posting on the wall. (I'll send him a link to this wall post so he has the opportunity to respond.)

But if you're actually hoping for an admin to order Amastan to stop, that's unlikely to happen, since the last time a similar issue came up the verdict was that

> As a general rule, no action will be taken against a contributor based on the sole fact that they are creating new sentences with a name that has been overused.

https://blog.tatoeba.org/2019/0...h-tom-and.html

If you want to avoid English sentences which overuse certain words, you might want to restrict your searches to lbdx's "Pruned English Corpus" list, i.e. https://tatoeba.org/en/sentence...ny&sort=random

It's also worth pondering what Amastan is supposed to do instead. After all, telling someone to stop doing something they think is the right thing to do is unlikely to be effective, but convincing them that there's something else they could do that would be even better might work.

As far as I know, Amastan is Algerian, so it is indeed not mere happenstance that he added so many sentences about Algeria, but also not terribly surprising. However, it does seem like many of these sentences could be equally said about pretty much any country, which might be why you consider them boring.

But surely Amastan doesn't want us to think Algeria is boring! So maybe there is room for mutually beneficial cooperation here. Do you think some of the sentences are less boring than others? For example, I think that sentences that are less generic and more specifically about Algeria, like #7811221 "Algeria and Morocco are the only North African nations that recognize Berber as an official language." are a bit more interesting. What do you think?

Maybe Amastan would also be up for creating more sentences that are about Algeria without explicitly stating as much. (In keeping with the old adage that writers should "show, not tell.")

maaster maaster 9 maj 2023, redigerad 9 maj 2023 9 maj 2023 04:32:02 UTC, redigerad 9 maj 2023 04:41:44 UTC flag Report link Permalänk

If everyone stop adding almost useless and empty short sentences with the same words over and over again, perhaps, Amastan will also stop acting so.
I think that may be his reaction against Tom- and/or Australia-sentences.

(I still think these sentences are the first reason that the most contributors finish using Tatoeba after translating three (or much more) sentences.)

(It may be just a double standard against Amastan. Am I wrong?)

{{vm.hiddenReplies[39817] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
lbdx lbdx 9 maj 2023 9 maj 2023 06:30:44 UTC flag Report link Permalänk

> It may be just a double standard against Amastan. Am I wrong?

You're right, Amastan isn't the first to stuff his sentences with pervasive words. But he's by far the one who's been producing the most of them lately. Last month, he added 16,000 original English sentences. The other two main English contributors added only 1,000. The figures are available at https://colab.research.google.c...=5&uniqifier=1

Unfortunately, the people who seem to be in charge of the project refuse to see this as a problem. How about a monthly cap on the number of original sentences in one language? 3,000 seems like a reasonable number to me...

As this issue comes up regularly on the wall, I would like the Tatoeba community to comment on this measure. If you think such a cap is appropriate, please post a plus sign in the comments of this post. If you want to show your disapproval, please post a minus.

{{vm.hiddenReplies[39818] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
lbdx lbdx 9 maj 2023 9 maj 2023 06:31:14 UTC flag Report link Permalänk

+

sundown sundown 9 maj 2023 9 maj 2023 07:09:57 UTC flag Report link Permalänk

+

ddnktr ddnktr 9 maj 2023 9 maj 2023 20:11:32 UTC flag Report link Permalänk

+

User55521 User55521 10 maj 2023 10 maj 2023 06:02:12 UTC flag Report link Permalänk

-

User55521 User55521 10 maj 2023 10 maj 2023 06:11:39 UTC flag Report link Permalänk

When the corpus is domated by Western names Tom and Mary, everyone is OK with that. But when a non-Western place name Algeria gets used, people want some caps.

This is cultural colonialism, plain and simple.

{{vm.hiddenReplies[39825] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
sundown sundown 10 maj 2023, redigerad 10 maj 2023 10 maj 2023 06:33:15 UTC, redigerad 10 maj 2023 07:35:42 UTC flag Report link Permalänk

You're wrong. "Everyone" is not OK with Tom and Mary ad infinitum.

I still remember when a certain member here was uploading tens of thousands of sentences in one go once a month (using scripts), which is partly why we're lumbered now with Tom and Mary. Above all, it's that sort of behaviour, now being exhibited by another user, that I'd like to see some "caps" on. I wish it had been done years ago.

{{vm.hiddenReplies[39826] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
User55521 User55521 10 maj 2023 10 maj 2023 06:36:52 UTC flag Report link Permalänk

Well, if that gets instituted after Algeria and not after Tom and Mary, that still shows the bias of the community.

{{vm.hiddenReplies[39827] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
sundown sundown 10 maj 2023 10 maj 2023 06:39:10 UTC flag Report link Permalänk

Maybe. But you're still mischaracterising people here. We're not all the same.

{{vm.hiddenReplies[39828] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
User55521 User55521 10 maj 2023 10 maj 2023 06:43:14 UTC flag Report link Permalänk

You're right, I guess.

{{vm.hiddenReplies[39829] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
sundown sundown 10 maj 2023, redigerad 10 maj 2023 10 maj 2023 06:53:55 UTC, redigerad 10 maj 2023 07:13:25 UTC flag Report link Permalänk

That said, though, I personally don't like some users' use of this site to, as they see it, push their agenda.

However, my point in supporting a cap is about *volume*. As I said, I wish it had been done years before this latest user started the deluge. Tatoeba seems to be at the mercy of anyone motivated enough to dominate its contents.

{{vm.hiddenReplies[39830] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
User55521 User55521 10 maj 2023 10 maj 2023 06:59:33 UTC flag Report link Permalänk

> some users' use of this site to, as they see it, push their agenda

I don't believe in "not having an agenda". We all have our views, and the sentences we add necessarily reflect our views. Everyone has an agenda.

{{vm.hiddenReplies[39831] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
sundown sundown 10 maj 2023, redigerad 10 maj 2023 10 maj 2023 07:02:21 UTC, redigerad 10 maj 2023 07:03:19 UTC flag Report link Permalänk

We all have views. That's stating the obvious.

Polgar1 Polgar1 17 maj 2023 17 maj 2023 12:26:11 UTC flag Report link Permalänk

There is a difference between "having views" and "having an agenda". Anyway, if you will: my "agenda" is that the only accepted agenda on Tatoeba should be the passion for creating (collecting, in some cases) high-quality linguistic content across the planet.

shekitten shekitten 18 maj 2023, redigerad 18 maj 2023 18 maj 2023 04:12:21 UTC, redigerad 18 maj 2023 04:31:28 UTC flag Report link Permalänk

What it looks like to me is that there are two separate issues, and that they've been conflated. I see these two issues as:

1. A single person is adding far more sentences to the English corpus than the next two people combined.

2. People don't like that a lot of this person's sentences are about Algeria. That's too bad. More people should add sentences about their countries and cultures. He's doing nothing wrong by writing about his own, even if a lot of it amounts to political propaganda.

To address 1, I'll echo those who have suggested caps. I think the caps suggested are more than reasonable and would not prevent most users from adding the number of sentences that they are already adding.

{{vm.hiddenReplies[39857] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
sundown sundown 21 maj 2023 21 maj 2023 09:44:00 UTC flag Report link Permalänk

I don't object to sentences about any country, just as I don't object to sentences about any city. As far I'm concerned, some of the best English contributions here are written by non-native speakers. They put to shame my own attempts to write in other languages. I take my hat off to those users. I'm not one of those who discourage non-native speakers from adding English sentences.

What I do object to is sentences being uploaded to the site on an industrial scale. The main priority seems to be to pump out as many sentences as possible – to what purpose, we can only guess – and let the rest of us find and correct the mistakes (a service we provide voluntarily). We all make mistakes, but in this case it's the sheer volume. Whoever you are here – native speaker or non-native speaker, admin, corpus maintainer or whatever – taking this approach is not community-minded.

{{vm.hiddenReplies[39869] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
shekitten shekitten 21 maj 2023, redigerad 21 maj 2023 21 maj 2023 13:16:47 UTC, redigerad 21 maj 2023 13:52:37 UTC flag Report link Permalänk

Did you still have a tab open from three days ago? I got rid of the native part of that post within an hour of making it (and my original comment said non-native contributions were fine, but that the volume of non-native contributions was the problem)

At any rate, the answer seems to be a cap on daily contributions.

shekitten shekitten 26 maj 2023, redigerad 26 maj 2023 26 maj 2023 15:05:42 UTC, redigerad 26 maj 2023 15:06:59 UTC flag Report link Permalänk

There are currently 17,183 occurrences of the "wildcard" country "Australia" in the corpus, and 14,651 occurrences of "Algeria." Australia is the best thing to compare Algeria to, rather than Tom and Mary, which are names (and Amastan uses many names in his sentences).

If the cap is instituted once the number of occurrences of Algeria comes to roughly equal that of Australia, will that allay your concerns? It will mean both Algeria and Australia have equally benefited from the pre-cap situation. No one could say that Algeria has been disadvantaged; in fact, one could say that instituting this cap at any point (even now) makes it harder for any country to play catch-up to Algeria.

Disclaimer: I am not an administrator and do not have the power to make offers.

Cabo Cabo 26 maj 2023, redigerad 26 maj 2023 26 maj 2023 17:56:29 UTC, redigerad 26 maj 2023 18:31:59 UTC flag Report link Permalänk

It's not the first talk about wildcard words.

{{vm.hiddenReplies[39877] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
shekitten shekitten 26 maj 2023, redigerad 26 maj 2023 26 maj 2023 19:47:22 UTC, redigerad 26 maj 2023 19:47:48 UTC flag Report link Permalänk

The point is Algeria is already almost as well represented in the corpus as Australia - better represented even, proportionally to population. And that it will soon catch up to Australia in raw numbers.

Before Amastan was adding 16,000 new sentences in a month, someone else apparently used to add a similarly high amount, but their contributions were never capped. If we did the cap after Algeria caught up to Australia, you could say that both Algeria and Australia had benefitted equally from the situation before the cap was instituted. I don't know if this is less prejudicial or not; it's an idea.

{{vm.hiddenReplies[39879] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Cabo Cabo 26 maj 2023 26 maj 2023 20:08:16 UTC flag Report link Permalänk

We were talking about a cap for years now. When finally we will have a cap, that long list will contain 160 thousand sentences, not just 16.

{{vm.hiddenReplies[39880] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
shekitten shekitten 26 maj 2023, redigerad 26 maj 2023 26 maj 2023 20:20:48 UTC, redigerad 26 maj 2023 21:28:06 UTC flag Report link Permalänk

A cap is a limit. I'm talking about a limit on the number of daily contributions.

16,000 sentences by one user in a day is too many. It's not natural. I'm disabled and I don't contribute anywhere near that much, nor has anyone contributed anywhere near that much before without using scripts.

{{vm.hiddenReplies[39882] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Thanuir Thanuir 27 maj 2023 27 maj 2023 09:32:24 UTC flag Report link Permalänk

Assuming 16 hour workday, that is 1000 sentences per hour, or 17 per minute, or one per 4 seconds. Just for reference. Or one per 2 seconds if only working 8 hours on Tatoeba that day.

{{vm.hiddenReplies[39883] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
shekitten shekitten 28 maj 2023 28 maj 2023 01:37:31 UTC flag Report link Permalänk

My apologies. It's 16,000 a month, not 16,000 a day. I misremembered what I read.

Either way, that is 16 times as much as the people who add the second-most and third-most sentences. That's something we have good reason to want to prevent, for the sake of the future of this site.

But having this discussion in the midst of a discussion asking to limit sentences about Algeria specifically taints this.

{{vm.hiddenReplies[39887] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Thanuir Thanuir 28 maj 2023 28 maj 2023 05:53:36 UTC flag Report link Permalänk

About 500 a day is much more manageable, even by hand. Thank you for correcting.

Cabo Cabo 27 maj 2023, redigerad 27 maj 2023 27 maj 2023 11:45:51 UTC, redigerad 27 maj 2023 13:00:55 UTC flag Report link Permalänk

What 16 thousand sentences a day? Where? I don't see such day.

{{vm.hiddenReplies[39884] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
shekitten shekitten 28 maj 2023 28 maj 2023 01:39:29 UTC flag Report link Permalänk

I misremembered - it was 16,000 a month.

deniko deniko 10 maj 2023 10 maj 2023 09:59:47 UTC flag Report link Permalänk

+

Wezel Wezel 18 maj 2023 18 maj 2023 15:36:04 UTC flag Report link Permalänk

+

maaster maaster 9 maj 2023 9 maj 2023 09:23:52 UTC flag Report link Permalänk

However, I don't like that great amount of sentences with the word "Algeria" or with any other country name, either.

Polgar1 Polgar1 17 maj 2023 17 maj 2023 12:49:57 UTC flag Report link Permalänk

> I still think these sentences are the first reason that the most contributors finish using Tatoeba after translating three (or much more) sentences.

What is the presumption and what is your statement? Do we have the means to talk about worrying tendencies within contributions to Tatoeba? If so, do you have any evidence supporting this idea that trivial sentences, even in excessive amounts, somehow pose a threat to Tatoeba?

My personal impression is that it is *easy* to find useful sentences and a whole lot of mental cycles are wasted, with little constructivity, over something that isn't even consensual within the community. I can easily agree with a sort of "rate limit" on contributions as a broader measure against bruteforce takeover.

Last but not least... I have no intention to rank the reasons why somebody stops using Tatoeba but you yourself are a dubious example in my opinion, and you should at least think about that before coming up with theories. You seem to fancy far-fetched or downright undecipherable translations and you are very vocal not only against near-literal translations but upon your personal ad-hoc prescriptivism as well.
Since I was rather involved in Tatoeba via Clozemaster, I definitely see a bigger problem in coming across "maasterisms" or your annoying insistence to change translations that I consider plausible and practical because they are apparently not idiomatic enough for you.

{{vm.hiddenReplies[39856] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
maaster maaster 19 maj 2023, redigerad 19 maj 2023 19 maj 2023 06:19:39 UTC, redigerad 19 maj 2023 06:54:32 UTC flag Report link Permalänk

That isn't just a presumption.

Clozemaster is not my business.
It's a voluntary job. I don't do it for other ones' pleasure.
Add much more translations on Tatoeba and read them on Clozemaster.

Perhaps, there're not enough cynical sentences on Tatoeba. You can add some.

{{vm.hiddenReplies[39859] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Polgar1 Polgar1 19 maj 2023 19 maj 2023 08:13:38 UTC flag Report link Permalänk

If it's not a presumption, then maybe argue for it, or show evidence.

Also, it's kind of a strawman to talk about someone's pleasure when I said you are actively causing DISpleasure, both to users with your trademark undecipherable sentences, and to contributors with the trademark prescriptivist gatekeeping over absolutely non-representative personal fixations. This is something you ought to think about, not to deflect.

Cabo Cabo 19 maj 2023, redigerad 19 maj 2023 19 maj 2023 17:35:45 UTC, redigerad 19 maj 2023 17:41:34 UTC flag Report link Permalänk

Using clozemaster after the change (now you only can do 30 sentences a day per language (as a free user)), my biggest concern no "maasterism", no "amastanism", but the mood what the sentences create, there are soooo many bad view about the world, sooo many pessimistic sentences, sooo many about death and dying.
I know, I contributed to those. But in the big picture, there are too much of them. It makes me sad. It's a sad collection about sad sentences.
Our creation is what sadeness, anger was inside us.

People probably stop using the site because they are not full of hatred or sadeness.

{{vm.hiddenReplies[39863] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Polgar1 Polgar1 20 maj 2023 20 maj 2023 08:10:51 UTC flag Report link Permalänk

I also stopped using Clozemaster after that ridiculous restriction but honestly, I can't recall the sentences to be particularly bitter or depressive.

Anyway, it doesn't literally have to be Clozemaster. If you want to learn about a language by taking sentences and their translations, it's an unwanted challenge that somebody makes up odd artsy translations on a regular basis, while also telling others off for translating too literally, or using certain colloquialisms that go against his peculiar view of language protectionism. I have more confidence in my language use than to fall victim for the latter attempts but I think even the attempt is completely off for a project like Tatoeba. And the translations that barely resemble the original sentence have dubious value, the least to say.

{{vm.hiddenReplies[39866] ? 'expand_more' : 'expand_less'}} dölj svar visa svar
Cabo Cabo 20 maj 2023, redigerad 20 maj 2023 20 maj 2023 08:21:24 UTC, redigerad 20 maj 2023 08:23:05 UTC flag Report link Permalänk

It became small pocets of sadeness. When you on a daily basis read something about death, it seems depressive. (not only death, but suicide, suicidal thoughts, marital misbehavement... etc. (when I did 400 sentences a day I didn't recognize such thing, but this way I picked up on it)

maaster maaster 28 maj 2023, redigerad 2 juni 2023 28 maj 2023 18:42:37 UTC, redigerad 2 juni 2023 16:32:44 UTC flag Report link Permalänk

I wrote "I think".
Nevertheless, if you had been here in 2016, you could have read supposedly the last comment and opinion of freddy1 about "empty" sentences–before leaving the project.

Ja, meg a kiseva33 vagy a jegaevi is írta erről időközben megváltozott véleményét és abbahagyta az egészet.

nipbud nipbud 19 maj 2023 19 maj 2023 12:06:42 UTC flag Report link Permalänk

> Users who participated in the last 200 contributions.
>Amastan: 169

85% of the last 200, he's adding a sentence every 6 seconds, all of them are "Algeria" or "Antonio"…

shekitten shekitten 19 maj 2023 19 maj 2023 13:21:52 UTC flag Report link Permalänk

To me the answer to "disproportionate number of sentences about Algeria" is to add more sentences about France, India, China, the U.S., Japan, Germany, Kenya, etc..

Granted, this is hard when someone adds such a volume of sentences that it dwarfs all others, so caps are a good idea

Thanuir Thanuir 20 maj 2023 20 maj 2023 09:07:52 UTC flag Report link Permalänk

Aluksi Tatoebassa oli paljon Japania koskevia lauseita.
Jossain vaiheessa alkoi Tom- ja Mary-aalto, samoin kuin ranskan kieli.
Samoin Ziri, Mennad, Sami ja mitä näitä nyt on.
Nyt sitten Algeria.

Toisaalta: tämäkin menee ohi ja uusia aaltoja tulee.

Toisaalta: ilmeisesti tämä aktiivisesti ärsyttää ihmisiä.