Wall (7129 threads)
Astúcias
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
hecko
2 hours ago
frpzzd
23 hours ago
araneo
1 days ago
gillux
1 days ago
araneo
1 days ago
gillux
1 days ago
gillux
1 days ago
PaulP
1 days ago
frpzzd
1 days ago
Waldelfe
1 days ago
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

✹✹ Stats & Graphs ✹✹
Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

In the Interlingua (ia) localization of the web interface, the "show {number} more translations" button always appears as "monstrar 1 altere traduction" no matter how many altere traductiones there actually are to show.
It appears that the same also happens with br, gos, zza, pt-br, tl, kab, and bal locales. But I don't know these languages so I can't tell for sure.
Looking at it I also noticed that the page title on sentence pages renders incorrectly with pt-br, but this is a simple error in the PO file (field names got translated: "{frase} - {idioma} exemplo de frase"); and for the same reason the button to show more translations in the el locale correctly chooses the plural form but doesn't show the {αριθμός} in it.

I have opened an issue on the GitHub page.
https://github.com/Tatoeba/tatoeba2/issues/3087

Oh, ok. Thanks.
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

🍎 Dashboard for Translating English Sentences with Audio
http://a4esl.org/temporary/tato...e/searches.php
Updated
► I added the option for the "minimum" and "maximum" number of words in the search.
► I also added the following "find all eng not yet in _the_specified_langauge_" for the random selection.
◼ Random 3-to-10 word sentences, not yet in _the_specified_langauge_

** November 2023 Updates **
- Tatominer https://tatominer.netlify.app
- Tatolead https://tatolead.netlify.app
- Spread by Tatoebans ✨ https://tatoeba.org/en/sentences_lists/show/170280
- Rated as 'not OK' 🔴 https://tatoeba.org/en/sentences_lists/show/170380
- Rated as 'unsure' 🟠 https://tatoeba.org/en/sentences_lists/show/170383
- Pruned English ✂️ https://tatoeba.org/en/sentences_lists/show/171182
- JMdict - Japanese 🇯🇵 https://tatoeba.org/en/sentences_lists/show/171073
- JMdict - English 🇬🇧 https://tatoeba.org/en/sentences_lists/show/171072
More information about these tools at my profile page: https://tatoeba.org/en/user/profile/lbdx
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

Is there a way to delete a translation? I'm new to this website.

Edit the sentence, using the pencil icon, to the one word "delete" and a corpus maintainer will delete it for you.

Thank you. You're a great moderator.

I'm interested in what happens when sentences merge.
When two sentences merge – one that has audio with another that hasn't – is ownership always retained by the owner of the sentence with audio, regardless of which sentence was created first?
If both sentences have audio, is ownership always retained by the owner of the sentence that was created first?
Does the number of translations a sentence has play any role? Does anything else play a role, such as sentence ratings?
I'm not a programmer. If I haven't worded these questions clearly, please let me know.

Among the French sentences there are some duplicates without audio.
I don't know why.

Das kann ich Dir sagen: es gibt vier verschiedene Leerzeichen, die vor oder nach gewissen anderen Satzzeichen gesetzt werden, und jedes davon hat bei den Autoren Anhänger. 😊

Maybe @Trang can answer this. She may know best. Normally the older sentence is retained, and the newer one is deleted. It is entirely possible that audio is considered above the creation date. I've seen certain comments. What happened in those cases?

The logic for choosing the "main" sentence is specified here: https://github.com/Tatoeba/horu...licate.py#L102
If both sentences have audio, the one with the lower ID (created first) is kept.
If only one sentence has audio, that one is kept.
If neither sentence has audio and both are owned by a user, the one with the lower ID is kept.
If neither sentence has audio and only one is owned by a user, that one is kept.
If neither sentence has audio and they're both orphans, the one with the lower ID is kept.
Nothing else plays a role. (I think.)

Danke, Yorwba! 😊

Thanks, everyone, for your replies.
> If only one sentence has audio, that one is kept.
This is what I thought, though I wasn't sure.
This would seem to reward any users out there who have the ability to upload audio and who are motivated by numbers to 'capture' existing sentences:
add a corrected version with audio of a sentence that's already in the corpus
= inherit any translations linked to those older, 'original' sentences when they're corrected
= accrue contribution points
= gain a sentence at the expense of someone else (for those with a zero-sum mentality)
It encourages acquisitive behaviour rather than giving help to others to improve their sentences and the quality of the corpus.
I've been on the receiving end of this a few times. It's only because I kicked up a fuss for all to see that the user in question 'generously' released the sentences he'd gained. Over the years I've seen it happen to others, I would say, multiple times: a comment is left on an incorrect sentence, and when it's eventually edited, it then merges with a newer sentence which has audio.
I don't know about anyone else, but it doesn't seem fair to me.
Now, obviously, this can happen entirely unintentionally. But I don't believe that it always does.

> If only one sentence has audio, that one is kept.
I can see why the algorithm works this way, from both a conceptual and a technical standpoint. The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.
Without knowing examples of the sentences, it's hard to know how often you see this occurrence or how likely it might be that two versions of these sentences, one correct and one incorrect, are submitted by chance. You're welcome to send me a private message with more information if you want.
Given the number of English sentences marked "@change" at any time, many of which are not particularly suited to recording, it's difficult to imagine how this could be exploited on a scale large enough to significantly affect one's ranking in terms of the number of sentences owned, either on the part of the owner of the original sentence or the owner of the one with audio. That doesn't rule out the possibility that someone would do it, but it does lower the stakes. It goes without saying that if this behavior were deliberate, it would be petty and worthy of reprimand. But I feel I don't currently have enough information to come to that conclusion.

> The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.
Yes I think that's the reason the deduplication algorithm was designed not to remove sentences with audio. But since the introduction of "multiple audio per sentence" feature, audio files are no longer named after the sentence number (they are now named after their own, audio-specific id), so in theory we could change the sentence selection algorithm of the deduplication bot. I personally doubt it is worth the effort though, but pull requests are always welcome.
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.