menu
Tatoeba
language
S'inscriure Connexion
language Occitan
menu
Tatoeba

chevron_right S'inscriure

chevron_right Connexion

Percórrer

chevron_right Afichar la frasa aleatòria

chevron_right Percórrer per lenga

chevron_right Percórrer per lista

chevron_right Percórrer per etiqueta

chevron_right Percórrer los enregistraments àudio

Community

chevron_right Paret

chevron_right Lista de totes los membres

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (7129 threads)

Astúcias

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Darrièrs messatges subdirectory_arrow_right

hecko

2 hours ago

subdirectory_arrow_right

frpzzd

23 hours ago

subdirectory_arrow_right

araneo

1 days ago

subdirectory_arrow_right

gillux

1 days ago

subdirectory_arrow_right

araneo

1 days ago

subdirectory_arrow_right

gillux

1 days ago

subdirectory_arrow_right

gillux

1 days ago

subdirectory_arrow_right

PaulP

1 days ago

subdirectory_arrow_right

frpzzd

1 days ago

subdirectory_arrow_right

Waldelfe

1 days ago

November 14, 2023 November 14, 2023 at 10:30:03 PM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

sharptoothed sharptoothed November 12, 2023 November 12, 2023 at 8:29:08 AM UTC flag Report link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

mwgamera mwgamera November 5, 2023 November 5, 2023 at 9:42:23 PM UTC flag Report link Permalink

In the Interlingua (ia) localization of the web interface, the "show {number} more translations" button always appears as "monstrar 1 altere traduction" no matter how many altere traductiones there actually are to show.
It appears that the same also happens with br, gos, zza, pt-br, tl, kab, and bal locales. But I don't know these languages so I can't tell for sure.

Looking at it I also noticed that the page title on sentence pages renders incorrectly with pt-br, but this is a simple error in the PO file (field names got translated: "{frase} - {idioma} exemplo de frase"); and for the same reason the button to show more translations in the el locale correctly chooses the plural form but doesn't show the {αριθμός} in it.

{{vm.hiddenReplies[40270] ? 'expand_more' : 'expand_less'}} hide replies show replies
Cangarejo Cangarejo November 7, 2023 November 7, 2023 at 1:14:43 PM UTC flag Report link Permalink

I have opened an issue on the GitHub page.

https://github.com/Tatoeba/tatoeba2/issues/3087

{{vm.hiddenReplies[40274] ? 'expand_more' : 'expand_less'}} hide replies show replies
mwgamera mwgamera November 7, 2023 November 7, 2023 at 6:12:10 PM UTC flag Report link Permalink

Oh, ok. Thanks.

November 6, 2023 November 6, 2023 at 5:33:43 AM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

CK CK November 6, 2023 November 6, 2023 at 4:42:29 AM UTC flag Report link Permalink

🍎 Dashboard for Translating English Sentences with Audio

http://a4esl.org/temporary/tato...e/searches.php

Updated

► I added the option for the "minimum" and "maximum" number of words in the search.

► I also added the following "find all eng not yet in _the_specified_langauge_" for the random selection.

◼ Random 3-to-10 word sentences, not yet in _the_specified_langauge_


lbdx lbdx November 4, 2023 November 4, 2023 at 2:28:32 PM UTC flag Report link Permalink

** November 2023 Updates **

- Tatominer https://tatominer.netlify.app
- Tatolead https://tatolead.netlify.app
- Spread by Tatoebans ✨ https://tatoeba.org/en/sentences_lists/show/170280
- Rated as 'not OK' 🔴 https://tatoeba.org/en/sentences_lists/show/170380
- Rated as 'unsure' 🟠 https://tatoeba.org/en/sentences_lists/show/170383
- Pruned English ✂️ https://tatoeba.org/en/sentences_lists/show/171182
- JMdict - Japanese 🇯🇵 https://tatoeba.org/en/sentences_lists/show/171073
- JMdict - English 🇬🇧 https://tatoeba.org/en/sentences_lists/show/171072

More information about these tools at my profile page: https://tatoeba.org/en/user/profile/lbdx

November 4, 2023 November 4, 2023 at 6:57:25 AM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

azcor azcor November 2, 2023, edited November 2, 2023 November 2, 2023 at 2:35:59 AM UTC, edited November 2, 2023 at 3:24:46 AM UTC flag Report link Permalink

Is there a way to delete a translation? I'm new to this website.

{{vm.hiddenReplies[40263] ? 'expand_more' : 'expand_less'}} hide replies show replies
CK CK November 2, 2023, edited November 2, 2023 November 2, 2023 at 2:46:26 AM UTC, edited November 2, 2023 at 2:46:40 AM UTC flag Report link Permalink

Edit the sentence, using the pencil icon, to the one word "delete" and a corpus maintainer will delete it for you.

{{vm.hiddenReplies[40264] ? 'expand_more' : 'expand_less'}} hide replies show replies
azcor azcor November 2, 2023 November 2, 2023 at 6:58:33 PM UTC flag Report link Permalink

Thank you. You're a great moderator.

sundown sundown October 25, 2023, edited October 25, 2023 October 25, 2023 at 6:22:59 AM UTC, edited October 25, 2023 at 6:59:30 AM UTC flag Report link Permalink

I'm interested in what happens when sentences merge.
When two sentences merge – one that has audio with another that hasn't – is ownership always retained by the owner of the sentence with audio, regardless of which sentence was created first?
If both sentences have audio, is ownership always retained by the owner of the sentence that was created first?
Does the number of translations a sentence has play any role? Does anything else play a role, such as sentence ratings?

I'm not a programmer. If I haven't worded these questions clearly, please let me know.

{{vm.hiddenReplies[40236] ? 'expand_more' : 'expand_less'}} hide replies show replies
maaster maaster October 25, 2023 October 25, 2023 at 8:01:01 AM UTC flag Report link Permalink

Among the French sentences there are some duplicates without audio.
I don't know why.

{{vm.hiddenReplies[40238] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen October 25, 2023, edited October 25, 2023 October 25, 2023 at 9:29:28 AM UTC, edited October 25, 2023 at 9:32:06 AM UTC flag Report link Permalink

Das kann ich Dir sagen: es gibt vier verschiedene Leerzeichen, die vor oder nach gewissen anderen Satzzeichen gesetzt werden, und jedes davon hat bei den Autoren Anhänger. 😊

Pfirsichbaeumchen Pfirsichbaeumchen October 25, 2023, edited October 25, 2023 October 25, 2023 at 9:37:21 AM UTC, edited October 25, 2023 at 9:37:36 AM UTC flag Report link Permalink

Maybe @Trang can answer this. She may know best. Normally the older sentence is retained, and the newer one is deleted. It is entirely possible that audio is considered above the creation date. I've seen certain comments. What happened in those cases?

Yorwba Yorwba October 25, 2023 October 25, 2023 at 6:31:38 PM UTC flag Report link Permalink

The logic for choosing the "main" sentence is specified here: https://github.com/Tatoeba/horu...licate.py#L102

If both sentences have audio, the one with the lower ID (created first) is kept.

If only one sentence has audio, that one is kept.

If neither sentence has audio and both are owned by a user, the one with the lower ID is kept.

If neither sentence has audio and only one is owned by a user, that one is kept.

If neither sentence has audio and they're both orphans, the one with the lower ID is kept.

Nothing else plays a role. (I think.)

{{vm.hiddenReplies[40242] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen October 25, 2023 October 25, 2023 at 9:01:07 PM UTC flag Report link Permalink

Danke, Yorwba! 😊

sundown sundown October 26, 2023, edited October 26, 2023 October 26, 2023 at 10:48:36 PM UTC, edited October 26, 2023 at 10:49:56 PM UTC flag Report link Permalink

Thanks, everyone, for your replies.

> If only one sentence has audio, that one is kept.

This is what I thought, though I wasn't sure.

This would seem to reward any users out there who have the ability to upload audio and who are motivated by numbers to 'capture' existing sentences:

add a corrected version with audio of a sentence that's already in the corpus
= inherit any translations linked to those older, 'original' sentences when they're corrected
= accrue contribution points
= gain a sentence at the expense of someone else (for those with a zero-sum mentality)

It encourages acquisitive behaviour rather than giving help to others to improve their sentences and the quality of the corpus.

I've been on the receiving end of this a few times. It's only because I kicked up a fuss for all to see that the user in question 'generously' released the sentences he'd gained. Over the years I've seen it happen to others, I would say, multiple times: a comment is left on an incorrect sentence, and when it's eventually edited, it then merges with a newer sentence which has audio.

I don't know about anyone else, but it doesn't seem fair to me.

Now, obviously, this can happen entirely unintentionally. But I don't believe that it always does.

{{vm.hiddenReplies[40246] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US October 27, 2023 October 27, 2023 at 2:34:03 AM UTC flag Report link Permalink

> If only one sentence has audio, that one is kept.

I can see why the algorithm works this way, from both a conceptual and a technical standpoint. The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.

Without knowing examples of the sentences, it's hard to know how often you see this occurrence or how likely it might be that two versions of these sentences, one correct and one incorrect, are submitted by chance. You're welcome to send me a private message with more information if you want.

Given the number of English sentences marked "@change" at any time, many of which are not particularly suited to recording, it's difficult to imagine how this could be exploited on a scale large enough to significantly affect one's ranking in terms of the number of sentences owned, either on the part of the owner of the original sentence or the owner of the one with audio. That doesn't rule out the possibility that someone would do it, but it does lower the stakes. It goes without saying that if this behavior were deliberate, it would be petty and worthy of reprimand. But I feel I don't currently have enough information to come to that conclusion.

{{vm.hiddenReplies[40247] ? 'expand_more' : 'expand_less'}} hide replies show replies
gillux gillux November 1, 2023 November 1, 2023 at 10:21:25 AM UTC flag Report link Permalink

> The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.

Yes I think that's the reason the deduplication algorithm was designed not to remove sentences with audio. But since the introduction of "multiple audio per sentence" feature, audio files are no longer named after the sentence number (they are now named after their own, audio-specific id), so in theory we could change the sentence selection algorithm of the deduplication bot. I personally doubt it is worth the effort though, but pull requests are always welcome.

October 31, 2023, edited October 31, 2023 October 31, 2023 at 11:11:00 AM UTC, edited October 31, 2023 at 11:12:00 AM UTC link Permalink
warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.