Paret - Tatoeba

Wall (7129 threads)

Astúcias

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Darrièrs messatges

subdirectory_arrow_right

hecko

2 hours ago

subdirectory_arrow_right

frpzzd

23 hours ago

subdirectory_arrow_right

araneo

1 days ago

subdirectory_arrow_right

gillux

1 days ago

subdirectory_arrow_right

araneo

1 days ago

subdirectory_arrow_right

gillux

1 days ago

subdirectory_arrow_right

gillux

1 days ago

subdirectory_arrow_right

PaulP

1 days ago

subdirectory_arrow_right

frpzzd

1 days ago

subdirectory_arrow_right

Waldelfe

1 days ago

November 14, 2023 November 14, 2023 at 10:30:03 PM UTC

link

Permalink

warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

sharptoothed November 12, 2023 November 12, 2023 at 8:29:08 AM UTC

flag

Report

link

Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

mwgamera November 5, 2023 November 5, 2023 at 9:42:23 PM UTC

flag

Report

link

Permalink

In the Interlingua (ia) localization of the web interface, the "show {number} more translations" button always appears as "monstrar 1 altere traduction" no matter how many altere traductiones there actually are to show.
It appears that the same also happens with br, gos, zza, pt-br, tl, kab, and bal locales. But I don't know these languages so I can't tell for sure.

Looking at it I also noticed that the page title on sentence pages renders incorrectly with pt-br, but this is a simple error in the PO file (field names got translated: "{frase} - {idioma} exemplo de frase"); and for the same reason the button to show more translations in the el locale correctly chooses the plural form but doesn't show the {αριθμός} in it.

hide replies show replies

Cangarejo November 7, 2023 November 7, 2023 at 1:14:43 PM UTC

flag

Report

link

Permalink

I have opened an issue on the GitHub page.

https://github.com/Tatoeba/tatoeba2/issues/3087

hide replies show replies

mwgamera November 7, 2023 November 7, 2023 at 6:12:10 PM UTC

flag

Report

link

Permalink

Oh, ok. Thanks.

November 6, 2023 November 6, 2023 at 5:33:43 AM UTC

link

Permalink

warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

CK November 6, 2023 November 6, 2023 at 4:42:29 AM UTC

flag

Report

link

Permalink

🍎 Dashboard for Translating English Sentences with Audio

http://a4esl.org/temporary/tato...e/searches.php

Updated

► I added the option for the "minimum" and "maximum" number of words in the search.

► I also added the following "find all eng not yet in _the_specified_langauge_" for the random selection.

◼ Random 3-to-10 word sentences, not yet in _the_specified_langauge_

lbdx November 4, 2023 November 4, 2023 at 2:28:32 PM UTC

flag

Report

link

Permalink

** November 2023 Updates **

- Tatominer https://tatominer.netlify.app
- Tatolead https://tatolead.netlify.app
- Spread by Tatoebans ✨ https://tatoeba.org/en/sentences_lists/show/170280
- Rated as 'not OK' 🔴 https://tatoeba.org/en/sentences_lists/show/170380
- Rated as 'unsure' 🟠 https://tatoeba.org/en/sentences_lists/show/170383
- Pruned English ✂️ https://tatoeba.org/en/sentences_lists/show/171182
- JMdict - Japanese 🇯🇵 https://tatoeba.org/en/sentences_lists/show/171073
- JMdict - English 🇬🇧 https://tatoeba.org/en/sentences_lists/show/171072

More information about these tools at my profile page: https://tatoeba.org/en/user/profile/lbdx

November 4, 2023 November 4, 2023 at 6:57:25 AM UTC

link

Permalink

warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

azcor November 2, 2023, edited November 2, 2023 November 2, 2023 at 2:35:59 AM UTC, edited November 2, 2023 at 3:24:46 AM UTC

flag

Report

link

Permalink

Is there a way to delete a translation? I'm new to this website.

hide replies show replies

CK November 2, 2023, edited November 2, 2023 November 2, 2023 at 2:46:26 AM UTC, edited November 2, 2023 at 2:46:40 AM UTC

flag

Report

link

Permalink

Edit the sentence, using the pencil icon, to the one word "delete" and a corpus maintainer will delete it for you.

hide replies show replies

azcor November 2, 2023 November 2, 2023 at 6:58:33 PM UTC

flag

Report

link

Permalink

Thank you. You're a great moderator.

sundown October 25, 2023, edited October 25, 2023 October 25, 2023 at 6:22:59 AM UTC, edited October 25, 2023 at 6:59:30 AM UTC

flag

Report

link

Permalink

I'm interested in what happens when sentences merge.
When two sentences merge – one that has audio with another that hasn't – is ownership always retained by the owner of the sentence with audio, regardless of which sentence was created first?
If both sentences have audio, is ownership always retained by the owner of the sentence that was created first?
Does the number of translations a sentence has play any role? Does anything else play a role, such as sentence ratings?

I'm not a programmer. If I haven't worded these questions clearly, please let me know.

hide replies show replies

maaster October 25, 2023 October 25, 2023 at 8:01:01 AM UTC

flag

Report

link

Permalink

Among the French sentences there are some duplicates without audio.
I don't know why.

hide replies show replies

Pfirsichbaeumchen October 25, 2023, edited October 25, 2023 October 25, 2023 at 9:29:28 AM UTC, edited October 25, 2023 at 9:32:06 AM UTC

flag

Report

link

Permalink

Das kann ich Dir sagen: es gibt vier verschiedene Leerzeichen, die vor oder nach gewissen anderen Satzzeichen gesetzt werden, und jedes davon hat bei den Autoren Anhänger. 😊

Pfirsichbaeumchen October 25, 2023, edited October 25, 2023 October 25, 2023 at 9:37:21 AM UTC, edited October 25, 2023 at 9:37:36 AM UTC

flag

Report

link

Permalink

Maybe @Trang can answer this. She may know best. Normally the older sentence is retained, and the newer one is deleted. It is entirely possible that audio is considered above the creation date. I've seen certain comments. What happened in those cases?

Yorwba October 25, 2023 October 25, 2023 at 6:31:38 PM UTC

flag

Report

link

Permalink

The logic for choosing the "main" sentence is specified here: https://github.com/Tatoeba/horu...licate.py#L102

If both sentences have audio, the one with the lower ID (created first) is kept.

If only one sentence has audio, that one is kept.

If neither sentence has audio and both are owned by a user, the one with the lower ID is kept.

If neither sentence has audio and only one is owned by a user, that one is kept.

If neither sentence has audio and they're both orphans, the one with the lower ID is kept.

Nothing else plays a role. (I think.)

hide replies show replies

Pfirsichbaeumchen October 25, 2023 October 25, 2023 at 9:01:07 PM UTC

flag

Report

link

Permalink

Danke, Yorwba! 😊

sundown October 26, 2023, edited October 26, 2023 October 26, 2023 at 10:48:36 PM UTC, edited October 26, 2023 at 10:49:56 PM UTC

flag

Report

link

Permalink

Thanks, everyone, for your replies.

> If only one sentence has audio, that one is kept.

This is what I thought, though I wasn't sure.

This would seem to reward any users out there who have the ability to upload audio and who are motivated by numbers to 'capture' existing sentences:

add a corrected version with audio of a sentence that's already in the corpus
= inherit any translations linked to those older, 'original' sentences when they're corrected
= accrue contribution points
= gain a sentence at the expense of someone else (for those with a zero-sum mentality)

It encourages acquisitive behaviour rather than giving help to others to improve their sentences and the quality of the corpus.

I've been on the receiving end of this a few times. It's only because I kicked up a fuss for all to see that the user in question 'generously' released the sentences he'd gained. Over the years I've seen it happen to others, I would say, multiple times: a comment is left on an incorrect sentence, and when it's eventually edited, it then merges with a newer sentence which has audio.

I don't know about anyone else, but it doesn't seem fair to me.

Now, obviously, this can happen entirely unintentionally. But I don't believe that it always does.

hide replies show replies

AlanF_US October 27, 2023 October 27, 2023 at 2:34:03 AM UTC

flag

Report

link

Permalink

> If only one sentence has audio, that one is kept.

I can see why the algorithm works this way, from both a conceptual and a technical standpoint. The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.

Without knowing examples of the sentences, it's hard to know how often you see this occurrence or how likely it might be that two versions of these sentences, one correct and one incorrect, are submitted by chance. You're welcome to send me a private message with more information if you want.

Given the number of English sentences marked "@change" at any time, many of which are not particularly suited to recording, it's difficult to imagine how this could be exploited on a scale large enough to significantly affect one's ranking in terms of the number of sentences owned, either on the part of the owner of the original sentence or the owner of the one with audio. That doesn't rule out the possibility that someone would do it, but it does lower the stakes. It goes without saying that if this behavior were deliberate, it would be petty and worthy of reprimand. But I feel I don't currently have enough information to come to that conclusion.

hide replies show replies

gillux November 1, 2023 November 1, 2023 at 10:21:25 AM UTC

flag

Report

link

Permalink

> The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.

Yes I think that's the reason the deduplication algorithm was designed not to remove sentences with audio. But since the introduction of "multiple audio per sentence" feature, audio files are no longer named after the sentence number (they are now named after their own, audio-specific id), so in theory we could change the sentence selection algorithm of the deduplication bot. I personally doubt it is worth the effort though, but pull requests are always welcome.

October 31, 2023, edited October 31, 2023 October 31, 2023 at 11:11:00 AM UTC, edited October 31, 2023 at 11:12:00 AM UTC

link

Permalink

warning

The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.

Wall (7129 threads)

Astúcias

hecko

frpzzd

araneo

gillux

araneo

gillux

gillux

PaulP

frpzzd

Waldelfe

Besonh d'ajuda ?

Desvolopaires

A prepaus