menu
Tatoeba
language
Рэгістрацыя Уваход
language Беларуская
menu
Tatoeba

chevron_right Рэгістрацыя

chevron_right Уваход

Прагляд

chevron_right Show random sentence

chevron_right Прагляд па мовах

chevron_right Прагляд спісаў

chevron_right Прагляд па цэтліках

chevron_right Прагляд аўдыёзапісаў

Community

chevron_right Сцяна

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (7 146 threads)

Парады

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Апошнія паведамленні subdirectory_arrow_right

frpzzd

7 days ago

subdirectory_arrow_right

EugeneGS

7 days ago

subdirectory_arrow_right

frpzzd

7 days ago

subdirectory_arrow_right

EugeneGS

8 days ago

subdirectory_arrow_right

frpzzd

8 days ago

subdirectory_arrow_right

gillux

8 days ago

feedback

frpzzd

10 days ago

feedback

sharptoothed

11 days ago

subdirectory_arrow_right

marafon

12 days ago

subdirectory_arrow_right

Pfirsichbaeumchen

12 days ago

6 лістапада 2023 г. 6 лістапада 2023 г. у 05:33:43 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

CK CK 6 лістапада 2023 г. 6 лістапада 2023 г. у 04:42:29 UTC flag Report link Permalink

🍎 Dashboard for Translating English Sentences with Audio

http://a4esl.org/temporary/tato...e/searches.php

Updated

► I added the option for the "minimum" and "maximum" number of words in the search.

► I also added the following "find all eng not yet in _the_specified_langauge_" for the random selection.

◼ Random 3-to-10 word sentences, not yet in _the_specified_langauge_


lbdx lbdx 4 лістапада 2023 г. 4 лістапада 2023 г. у 14:28:32 UTC flag Report link Permalink

** November 2023 Updates **

- Tatominer https://tatominer.netlify.app
- Tatolead https://tatolead.netlify.app
- Spread by Tatoebans ✨ https://tatoeba.org/en/sentences_lists/show/170280
- Rated as 'not OK' 🔴 https://tatoeba.org/en/sentences_lists/show/170380
- Rated as 'unsure' 🟠 https://tatoeba.org/en/sentences_lists/show/170383
- Pruned English ✂️ https://tatoeba.org/en/sentences_lists/show/171182
- JMdict - Japanese 🇯🇵 https://tatoeba.org/en/sentences_lists/show/171073
- JMdict - English 🇬🇧 https://tatoeba.org/en/sentences_lists/show/171072

More information about these tools at my profile page: https://tatoeba.org/en/user/profile/lbdx

4 лістапада 2023 г. 4 лістапада 2023 г. у 06:57:25 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

azcor azcor 2 лістапада 2023 г., edited 2 лістапада 2023 г. 2 лістапада 2023 г. у 02:35:59 UTC, edited 2 лістапада 2023 г. у 03:24:46 UTC flag Report link Permalink

Is there a way to delete a translation? I'm new to this website.

{{vm.hiddenReplies[40263] ? 'expand_more' : 'expand_less'}} hide replies show replies
CK CK 2 лістапада 2023 г., edited 2 лістапада 2023 г. 2 лістапада 2023 г. у 02:46:26 UTC, edited 2 лістапада 2023 г. у 02:46:40 UTC flag Report link Permalink

Edit the sentence, using the pencil icon, to the one word "delete" and a corpus maintainer will delete it for you.

{{vm.hiddenReplies[40264] ? 'expand_more' : 'expand_less'}} hide replies show replies
azcor azcor 2 лістапада 2023 г. 2 лістапада 2023 г. у 18:58:33 UTC flag Report link Permalink

Thank you. You're a great moderator.

sundown sundown 25 кастрычніка 2023 г., edited 25 кастрычніка 2023 г. 25 кастрычніка 2023 г. у 06:22:59 UTC, edited 25 кастрычніка 2023 г. у 06:59:30 UTC flag Report link Permalink

I'm interested in what happens when sentences merge.
When two sentences merge – one that has audio with another that hasn't – is ownership always retained by the owner of the sentence with audio, regardless of which sentence was created first?
If both sentences have audio, is ownership always retained by the owner of the sentence that was created first?
Does the number of translations a sentence has play any role? Does anything else play a role, such as sentence ratings?

I'm not a programmer. If I haven't worded these questions clearly, please let me know.

{{vm.hiddenReplies[40236] ? 'expand_more' : 'expand_less'}} hide replies show replies
maaster maaster 25 кастрычніка 2023 г. 25 кастрычніка 2023 г. у 08:01:01 UTC flag Report link Permalink

Among the French sentences there are some duplicates without audio.
I don't know why.

{{vm.hiddenReplies[40238] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen 25 кастрычніка 2023 г., edited 25 кастрычніка 2023 г. 25 кастрычніка 2023 г. у 09:29:28 UTC, edited 25 кастрычніка 2023 г. у 09:32:06 UTC flag Report link Permalink

Das kann ich Dir sagen: es gibt vier verschiedene Leerzeichen, die vor oder nach gewissen anderen Satzzeichen gesetzt werden, und jedes davon hat bei den Autoren Anhänger. 😊

Pfirsichbaeumchen Pfirsichbaeumchen 25 кастрычніка 2023 г., edited 25 кастрычніка 2023 г. 25 кастрычніка 2023 г. у 09:37:21 UTC, edited 25 кастрычніка 2023 г. у 09:37:36 UTC flag Report link Permalink

Maybe @Trang can answer this. She may know best. Normally the older sentence is retained, and the newer one is deleted. It is entirely possible that audio is considered above the creation date. I've seen certain comments. What happened in those cases?

Yorwba Yorwba 25 кастрычніка 2023 г. 25 кастрычніка 2023 г. у 18:31:38 UTC flag Report link Permalink

The logic for choosing the "main" sentence is specified here: https://github.com/Tatoeba/horu...licate.py#L102

If both sentences have audio, the one with the lower ID (created first) is kept.

If only one sentence has audio, that one is kept.

If neither sentence has audio and both are owned by a user, the one with the lower ID is kept.

If neither sentence has audio and only one is owned by a user, that one is kept.

If neither sentence has audio and they're both orphans, the one with the lower ID is kept.

Nothing else plays a role. (I think.)

{{vm.hiddenReplies[40242] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen 25 кастрычніка 2023 г. 25 кастрычніка 2023 г. у 21:01:07 UTC flag Report link Permalink

Danke, Yorwba! 😊

sundown sundown 26 кастрычніка 2023 г., edited 26 кастрычніка 2023 г. 26 кастрычніка 2023 г. у 22:48:36 UTC, edited 26 кастрычніка 2023 г. у 22:49:56 UTC flag Report link Permalink

Thanks, everyone, for your replies.

> If only one sentence has audio, that one is kept.

This is what I thought, though I wasn't sure.

This would seem to reward any users out there who have the ability to upload audio and who are motivated by numbers to 'capture' existing sentences:

add a corrected version with audio of a sentence that's already in the corpus
= inherit any translations linked to those older, 'original' sentences when they're corrected
= accrue contribution points
= gain a sentence at the expense of someone else (for those with a zero-sum mentality)

It encourages acquisitive behaviour rather than giving help to others to improve their sentences and the quality of the corpus.

I've been on the receiving end of this a few times. It's only because I kicked up a fuss for all to see that the user in question 'generously' released the sentences he'd gained. Over the years I've seen it happen to others, I would say, multiple times: a comment is left on an incorrect sentence, and when it's eventually edited, it then merges with a newer sentence which has audio.

I don't know about anyone else, but it doesn't seem fair to me.

Now, obviously, this can happen entirely unintentionally. But I don't believe that it always does.

{{vm.hiddenReplies[40246] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US 27 кастрычніка 2023 г. 27 кастрычніка 2023 г. у 02:34:03 UTC flag Report link Permalink

> If only one sentence has audio, that one is kept.

I can see why the algorithm works this way, from both a conceptual and a technical standpoint. The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.

Without knowing examples of the sentences, it's hard to know how often you see this occurrence or how likely it might be that two versions of these sentences, one correct and one incorrect, are submitted by chance. You're welcome to send me a private message with more information if you want.

Given the number of English sentences marked "@change" at any time, many of which are not particularly suited to recording, it's difficult to imagine how this could be exploited on a scale large enough to significantly affect one's ranking in terms of the number of sentences owned, either on the part of the owner of the original sentence or the owner of the one with audio. That doesn't rule out the possibility that someone would do it, but it does lower the stakes. It goes without saying that if this behavior were deliberate, it would be petty and worthy of reprimand. But I feel I don't currently have enough information to come to that conclusion.

{{vm.hiddenReplies[40247] ? 'expand_more' : 'expand_less'}} hide replies show replies
gillux gillux 1 лістапада 2023 г. 1 лістапада 2023 г. у 10:21:25 UTC flag Report link Permalink

> The audio file name is based on the number of the sentence with which it was submitted. If the algorithm were to instead choose the lower number, the audio file would have to be renamed, which is tricky and could cause problems if something happened to the system at that moment.

Yes I think that's the reason the deduplication algorithm was designed not to remove sentences with audio. But since the introduction of "multiple audio per sentence" feature, audio files are no longer named after the sentence number (they are now named after their own, audio-specific id), so in theory we could change the sentence selection algorithm of the deduplication bot. I personally doubt it is worth the effort though, but pull requests are always welcome.

31 кастрычніка 2023 г., edited 31 кастрычніка 2023 г. 31 кастрычніка 2023 г. у 11:11:00 UTC, edited 31 кастрычніка 2023 г. у 11:12:00 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

30 кастрычніка 2023 г. 30 кастрычніка 2023 г. у 10:38:06 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.

sharptoothed sharptoothed 29 кастрычніка 2023 г. 29 кастрычніка 2023 г. у 07:05:20 UTC flag Report link Permalink

✹✹ Stats & Graphs ✹✹

Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/

27 кастрычніка 2023 г. 27 кастрычніка 2023 г. у 04:11:54 UTC link Permalink
warning

Гэта паведамлення парушае нашы правілы, і таму яно прыхаванае. Яго могуць убачыць толькі адміністратары і аўтар(ка) паведамлення.