menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
CK CK September 16, 2014, edited October 30, 2019 September 16, 2014 at 10:48:31 AM UTC, edited October 30, 2019 at 7:28:17 AM UTC link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[20423] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pfirsichbaeumchen Pfirsichbaeumchen September 17, 2014 September 17, 2014 at 7:27:01 AM UTC link Permalink

Has any progress been made on the development of a new duplicate merging script?

{{vm.hiddenReplies[20433] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US September 17, 2014 September 17, 2014 at 4:08:51 PM UTC link Permalink

I just sent saeb a note to ask.

sacredceltic sacredceltic September 22, 2014 September 22, 2014 at 12:44:28 AM UTC link Permalink

The former merging script worked wonders. Why on Earth was it ever changed ?!

{{vm.hiddenReplies[20453] ? 'expand_more' : 'expand_less'}} hide replies show replies
CK CK September 22, 2014, edited October 30, 2019 September 22, 2014 at 4:20:01 AM UTC, edited October 30, 2019 at 7:28:01 AM UTC link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[20455] ? 'expand_more' : 'expand_less'}} hide replies show replies
sacredceltic sacredceltic September 22, 2014 September 22, 2014 at 10:18:05 AM UTC link Permalink

Data is lost when Corpus maintainers manually merge sentences while forgetting to merge comments, tags and lists attachments.

{{vm.hiddenReplies[20456] ? 'expand_more' : 'expand_less'}} hide replies show replies
sacredceltic sacredceltic September 24, 2014 September 24, 2014 at 10:32:51 AM UTC link Permalink

and update histories, without which comments don't make sense...

{{vm.hiddenReplies[20473] ? 'expand_more' : 'expand_less'}} hide replies show replies
corvard corvard September 24, 2014 September 24, 2014 at 1:17:32 PM UTC link Permalink

update history must be editable at least

{{vm.hiddenReplies[20475] ? 'expand_more' : 'expand_less'}} hide replies show replies
sacredceltic sacredceltic September 24, 2014 September 24, 2014 at 1:41:05 PM UTC link Permalink

what do you mean ? History shouldn't be changed...

tommy_san tommy_san September 27, 2014 September 27, 2014 at 5:54:02 PM UTC link Permalink

This is true indeed. We should check if someone has not put the sentence into a list if we manually delete a duplicate.

Guybrush88 Guybrush88 September 29, 2014, edited September 29, 2014 September 29, 2014 at 3:27:39 PM UTC, edited September 29, 2014 at 3:36:42 PM UTC link Permalink

Also, I think that merging with a script would be preferrable to make sure that any effort made by any user to add a sentence, even if it's a duplicate, will be credited in the sentences logs. if two different users add the same sentence, why should we credit only one user for having added it and completely ignore the other one?

{{vm.hiddenReplies[20528] ? 'expand_more' : 'expand_less'}} hide replies show replies
wallebot wallebot September 30, 2014 September 30, 2014 at 11:13:25 PM UTC link Permalink

Tal vez se podrian añadir el autor y los comentarios al final de los comentarios de la frase no borrada.

Aunque que criterio se usa para decidir cual de las dos frases se borra y cual se mantiene?

{{vm.hiddenReplies[20552] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US October 1, 2014, edited October 1, 2014 October 1, 2014 at 11:21:02 PM UTC, edited October 1, 2014 at 11:29:26 PM UTC link Permalink

wallebot wanted to know what criteria would be used to decide which sentences are deleted and which are kept. A number of developers had a brainstorming session where we collected requirements for the new script. You can see it on the wiki: http://en.wiki.tatoeba.org/arti...icate-merging# The discussion was written for developers rather than users, but you still may find it useful. Throughout the discussion, you can see some mention of the criteria. For instance, the new script should save a sentence with audio preferentially over a sentence without. The old script did not do this, which meant that audio was lost when the script was run.

I asked saeb when he expected to have the script ready. He told me on Saturday, when we were chatting on IRC, that he thought he could submit it that day, along with some tests. However, apparently it didn't happen. I'll ask him again.

{{vm.hiddenReplies[20559] ? 'expand_more' : 'expand_less'}} hide replies show replies
wallebot wallebot October 2, 2014 October 2, 2014 at 12:46:45 PM UTC link Permalink

Muchas gracias.
Una frase puede tener mas de un audio?

Seria util que las frases pudiesen tener mas de un audio, para oir diferentes acentos por ejemplo.

{{vm.hiddenReplies[20562] ? 'expand_more' : 'expand_less'}} hide replies show replies
al_ex_an_der al_ex_an_der October 2, 2014, edited October 2, 2014 October 2, 2014 at 2:25:52 PM UTC, edited October 2, 2014 at 2:41:23 PM UTC link Permalink

ENGLISH
Well, it would be certainly advantageous to hear the examples pronounced both by a male and a female voice. Also the pronunciation in different regions would be interesting. But I think this is not the most urgent matter, as at the time being an overwhelmimg majority of the sentences don't have audio at all.

ESPERANTO
Nu, estus certe avantaĝe aŭdi la ekzemplojn prononcitajn kaj de vira kaj de virina voĉo. Ankaŭ la prononco en diversaj regionoj estus interesa. Sed mi kredas, ke tio ne estas la plej urĝa afero, ĉar nuntempe la grandega plimulto de la frazoj tute ne havas sonon.

DEUTSCH
Nun, es wäre sicherlich von Vorteil, könnte man die Beispiele sowohl von einer männlichen als auch von einer weiblichen Stimme gesprochen hören. Auch die Aussprache in verschiedenen Regionen wäre interessant. Aber ich denke, das ist nicht die dringendste Angelegenheit, da zur Zeit die übergroße Mehrheit der Sätze gänzlich ohne Vertonung ist.

{{vm.hiddenReplies[20563] ? 'expand_more' : 'expand_less'}} hide replies show replies
wallebot wallebot October 4, 2014 October 4, 2014 at 11:55:28 PM UTC link Permalink

Pienso que no es prioritario cambiar el sofware para que acepte varios audios de frases. Pero pienso que deberia haber un plan B, si hay dos audio para una sola frase.

Pienso que incluir un comentario con un enlace a los otros audios seria ya suficiente, Si en el Futuro se ve necesario añadir varios audios, se pueden recuperar de los comentarios.

wallebot wallebot October 5, 2014 October 5, 2014 at 1:16:09 PM UTC link Permalink

In chinese writed, a sentence can be speeaked in more than one languaje, by example mandarin and cantonesse.

{{vm.hiddenReplies[20588] ? 'expand_more' : 'expand_less'}} hide replies show replies
Objectivesea Objectivesea October 5, 2014 October 5, 2014 at 2:15:00 PM UTC link Permalink

Even if they are pronounced radically different to each other, my understanding is that the various Chinese dialects use essentially the same form of written communication. There is sometimes a distinction between traditional hanzi and the simplified ones now used in modern China, but as I understand it, the two styles of hanzi are not specifically associated with individual dialects like Mandarin, Cantonese, Xiang, Min, Wu, etc.

CK CK September 21, 2014, edited October 30, 2019 September 21, 2014 at 4:57:24 PM UTC, edited October 30, 2019 at 7:28:09 AM UTC link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[20451] ? 'expand_more' : 'expand_less'}} hide replies show replies
sacredceltic sacredceltic September 23, 2014, edited September 23, 2014 September 23, 2014 at 7:12:12 PM UTC, edited September 23, 2014 at 7:12:49 PM UTC link Permalink

Duplicates can be introduced in different ways :

1) because the same sentence is being simultaneously translated in the same target language, by different people who are unaware of it.
This is particularly a side-effect of showing last sentences on the front page => every newcomer jumps on what is being shown and translates it...
Maybe we should not show latest contributions on the front page but on a different, dedicated page...

But this might also happen because several people translate simultaneously from the same list, and I can't see how to avoid this

2) because different sentences, in the same or different languages, end up being translated into the same target sentence.
That is unavoidable, especially since the two source sentences that are being translated are not directly linked.

When I translate "I show her" and then "I show him" into French, I have to create a duplicate, because these 2 sentences are initially unlinked, but they have the same translation in French : « Je lui montre ». I can't see how to avoid this.
All the procedures I have seen so far, based on javascript gimmicks are awkward (you need to copy sentences numbers...) and inconvenient and they don't work on all devices.
I want to be able to translate what I want from any device, at any time.

One way I could see would be to be able to link my latest translations to a sentence from a list. But that would work only if the 2 sentences are close to each other in the list I'm translating...

VilhelmsPort VilhelmsPort September 30, 2014 September 30, 2014 at 12:54:21 AM UTC link Permalink

Does Tatoeba tell if a sentence has already been added? If not, it would be a good idea to develop a system to do so. :)

{{vm.hiddenReplies[20533] ? 'expand_more' : 'expand_less'}} hide replies show replies
wallebot wallebot September 30, 2014 September 30, 2014 at 11:16:15 PM UTC link Permalink

The search don't found recently added sentences. Maybe works for older sentences, but I think is a heavy work for server.