menu
Tatoeba
language
Registriĝi Ensaluti
language Esperanto
menu
Tatoeba

chevron_right Registriĝi

chevron_right Ensaluti

Foliumi

chevron_right Montri hazardan frazon

chevron_right Foliumi laŭ lingvo

chevron_right Foliumi laŭ listo

chevron_right Foliumi laŭ etikedo

chevron_right Foliumi sonregistraĵojn

Komunumo

chevron_right Muro

chevron_right Listo de ĉiuj membroj

chevron_right Lingvoj de la membroj

chevron_right Denaskaj parolantoj

search
clear
swap_horiz
search
lilygilder lilygilder 2009-decembro-23 2009-decembro-23 11:57:15 UTC link Konstanta ligilo

Hi there,

What can I do with repeated sentences? Is there a way to link one entry to the other or maybe even merge them?

Anyways, thank you for this wonderful project.

{{vm.hiddenReplies[80] ? 'expand_more' : 'expand_less'}} kaŝi la respondojn montri la respondojn
TRANG TRANG 2009-decembro-23 2009-decembro-23 12:24:16 UTC link Konstanta ligilo

You don't have to worry about them. We take care of merging them :) We actually already launched a loooong cleaning process a few weeks ago, it removed about 10,000 exact duplicate sentences.
We're going to launch it again sometime, after we've cleaned the sentences from typos or extra spaces where there shouldn't be or things like that.

Anyways, thank you for your contributions. I'm happy to see German getting popular again :D It used to be the 4th language in Tatoeba, until extremely motivated contributors in Chinese and Spanish came along...

{{vm.hiddenReplies[81] ? 'expand_more' : 'expand_less'}} kaŝi la respondojn montri la respondojn
lilygilder lilygilder 2009-decembro-23 2009-decembro-23 12:42:36 UTC link Konstanta ligilo

Does this cleaning programm also remove nearly identical sentences? I found a pair where the only difference is the punctuation mark... I'm glad you don't have to do that manually...

I'd be happy if German took the fourth place again. I'll see what I can do and show some competitive spirit. =) This is a fun way to pass time and help other language learners. :)

{{vm.hiddenReplies[82] ? 'expand_more' : 'expand_less'}} kaŝi la respondojn montri la respondojn
TRANG TRANG 2009-decembro-23 2009-decembro-23 13:08:08 UTC link Konstanta ligilo

No it doesn't remove nearly identical sentences. I've seen sentences which differ only from the punctuation, but... Well this is a bit tricky.

If you take Japanese, there is supposedly no question mark or exclamation mark (although I suppose it's changing). Instead you have particles to express a question or an exclamation.
The fact that you write "I'm cold." or "I'm cold!" can change something in the Japanese sentence (samui desu / samui desu yo).

So to be safe, I wouldn't delete a sentence that has a nearly identical twin, with only a difference of punctuation.