Menu
Hi there,
What can I do with repeated sentences? Is there a way to link one entry to the other or maybe even merge them?
Anyways, thank you for this wonderful project.
You don't have to worry about them. We take care of merging them :) We actually already launched a loooong cleaning process a few weeks ago, it removed about 10,000 exact duplicate sentences.
We're going to launch it again sometime, after we've cleaned the sentences from typos or extra spaces where there shouldn't be or things like that.
Anyways, thank you for your contributions. I'm happy to see German getting popular again :D It used to be the 4th language in Tatoeba, until extremely motivated contributors in Chinese and Spanish came along...
Does this cleaning programm also remove nearly identical sentences? I found a pair where the only difference is the punctuation mark... I'm glad you don't have to do that manually...
I'd be happy if German took the fourth place again. I'll see what I can do and show some competitive spirit. =) This is a fun way to pass time and help other language learners. :)
No it doesn't remove nearly identical sentences. I've seen sentences which differ only from the punctuation, but... Well this is a bit tricky.
If you take Japanese, there is supposedly no question mark or exclamation mark (although I suppose it's changing). Instead you have particles to express a question or an exclamation.
The fact that you write "I'm cold." or "I'm cold!" can change something in the Japanese sentence (samui desu / samui desu yo).
So to be safe, I wouldn't delete a sentence that has a nearly identical twin, with only a difference of punctuation.