menu
Tatoeba
language
Zaregistrovat se Přihlásit se
language Čeština
menu
Tatoeba

chevron_right Zaregistrovat se

chevron_right Přihlásit se

Prohlížet věty

chevron_right Náhodná věta

chevron_right Podle jazyka

chevron_right Podle seznamů

chevron_right Podle štítků

chevron_right Podle nahrávek

Komunita

chevron_right Zeď

chevron_right Seznam všech členů

chevron_right Jazyky členů

chevron_right Rodilí mluvčí

search
clear
swap_horiz
search
papabear {{ icon }} keyboard_arrow_right

Profil

keyboard_arrow_right

Věty

keyboard_arrow_right

Slovíčka

keyboard_arrow_right

Provedená hodnocení

keyboard_arrow_right

Seznamy

keyboard_arrow_right

Oblíbené

keyboard_arrow_right

Komentáře

keyboard_arrow_right

Komentáře u vět uživatele papabear

keyboard_arrow_right

Zprávy na Zdi

keyboard_arrow_right

Záznamy

keyboard_arrow_right

Audio

keyboard_arrow_right

Přepisy

translate

Překládat věty uživatele papabear

Zprávy uživatele papabear na Zdi (celkem 88)

papabear papabear 10. června 2011 10. června 2011 14:10:21 UTC link Trvalý odkaz (permalink)

We have over 5000 members now!
¡Ahora tenemos más de 5000 miembros!
今は、メンバーが5000人以上いますよ!

papabear papabear 3. června 2011 3. června 2011 12:27:25 UTC link Trvalý odkaz (permalink)

And, for the love of Russell's teapot, can the sentence-adding code reject anything without a final punctuation mark? It'd save us so much trouble.

papabear papabear 3. června 2011 3. června 2011 12:19:58 UTC link Trvalý odkaz (permalink)

Might that make it more difficult to add sentences, parse corpus sentences by computer, and have audio sentences match their text?

I do agree with your basic premise, however, and I have a couple more ideas we could explore the pros and cons of:

* A moratorium on adding sentences of a certain length in well-established languages. Ex: No more sentences of less than three words in English.
* A heuristic anti-duplication algorithm that forbids not just whole duplicate sentences, but near-duplicates.

papabear papabear 3. června 2011 3. června 2011 12:00:45 UTC link Trvalý odkaz (permalink)

+1

I've run into this problem before. My solution, a cumbersome one, has been to comment on every single sentence linked to the one I changed.

papabear papabear 3. června 2011 3. června 2011 11:58:04 UTC link Trvalý odkaz (permalink)

It's not something I can take to its logical extreme, but I try to mix up the ethnicities of names I use. I wonder if lists of baby names around the world might help.

papabear papabear 2. června 2011 2. června 2011 10:02:18 UTC link Trvalý odkaz (permalink)

I've disowned all my Spanish sentences so that native speakers can adopt them.
He desconocido todas mis frases españoles para que hablantes nativos puedan adoptarlas.

papabear papabear 26. května 2011 26. května 2011 10:50:58 UTC link Trvalý odkaz (permalink)

To clarify, an example search for what I want might be:

All sentences of inmachan in Spanish NOT translated into English

You could even combine it with sacredceltic's suggestion:

All sentences NOT of papabear OR sysko NOT translated into French OR German

papabear papabear 26. května 2011 26. května 2011 9:26:56 UTC link Trvalý odkaz (permalink)

It might help free up the view of browsing untranslated sentences by source language. When I do that, I often find huge, 10-page blocks of sentences from one person.

papabear papabear 26. května 2011 26. května 2011 9:06:36 UTC link Trvalý odkaz (permalink)

When translating a specific person's sentences, I'd like to be able to filter them by language and whether they are already translated, as we can already do when we browse sentences by language.

papabear papabear 24. května 2011 24. května 2011 22:21:55 UTC link Trvalý odkaz (permalink)

If I recall correctly, the search engine database updates once a week. This will not be an issue in the upcoming C++ database.

papabear papabear 23. května 2011 23. května 2011 21:07:32 UTC link Trvalý odkaz (permalink)

There's already a ticket for that.

papabear papabear 23. května 2011 23. května 2011 11:46:55 UTC link Trvalý odkaz (permalink)

I'm reconfirming that the duplicate detector seems to work, at least in English. Also, Sysko, are you going to blow off this version of the database when you migrate TatoebaCPP to this domain?

papabear papabear 12. května 2011 12. května 2011 8:21:26 UTC link Trvalý odkaz (permalink)

I, for one, try to switch things up, but I don't have a precise algorithm for it.

papabear papabear 12. května 2011 12. května 2011 4:09:39 UTC link Trvalý odkaz (permalink)

But what if you haven't already registered? Let's keep in mind that we aim to serve non-registered visitors, and that the (limited) interfaces should be as usable and understandable as possible. Wikipedia already has separate interfaces for different languages, probably for similar reasons.

papabear papabear 10. května 2011 10. května 2011 6:19:02 UTC link Trvalý odkaz (permalink)

Would that better denote the hard B sound? To me, you see, that looks like Tatoempa.

papabear papabear 9. května 2011 9. května 2011 23:03:56 UTC link Trvalý odkaz (permalink)

True.

papabear papabear 9. května 2011 9. května 2011 22:43:31 UTC link Trvalý odkaz (permalink)

We should translate the word "project" in each image as well.

papabear papabear 9. května 2011 9. května 2011 22:41:32 UTC link Trvalý odkaz (permalink)

Matching the interface was what I meant.

papabear papabear 9. května 2011 9. května 2011 12:33:48 UTC link Trvalý odkaz (permalink)

Perhaps we should have different logos for each alphabet: one for Tatoeba, and others for Τατοεβα, Татоэба, タトエバ....

papabear papabear 8. května 2011 8. května 2011 9:01:24 UTC link Trvalý odkaz (permalink)

There is currently no way to search for sentences with a certain word count. However, CK painstakingly tagged a bunch of English sentences with less than 8 words and 8 to 10 words.

http://tatoeba.org/eng/tags/sho...s_than_8_words
http://tatoeba.org/eng/tags/sho.../8_to_10_words

When Sysko finishes his new version of the Tatoeba database, he may begin implementing features like word count.