iznan n papabear ɣef uɣrab-is

papabear {{ icon }}

keyboard_arrow_right

Amaɣnu

keyboard_arrow_right

Tifyar

keyboard_arrow_right

Amawal

keyboard_arrow_right

Iceggiren

keyboard_arrow_right

Tibdarin

keyboard_arrow_right

Inurifen

keyboard_arrow_right

Iwenniten

keyboard_arrow_right

Iwenniten ɣef tifyar n papabear

keyboard_arrow_right

Iznan n uɣrab

keyboard_arrow_right

Aɣmis

keyboard_arrow_right

Imesli

keyboard_arrow_right

Iḍrisen

translate

Suqqel tifyar n papabear

papabear 10 Yunyu 2011 10 Yunyu 2011 2:10:21 n tmeddit UTC

link

Aseɣwen yezgan

We have over 5000 members now!
¡Ahora tenemos más de 5000 miembros!
今は、メンバーが５０００人以上いますよ！

papabear 3 Yunyu 2011 3 Yunyu 2011 12:27:25 n tmeddit UTC

link

Aseɣwen yezgan

And, for the love of Russell's teapot, can the sentence-adding code reject anything without a final punctuation mark? It'd save us so much trouble.

papabear 3 Yunyu 2011 3 Yunyu 2011 12:19:58 n tmeddit UTC

link

Aseɣwen yezgan

Might that make it more difficult to add sentences, parse corpus sentences by computer, and have audio sentences match their text?

I do agree with your basic premise, however, and I have a couple more ideas we could explore the pros and cons of:

* A moratorium on adding sentences of a certain length in well-established languages. Ex: No more sentences of less than three words in English.
* A heuristic anti-duplication algorithm that forbids not just whole duplicate sentences, but near-duplicates.

papabear 3 Yunyu 2011 3 Yunyu 2011 12:00:45 n tmeddit UTC

link

Aseɣwen yezgan

+1

I've run into this problem before. My solution, a cumbersome one, has been to comment on every single sentence linked to the one I changed.

papabear 3 Yunyu 2011 3 Yunyu 2011 11:58:04 n tufat UTC

link

Aseɣwen yezgan

It's not something I can take to its logical extreme, but I try to mix up the ethnicities of names I use. I wonder if lists of baby names around the world might help.

papabear 2 Yunyu 2011 2 Yunyu 2011 10:02:18 n tufat UTC

link

Aseɣwen yezgan

I've disowned all my Spanish sentences so that native speakers can adopt them.
He desconocido todas mis frases españoles para que hablantes nativos puedan adoptarlas.

papabear 26 Mayyu 2011 26 Mayyu 2011 10:50:58 n tufat UTC

link

Aseɣwen yezgan

To clarify, an example search for what I want might be:

All sentences of inmachan in Spanish NOT translated into English

You could even combine it with sacredceltic's suggestion:

All sentences NOT of papabear OR sysko NOT translated into French OR German

papabear 26 Mayyu 2011 26 Mayyu 2011 9:26:56 n tufat UTC

link

Aseɣwen yezgan

It might help free up the view of browsing untranslated sentences by source language. When I do that, I often find huge, 10-page blocks of sentences from one person.

papabear 26 Mayyu 2011 26 Mayyu 2011 9:06:36 n tufat UTC

link

Aseɣwen yezgan

When translating a specific person's sentences, I'd like to be able to filter them by language and whether they are already translated, as we can already do when we browse sentences by language.

papabear 24 Mayyu 2011 24 Mayyu 2011 10:21:55 n tmeddit UTC

link

Aseɣwen yezgan

If I recall correctly, the search engine database updates once a week. This will not be an issue in the upcoming C++ database.

papabear 23 Mayyu 2011 23 Mayyu 2011 9:07:32 n tmeddit UTC

link

Aseɣwen yezgan

There's already a ticket for that.

papabear 23 Mayyu 2011 23 Mayyu 2011 11:46:55 n tufat UTC

link

Aseɣwen yezgan

I'm reconfirming that the duplicate detector seems to work, at least in English. Also, Sysko, are you going to blow off this version of the database when you migrate TatoebaCPP to this domain?

papabear 12 Mayyu 2011 12 Mayyu 2011 8:21:26 n tufat UTC

link

Aseɣwen yezgan

I, for one, try to switch things up, but I don't have a precise algorithm for it.

papabear 12 Mayyu 2011 12 Mayyu 2011 4:09:39 n tufat UTC

link

Aseɣwen yezgan

But what if you haven't already registered? Let's keep in mind that we aim to serve non-registered visitors, and that the (limited) interfaces should be as usable and understandable as possible. Wikipedia already has separate interfaces for different languages, probably for similar reasons.

papabear 10 Mayyu 2011 10 Mayyu 2011 6:19:02 n tufat UTC

link

Aseɣwen yezgan

Would that better denote the hard B sound? To me, you see, that looks like Tatoempa.

papabear 9 Mayyu 2011 9 Mayyu 2011 11:03:56 n tmeddit UTC

link

Aseɣwen yezgan

True.

papabear 9 Mayyu 2011 9 Mayyu 2011 10:43:31 n tmeddit UTC

link

Aseɣwen yezgan

We should translate the word "project" in each image as well.

papabear 9 Mayyu 2011 9 Mayyu 2011 10:41:32 n tmeddit UTC

link

Aseɣwen yezgan

Matching the interface was what I meant.

papabear 9 Mayyu 2011 9 Mayyu 2011 12:33:48 n tmeddit UTC

link

Aseɣwen yezgan

Perhaps we should have different logos for each alphabet: one for Tatoeba, and others for Τατοεβα, Татоэба, タトエバ....

papabear 8 Mayyu 2011 8 Mayyu 2011 9:01:24 n tufat UTC

link

Aseɣwen yezgan

There is currently no way to search for sentences with a certain word count. However, CK painstakingly tagged a bunch of English sentences with less than 8 words and 8 to 10 words.

http://tatoeba.org/eng/tags/sho...s_than_8_words
http://tatoeba.org/eng/tags/sho.../8_to_10_words

When Sysko finishes his new version of the Tatoeba database, he may begin implementing features like word count.

Tesriḍ tallelt?

Ineflayen

Ɣef

iznan n papabear ɣef uɣrab-is (amatu 88)

Tesriḍ tallelt?

Ineflayen

Ɣef