menu
ٹٹوبا
language
درج کن لاگِن
language بلۏچی
menu
ٹٹوبا

chevron_right درج کن

chevron_right لاگِن

براوز کن

chevron_right پَدیمیں گالِد ءَ پیشدار

chevron_right زبان ءِ رِد ءَ براوز کن

chevron_right لَڑ ءِ رِد ءَ براوز کن

chevron_right ٹیگ ءِ رِد ءَ براوز کن

chevron_right تواربَند ءَ براوز کن

ٹَکّ

chevron_right دیوال

chevron_right دراھیں باسکانی لَڑ

chevron_right باسکانی زبان

chevron_right پیدائشی گپ جنوک

search
clear
swap_horiz
search
papabear {{ icon }} keyboard_arrow_right

پروفائل

keyboard_arrow_right

گالِد

keyboard_arrow_right

گالَمبار

keyboard_arrow_right

چمشانک

keyboard_arrow_right

لَڑ

keyboard_arrow_right

دلپسند

keyboard_arrow_right

دانک

keyboard_arrow_right

ءِ گالِدانی باروا دانکpapabear

keyboard_arrow_right

دیوال ءِ کلوہ

keyboard_arrow_right

لاگ

keyboard_arrow_right

تواربند

keyboard_arrow_right

نَدگِیر

translate

papabear ءِ گالِداں رجانک کن

ءِ دیوال ءِ سر ءِ کلوہ (درستی ءُ این بی ایس پی؛88)۔papabear

papabear papabear June 10, 2011 June 10, 2011 at 2:10:21 PM UTC link دائمکڑی

We have over 5000 members now!
¡Ahora tenemos más de 5000 miembros!
今は、メンバーが5000人以上いますよ!

papabear papabear June 3, 2011 June 3, 2011 at 12:27:25 PM UTC link دائمکڑی

And, for the love of Russell's teapot, can the sentence-adding code reject anything without a final punctuation mark? It'd save us so much trouble.

papabear papabear June 3, 2011 June 3, 2011 at 12:19:58 PM UTC link دائمکڑی

Might that make it more difficult to add sentences, parse corpus sentences by computer, and have audio sentences match their text?

I do agree with your basic premise, however, and I have a couple more ideas we could explore the pros and cons of:

* A moratorium on adding sentences of a certain length in well-established languages. Ex: No more sentences of less than three words in English.
* A heuristic anti-duplication algorithm that forbids not just whole duplicate sentences, but near-duplicates.

papabear papabear June 3, 2011 June 3, 2011 at 12:00:45 PM UTC link دائمکڑی

+1

I've run into this problem before. My solution, a cumbersome one, has been to comment on every single sentence linked to the one I changed.

papabear papabear June 3, 2011 June 3, 2011 at 11:58:04 AM UTC link دائمکڑی

It's not something I can take to its logical extreme, but I try to mix up the ethnicities of names I use. I wonder if lists of baby names around the world might help.

papabear papabear June 2, 2011 June 2, 2011 at 10:02:18 AM UTC link دائمکڑی

I've disowned all my Spanish sentences so that native speakers can adopt them.
He desconocido todas mis frases españoles para que hablantes nativos puedan adoptarlas.

papabear papabear May 26, 2011 May 26, 2011 at 10:50:58 AM UTC link دائمکڑی

To clarify, an example search for what I want might be:

All sentences of inmachan in Spanish NOT translated into English

You could even combine it with sacredceltic's suggestion:

All sentences NOT of papabear OR sysko NOT translated into French OR German

papabear papabear May 26, 2011 May 26, 2011 at 9:26:56 AM UTC link دائمکڑی

It might help free up the view of browsing untranslated sentences by source language. When I do that, I often find huge, 10-page blocks of sentences from one person.

papabear papabear May 26, 2011 May 26, 2011 at 9:06:36 AM UTC link دائمکڑی

When translating a specific person's sentences, I'd like to be able to filter them by language and whether they are already translated, as we can already do when we browse sentences by language.

papabear papabear May 24, 2011 May 24, 2011 at 10:21:55 PM UTC link دائمکڑی

If I recall correctly, the search engine database updates once a week. This will not be an issue in the upcoming C++ database.

papabear papabear May 23, 2011 May 23, 2011 at 9:07:32 PM UTC link دائمکڑی

There's already a ticket for that.

papabear papabear May 23, 2011 May 23, 2011 at 11:46:55 AM UTC link دائمکڑی

I'm reconfirming that the duplicate detector seems to work, at least in English. Also, Sysko, are you going to blow off this version of the database when you migrate TatoebaCPP to this domain?

papabear papabear May 12, 2011 May 12, 2011 at 8:21:26 AM UTC link دائمکڑی

I, for one, try to switch things up, but I don't have a precise algorithm for it.

papabear papabear May 12, 2011 May 12, 2011 at 4:09:39 AM UTC link دائمکڑی

But what if you haven't already registered? Let's keep in mind that we aim to serve non-registered visitors, and that the (limited) interfaces should be as usable and understandable as possible. Wikipedia already has separate interfaces for different languages, probably for similar reasons.

papabear papabear May 10, 2011 May 10, 2011 at 6:19:02 AM UTC link دائمکڑی

Would that better denote the hard B sound? To me, you see, that looks like Tatoempa.

papabear papabear May 9, 2011 May 9, 2011 at 11:03:56 PM UTC link دائمکڑی

True.

papabear papabear May 9, 2011 May 9, 2011 at 10:43:31 PM UTC link دائمکڑی

We should translate the word "project" in each image as well.

papabear papabear May 9, 2011 May 9, 2011 at 10:41:32 PM UTC link دائمکڑی

Matching the interface was what I meant.

papabear papabear May 9, 2011 May 9, 2011 at 12:33:48 PM UTC link دائمکڑی

Perhaps we should have different logos for each alphabet: one for Tatoeba, and others for Τατοεβα, Татоэба, タトエバ....

papabear papabear May 8, 2011 May 8, 2011 at 9:01:24 AM UTC link دائمکڑی

There is currently no way to search for sentences with a certain word count. However, CK painstakingly tagged a bunch of English sentences with less than 8 words and 8 to 10 words.

http://tatoeba.org/eng/tags/sho...s_than_8_words
http://tatoeba.org/eng/tags/sho.../8_to_10_words

When Sysko finishes his new version of the Tatoeba database, he may begin implementing features like word count.