Perfil
Frasas
Vocabulary
Reviews
Lists
Marcapaginas
Comentaris
Comentaris sus las frasas de Demetrius
Cabinats
Jornals
Audio
Transcriptions
Translate Demetrius's sentences

I’m not sure whether we should translate By- and From- tags into English. Doing this will immediately make them less useful for non-English applications re-using Tatoeba’s data [unless someone's willing to maintain a database of tag translations ;o].
On the other hand, they’re used relatively rarely (compared to other tags anyway), so ability to view quotes of some person in all the languages shouldn't be very useful.
BTW, not all %NAME% tags should become By-%NAME%. E.g. Plato, Caesar (‘Platon is my friend, but...’ is said by Aristoteles, ‘Caesar non supra grammaticos’). I suggest these to be deleted from auto-delete list and dealt with manually.
I believe By- and For- should be converted into a DB fields describing a sentence and should be kept in the language of the sentence, since they're not really repeatable anyway.
Белый Ослик is ‘[A] little white donkey’, but I’m not sure whether it’s how this book(?) should be translated.

You've replied about the Qazan that is a capital of Tatarstan, not about the qazan that is a pot.
http://tatoeba.org/eng/sentences/show/441304

But it's much easier to copy-paste than Arabic script. :)
BTW, have you seen my comment about qazanda/qazangha?

Min tatarça BIK naçar belim. My Tatar is not good enough for this. (That's why there are so many Tatar sentences about apples ;)
BTW, you can PM him/her in latinized Uyghur. ;)

I don't agree with this nomination. Turkish is a language spoken in Germany. Since I'm learning German, I will do without Turkish. ;) Plus, I don't like purism, and Turkish people are known for their purism.
I think a person who knows other Turkic language well should start learning Turkish, because it's much easier to pick up a second language when you know a related one. ;)

> Anyone interested in learning Turkish? :D
By the way, most of the new sentence he has added are very simple and easy to understand, so they're good for people beginning to study Turkish. ;)

IMHO, since it's clear that he is online, but unwilling to change or discuss his sentences, moderators should be given the right to correct them immediately, not waiting for 2 weeks.

Hmm. Nice.
I’ll read about it, thank you.

Well, this may be a viable solution for Japanese or Chinese, but I’ve never heard about international tests in Russian, let alone in Ukrainian or Belarusian. Even if they do exist, they’re not divided into levels.
I feel that simply checking words in the frequency dictionary is the only way for Russian...

** Difficulty tagging **
I feel that tags like ‘difficult’, ‘easy’ and so on would be very helpful. However, I’m reluctant to add a lot of these unless they have a definite meaning.
IMHO we need a clear guidlines about what can be tagged as ‘easy’. Maybe we can even break it up into 2 different tags: ‘easy vocabulary’ (e.g. all words are in the first 2000 in the frequency dictionary), ‘easy grammar’ (a simple sentence w/out any complex grammar structures, etc.).
Are there any established systems of calculating a sentence difficulty?

Sentences can be used by anyone, see page in the bottom.
Currently we have not contacted Google, but since it uses the data from the Internet, it may use it... who knows? At least all Tatoeba is indexed by Google.
The main idea is that anyone who needs a database of sentences can use Tatoeba to make applications. DB can be downloaded here: http://tatoeba.org/eng/download...mple_sentences
One purpose is learning.
For example, an online dictionary WWWJDIC uses English-Japanese pairs from Tatoeba.
Language learning programmes can be created. For example, CK has also created a Japanese reading practice program based on data from Tatoeba (I don't remember the link exactly).
Well, Tatoeba can be used for any application. For example, Braulio Bezerra created a typing game with these sentences.
(Also, Tatoeba itself has some functions for learning too. For example, you can create a list and export it to Anki.)

> 4. Do not translate word for word
>
> We are not interested in having sentences that sound like
> they were written by a robot. We want sentences that really
> are what a native speaker would say.
Tatoeba Blog, ‘How to be a good contributor’, rule 4 (http://blog.tatoeba.org/2010/02...ba.html#rule4)
I guess this covers machine translations. ^^

I think a mss tagging ‘tag all sentences linked to this’ would be useful. I.e. if you see it's a ‘fact’, most linked sentences would usually be facts too.

Azeri is a language spoken in Azerbaijan.

PG-13 is a normal tag.
be-1959acad shows that this is applicable only in the Academic variery of Belarusian; it's a normal language tag
Leopolis is a latin duplicate of Lviv ;o
Urdu is to be deleted, since we have a flag now
IMHO spoken to male should be replaced with ‘said to male’

By the way, about the XXX tag...
How it should be used? Consider the following groups:
a) sentence describes a sexual intercourse in rude words,
b) sentence uses the rude words with an indirect meaning, to describe something other,
c) sentence describes a sexual intercourse with euphemisms
In which cases XXX tag should be used?

[@sysko and anyone interested]
Uzbek script-switching code is here:
http://uyghur.webatu.com/uzb/uzbek_script.zip
The Cyrillic>Latin should work fine, Latin>Cyrillic may have problems with Russian loanwords. Also, it may be slow, since it’s a simple PHP str_replace w/ 2 arrays.
A simple page for anyone to toy with the transliteration:
http://uyghur.webatu.com/uzb/

Are there any Bosnian and Croatian sentences these NOT matching Serbian?.. :o

BTW, we should be careful when adding sentences with periods.
Sometimes there are non-standard 1337-like ‘punctuation’ like "!!1" in the end.
I’m not sure whether we should keep such sentences though...

Leaving a comment lets a non-native owing it learn the correct variant or be confident in their translations.