menu
Tatoeba
language
S'inscriure Connexion
language Occitan
menu
Tatoeba

chevron_right S'inscriure

chevron_right Connexion

Percórrer

chevron_right Afichar la frasa aleatòria

chevron_right Percórrer per lenga

chevron_right Percórrer per lista

chevron_right Percórrer per etiqueta

chevron_right Percórrer los enregistraments àudio

Community

chevron_right Paret

chevron_right Lista de totes los membres

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
Demetrius {{ icon }} keyboard_arrow_right

Perfil

keyboard_arrow_right

Frasas

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Marcapaginas

keyboard_arrow_right

Comentaris

keyboard_arrow_right

Comentaris sus las frasas de Demetrius

keyboard_arrow_right

Cabinats

keyboard_arrow_right

Jornals

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate Demetrius's sentences

Cabinets de Demetrius sus la paret (total 442)

Demetrius Demetrius August 28, 2010 August 28, 2010 at 8:16:31 PM UTC link Permalink

I’m not sure whether we should translate By- and From- tags into English. Doing this will immediately make them less useful for non-English applications re-using Tatoeba’s data [unless someone's willing to maintain a database of tag translations ;o].

On the other hand, they’re used relatively rarely (compared to other tags anyway), so ability to view quotes of some person in all the languages shouldn't be very useful.

BTW, not all %NAME% tags should become By-%NAME%. E.g. Plato, Caesar (‘Platon is my friend, but...’ is said by Aristoteles, ‘Caesar non supra grammaticos’). I suggest these to be deleted from auto-delete list and dealt with manually.

I believe By- and For- should be converted into a DB fields describing a sentence and should be kept in the language of the sentence, since they're not really repeatable anyway.

Белый Ослик is ‘[A] little white donkey’, but I’m not sure whether it’s how this book(?) should be translated.

Demetrius Demetrius August 27, 2010 August 27, 2010 at 1:33:34 AM UTC link Permalink

You've replied about the Qazan that is a capital of Tatarstan, not about the qazan that is a pot.

http://tatoeba.org/eng/sentences/show/441304

Demetrius Demetrius August 27, 2010 August 27, 2010 at 1:26:22 AM UTC link Permalink

But it's much easier to copy-paste than Arabic script. :)

BTW, have you seen my comment about qazanda/qazangha?

Demetrius Demetrius August 27, 2010 August 27, 2010 at 1:18:13 AM UTC link Permalink

Min tatarça BIK naçar belim. My Tatar is not good enough for this. (That's why there are so many Tatar sentences about apples ;)

BTW, you can PM him/her in latinized Uyghur. ;)

Demetrius Demetrius August 27, 2010 August 27, 2010 at 12:46:30 AM UTC link Permalink

I don't agree with this nomination. Turkish is a language spoken in Germany. Since I'm learning German, I will do without Turkish. ;) Plus, I don't like purism, and Turkish people are known for their purism.

I think a person who knows other Turkic language well should start learning Turkish, because it's much easier to pick up a second language when you know a related one. ;)

Demetrius Demetrius August 27, 2010 August 27, 2010 at 12:28:34 AM UTC link Permalink

> Anyone interested in learning Turkish? :D
By the way, most of the new sentence he has added are very simple and easy to understand, so they're good for people beginning to study Turkish. ;)

Demetrius Demetrius August 26, 2010 August 26, 2010 at 11:37:35 PM UTC link Permalink

IMHO, since it's clear that he is online, but unwilling to change or discuss his sentences, moderators should be given the right to correct them immediately, not waiting for 2 weeks.

Demetrius Demetrius August 26, 2010 August 26, 2010 at 2:51:52 PM UTC link Permalink

Hmm. Nice.

I’ll read about it, thank you.

Demetrius Demetrius August 26, 2010 August 26, 2010 at 2:37:39 PM UTC link Permalink

Well, this may be a viable solution for Japanese or Chinese, but I’ve never heard about international tests in Russian, let alone in Ukrainian or Belarusian. Even if they do exist, they’re not divided into levels.

I feel that simply checking words in the frequency dictionary is the only way for Russian...

Demetrius Demetrius August 26, 2010 August 26, 2010 at 2:19:40 PM UTC link Permalink

** Difficulty tagging **

I feel that tags like ‘difficult’, ‘easy’ and so on would be very helpful. However, I’m reluctant to add a lot of these unless they have a definite meaning.

IMHO we need a clear guidlines about what can be tagged as ‘easy’. Maybe we can even break it up into 2 different tags: ‘easy vocabulary’ (e.g. all words are in the first 2000 in the frequency dictionary), ‘easy grammar’ (a simple sentence w/out any complex grammar structures, etc.).

Are there any established systems of calculating a sentence difficulty?

Demetrius Demetrius August 26, 2010 August 26, 2010 at 1:01:59 PM UTC link Permalink

Sentences can be used by anyone, see page in the bottom.

Currently we have not contacted Google, but since it uses the data from the Internet, it may use it... who knows? At least all Tatoeba is indexed by Google.

The main idea is that anyone who needs a database of sentences can use Tatoeba to make applications. DB can be downloaded here: http://tatoeba.org/eng/download...mple_sentences

One purpose is learning.
For example, an online dictionary WWWJDIC uses English-Japanese pairs from Tatoeba.
Language learning programmes can be created. For example, CK has also created a Japanese reading practice program based on data from Tatoeba (I don't remember the link exactly).

Well, Tatoeba can be used for any application. For example, Braulio Bezerra created a typing game with these sentences.

(Also, Tatoeba itself has some functions for learning too. For example, you can create a list and export it to Anki.)

Demetrius Demetrius August 26, 2010 August 26, 2010 at 9:45:41 AM UTC link Permalink

> 4. Do not translate word for word
>
> We are not interested in having sentences that sound like
> they were written by a robot. We want sentences that really
> are what a native speaker would say.
Tatoeba Blog, ‘How to be a good contributor’, rule 4 (http://blog.tatoeba.org/2010/02...ba.html#rule4)

I guess this covers machine translations. ^^

Demetrius Demetrius August 25, 2010 August 25, 2010 at 11:20:57 PM UTC link Permalink

I think a mss tagging ‘tag all sentences linked to this’ would be useful. I.e. if you see it's a ‘fact’, most linked sentences would usually be facts too.

Demetrius Demetrius August 25, 2010 August 25, 2010 at 9:37:42 AM UTC link Permalink

Azeri is a language spoken in Azerbaijan.

Demetrius Demetrius August 24, 2010 August 24, 2010 at 2:46:04 PM UTC link Permalink

PG-13 is a normal tag.
be-1959acad shows that this is applicable only in the Academic variery of Belarusian; it's a normal language tag
Leopolis is a latin duplicate of Lviv ;o
Urdu is to be deleted, since we have a flag now
IMHO spoken to male should be replaced with ‘said to male’

Demetrius Demetrius August 24, 2010 August 24, 2010 at 9:59:06 AM UTC link Permalink

By the way, about the XXX tag...

How it should be used? Consider the following groups:
a) sentence describes a sexual intercourse in rude words,
b) sentence uses the rude words with an indirect meaning, to describe something other,
c) sentence describes a sexual intercourse with euphemisms
In which cases XXX tag should be used?

Demetrius Demetrius August 22, 2010 August 22, 2010 at 9:18:52 PM UTC link Permalink

[@sysko and anyone interested]
Uzbek script-switching code is here:
http://uyghur.webatu.com/uzb/uzbek_script.zip

The Cyrillic>Latin should work fine, Latin>Cyrillic may have problems with Russian loanwords. Also, it may be slow, since it’s a simple PHP str_replace w/ 2 arrays.

A simple page for anyone to toy with the transliteration:
http://uyghur.webatu.com/uzb/

Demetrius Demetrius August 22, 2010 August 22, 2010 at 9:15:31 PM UTC link Permalink

Are there any Bosnian and Croatian sentences these NOT matching Serbian?.. :o

Demetrius Demetrius August 21, 2010 August 21, 2010 at 7:56:16 PM UTC link Permalink

BTW, we should be careful when adding sentences with periods.

Sometimes there are non-standard 1337-like ‘punctuation’ like "!!1" in the end.

I’m not sure whether we should keep such sentences though...

Demetrius Demetrius August 20, 2010 August 20, 2010 at 3:47:30 PM UTC link Permalink

Leaving a comment lets a non-native owing it learn the correct variant or be confident in their translations.