menu
Tatoeba
language
Zaregistrovat se Přihlásit se
language Čeština
menu
Tatoeba

chevron_right Zaregistrovat se

chevron_right Přihlásit se

Prohlížet věty

chevron_right Náhodná věta

chevron_right Podle jazyka

chevron_right Podle seznamů

chevron_right Podle štítků

chevron_right Podle nahrávek

Komunita

chevron_right Zeď

chevron_right Seznam všech členů

chevron_right Jazyky členů

chevron_right Rodilí mluvčí

search
clear
swap_horiz
search
Demetrius {{ icon }} keyboard_arrow_right

Profil

keyboard_arrow_right

Věty

keyboard_arrow_right

Slovíčka

keyboard_arrow_right

Provedená hodnocení

keyboard_arrow_right

Seznamy

keyboard_arrow_right

Oblíbené

keyboard_arrow_right

Komentáře

keyboard_arrow_right

Komentáře u vět uživatele Demetrius

keyboard_arrow_right

Zprávy na Zdi

keyboard_arrow_right

Záznamy

keyboard_arrow_right

Audio

keyboard_arrow_right

Přepisy

translate

Překládat věty uživatele Demetrius

Zprávy uživatele Demetrius na Zdi (celkem 442)

Demetrius Demetrius 28. srpna 2010 28. srpna 2010 20:16:31 UTC link Trvalý odkaz (permalink)

I’m not sure whether we should translate By- and From- tags into English. Doing this will immediately make them less useful for non-English applications re-using Tatoeba’s data [unless someone's willing to maintain a database of tag translations ;o].

On the other hand, they’re used relatively rarely (compared to other tags anyway), so ability to view quotes of some person in all the languages shouldn't be very useful.

BTW, not all %NAME% tags should become By-%NAME%. E.g. Plato, Caesar (‘Platon is my friend, but...’ is said by Aristoteles, ‘Caesar non supra grammaticos’). I suggest these to be deleted from auto-delete list and dealt with manually.

I believe By- and For- should be converted into a DB fields describing a sentence and should be kept in the language of the sentence, since they're not really repeatable anyway.

Белый Ослик is ‘[A] little white donkey’, but I’m not sure whether it’s how this book(?) should be translated.

Demetrius Demetrius 27. srpna 2010 27. srpna 2010 1:33:34 UTC link Trvalý odkaz (permalink)

You've replied about the Qazan that is a capital of Tatarstan, not about the qazan that is a pot.

http://tatoeba.org/eng/sentences/show/441304

Demetrius Demetrius 27. srpna 2010 27. srpna 2010 1:26:22 UTC link Trvalý odkaz (permalink)

But it's much easier to copy-paste than Arabic script. :)

BTW, have you seen my comment about qazanda/qazangha?

Demetrius Demetrius 27. srpna 2010 27. srpna 2010 1:18:13 UTC link Trvalý odkaz (permalink)

Min tatarça BIK naçar belim. My Tatar is not good enough for this. (That's why there are so many Tatar sentences about apples ;)

BTW, you can PM him/her in latinized Uyghur. ;)

Demetrius Demetrius 27. srpna 2010 27. srpna 2010 0:46:30 UTC link Trvalý odkaz (permalink)

I don't agree with this nomination. Turkish is a language spoken in Germany. Since I'm learning German, I will do without Turkish. ;) Plus, I don't like purism, and Turkish people are known for their purism.

I think a person who knows other Turkic language well should start learning Turkish, because it's much easier to pick up a second language when you know a related one. ;)

Demetrius Demetrius 27. srpna 2010 27. srpna 2010 0:28:34 UTC link Trvalý odkaz (permalink)

> Anyone interested in learning Turkish? :D
By the way, most of the new sentence he has added are very simple and easy to understand, so they're good for people beginning to study Turkish. ;)

Demetrius Demetrius 26. srpna 2010 26. srpna 2010 23:37:35 UTC link Trvalý odkaz (permalink)

IMHO, since it's clear that he is online, but unwilling to change or discuss his sentences, moderators should be given the right to correct them immediately, not waiting for 2 weeks.

Demetrius Demetrius 26. srpna 2010 26. srpna 2010 14:51:52 UTC link Trvalý odkaz (permalink)

Hmm. Nice.

I’ll read about it, thank you.

Demetrius Demetrius 26. srpna 2010 26. srpna 2010 14:37:39 UTC link Trvalý odkaz (permalink)

Well, this may be a viable solution for Japanese or Chinese, but I’ve never heard about international tests in Russian, let alone in Ukrainian or Belarusian. Even if they do exist, they’re not divided into levels.

I feel that simply checking words in the frequency dictionary is the only way for Russian...

Demetrius Demetrius 26. srpna 2010 26. srpna 2010 14:19:40 UTC link Trvalý odkaz (permalink)

** Difficulty tagging **

I feel that tags like ‘difficult’, ‘easy’ and so on would be very helpful. However, I’m reluctant to add a lot of these unless they have a definite meaning.

IMHO we need a clear guidlines about what can be tagged as ‘easy’. Maybe we can even break it up into 2 different tags: ‘easy vocabulary’ (e.g. all words are in the first 2000 in the frequency dictionary), ‘easy grammar’ (a simple sentence w/out any complex grammar structures, etc.).

Are there any established systems of calculating a sentence difficulty?

Demetrius Demetrius 26. srpna 2010 26. srpna 2010 13:01:59 UTC link Trvalý odkaz (permalink)

Sentences can be used by anyone, see page in the bottom.

Currently we have not contacted Google, but since it uses the data from the Internet, it may use it... who knows? At least all Tatoeba is indexed by Google.

The main idea is that anyone who needs a database of sentences can use Tatoeba to make applications. DB can be downloaded here: http://tatoeba.org/eng/download...mple_sentences

One purpose is learning.
For example, an online dictionary WWWJDIC uses English-Japanese pairs from Tatoeba.
Language learning programmes can be created. For example, CK has also created a Japanese reading practice program based on data from Tatoeba (I don't remember the link exactly).

Well, Tatoeba can be used for any application. For example, Braulio Bezerra created a typing game with these sentences.

(Also, Tatoeba itself has some functions for learning too. For example, you can create a list and export it to Anki.)

Demetrius Demetrius 26. srpna 2010 26. srpna 2010 9:45:41 UTC link Trvalý odkaz (permalink)

> 4. Do not translate word for word
>
> We are not interested in having sentences that sound like
> they were written by a robot. We want sentences that really
> are what a native speaker would say.
Tatoeba Blog, ‘How to be a good contributor’, rule 4 (http://blog.tatoeba.org/2010/02...ba.html#rule4)

I guess this covers machine translations. ^^

Demetrius Demetrius 25. srpna 2010 25. srpna 2010 23:20:57 UTC link Trvalý odkaz (permalink)

I think a mss tagging ‘tag all sentences linked to this’ would be useful. I.e. if you see it's a ‘fact’, most linked sentences would usually be facts too.

Demetrius Demetrius 25. srpna 2010 25. srpna 2010 9:37:42 UTC link Trvalý odkaz (permalink)

Azeri is a language spoken in Azerbaijan.

Demetrius Demetrius 24. srpna 2010 24. srpna 2010 14:46:04 UTC link Trvalý odkaz (permalink)

PG-13 is a normal tag.
be-1959acad shows that this is applicable only in the Academic variery of Belarusian; it's a normal language tag
Leopolis is a latin duplicate of Lviv ;o
Urdu is to be deleted, since we have a flag now
IMHO spoken to male should be replaced with ‘said to male’

Demetrius Demetrius 24. srpna 2010 24. srpna 2010 9:59:06 UTC link Trvalý odkaz (permalink)

By the way, about the XXX tag...

How it should be used? Consider the following groups:
a) sentence describes a sexual intercourse in rude words,
b) sentence uses the rude words with an indirect meaning, to describe something other,
c) sentence describes a sexual intercourse with euphemisms
In which cases XXX tag should be used?

Demetrius Demetrius 22. srpna 2010 22. srpna 2010 21:18:52 UTC link Trvalý odkaz (permalink)

[@sysko and anyone interested]
Uzbek script-switching code is here:
http://uyghur.webatu.com/uzb/uzbek_script.zip

The Cyrillic>Latin should work fine, Latin>Cyrillic may have problems with Russian loanwords. Also, it may be slow, since it’s a simple PHP str_replace w/ 2 arrays.

A simple page for anyone to toy with the transliteration:
http://uyghur.webatu.com/uzb/

Demetrius Demetrius 22. srpna 2010 22. srpna 2010 21:15:31 UTC link Trvalý odkaz (permalink)

Are there any Bosnian and Croatian sentences these NOT matching Serbian?.. :o

Demetrius Demetrius 21. srpna 2010 21. srpna 2010 19:56:16 UTC link Trvalý odkaz (permalink)

BTW, we should be careful when adding sentences with periods.

Sometimes there are non-standard 1337-like ‘punctuation’ like "!!1" in the end.

I’m not sure whether we should keep such sentences though...

Demetrius Demetrius 20. srpna 2010 20. srpna 2010 15:47:30 UTC link Trvalý odkaz (permalink)

Leaving a comment lets a non-native owing it learn the correct variant or be confident in their translations.