menu
Tatoeba
language
Nýskráning Innskrá
language Íslenska
menu
Tatoeba

chevron_right Nýskráning

chevron_right Innskrá

Vafra

chevron_right Sýna setningu af handahófi

chevron_right Vafra eftir tungumáli

chevron_right Vafra eftir lista

chevron_right Vafra eftir merki

chevron_right Vafra upptökum

Samfélag

chevron_right Veggur

chevron_right Meðlimalisti

chevron_right Listi tungumála meðlima

chevron_right Innfæddir

search
clear
swap_horiz
search
Demetrius {{ icon }} keyboard_arrow_right

Notandasíða

keyboard_arrow_right

Setningar

keyboard_arrow_right

Orðaforði

keyboard_arrow_right

Dómar

keyboard_arrow_right

Listar

keyboard_arrow_right

Eftirlæti

keyboard_arrow_right

Ummæli

keyboard_arrow_right

Ummæli á setningum frá Demetrius

keyboard_arrow_right

Veggskilaboð

keyboard_arrow_right

Saga

keyboard_arrow_right

Upptökur

keyboard_arrow_right

Umritunir

translate

Þýða setningar frá Demetrius

Veggskilaboð frá Demetrius (samtals 442)

Demetrius Demetrius 28. ágúst 2010 28. ágúst 2010 kl. 20:16:31 UTC link Tengill

I’m not sure whether we should translate By- and From- tags into English. Doing this will immediately make them less useful for non-English applications re-using Tatoeba’s data [unless someone's willing to maintain a database of tag translations ;o].

On the other hand, they’re used relatively rarely (compared to other tags anyway), so ability to view quotes of some person in all the languages shouldn't be very useful.

BTW, not all %NAME% tags should become By-%NAME%. E.g. Plato, Caesar (‘Platon is my friend, but...’ is said by Aristoteles, ‘Caesar non supra grammaticos’). I suggest these to be deleted from auto-delete list and dealt with manually.

I believe By- and For- should be converted into a DB fields describing a sentence and should be kept in the language of the sentence, since they're not really repeatable anyway.

Белый Ослик is ‘[A] little white donkey’, but I’m not sure whether it’s how this book(?) should be translated.

Demetrius Demetrius 27. ágúst 2010 27. ágúst 2010 kl. 01:33:34 UTC link Tengill

You've replied about the Qazan that is a capital of Tatarstan, not about the qazan that is a pot.

http://tatoeba.org/eng/sentences/show/441304

Demetrius Demetrius 27. ágúst 2010 27. ágúst 2010 kl. 01:26:22 UTC link Tengill

But it's much easier to copy-paste than Arabic script. :)

BTW, have you seen my comment about qazanda/qazangha?

Demetrius Demetrius 27. ágúst 2010 27. ágúst 2010 kl. 01:18:13 UTC link Tengill

Min tatarça BIK naçar belim. My Tatar is not good enough for this. (That's why there are so many Tatar sentences about apples ;)

BTW, you can PM him/her in latinized Uyghur. ;)

Demetrius Demetrius 27. ágúst 2010 27. ágúst 2010 kl. 00:46:30 UTC link Tengill

I don't agree with this nomination. Turkish is a language spoken in Germany. Since I'm learning German, I will do without Turkish. ;) Plus, I don't like purism, and Turkish people are known for their purism.

I think a person who knows other Turkic language well should start learning Turkish, because it's much easier to pick up a second language when you know a related one. ;)

Demetrius Demetrius 27. ágúst 2010 27. ágúst 2010 kl. 00:28:34 UTC link Tengill

> Anyone interested in learning Turkish? :D
By the way, most of the new sentence he has added are very simple and easy to understand, so they're good for people beginning to study Turkish. ;)

Demetrius Demetrius 26. ágúst 2010 26. ágúst 2010 kl. 23:37:35 UTC link Tengill

IMHO, since it's clear that he is online, but unwilling to change or discuss his sentences, moderators should be given the right to correct them immediately, not waiting for 2 weeks.

Demetrius Demetrius 26. ágúst 2010 26. ágúst 2010 kl. 14:51:52 UTC link Tengill

Hmm. Nice.

I’ll read about it, thank you.

Demetrius Demetrius 26. ágúst 2010 26. ágúst 2010 kl. 14:37:39 UTC link Tengill

Well, this may be a viable solution for Japanese or Chinese, but I’ve never heard about international tests in Russian, let alone in Ukrainian or Belarusian. Even if they do exist, they’re not divided into levels.

I feel that simply checking words in the frequency dictionary is the only way for Russian...

Demetrius Demetrius 26. ágúst 2010 26. ágúst 2010 kl. 14:19:40 UTC link Tengill

** Difficulty tagging **

I feel that tags like ‘difficult’, ‘easy’ and so on would be very helpful. However, I’m reluctant to add a lot of these unless they have a definite meaning.

IMHO we need a clear guidlines about what can be tagged as ‘easy’. Maybe we can even break it up into 2 different tags: ‘easy vocabulary’ (e.g. all words are in the first 2000 in the frequency dictionary), ‘easy grammar’ (a simple sentence w/out any complex grammar structures, etc.).

Are there any established systems of calculating a sentence difficulty?

Demetrius Demetrius 26. ágúst 2010 26. ágúst 2010 kl. 13:01:59 UTC link Tengill

Sentences can be used by anyone, see page in the bottom.

Currently we have not contacted Google, but since it uses the data from the Internet, it may use it... who knows? At least all Tatoeba is indexed by Google.

The main idea is that anyone who needs a database of sentences can use Tatoeba to make applications. DB can be downloaded here: http://tatoeba.org/eng/download...mple_sentences

One purpose is learning.
For example, an online dictionary WWWJDIC uses English-Japanese pairs from Tatoeba.
Language learning programmes can be created. For example, CK has also created a Japanese reading practice program based on data from Tatoeba (I don't remember the link exactly).

Well, Tatoeba can be used for any application. For example, Braulio Bezerra created a typing game with these sentences.

(Also, Tatoeba itself has some functions for learning too. For example, you can create a list and export it to Anki.)

Demetrius Demetrius 26. ágúst 2010 26. ágúst 2010 kl. 09:45:41 UTC link Tengill

> 4. Do not translate word for word
>
> We are not interested in having sentences that sound like
> they were written by a robot. We want sentences that really
> are what a native speaker would say.
Tatoeba Blog, ‘How to be a good contributor’, rule 4 (http://blog.tatoeba.org/2010/02...ba.html#rule4)

I guess this covers machine translations. ^^

Demetrius Demetrius 25. ágúst 2010 25. ágúst 2010 kl. 23:20:57 UTC link Tengill

I think a mss tagging ‘tag all sentences linked to this’ would be useful. I.e. if you see it's a ‘fact’, most linked sentences would usually be facts too.

Demetrius Demetrius 25. ágúst 2010 25. ágúst 2010 kl. 09:37:42 UTC link Tengill

Azeri is a language spoken in Azerbaijan.

Demetrius Demetrius 24. ágúst 2010 24. ágúst 2010 kl. 14:46:04 UTC link Tengill

PG-13 is a normal tag.
be-1959acad shows that this is applicable only in the Academic variery of Belarusian; it's a normal language tag
Leopolis is a latin duplicate of Lviv ;o
Urdu is to be deleted, since we have a flag now
IMHO spoken to male should be replaced with ‘said to male’

Demetrius Demetrius 24. ágúst 2010 24. ágúst 2010 kl. 09:59:06 UTC link Tengill

By the way, about the XXX tag...

How it should be used? Consider the following groups:
a) sentence describes a sexual intercourse in rude words,
b) sentence uses the rude words with an indirect meaning, to describe something other,
c) sentence describes a sexual intercourse with euphemisms
In which cases XXX tag should be used?

Demetrius Demetrius 22. ágúst 2010 22. ágúst 2010 kl. 21:18:52 UTC link Tengill

[@sysko and anyone interested]
Uzbek script-switching code is here:
http://uyghur.webatu.com/uzb/uzbek_script.zip

The Cyrillic>Latin should work fine, Latin>Cyrillic may have problems with Russian loanwords. Also, it may be slow, since it’s a simple PHP str_replace w/ 2 arrays.

A simple page for anyone to toy with the transliteration:
http://uyghur.webatu.com/uzb/

Demetrius Demetrius 22. ágúst 2010 22. ágúst 2010 kl. 21:15:31 UTC link Tengill

Are there any Bosnian and Croatian sentences these NOT matching Serbian?.. :o

Demetrius Demetrius 21. ágúst 2010 21. ágúst 2010 kl. 19:56:16 UTC link Tengill

BTW, we should be careful when adding sentences with periods.

Sometimes there are non-standard 1337-like ‘punctuation’ like "!!1" in the end.

I’m not sure whether we should keep such sentences though...

Demetrius Demetrius 20. ágúst 2010 20. ágúst 2010 kl. 15:47:30 UTC link Tengill

Leaving a comment lets a non-native owing it learn the correct variant or be confident in their translations.