menu
Tatoeba
language
Loo kasutaja Logi sisse
language Eesti
menu
Tatoeba

chevron_right Loo kasutaja

chevron_right Logi sisse

Sirvi

chevron_right Näita suvalist lauset

chevron_right Sirvi keelte kaupa

chevron_right Sirvi nimekirja kaupa

chevron_right Browse by tag

chevron_right Sirvi helisalvestusi

Suhtle

chevron_right Sein

chevron_right Kõigi liikmete nimekiri

chevron_right Liikmete keeled

chevron_right Emakeele rääkijad

search
clear
swap_horiz
search

Wall (7 162 threads)

Vihjed

Enne küsimist, loe läbi KKK.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Viimased sõnumid feedback

sharptoothed

2 päeva tagasi

feedback

CK

4 päeva tagasi

subdirectory_arrow_right

Seael

5 päeva tagasi

subdirectory_arrow_right

gillux

7 päeva tagasi

subdirectory_arrow_right

sacredceltic

8 päeva tagasi

subdirectory_arrow_right

sacredceltic

8 päeva tagasi

subdirectory_arrow_right

Seael

9 päeva tagasi

subdirectory_arrow_right

Guybrush88

9 päeva tagasi

feedback

Seael

9 päeva tagasi

feedback

StanJones

11 päeva tagasi

blay_paul blay_paul 15. veebruar 2010 15. veebruar 2010 06:44:42 UTC flag Report link Püsilink

A->B or B->A

There doesn't seem to be an easy way to tell whether a pair of sentences are
A(Japanese) translated to B(English)
or
B(English) translated to A(Japanese)

I would like to make sure this feature is firmly placed in the wish list for future development.

{{vm.hiddenReplies[193] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 17. veebruar 2010 17. veebruar 2010 22:31:48 UTC flag Report link Püsilink

Is there a specific case where you would need this information?

One not too difficult way is to look at the creation date of each sentence (which you can see in the first entry in the logs). If A was created before B, then it must have been A->B.

{{vm.hiddenReplies[194] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
blay_paul blay_paul 18. veebruar 2010 18. veebruar 2010 13:56:34 UTC flag Report link Püsilink

> Is there a specific case where you would need this information?

Not as such, but I can explain _why_ I want this information to be recorded / displayed.

One common use of the translated example sentences is to explain what the original sentence means. So it is quite normal to have, for example, obscure English translated (explained) into normal Japanese.

Imagine you have this:
http://tatoeba.org/eng/sentences/show/270663
A. Many a mickle makes a muckle. [Proverb]
B. 塵も積もれば山となる。

B is the Japanese equivalent (and 'translation') of A.

Some well meaning person might decide that hardly anybody knows what "Many a mickle makes a muckle." means and 'correct' it into a different English sentence.

A more common example would be the

http://tatoeba.org/eng/sentences/show/71044
A. I'll make you a present of a doll.
B. あなたに人形をお贈りします。

If B. is the original and A. the translation then you could well say that the English in A is a little odd and should be changed. If A is the original and B is the translation then you could say that the A demonstrates a somewhat old-fashioned phrasing and B explains is.

{{vm.hiddenReplies[195] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
contour contour 19. veebruar 2010 19. veebruar 2010 02:52:57 UTC flag Report link Püsilink

In those cases it would be desirable to have both phrasings, with uncommon ones marked somehow, so users should be encouraged to add alternate translations rather than 'correcting' existing ones when the translation is not erroneous.

I think this could be handled by attaching more metadata to sentences. Properties like masculine/feminine, proverb, quotation, polite, slang, dated, etc. could be tagged onto a single sentence, so you could have e.g. both a polite and a colloquial translation, and mark them as such.

That would also address the problem where sentences from the Tanaka Corpus are currently marked with tags like [F] on the English sentence, even though the property belongs to the Japanese sentence. That doesn't work that well when the sentences aren't restricted to pairs.

cburgmer cburgmer 15. veebruar 2010 15. veebruar 2010 10:07:10 UTC flag Report link Püsilink

Which languages to add, and which not to add.

Just a short note to say that Wikipedia adopted the policy to only create Wikipedias for languages that have an ISO 639-3 Code. There might be exceptions. I think this decision helped them pretty much ease the process for new languages.

{{vm.hiddenReplies[198] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
sysko sysko 16. veebruar 2010 16. veebruar 2010 14:14:20 UTC flag Report link Püsilink

Yep we used to have a discussion about that, and we don't plan to only add languages which have an ISO 639 alpha 3 code, wikipedia as to do this because they have tons of contributors, and an encyclopedia need much more data than just a database of example sentences, so I can understand why they don't prefer to have a lot of articles rather than a lot of dialects or so

but for us, I think as soon as the language is enough different to be not totaly intelligible with an other, then we can add it as a specific language (that's the case for shanghainese for example, the closest ISO 639 code is for Wu language, but the Wu language, for which shanghainese is a "dialect", is divided in some other "dialects" (even i don't really like the word "dialect"), which are not intelligible with shanghainese)

and as I think tatoeba can be used to keep a trace of language, especially endangered ones, I would find really pityfull to not add a small language, only because it has no iso 639-3 code (moreover I've heard iso 639-4 code will be released)

fcbond fcbond 16. veebruar 2010 16. veebruar 2010 10:14:44 UTC flag Report link Püsilink

How do I set the language I translate into?

If I click on "死ね" and try to add a translation, it comes up as German, while I mean it to be English, ...

{{vm.hiddenReplies[200] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
sysko sysko 16. veebruar 2010 16. veebruar 2010 13:40:28 UTC flag Report link Püsilink

for the moment (but we will change it soon) you have no way to directly specify the language
But, if Tatoeba misdetects the language, then you simply click on the flag next to the incriminate sentence and set it to the right one. It can be done whenever you want, as soon as the sentence belong to you (like editing)

{{vm.hiddenReplies[201] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
fcbond fcbond 16. veebruar 2010 16. veebruar 2010 13:53:25 UTC flag Report link Püsilink

Thanks. I tried clicking on the flag, I didn't realise I had to select the sentence first. I look forward to the new version where I can specify it at once.

BTW -- I seem to have added some rubbish (which I have marked with DELETE ME in the comments). Sorry.

{{vm.hiddenReplies[202] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
sysko sysko 16. veebruar 2010 16. veebruar 2010 14:07:32 UTC flag Report link Püsilink

ok no problem for the duplicate sentence, they will be automatically deleted, and an admin will deleted the other one

sysko sysko 16. veebruar 2010 16. veebruar 2010 14:15:58 UTC flag Report link Püsilink

you could have direcly change the language on one of the sentence you've already added :p

blay_paul blay_paul 15. veebruar 2010 15. veebruar 2010 06:47:01 UTC flag Report link Püsilink

Changed from ...

Another 'wish list' item. When looking at the log of recently changed items I would like to be able to see what changed items have been changed _FROM_.

blay_paul blay_paul 14. veebruar 2010 14. veebruar 2010 13:48:28 UTC flag Report link Püsilink

Procedure when replacing (not correcting) sentences.

See http://tatoeba.org/eng/sentences/show/218263

If a sentence is pretty much completely replaced it's position in the "Sentence X is translation of Sentence Y" system may need to change.

Example:
BEFORE
Sentence A (Japanese) was (allegedly) translated into Sentence B (English) which was in turn translated into Sentence C (German)

AFTER
Japanese is noted to be unrelated to English and so completely replaced with a new Japanese sentence.
So _now_
Sentence B (English) is translated into
Sentence C (German) and also translated into
Sentence D (Japanese).

So A -> B -> C
changes to
B -> C
-> D

JeroenHoek JeroenHoek 10. veebruar 2010 10. veebruar 2010 21:02:32 UTC flag Report link Püsilink

Here's a nice user-test for the system:

Sentence nº164914 was a Japanese sentence I wanted to edit. I edited its English translation, which went well, but because tatoeba.org responded too slow I accidentally added a new Japanese sentence (nº361150) instead of amending nº164914. I can't delete translations, so I just changed that old Japanese one to a Dutch translation of the sentence. Meanwhile, someone else added a German translation too! :)

Questions:
* What happens to the indices for the former Japanese sentence?
* Should user be able to do this?

As a developer I suspect that changing languages on an existing (Japanese) sentence is bound to cause issues, but as a user it makes perfect sense to solve the issue I ran into. Your thoughts?

{{vm.hiddenReplies[187] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 11. veebruar 2010 11. veebruar 2010 18:51:38 UTC flag Report link Püsilink

> What happens to the indices for the former Japanese sentence?

As Paul said, it gets left behind. We don't have (yet) strong mechanisms that would help keep the database consistent.

> Should user be able to do this?

No, they shouldn't. Ideally, there should be guidelines (which I'm hoping to be able to write by the end of the month) to help users understand better how things work and how they can contribute in a way that doesn't give us (developers) more work than we already have ^^'
I wrote down some of the ideas in my comment here : http://tatoeba.org/eng/sentences/show/126

> Perhaps "nominate for deletion" could be added as explicit functionality?

Yes, in general, we could have various status for a sentence. Actually in the previous version of Tatoeba we used to have that, but I haven't re-implemented it. A sentence could be marked as "to delete", "checked" or "locked" (and perhaps other things, I don't remember). When a sentence was checked, it meant you could rely on it for not having mistakes. When it was locked, no one could edit it anymore.

But this is not urgent compared to other things we have to do. My priority at the moment is to make sure that people understand clearly that when they translate, they have to translate from the sentence written in big letters. I'm pretty sure that very often, people are adding translations to a Japanese sentence when they were actually translating from the English sentence.
We also have to enable people to link and unlink sentences. There are many sentences that are linked to each other without being translations of each other, and there are many sentences that could be translations of each other but are not linked to each other.

Once all of that is settled, and people understand that they have to view the corpus as a GRAPH and not a table, it will be less likely that they behave in a way that we don't want them to behave, like what you did. And perhaps "nominate for deletion" will not be *that* useful because instead of deleting, you could just edit your sentence into whatever you want and unlink it from any sentence it was linked to.

{{vm.hiddenReplies[190] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
JeroenHoek JeroenHoek 11. veebruar 2010 11. veebruar 2010 19:20:31 UTC flag Report link Püsilink

Educating users is desirable of course, but opportunistic contributors will make mistakes. In the case of incidental contributions, not being able to delete an entry that should not have been created, nor nominate it for deletion is likely to frustrate the user. An alternative may be to offer a grace period for sentences you created yourself, being able to delete them within a certain period as long as they are not linked to by other new sentences.

On the topic of linking translations: is it possible to link a sentence to multiple sentences? There are many cases where the translated sentences actually do function as proper translations of each other, as well the sentence they are linked to.

Visualizing the graph is challenging within the confines of HTML/CSS, good luck there. Further indenting of the non-direct translations might help.

blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 22:36:27 UTC flag Report link Püsilink

> What happens to the indices for the former Japanese sentence?

It gets left behind. I manually copied it to the new sentence in this case, but Trang will need to clear up things. I'm not sure if I can delete index entries.

> Should user be able to do this?

It's probably a bad idea.

Suppose you have

A(English) translates to B(Japanese) translates to C(German)

If you change the Japanese to a new, Dutch, sentence you get

A(English) translates to B(Dutch) (doesn't really) translates to C(German)

Because C is really the translation of the (vanished) Japanese sentence not the (new) Dutch sentence.

I think it would have been best to have left the Japanese duplicate and added a "Please delete me" comment.

{{vm.hiddenReplies[188] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
JeroenHoek JeroenHoek 11. veebruar 2010 11. veebruar 2010 10:00:20 UTC flag Report link Püsilink

> I think it would have been best to have left the Japanese duplicate and added a "Please delete me" comment.

Agreed. (Since Tatoeba is in beta, I try to actively break things by using it from a novice user's perspective.)

Trang:
Perhaps "nominate for deletion" could be added as explicit functionality? A way for user's to flag a sentence as undesirable (with optional comment). In time the comment system will become hard to monitor for "delete me" type messages.

blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 13:44:17 UTC flag Report link Püsilink

I do not appear to be able to change my password. Are passwords limited to a certain number of letters or something?

{{vm.hiddenReplies[159] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 10. veebruar 2010 10. veebruar 2010 22:29:43 UTC flag Report link Püsilink

Okay should be fixed now.

{{vm.hiddenReplies[166] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 22:48:24 UTC flag Report link Püsilink

It is. Thanks.

TRANG TRANG 10. veebruar 2010 10. veebruar 2010 18:09:24 UTC flag Report link Püsilink

Paul, you're back! :O (Welcome back ^^)

I just tried to change my password and it worked... Passwords shouldn't be limited to a certain number of character, although I haven't tried anything above 20ish characters.

All I can tell you is to try again... Just be careful that there isn't any leading or trailing space if you're copy-pasting your "old" password.

{{vm.hiddenReplies[160] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 19:21:18 UTC flag Report link Püsilink

Hmm, still doesn't seem to work.

One of the passwords I tried was 11 characters with 8 letters + 4 numbers. The other was 8 characters with six letters and two symbols.

{{vm.hiddenReplies[161] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 19:24:03 UTC flag Report link Püsilink

Just tried with

testing123

(not a real password) and it didn't work.

I get the error message "An error occured while saving." each time.

{{vm.hiddenReplies[162] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 10. veebruar 2010 10. veebruar 2010 20:00:07 UTC flag Report link Püsilink

I tried with testing123 too, but didn't get any error.

Well, perhaps you can try to log out and log in again. This will make sure that the password you were entering in "old password" is the right password. I mean, perhaps you were entering a password that you tought were your current password but actually isn't.

If it's not that, then for now I don't know what it is...

TRANG TRANG 10. veebruar 2010 10. veebruar 2010 21:22:21 UTC flag Report link Püsilink

Actually, sysko made me realize that if the message is "An error occured while saving." and not "Password error. Please try again." then it means Tatoeba won't save your new password. I'm looking into this.

sysko sysko 10. veebruar 2010 10. veebruar 2010 19:34:15 UTC flag Report link Püsilink

@Trang maybe it's related to the biptaste's bug, maybe it's not really closed

lilygilder lilygilder 28. jaanuar 2010 28. jaanuar 2010 20:45:02 UTC flag Report link Püsilink

To my fellow German translators:

+5,000 sentences! (And about 1,000 sentences in 10 days, wow!) German is now #5! :D Let's have some cake.

{{vm.hiddenReplies[127] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 29. jaanuar 2010 29. jaanuar 2010 00:25:03 UTC flag Report link Püsilink

Congratulations ^_^

human600 human600 30. jaanuar 2010 30. jaanuar 2010 21:42:59 UTC flag Report link Püsilink

Gratuliere!

TRANG TRANG 10. veebruar 2010 10. veebruar 2010 18:57:10 UTC flag Report link Püsilink

And now, not only are there 6,500+ sentences, but German is also #4! =D

(but for how long...)

blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 14:45:26 UTC flag Report link Püsilink

How do you edit the romaji line for Japanese sentences?

{{vm.hiddenReplies[171] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 10. veebruar 2010 10. veebruar 2010 18:32:28 UTC flag Report link Püsilink

For now you don't :
http://tatoeba.org/eng/wall/index#message_24

As long as there isn't at least 2 or 3 "hardcore contributor" in Japanese, I will not implement the possibility to edit the romaji.

{{vm.hiddenReplies[172] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
blay_paul blay_paul 11. veebruar 2010 11. veebruar 2010 13:14:05 UTC flag Report link Püsilink

That's your decision. But do note that there are some very wrong romaji entries.

{{vm.hiddenReplies[173] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 11. veebruar 2010 11. veebruar 2010 19:03:58 UTC flag Report link Püsilink

Don't worry, I'm very aware of this. I just think that people can wait to have 99,99% reliable romaji.

And there used to be a warning tooltip, when you moved your mouse over the romaji, that said it was not reliable... but you made me notice that it's not there anymore. I guess we took it out by mistake while cleaning our code.

{{vm.hiddenReplies[174] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
Nemo Nemo 21. veebruar 2010 21. veebruar 2010 15:55:49 UTC flag Report link Püsilink

You say that as if we are now at 99.98% accuracy. If you intend to have this project be a serious attempt for people to learn, please hide the Romaji if it's not going to be fixable. Currently, much of it is an abomination of on/kun mixups, and idiosyncratic spacing. Any beginners will just be confused by it, and anyone who knows how wrong it is will be frustrated by it. Also, what if a Japanese sentence is wrong? The romaji will be completely different, with added, subtracted, or moved words. If nothing else, add Romaji as a "Language". Long-term however, we're going to need some solution. I assume Chinese has the same issue, and Shanghainese likely does or will, too (I haven't seen any Shanghainese with ruby text yet). Eventually we'll be swimming in meta-languages. I'd say just nuke all romanization until it's done right. Also, doesn't wwwjdict already have every example sentence broken down into kana? I've used it in the past, but it was far too long ago for me to remember.

{{vm.hiddenReplies[175] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
sysko sysko 21. veebruar 2010 21. veebruar 2010 17:06:58 UTC flag Report link Püsilink

in fact chinese doesn't have the same issue

the problem with japanese romanization in tatoeba, is that it come from Kakasi, which is no longer a "living project", so we have no way to report the error to someone, as nobody maintain it anymore,

for chinese it comes from adso, develloped by the guy of popupchinese.com, I've regular contact with him, and as he have a strong background in both linguistic and programming, he's tool is really reliable and support by a reliable community
moreover when a bad romnization is found, a new version is released in the day most of the times
so for chinese there's no really problem

for shanghainese yep we will need to make it editable for the simple reason currently there's just no tool do this automatically :p (I'm working wit the adso guy to support shanghainese, but as I don't expect reliable result before a long time, I will make it editable)

for Japanese, I let Trang answers, as I don't speak japanese at all

{{vm.hiddenReplies[177] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
Nemo Nemo 21. veebruar 2010 21. veebruar 2010 17:13:45 UTC flag Report link Püsilink

Well, by "problem" I meant that the ruby can't be edited. So it does then? I understand that the mitigation of the problem is better for Chinese, but it's still read-only, correct? I also know that it's easier for Chinese, since the reading variations are more predictable, less common, and just plain easier to get right.

{{vm.hiddenReplies[178] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
sysko sysko 21. veebruar 2010 21. veebruar 2010 17:37:50 UTC flag Report link Püsilink

no it's not, and to be honnest i don't know if it's requested for chinese (for shanghainese absolutly it will need)
as I used adso also for other works than Tatoeba, and even in tatoeba, my chinese friends as not reported me yet a single sentence with a bad romanization, so I think letting people able to edit chinese will bring more drawbacks than advantages

* if a error is reported, i will notify it to adso guy, and waiting, the patch, we can add an hardcoded pinyin for the sentence

* if a chinese learner show a sentence where a chinese character has a very rare pronunciation, he will maybe correct it , thinking it's wrong, which is not intented
* the chinese romanization use characters with tones, and most of them cannot be input directly, neither with a IME

but after things are never stated for ever, and if we see the amount of errors from adso is too large to be covered by our current way of working, then we will make it editable

what do you think ?

(I only speak about shanghainese and chinese)

Nemo Nemo 21. veebruar 2010 21. veebruar 2010 16:29:00 UTC flag Report link Püsilink

"Also, what if a Japanese sentence is wrong? The romaji will be completely different, with added, subtracted, or moved words."

I said this before I discovered where the romaji was coming from, forgot to go back and remove it before I posted, oops.

TRANG TRANG 21. veebruar 2010 21. veebruar 2010 18:29:14 UTC flag Report link Püsilink

I've replied to Paul regarding the issue of romaji:
http://tatoeba.org/wall/index#message_223

> Also, doesn't wwwjdict already have every example sentence broken down into kana

Not exactly broken down into kana. It looks more like this: 誰も[01] が 私|1(わたし)[01] は|1 間違う{間違っている} と|1 言う{いった}
For this sentence: 誰もが私は間違っているといった。

But I know I could use this to make the romaji more reliable.

blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 13:48:28 UTC flag Report link Püsilink

How are the tags used by WWWJDIC handled?

For example the following sentence should have a sense 1 tag on 誰も, but I don't see it in the edit sentence options.
http://tatoeba.org/eng/sentences/show/136692

{{vm.hiddenReplies[168] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
TRANG TRANG 10. veebruar 2010 10. veebruar 2010 18:23:45 UTC flag Report link Püsilink

If by "tags" you mean the indices, or "B line", then they are accessible from a special page. I'll send you the link in a private message, you can also ask Jim for more details.

If you are referring to the feature from the old version, that was using the indices to link words from the Japanese sentences to WWWJDIC, then I haven't had time to re-implement it yet.

{{vm.hiddenReplies[169] ? 'expand_more' : 'expand_less'}} peida vastused näita vastuseid
blay_paul blay_paul 10. veebruar 2010 10. veebruar 2010 18:28:49 UTC flag Report link Püsilink

Yes, I meant the "B line".