Wall - Tatoeba

Wall (6,960 threads)

Tips

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Latest messages

subdirectory_arrow_right

marafon

4 days ago

feedback

CK

5 days ago

feedback

sharptoothed

10 days ago

subdirectory_arrow_right

Cangarejo

10 days ago

subdirectory_arrow_right

Cangarejo

13 days ago

subdirectory_arrow_right

Thanuir

13 days ago

subdirectory_arrow_right

ondo

14 days ago

subdirectory_arrow_right

ddnktr

14 days ago

feedback

ondo

14 days ago

subdirectory_arrow_right

AlanF_US

18 days ago

fs December 30, 2010 December 30, 2010 at 11:22:33 PM UTC

link

Permalink

(eo) Unu frazo (cetere malĝusta) afiŝiĝis, kiel "ŝatata" de mi. Kiel do mi povas ĝin malaperigi ?

(fr) Une phrase (d'ailleurs incorrecte) s'est accidentellement affichée comme étant pour moi une phrase "favorite". Comment puis-je la faire disparaître ?

hide replies show replies

jakov January 6, 2011 January 6, 2011 at 4:01:27 PM UTC

link

Permalink

Vi devas klaki sur la koron de la frazo:
Normale estas rugxa tiu koro, kaj kun verda pluso - simbolo por "aldoni al sxatataj frazoj". Sed se tiu frazo jam (hazarde) estas ene de viaj sxatataj frazoj la simbolo normale farigxas griza kun minuso. Tiun grizan simbolon vi klaku por fordoni la frazon for de viaj sxatataj frazoj.

hide replies show replies

fs January 7, 2011 January 7, 2011 at 10:30:41 AM UTC

link

Permalink

(eo) Dankon al 'jakpv" ! La frazo malaperis el miaj "sxatataj frazoj", kion mi celis.

Gyuri December 30, 2010 December 30, 2010 at 11:53:23 PM UTC

link

Permalink

Hodiaŭ ankaŭ al mi okazis io stranga. En la ĉefpaĝo aperis frazo kvazaŭ mi estis ĵus aldoninta ĝin. Sed ĝi estis aldonita antaŭ tagoj kaj kune kun pliaj du tradukoj. Hodiaŭ la sama frazo aperis sola kaj kun ne indikita lingvo. Do mi simple redaktis ĝin.

boracasli January 5, 2011 January 5, 2011 at 6:23:34 AM UTC

link

Permalink

I'm Tatoeba in morning, it's first time! :)

Kind regards,
boracasli

Zifre January 5, 2011 January 5, 2011 at 12:23:14 AM UTC

link

Permalink

A minor suggestion regarding the generated pinyin on Chinese sentences:

When two syllables are joined together as one word and the second begins with a vowel, put a ' between them. This usually seems to be done with the pinyin that I have seen.

For example, on http://tatoeba.org/spa/sentences/show/698913, it would be nice to see "kù'ài" rather than "kùài" (especially because "uai" is a valid syllable final).

hide replies show replies

sysko January 5, 2011 January 5, 2011 at 5:36:05 AM UTC

link

Permalink

Oh yep sure, in fact the software I've made only segments words, the words themselves, and their pinyin is already stored, so it's just a per entry correction to make :), I will do that, if you see other entries which are missing this correction, can you put them in this thread ? :)

jakov January 5, 2011 January 5, 2011 at 12:45:11 AM UTC

link

Permalink

Can Text from a project under the GNU GPL Version 2 be added without concern? I've read that Tatoeba is licenced under CC but i am not sure what consequences this has: Does it mean i have to add a comment to every sentence i copy from a gpl project?
What I would like to do is add some sentences from www.reta-vortaro.de

hide replies show replies

Zifre January 5, 2011 January 5, 2011 at 2:21:03 AM UTC

link

Permalink

Unfortunately, I don't think this is possible. According to http://www.gnu.org/licenses/lic...OtherLicenses, the GPL is not compatible with CC-BY.

sysko January 5, 2011 January 5, 2011 at 5:31:19 AM UTC

link

Permalink

you still can contact the author to ask him if he's the author of this sentences, or if at least it does have the right on these sentences, so that he can give you a copy of them relicenced under the CC-BY or a compatible licence. Because to be honest the GPL has not been made for text, books or so, there are more suitable licences for this kind of data (such as the CC-BY / CC-BY-SA , or the Gnu FDL if you want to say in Gnu's licence), so I do think the guy chose th GPL more because it was to say "my data are open" rather than for the exact terms of the licence. So it costs nothing to ask :) (1000 shanghainese sentences come from a copyrighted books for which the author given me the explicit authorization to use them under CC-BY)

frankou January 5, 2011 January 5, 2011 at 5:10:37 AM UTC

link

Permalink

But there still have lots of local word like country name, person name, and nouns.

hide replies show replies

sysko January 5, 2011 January 5, 2011 at 5:15:26 AM UTC

link

Permalink

but it's still not a different language, and it still can be written in both script. If a sentence is specific to a part of China, then it will be tagged accordingly.

frankou January 5, 2011 January 5, 2011 at 4:22:20 AM UTC

link

Permalink

Hi, I'm a traditional Chinese user.
There two types in Chinese:
Simple Chinese, use in China, and some south-east Asia country.
Traditional Chinese, use in Taiwan, Hong Kong, Macao.

Could you please add Simple Chinese and Traditional Chinese, not only "Chinese"?

Thank you!

hide replies show replies

sysko January 5, 2011 January 5, 2011 at 5:01:45 AM UTC

link

Permalink

Hi, glad to see a new Chinese user here
If you take a look you will see that each chinese sentences come with both script, and when you add a sentence, the other script version is automatically generated. Moreover as said, it's 2 different scripts, not 2 different languages.

hide replies show replies

Zifre January 5, 2011 January 5, 2011 at 8:52:49 PM UTC

link

Permalink

How accurate is the simplified to traditional conversion? As I understand it, every traditional character corresponds to one unique simplified character, but the reverse is not true, as many characters were combined. Does the software make smart guesses based on context? Or does it just always choose the most common one?

I prefer traditional characters and I would like to make sure that any auto-generated conversions are correct so I don't use the wrong characters. ;-)

hide replies show replies

sysko January 5, 2011 January 5, 2011 at 9:43:02 PM UTC

link

Permalink

the software works on "words" y "words" not on characters by characters, and looking to the number of "words" I use (around 100k). I think it's really realiable, as anyway it's the technics used by wikipedia (and their words list is far shorter than mine), and most of possible ambiguity disappear if you see the text as "words" (anyway otherwise it would have been ambigous for human reader too).
But after some "errors" are still possible, for the rare case of single characters "words" and also for errors in the words list itself. So by correcting the words list, and by permit manual edit of the generated "other script" version (which will be possible in a near future) we should be able to reach easily a 99,9% accuracy. (I think we're near 95~98% for the moment)

minshirui January 8, 2011 January 8, 2011 at 4:03:41 AM UTC

link

Permalink

Just FYI, every traditional character does not correspond to one unique simplified character. Cases in point:
著 (zhe) -> 着
著 (zhù) -> 著

minshirui January 5, 2011 January 5, 2011 at 5:49:23 PM UTC

link

Permalink

Is there any sort of tagging added to the two sentences to say that one is simplified and the other traditional? I see this as being very useful if people want to use tatoeba as part of other applications.

hide replies show replies

sysko January 5, 2011 January 5, 2011 at 6:05:03 PM UTC

link

Permalink

chinese sentences has a little icon in the icon bar to say if it's simplified or traditional script.

hide replies show replies

Pharamp January 6, 2011 January 6, 2011 at 9:49:23 AM UTC

link

Permalink

I don't see the little icon any more too, sysko!

hide replies show replies

sysko January 10, 2011 January 10, 2011 at 10:00:24 AM UTC

link

Permalink

i've just fixed it :)

minshirui January 5, 2011 January 5, 2011 at 11:13:48 PM UTC

link

Permalink

Hmm... I don't see that on this sentence: http://tatoeba.org/eng/sentences/show/692904

hide replies show replies

Zifre January 6, 2011 January 6, 2011 at 1:06:19 AM UTC

link

Permalink

It looks fine to me. The sentence appears to be traditional Chinese. Below that is the pinyin. And below that is the simplified Chinese (in light grey).

jakov January 4, 2011 January 4, 2011 at 11:58:21 PM UTC

link

Permalink

If one translates a sentence that has a tag, the tag gets "lost" i.e. One doesnt see it next to the translated sentence. I think the tags of a sentence should be displayed just like the sentences themselves: the tags of the sentence youre looking at in colour and the ones of the translations greyed out (or similar). Like this these tags could get shortcuts to add the same tag to the current sentence and (even more important) work as a kind of annotation.
E.g. if the tag next to a translation says "rude" I would rather not use it in a letter.
Would this be a good idea for the upcoming version? What do you think?

hide replies show replies

Zifre January 5, 2011 January 5, 2011 at 12:14:44 AM UTC

link

Permalink

I think this makes sense for some tags, but not for others. For example "です。", "6 syllables", and "Spain Spanish" shouldn't be showing up on translations.

On the other hand, something like "weather", "family", or "romance" probably could apply to translations.

Theoretically, the system could keep a list of tags that were okay to apply to translations automatically, but I don't think it's really a necessary feature.

What I would like to see a lot more is a system for adding metadata. It would be like tags, except that the tag would store data. For example rather than having "by Albert Einstein" and "by Mark Twain", we would have a data-tag "by" and sentences could have "by: Albert Einstein" and "by: Mark Twain". A better example might be replacing "8 to 10 words" etc. with "number of words: 9" etc. Then the search function could let you, for example, find all the English sentences with between 12 and 19 words. The system could also be automated so we didn't have to sit there counting words. :-)

hide replies show replies

jakov January 5, 2011 January 5, 2011 at 12:58:48 AM UTC

link

Permalink

Sorry, i didnt point this out clear enough: I wouldnt want to automatically copy the tags, but just have the option to quickly do so. This means that there could be "spain spanish" next to the spanish translation of the current sentence (which wouldnt make much sence to click on unless the current sentence is also spanish). But if there would be written "family" next to the spanish sentnce and the current sentence also refers to something about "family" you might decide to click it and thereby quickly add this tag to the current sentence too.
As far as I see, not much "filter" needed

This would also add an oportunity to allow adding these tags for not-yet "trusted users": They would just click the tag from the oher sentence, without the possibility to mess things up.
Also I like your metadata-idea - lets see how sysko is going to solve this.

hide replies show replies

jakov January 5, 2011 January 5, 2011 at 1:00:44 AM UTC

link

Permalink

As for necessity: i think this might promote the usage of tags.

Alois January 4, 2011 January 4, 2011 at 10:58:43 PM UTC

link

Permalink

La frazoj numero 698866 kaj 398316 estas samsignifaj. Kiu povas kunigi ilin?

hide replies show replies

jakov January 4, 2011 January 4, 2011 at 11:09:06 PM UTC

link

Permalink

Vi nu devas aldoni komenton, ke iu admin bonvole kunligu la du frazojn.
Aux vi demadu na Trang cxu sxi vin igus "trusted user". Tiam vi povus ligi iun frazon al viaj frazoj.

papabear January 4, 2011 January 4, 2011 at 8:36:20 PM UTC

link

Permalink

Are there any guidelines for sentence length?

hide replies show replies

Pharamp January 4, 2011 January 4, 2011 at 8:48:49 PM UTC

link

Permalink

No, there aren't strict guidelines, but the input form has limited characters (but I don't know the exact length allowed).

Anyway, remember that sentences should be useful for learners, so maybe you could for example split a paragraph in two sentences and put a link to the other one below, in a comment.

For long texts, like public domain books, we will have a separate section in the future, so it is better to add at least short quotes or small paragraphs you like best, and the entire novel / speech / essay will be translated when this feature will be available.

sysko January 4, 2011 January 4, 2011 at 9:20:35 PM UTC

link

Permalink

the technical limitation is 500 characters (due to the way we store sentences). After for the "guideline" part, I agree with what Pahramp said.

menix January 4, 2011 January 4, 2011 at 8:09:41 PM UTC

link

Permalink

I just bring up Ukrainian translations to 15000. Слава Україні! :)

hide replies show replies

Demetrius January 4, 2011 January 4, 2011 at 8:41:26 PM UTC

link

Permalink

Are we expected to answer „Героям слава“? :)

hide replies show replies

menix January 4, 2011 January 4, 2011 at 10:35:59 PM UTC

link

Permalink

on your discretion :*)

Pharamp January 4, 2011 January 4, 2011 at 8:40:17 PM UTC

link

Permalink

Great! Thank you very much!

Wall (6,960 threads)

Tips

marafon

CK

sharptoothed

Cangarejo

Cangarejo

Thanuir

ondo

ddnktr

ondo

AlanF_US

Need some help?

Developers

About