Wall (6,960 threads)
Tips
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
marafon
4 days ago
CK
5 days ago
sharptoothed
10 days ago
Cangarejo
10 days ago
Cangarejo
13 days ago
Thanuir
13 days ago
ondo
14 days ago
ddnktr
14 days ago
ondo
14 days ago
AlanF_US
18 days ago
(eo) Unu frazo (cetere malĝusta) afiŝiĝis, kiel "ŝatata" de mi. Kiel do mi povas ĝin malaperigi ?
(fr) Une phrase (d'ailleurs incorrecte) s'est accidentellement affichée comme étant pour moi une phrase "favorite". Comment puis-je la faire disparaître ?
Vi devas klaki sur la koron de la frazo:
Normale estas rugxa tiu koro, kaj kun verda pluso - simbolo por "aldoni al sxatataj frazoj". Sed se tiu frazo jam (hazarde) estas ene de viaj sxatataj frazoj la simbolo normale farigxas griza kun minuso. Tiun grizan simbolon vi klaku por fordoni la frazon for de viaj sxatataj frazoj.
(eo) Dankon al 'jakpv" ! La frazo malaperis el miaj "sxatataj frazoj", kion mi celis.
Hodiaŭ ankaŭ al mi okazis io stranga. En la ĉefpaĝo aperis frazo kvazaŭ mi estis ĵus aldoninta ĝin. Sed ĝi estis aldonita antaŭ tagoj kaj kune kun pliaj du tradukoj. Hodiaŭ la sama frazo aperis sola kaj kun ne indikita lingvo. Do mi simple redaktis ĝin.
I'm Tatoeba in morning, it's first time! :)
Kind regards,
boracasli
A minor suggestion regarding the generated pinyin on Chinese sentences:
When two syllables are joined together as one word and the second begins with a vowel, put a ' between them. This usually seems to be done with the pinyin that I have seen.
For example, on http://tatoeba.org/spa/sentences/show/698913, it would be nice to see "kù'ài" rather than "kùài" (especially because "uai" is a valid syllable final).
Oh yep sure, in fact the software I've made only segments words, the words themselves, and their pinyin is already stored, so it's just a per entry correction to make :), I will do that, if you see other entries which are missing this correction, can you put them in this thread ? :)
Can Text from a project under the GNU GPL Version 2 be added without concern? I've read that Tatoeba is licenced under CC but i am not sure what consequences this has: Does it mean i have to add a comment to every sentence i copy from a gpl project?
What I would like to do is add some sentences from www.reta-vortaro.de
Unfortunately, I don't think this is possible. According to http://www.gnu.org/licenses/lic...OtherLicenses, the GPL is not compatible with CC-BY.
you still can contact the author to ask him if he's the author of this sentences, or if at least it does have the right on these sentences, so that he can give you a copy of them relicenced under the CC-BY or a compatible licence. Because to be honest the GPL has not been made for text, books or so, there are more suitable licences for this kind of data (such as the CC-BY / CC-BY-SA , or the Gnu FDL if you want to say in Gnu's licence), so I do think the guy chose th GPL more because it was to say "my data are open" rather than for the exact terms of the licence. So it costs nothing to ask :) (1000 shanghainese sentences come from a copyrighted books for which the author given me the explicit authorization to use them under CC-BY)
But there still have lots of local word like country name, person name, and nouns.
but it's still not a different language, and it still can be written in both script. If a sentence is specific to a part of China, then it will be tagged accordingly.
Hi, I'm a traditional Chinese user.
There two types in Chinese:
Simple Chinese, use in China, and some south-east Asia country.
Traditional Chinese, use in Taiwan, Hong Kong, Macao.
Could you please add Simple Chinese and Traditional Chinese, not only "Chinese"?
Thank you!
Hi, glad to see a new Chinese user here
If you take a look you will see that each chinese sentences come with both script, and when you add a sentence, the other script version is automatically generated. Moreover as said, it's 2 different scripts, not 2 different languages.
How accurate is the simplified to traditional conversion? As I understand it, every traditional character corresponds to one unique simplified character, but the reverse is not true, as many characters were combined. Does the software make smart guesses based on context? Or does it just always choose the most common one?
I prefer traditional characters and I would like to make sure that any auto-generated conversions are correct so I don't use the wrong characters. ;-)
the software works on "words" y "words" not on characters by characters, and looking to the number of "words" I use (around 100k). I think it's really realiable, as anyway it's the technics used by wikipedia (and their words list is far shorter than mine), and most of possible ambiguity disappear if you see the text as "words" (anyway otherwise it would have been ambigous for human reader too).
But after some "errors" are still possible, for the rare case of single characters "words" and also for errors in the words list itself. So by correcting the words list, and by permit manual edit of the generated "other script" version (which will be possible in a near future) we should be able to reach easily a 99,9% accuracy. (I think we're near 95~98% for the moment)
Just FYI, every traditional character does not correspond to one unique simplified character. Cases in point:
著 (zhe) -> 着
著 (zhù) -> 著
Is there any sort of tagging added to the two sentences to say that one is simplified and the other traditional? I see this as being very useful if people want to use tatoeba as part of other applications.
chinese sentences has a little icon in the icon bar to say if it's simplified or traditional script.
I don't see the little icon any more too, sysko!
i've just fixed it :)
Hmm... I don't see that on this sentence: http://tatoeba.org/eng/sentences/show/692904
It looks fine to me. The sentence appears to be traditional Chinese. Below that is the pinyin. And below that is the simplified Chinese (in light grey).
If one translates a sentence that has a tag, the tag gets "lost" i.e. One doesnt see it next to the translated sentence. I think the tags of a sentence should be displayed just like the sentences themselves: the tags of the sentence youre looking at in colour and the ones of the translations greyed out (or similar). Like this these tags could get shortcuts to add the same tag to the current sentence and (even more important) work as a kind of annotation.
E.g. if the tag next to a translation says "rude" I would rather not use it in a letter.
Would this be a good idea for the upcoming version? What do you think?
I think this makes sense for some tags, but not for others. For example "です。", "6 syllables", and "Spain Spanish" shouldn't be showing up on translations.
On the other hand, something like "weather", "family", or "romance" probably could apply to translations.
Theoretically, the system could keep a list of tags that were okay to apply to translations automatically, but I don't think it's really a necessary feature.
What I would like to see a lot more is a system for adding metadata. It would be like tags, except that the tag would store data. For example rather than having "by Albert Einstein" and "by Mark Twain", we would have a data-tag "by" and sentences could have "by: Albert Einstein" and "by: Mark Twain". A better example might be replacing "8 to 10 words" etc. with "number of words: 9" etc. Then the search function could let you, for example, find all the English sentences with between 12 and 19 words. The system could also be automated so we didn't have to sit there counting words. :-)
Sorry, i didnt point this out clear enough: I wouldnt want to automatically copy the tags, but just have the option to quickly do so. This means that there could be "spain spanish" next to the spanish translation of the current sentence (which wouldnt make much sence to click on unless the current sentence is also spanish). But if there would be written "family" next to the spanish sentnce and the current sentence also refers to something about "family" you might decide to click it and thereby quickly add this tag to the current sentence too.
As far as I see, not much "filter" needed
This would also add an oportunity to allow adding these tags for not-yet "trusted users": They would just click the tag from the oher sentence, without the possibility to mess things up.
Also I like your metadata-idea - lets see how sysko is going to solve this.
As for necessity: i think this might promote the usage of tags.
La frazoj numero 698866 kaj 398316 estas samsignifaj. Kiu povas kunigi ilin?
Vi nu devas aldoni komenton, ke iu admin bonvole kunligu la du frazojn.
Aux vi demadu na Trang cxu sxi vin igus "trusted user". Tiam vi povus ligi iun frazon al viaj frazoj.
Are there any guidelines for sentence length?
No, there aren't strict guidelines, but the input form has limited characters (but I don't know the exact length allowed).
Anyway, remember that sentences should be useful for learners, so maybe you could for example split a paragraph in two sentences and put a link to the other one below, in a comment.
For long texts, like public domain books, we will have a separate section in the future, so it is better to add at least short quotes or small paragraphs you like best, and the entire novel / speech / essay will be translated when this feature will be available.
the technical limitation is 500 characters (due to the way we store sentences). After for the "guideline" part, I agree with what Pahramp said.
I just bring up Ukrainian translations to 15000. Слава Україні! :)
Are we expected to answer „Героям слава“? :)
on your discretion :*)
Great! Thank you very much!