Perfil
Frasas
Vocabulary
Reviews
Lists
Marcapaginas
Comentaris
Comentaris sus las frasas de sysko
Cabinats
Jornals
Audio
Transcriptions
Translate sysko's sentences

I've added Maori to the list of languages.

c'est nettoyé

On a side (and those who follow the discussion on our IRC channel may have already heard me talking about this) , in addition to the new version, I'm also thinking on how to put tatoeba in a Peer to Peer way in order to make it less dependant of one central website and of one administrator , in order to make the service resistant to the crash of one machine or the indisponibility of one guy.

bug in the database due to a crash, actually it was only the number displayed which was wrong, the sentences are still all here :) and now it should be fixed.

=== Concerning the two last days downtime and related bugs ===
Actually it was the result of two factors, me being in business trip and the server to had crashed (due mainly to a nasty bug in the server kernel)
It should not happen in the future, sorry for the unconvenience.

actually it's already in the todo-list :)
I agree for all the other ideas (to have a markdown etc.)

c'est réparé, merci de l'avoir signalé

Note: I actually don't have any opinion myself about cc-y/cc-y-sa as nowadays my contribution to the content is very limited (and even my overall content contribution is now only a very little part of the total content), as soon as it suit the "community" opinion
The licence of the data itself is not really important in a developer point of view, as I can create my "close" code dictionnary feeded by "open" dictionnaries, (search on the android/iphone market how many chinese-english dictionnary software, free or not, do reuse for the data the cc-cedict project which is actually cc-by-sa) and as long as they attribute it, it's perfectly legal, the only things the "share-alike" add is that if they mix for example cc-cedict and handedict (also under cc-by-sa), they need to release somewhere they're resulting dictionnary under cc-by-sa.
in theory what Demetrius said is absolutly right http://wiki.creativecommons.org...ulting_work.3F
but the actual situtation make that most of the current "open" culture ecosystem has chosen cc-by-sa, which make tatoeba a "provider" but that can't be feeded by these projects , so in way it "closes" us from the content of these projects
as said, personnaly I'm both fine with cc-by and cc-by-sa, as anyway I don't consider the content of tatoeba as "mine" (though i will not say the same about the code :P) , this post is mainly to provide more information for those of the community who have never really been interested by these "copyright/copyleft" issues
Note 2 : at first my original post was mainly to also talk about a more important need to swtich licence from cc-by 2.0 to cc-by 3.0

Actually the seemed to not be that "non-willing" to respect the "attribution" part of the licence as there's an obvious tatoeba icon near the sentence, I will contact them if you want in order to put in addition to this the name of the author of the sentence.

Concerning the licence:
A user reported me that he's not able to incorpate the Tatoeba database into a package aimed to be included into the debian project http://en.wikipedia.org/wiki/Debian because of some incompatibilty with the version of the CC-BY licence we use (the 2.0), which is solved with the version 3.0 of it.
The problem is that I highly supposed to make that change I will need the explicit authorization of all the contributors, which may be a quite long process as I will need to email a lot of people.
In the same time I've also been wondering about changing the licence to CC-by-SA, which would make our project compatible in both way with some big projects (wikipedia/wiktionnary, wikia, a lot of 'open' words dictionnaries are also under cc-by-sa)
I know that's a very vast subject so I would like to know the opinion of people on it, as it may influence greatly the future of the projet, especially it's "incorporation" into the "open culture" ecosystem.

Hello, I've seen your email and the fact that you've added the 10 sentences, I try to add it before tomorrow.

[some message deleted]
If you have a problem with a given user please notify me by private message or e-mail
first because when I get busy I may miss some wall messages
second because I don't see the point of putting on the public a quarrel between two user, as at the end I'm the only one to chose or not to make extreme decision such as blocking someone etc.

Actually sometimes ago we decided for transparency reason to make the promotion to the corpus maintainer grade a public process made here on the wall, so that other people can express their support or their reserve against a candidate.

Pardon je n'ai pas vraiment développé le fond de ma pensée,
Je voulais dire qu'en dehors du temps "en classe", surtout comme là si c'est la première fois qu'ils utilisent le système, sans séance d'initiation préalables (vu que certains semblent avoir ajouté sans ponctuation des phrases par exemples), cela diminuerait de beaucoup l'intérêt de la chose, de plus le faire en "groupe" permet de pouvoir en passant "dans les rangs" (ce qui la est facilité par le fait que le processeur a en un clic les contributions de ces élèves, beaucoup plus simple que de devoir regarder chaque réponse papier des élèves derrière le purpitre, pouvoir arrêter la classe si une erreur devient fréquente (par exemple l'oubli de la ponctuation, ou l'erreur propre à l'usage de tatoeba de traduire une langue en pensant en traduire une autre)
Sinon oui je pense qu'un professeur un tant soit peu inventif et impliqués peut arriver à trouver moult autre activités possible (CK m'en avait déjà montré plusieurs). Après je sais que si j'étais en France et que j'avais un peu plus de temps libre, je serais aller voir mon ancien prof de classes européenne (typiquement le genre de section où le prof a une plus grande liberté sur comment il occupe ces heures additionnelles de langues, qui plus est avec des groupes plus restreints que des classes normales)

à voir au vu de l'heure et du fuseau horaire turque. Je regarderai les logs des addresses ip voir si elles sont identiques (ce sera le cas s'ils font en classe) ou dans la même zone géographique (si c'est plus des devoirs à la maison) (bien que personnellement au vu de l'age qu'a mis l'un d'entre eux, ça serait surement des jeunes lycéens, enfin l'équivalent, et autant encadrer de très cela peut-être utile, autant en simple devoir à la maison c'est plus compliqués, sauf si le professeur regarde les phrases le soir, et fais une compilation des fautes pour le prochain cour)

[some message have been removed]

After I don't blame the users who thought in a first time that it was not a big deal to do this. I just hope they understand after why I must to do it.

Ok, I will try to see if that's possible to have this with the current search engine we're using (Sphinx, for those interested, I'll be glad actually if someone already have the solution and want to share it with us)

Concerning copyrighted sentences extracted from textbook
http://tatoeba.org/eng/sentence...comment-165797
I deleted the sentence before seeing the comment so I answer here, and anyway I think it may be informational for other people
Here actually I had a high doubt that the sentence was natural, though it was a very simple sentence, I never heard such a construction. So I've googled it
https://www.google.com/#hl=en&q..."&oq="球被他踢进去了"
3 results (so very very uncommon sentence, especially as two links are non native, and the third one is a translation of an article in English, and knowing Chinese people quite well, it's likely to be a simple copy paste of google translate or so)
Moreover the first link is a "website-ified" version of a textbook I know as my students here use it too to learn Chinese. If you take a look at the other sentences of this user, they are exactly the sentences extracted from the same exercises, and for the others they also give the same kind of results (very few link , less than 10, and pointing to this book, sometimes it require a little more investigation as it's a "put the character in the right order")
So all of this made me pretty sure that the "block" of sentences he added come from that book.
Of course you can't copyright "he's eating an apple", but adding exactly the same sentences in "block" then start to fall under things that can be protected by copyright (otherwise any books is only a set of words that everyone can use)
After I know that people who have been here for a long time know my position on that subject, but I will still precise it for the new users. I also find this situation stupid, and I would also like being in a world where "open culture" project such a wikipedia, tatoeba, universal subtitles, global voices etc. can get rid of this copyright stuff, but unfortunately it's not and some of these project (I mainly think about the handedict project) who didn't pay attention to this needed to be closed during some months (the time to "clean") because some users started to enter too much copyrighted data from other sources
So here I really don't want to take the risk to be sued, even if it's very very low, but the impact on the reputation of Tatoeba, the time it would take to check the database etc. is too huge, so I prefer to be quite strict on this.

yes I understand your request but what if a vietnamese or soemone with a keyboard without possibilities to input these diacritic want to search for something. I do understand your request as even in my own mother tongue (french) it can be problemati, but in an other way I do understand that many people in either a educational or professional context (i.e on computers on which they cannot install their own input method), can not enter these diacritics (even on a cellphone their own themselves)