menu
Tatoeba
language
S'inscriure Connexion
language Occitan
menu
Tatoeba

chevron_right S'inscriure

chevron_right Connexion

Percórrer

chevron_right Afichar la frasa aleatòria

chevron_right Percórrer per lenga

chevron_right Percórrer per lista

chevron_right Percórrer per etiqueta

chevron_right Percórrer los enregistraments àudio

Community

chevron_right Paret

chevron_right Lista de totes los membres

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
sysko {{ icon }} keyboard_arrow_right

Perfil

keyboard_arrow_right

Frasas

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Marcapaginas

keyboard_arrow_right

Comentaris

keyboard_arrow_right

Comentaris sus las frasas de sysko

keyboard_arrow_right

Cabinats

keyboard_arrow_right

Jornals

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate sysko's sentences

Cabinets de sysko sus la paret (total 1397)

sysko sysko June 3, 2012 June 3, 2012 at 4:13:04 PM UTC link Permalink

I've added Maori to the list of languages.

sysko sysko June 3, 2012 June 3, 2012 at 3:41:12 PM UTC link Permalink

c'est nettoyé

sysko sysko June 1, 2012 June 1, 2012 at 12:12:03 PM UTC link Permalink

On a side (and those who follow the discussion on our IRC channel may have already heard me talking about this) , in addition to the new version, I'm also thinking on how to put tatoeba in a Peer to Peer way in order to make it less dependant of one central website and of one administrator , in order to make the service resistant to the crash of one machine or the indisponibility of one guy.

sysko sysko June 1, 2012 June 1, 2012 at 12:09:54 PM UTC link Permalink

bug in the database due to a crash, actually it was only the number displayed which was wrong, the sentences are still all here :) and now it should be fixed.

sysko sysko June 1, 2012 June 1, 2012 at 12:07:12 PM UTC link Permalink

=== Concerning the two last days downtime and related bugs ===

Actually it was the result of two factors, me being in business trip and the server to had crashed (due mainly to a nasty bug in the server kernel)

It should not happen in the future, sorry for the unconvenience.

sysko sysko May 26, 2012 May 26, 2012 at 1:19:45 PM UTC link Permalink

actually it's already in the todo-list :)

I agree for all the other ideas (to have a markdown etc.)

sysko sysko May 26, 2012 May 26, 2012 at 1:18:33 PM UTC link Permalink

c'est réparé, merci de l'avoir signalé

sysko sysko May 21, 2012 May 21, 2012 at 6:43:41 PM UTC link Permalink

Note: I actually don't have any opinion myself about cc-y/cc-y-sa as nowadays my contribution to the content is very limited (and even my overall content contribution is now only a very little part of the total content), as soon as it suit the "community" opinion

The licence of the data itself is not really important in a developer point of view, as I can create my "close" code dictionnary feeded by "open" dictionnaries, (search on the android/iphone market how many chinese-english dictionnary software, free or not, do reuse for the data the cc-cedict project which is actually cc-by-sa) and as long as they attribute it, it's perfectly legal, the only things the "share-alike" add is that if they mix for example cc-cedict and handedict (also under cc-by-sa), they need to release somewhere they're resulting dictionnary under cc-by-sa.

in theory what Demetrius said is absolutly right http://wiki.creativecommons.org...ulting_work.3F
but the actual situtation make that most of the current "open" culture ecosystem has chosen cc-by-sa, which make tatoeba a "provider" but that can't be feeded by these projects , so in way it "closes" us from the content of these projects

as said, personnaly I'm both fine with cc-by and cc-by-sa, as anyway I don't consider the content of tatoeba as "mine" (though i will not say the same about the code :P) , this post is mainly to provide more information for those of the community who have never really been interested by these "copyright/copyleft" issues

Note 2 : at first my original post was mainly to also talk about a more important need to swtich licence from cc-by 2.0 to cc-by 3.0

sysko sysko May 21, 2012 May 21, 2012 at 6:27:17 PM UTC link Permalink

Actually the seemed to not be that "non-willing" to respect the "attribution" part of the licence as there's an obvious tatoeba icon near the sentence, I will contact them if you want in order to put in addition to this the name of the author of the sentence.

sysko sysko May 21, 2012 May 21, 2012 at 10:22:59 AM UTC link Permalink

Concerning the licence:

A user reported me that he's not able to incorpate the Tatoeba database into a package aimed to be included into the debian project http://en.wikipedia.org/wiki/Debian because of some incompatibilty with the version of the CC-BY licence we use (the 2.0), which is solved with the version 3.0 of it.

The problem is that I highly supposed to make that change I will need the explicit authorization of all the contributors, which may be a quite long process as I will need to email a lot of people.

In the same time I've also been wondering about changing the licence to CC-by-SA, which would make our project compatible in both way with some big projects (wikipedia/wiktionnary, wikia, a lot of 'open' words dictionnaries are also under cc-by-sa)

I know that's a very vast subject so I would like to know the opinion of people on it, as it may influence greatly the future of the projet, especially it's "incorporation" into the "open culture" ecosystem.

sysko sysko May 21, 2012 May 21, 2012 at 9:44:57 AM UTC link Permalink

Hello, I've seen your email and the fact that you've added the 10 sentences, I try to add it before tomorrow.

sysko sysko May 17, 2012 May 17, 2012 at 7:15:24 PM UTC link Permalink

[some message deleted]
If you have a problem with a given user please notify me by private message or e-mail

first because when I get busy I may miss some wall messages
second because I don't see the point of putting on the public a quarrel between two user, as at the end I'm the only one to chose or not to make extreme decision such as blocking someone etc.

sysko sysko May 15, 2012 May 15, 2012 at 5:29:20 AM UTC link Permalink

Actually sometimes ago we decided for transparency reason to make the promotion to the corpus maintainer grade a public process made here on the wall, so that other people can express their support or their reserve against a candidate.

sysko sysko May 3, 2012 May 3, 2012 at 11:28:54 PM UTC link Permalink

Pardon je n'ai pas vraiment développé le fond de ma pensée,

Je voulais dire qu'en dehors du temps "en classe", surtout comme là si c'est la première fois qu'ils utilisent le système, sans séance d'initiation préalables (vu que certains semblent avoir ajouté sans ponctuation des phrases par exemples), cela diminuerait de beaucoup l'intérêt de la chose, de plus le faire en "groupe" permet de pouvoir en passant "dans les rangs" (ce qui la est facilité par le fait que le processeur a en un clic les contributions de ces élèves, beaucoup plus simple que de devoir regarder chaque réponse papier des élèves derrière le purpitre, pouvoir arrêter la classe si une erreur devient fréquente (par exemple l'oubli de la ponctuation, ou l'erreur propre à l'usage de tatoeba de traduire une langue en pensant en traduire une autre)

Sinon oui je pense qu'un professeur un tant soit peu inventif et impliqués peut arriver à trouver moult autre activités possible (CK m'en avait déjà montré plusieurs). Après je sais que si j'étais en France et que j'avais un peu plus de temps libre, je serais aller voir mon ancien prof de classes européenne (typiquement le genre de section où le prof a une plus grande liberté sur comment il occupe ces heures additionnelles de langues, qui plus est avec des groupes plus restreints que des classes normales)

sysko sysko May 3, 2012 May 3, 2012 at 11:10:30 PM UTC link Permalink

à voir au vu de l'heure et du fuseau horaire turque. Je regarderai les logs des addresses ip voir si elles sont identiques (ce sera le cas s'ils font en classe) ou dans la même zone géographique (si c'est plus des devoirs à la maison) (bien que personnellement au vu de l'age qu'a mis l'un d'entre eux, ça serait surement des jeunes lycéens, enfin l'équivalent, et autant encadrer de très cela peut-être utile, autant en simple devoir à la maison c'est plus compliqués, sauf si le professeur regarde les phrases le soir, et fais une compilation des fautes pour le prochain cour)

sysko sysko May 3, 2012 May 3, 2012 at 10:49:06 PM UTC link Permalink

[some message have been removed]

sysko sysko April 29, 2012 April 29, 2012 at 1:48:50 PM UTC link Permalink

After I don't blame the users who thought in a first time that it was not a big deal to do this. I just hope they understand after why I must to do it.

sysko sysko April 29, 2012 April 29, 2012 at 1:44:32 PM UTC link Permalink

Ok, I will try to see if that's possible to have this with the current search engine we're using (Sphinx, for those interested, I'll be glad actually if someone already have the solution and want to share it with us)

sysko sysko April 29, 2012 April 29, 2012 at 1:43:01 PM UTC link Permalink

Concerning copyrighted sentences extracted from textbook

http://tatoeba.org/eng/sentence...comment-165797
I deleted the sentence before seeing the comment so I answer here, and anyway I think it may be informational for other people

Here actually I had a high doubt that the sentence was natural, though it was a very simple sentence, I never heard such a construction. So I've googled it
https://www.google.com/#hl=en&q..."&oq="球被他踢进去了"

3 results (so very very uncommon sentence, especially as two links are non native, and the third one is a translation of an article in English, and knowing Chinese people quite well, it's likely to be a simple copy paste of google translate or so)
Moreover the first link is a "website-ified" version of a textbook I know as my students here use it too to learn Chinese. If you take a look at the other sentences of this user, they are exactly the sentences extracted from the same exercises, and for the others they also give the same kind of results (very few link , less than 10, and pointing to this book, sometimes it require a little more investigation as it's a "put the character in the right order")

So all of this made me pretty sure that the "block" of sentences he added come from that book.

Of course you can't copyright "he's eating an apple", but adding exactly the same sentences in "block" then start to fall under things that can be protected by copyright (otherwise any books is only a set of words that everyone can use)

After I know that people who have been here for a long time know my position on that subject, but I will still precise it for the new users. I also find this situation stupid, and I would also like being in a world where "open culture" project such a wikipedia, tatoeba, universal subtitles, global voices etc. can get rid of this copyright stuff, but unfortunately it's not and some of these project (I mainly think about the handedict project) who didn't pay attention to this needed to be closed during some months (the time to "clean") because some users started to enter too much copyrighted data from other sources

So here I really don't want to take the risk to be sued, even if it's very very low, but the impact on the reputation of Tatoeba, the time it would take to check the database etc. is too huge, so I prefer to be quite strict on this.

sysko sysko April 24, 2012 April 24, 2012 at 10:06:37 PM UTC link Permalink

yes I understand your request but what if a vietnamese or soemone with a keyboard without possibilities to input these diacritic want to search for something. I do understand your request as even in my own mother tongue (french) it can be problemati, but in an other way I do understand that many people in either a educational or professional context (i.e on computers on which they cannot install their own input method), can not enter these diacritics (even on a cellphone their own themselves)