menu
Tatoeba
language
S'inscriure Connexion
language Occitan
menu
Tatoeba

chevron_right S'inscriure

chevron_right Connexion

Percórrer

chevron_right Afichar la frasa aleatòria

chevron_right Percórrer per lenga

chevron_right Percórrer per lista

chevron_right Percórrer per etiqueta

chevron_right Percórrer los enregistraments àudio

Community

chevron_right Paret

chevron_right Lista de totes los membres

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
sysko {{ icon }} keyboard_arrow_right

Perfil

keyboard_arrow_right

Frasas

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Marcapaginas

keyboard_arrow_right

Comentaris

keyboard_arrow_right

Comentaris sus las frasas de sysko

keyboard_arrow_right

Cabinats

keyboard_arrow_right

Jornals

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate sysko's sentences

Cabinets de sysko sus la paret (total 1397)

sysko sysko December 11, 2011 December 11, 2011 at 6:09:22 PM UTC link Permalink

I've temporaly removed "Auto dectection" as anyway it's not working right now, in the mean time I've started thinking about coding my own one, as it seems there's no good open source out of there to do this. Basically I think to use the current tatoeba database to create some kind of 3-gram stats table by languages, and after do a magic algorithm that will calculate a score when you submit a sentence, and will choose the language with the highest score.

(for reuse purpose it will be something independant from tatoeba, I mean it will be a standalone service, though here in tatoeba we will still use it transparently)

So I will maybe also take the advantage of coding this to , if i've time / if really needed, to permit users who want to enter the language they contribute him, in order for the system to not wrongly detect a sentence has spanish for soemone who contribute only in French , Shanghainese and Chinese.

sysko sysko December 9, 2011 December 9, 2011 at 12:11:03 AM UTC link Permalink

actually I've done a quick debug and discovered that this page was doing actually a heavy request on the database, and with the current load on the website, timeouted before being accomplished (and in the meantime increased even more the already high load).

So waiting me to put a cache on this page, I will desactivate this one.

sysko sysko December 7, 2011 December 7, 2011 at 7:44:59 PM UTC link Permalink

Ok I've put a correction, actually as I've said with Sacredceltic, the problem is that the language detection rely on a API provided by Google, as actually very few tools permit to do this.

And seems today Google has removed this functionnaly, or at least limited the access. So for the moment autodetect will set the sentence as unknown, waiting to find something to replace google API.

sysko sysko December 7, 2011 December 7, 2011 at 4:46:43 PM UTC link Permalink

but the problem is solved now,no?

sysko sysko December 7, 2011 December 7, 2011 at 2:45:36 PM UTC link Permalink

oui, mettre la langue en inconnu, et de mon coté faire que cela inscrit dans les logs que le service google est tombé.

sysko sysko December 7, 2011 December 7, 2011 at 2:39:52 PM UTC link Permalink

et c'est quand je reviens que le problème disparaît .... car bon ça aurait été trop simple si j'avais eu le temps de voir la cause exacte, histoire de proposer un correctif pérenne...

ça me rappelle quand je donne cours, les problèmes des élèves disparaissent comme par magie quand je m'approche de leur ordinateur, quand bien même ils avaient réellement un problème.
C'est un peu le don de thaumaturgie à la sauce Tatoeba.

sysko sysko December 7, 2011 December 7, 2011 at 2:34:46 PM UTC link Permalink

en fait pour le coup, ce ne serai du coup pas si bizarre, pour la détection automatique de langue, il n'y avait pas (encore maintenant?) d'outil efficace libre pour faire cela sur autant de langue et de tres court echantillons, du coup cela passe par une API de google. Donc du coup si l'url à changer où s'ils ont arrêté le service, ceci expliquerait cela.

Je regarde cela.

sysko sysko December 7, 2011 December 7, 2011 at 2:11:12 PM UTC link Permalink

Same problem for me

BUT , when I choose directly the language , not autodetect, I can add,

can others confirm this ?

sysko sysko December 7, 2011 December 7, 2011 at 1:56:48 PM UTC link Permalink

yep sorry for this, the problem have happened at the beginning of my working day so I see the messages only know. And Murphy law oblige, it's during my busiest days that all these problems happen ...

I'm starting to investigate this

sysko sysko December 4, 2011 December 4, 2011 at 6:59:53 PM UTC link Permalink

should be fixed by now.

sysko sysko November 30, 2011 November 30, 2011 at 6:06:28 PM UTC link Permalink

[fra] Ce qui est intéressant c'est de voir c'est que ce sont les langues les plus communes de tatoeba qui sont en proportion le moins traduit, sûrement que lorsqu'une personne contribue dans une langue "rare", elle le fait souvent soit en traduisant seulement, soit en ajoutant puis en traduisant peu de temps après.
Alors que les contributeurs dans les langues assez bien représentées sur Tatoeba prennent plus de liberté, ayant surement à l'esprit "De toute manières quelqu'un finira par me traduire"

Bon ça enfonce un peu des portes ouvertes mon analyse...

sysko sysko November 30, 2011 November 30, 2011 at 7:28:54 AM UTC link Permalink

I willl have time to restore them around friday, not before, but at the end it will be done

No that was not normal, as any crash anyway. It happends because of the conjonction of boracasli stuff and (and actually mainly this), the fact that google suddenly decided to crawl our website with a 5 request by second rate, so Tatoeba simply said "no I can't, or you should give me a raise" and goes on strike.

but don't worry we get backup of the data so at the end everything will get back to normal.

sysko sysko November 29, 2011 November 29, 2011 at 7:35:12 PM UTC link Permalink

Yep I don't think this account is a fake one from bora, my bora-fake-account-detector-3000-next-gen-deluxe-edition(tm) has recognized no "bora"-like pattern in his behaviour.

sysko sysko November 29, 2011 November 29, 2011 at 6:44:26 PM UTC link Permalink

This user seems to leave and answer to comment, so may be drop him so comment on the languages in which he has added sentences that he hasn't added in his profile.

sysko sysko November 28, 2011 November 28, 2011 at 6:29:55 PM UTC link Permalink

Don't worry for this :) Actually I've started working on this issue this weekend, but the internet connection there make the task much much slower. But in the end it will get fix, data are backup often enough.

sysko sysko November 26, 2011 November 26, 2011 at 12:56:52 PM UTC link Permalink

On the way to fix everything...

sysko sysko November 25, 2011 November 25, 2011 at 6:28:56 AM UTC link Permalink

Pour la premiere fois c'etait du au petit crash qy'il y a eu il y a 3/4 jours , du coup la table de stats etait plus synchronisé, et la c'est redescendu car j'ai enfin trouvé le temps de faire un script pour supprimer de façon propre et rapide les 238 comptes de bora et les phrases associés.

sysko sysko November 21, 2011 November 21, 2011 at 6:15:25 PM UTC link Permalink

what a coincidence, just before I planned to go to sleep ....

sysko sysko November 18, 2011 November 18, 2011 at 6:33:05 PM UTC link Permalink

oui, quelque jours a relacher mon attention, et paf, j'ai blacklisté sa nouvelle adresse IP, on devrait être tranquille quelque jours. Va falloir que je me fasse un petit script pour supprimer de manière propre toutes traces d'un utilisateur, comme s'il n'avait jamais existé. Histoire de supprimer plus rapidement les avatars de l'ami bora et leurs contributions délétères

sysko sysko November 16, 2011 November 16, 2011 at 5:21:28 PM UTC link Permalink

... as for example I've recorded some of Sacredceltic's sentences