menu
Tatoeba
language
En em enskrivañ Kevreañ
language Brezhoneg
menu
Tatoeba

chevron_right En em enskrivañ

chevron_right Kevreañ

Merdeiñ

chevron_right Diskouez ur frazenn dre zegouezh

chevron_right Diskouez dre yezh

chevron_right Diskouez dre listenn

chevron_right Diskouez dre valiz

chevron_right Diskouez an aodio

Kumuniezh

chevron_right Moger

chevron_right Listenn an holl Izili

chevron_right Yezhoù an Izili

chevron_right Komzerien a-vihanik

search
clear
swap_horiz
search
FeuDRenais {{ icon }} keyboard_arrow_right

Profil

keyboard_arrow_right

Frazennoù

keyboard_arrow_right

Geriaoueg

keyboard_arrow_right

Difaziadennoù

keyboard_arrow_right

Listennoù

keyboard_arrow_right

Sinedoù

keyboard_arrow_right

Evezhiadennoù

keyboard_arrow_right

Evezhiadennoù war frazennoù FeuDRenais

keyboard_arrow_right

Kemennadennoù war ar Voger

keyboard_arrow_right

Roll-istor

keyboard_arrow_right

Aodio

keyboard_arrow_right

Treuskrivadurioù

translate

Treiñ frazennoù FeuDRenais

FeuDRenais's kemennadennoù war ar voger (hollad 401)

FeuDRenais FeuDRenais 15 Mezheven 2011 15 Mezheven 2011 da 05:01:27 UTC link Liamm-peurbadus

Tatoeba should have birthday notifications.

That is all.

FeuDRenais FeuDRenais 13 Mezheven 2011 13 Mezheven 2011 da 17:30:43 UTC link Liamm-peurbadus

That's true too. That's probably the worst case.

FeuDRenais FeuDRenais 13 Mezheven 2011 13 Mezheven 2011 da 17:19:07 UTC link Liamm-peurbadus

I'm not so sure. Quality gets diluted as you get more people, and currently this site is small enough to allow good monitoring because all the active users take time to leave comments, etc.

If the site is to have major growth, I don't think that the system will be able to retain quality by peer-to-peer comment-based feedback.

But whether or not this site will grow is another question.

FeuDRenais FeuDRenais 13 Mezheven 2011 13 Mezheven 2011 da 17:03:53 UTC link Liamm-peurbadus

> Actually, the system is quite smart since it improves itself automatically. When people repeatedly create unnatural translations, they get repeated comments, and at some point they get annoyed and realise that their level is not sufficient, and sometimes they leave the project, or sometimes they switch to creating sentences in their native language or correcting mistakes of others and that is fine.


That's not true for everyone.

FeuDRenais FeuDRenais 13 Mezheven 2011 13 Mezheven 2011 da 16:59:31 UTC link Liamm-peurbadus

Again I would propose having ratings for users. It's unrealistic to expect all the million sentences to be trustworthy, but there need to be guidelines as to which can be trusted.

FeuDRenais FeuDRenais 12 Mezheven 2011 12 Mezheven 2011 da 16:00:22 UTC link Liamm-peurbadus

> if i write "h a y" in english there's a lot of chance that these are the first letters of "how are you"

Taking a page from Google Pinyin there.

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 18:11:45 UTC link Liamm-peurbadus

Je ne suis pas sûr si on parle toujours de la même chose, mais voilà mon avis honnête:

Tatoeba est splendide, mais elle manque un algorithme pour diriger automatiquement le grand nombre de données qu'elle reçoit quotidiennement, et pour dessiner la ligne entre le bon et le mal.

C'est bien d'avoir des indications et l'unité de la communauté pour encourager le comportement "correct", mais je crois que c'est désespère à la longue, vu, encore, le grand nombre de phrases qu'on collectionne, ainsi que la diversité des contributeurs.

Mais bon, ce n'est que mon avis.

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 16:54:13 UTC link Liamm-peurbadus

On peut essayer de vérifier une chose: est-ce que le nombre d'étiquettes NNC croise par rapport au nombre de phrases totale ?

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 16:51:18 UTC link Liamm-peurbadus

Cependant, ce n'est pas automatique. Ça demande qu'on consacre le temps (peut être que t'en as, mais je pense pas que c'est vrai en général). En plus, on peut pas mettre chaque nouveau contributeur en quarantaine - c'est offensant et souvent pas nécessaire. Enfin, ça n'arrête pas un contributeur expérimenté qui décide, un jour, d'essayer d'ajouter des phrases dans une langue qu'il est en train d'apprendre.

Ce que je propose laisse les gens qui veulent traduire pour apprendre traduire pour apprendre. Mais au lieu de masquer ça, ou de naviguer la zone grise qui correspond à "est-ce que c'est naturel ou pas, faut-il que je mets un NNC ici?", on peut traduire sans contrainte (ils vont être, néanmoins, pénalisés si les traductions ne sont pas bonnes, et le score correspondra à ça).

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 16:16:31 UTC link Liamm-peurbadus

Et non, je ne m'intéresse pas non plus au score personnel. C'est juste une façon de régler tant de problèmes à la fois.

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 16:14:16 UTC link Liamm-peurbadus

Oui, mais il faut constater qu'il y a à peu près un million de phrases sur Tatoeba, et seulement quelques milles de contributeurs (dont une fraction est responsable pour une grande partie de phrases).

Si tu attaques directement les phrases, tu auras trop de travail à faire, car tu n'attaques pas la source. Disons qu'un nouveau (mauvais) contributeur ajoute tout à coup 100 phrases qui sont soit liées de manière erronée, soit n'ont pas du sens, soit sont mal traduites. Pour réparer tout ça, il faut un grand effort de notre part (juste pour un seul cas). C'est à cause de ça que, par exemple, sysko a limité la liberté de boracasli.

Si tu attaques la source (le contributeur), tu rends inoffensif tout ce qu'il fait désormais, même s'il continue à ajouter des phrases, voire un mille d'autre phrases qui ont des problèmes ou qui sont erronées. C'est beaucoup plus pratique que d'essayer de régler les problèmes phrase-par-phrase.

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 15:31:52 UTC link Liamm-peurbadus

Also, you have to realize that a rating system is probably the only way to give this website any professional credibility. I mean, how else do you make guarantees about quality if not via feedback/ratings? Trang can't just say "you can trust sacredceltic's translations - believe us on this". It's a much more powerful statement to say that "this user has had his translations rated 10,000 times and has a rating of 98.5%" or something of the sort.

It would also eliminate so many of these other problems - native checks, debates about whether or not non-natives should translate, blocking users who are adding bad translations, orphans, etc. Bad translations would just have bad ratings and no one would trust them - problem solved.

FeuDRenais FeuDRenais 8 Mezheven 2011 8 Mezheven 2011 da 13:31:37 UTC link Liamm-peurbadus

"une catastrophe" - on peut faire pire avec les traductions qu'avec les liens (et tout le monde peut traduire, sans même connaitre les règles)...

Je suis, encore une fois, confondu par comment certains considèrent les liens comme une espèce d'arme nucléaire qu'il faut contrôler à n'importe quel prix.

FeuDRenais FeuDRenais 7 Mezheven 2011 7 Mezheven 2011 da 22:18:56 UTC link Liamm-peurbadus

(I know)

FeuDRenais FeuDRenais 7 Mezheven 2011 7 Mezheven 2011 da 22:00:04 UTC link Liamm-peurbadus

Assuming total chaos amongst members, you have a point. But I wouldn't say there's such a chaos here. It's not very hard for experienced members to know who's good for which language pairs. If we can't trust that, then indeed it's complete chaos and no rating system can ever work. But it's not that hopeless, IMO.

To solve the problem you bring up, you'd have weighted rating (i.e. experienced members, or members with higher ratings, would have much more powerful votes than completely new members who haven't been established). All you need to do is initialize these high ratings for a few people that you trust.

e.g. You are given a very high rating for French-English by default because people (we will assume) trust you in this domain. Then you rate other French-English translations, and the ones who get rated well have their ratings improved from 0 to something better than 0. At first, their own votes won't matter by comparison to yours, but over time they grow and the best rated also become the most powerful raters. It's a quality-breeds-quality system (you just have to start with quality).

FeuDRenais FeuDRenais 7 Mezheven 2011 7 Mezheven 2011 da 20:58:08 UTC link Liamm-peurbadus

Well, here goes a really reckless idea, but...

1) Delete all the orphans on Tatoeba. It won't feel as bad if we call them "weeds" instead of "orphans" and admit the fact that they're causing a lot of problems and are, in many cases, unreliable.

2) Institute ratings for links only. Since links all have owners, users will be rated for their translations and credibility will be established (i.e. sacredceltic will have a very high rating for French-English, fucongcong for Mandarin-French, etc.). That way, a person who doesn't know Tatoeba but wants to know if a certain translation is good will only have to look to the person responsible for the link, and if the person is well-rated, then the translation is trustworthy.

Quick 2-step solution. Any holes here?

FeuDRenais FeuDRenais 7 Mezheven 2011 7 Mezheven 2011 da 20:48:01 UTC link Liamm-peurbadus

Just nuke them all.

FeuDRenais FeuDRenais 7 Mezheven 2011 7 Mezheven 2011 da 20:47:29 UTC link Liamm-peurbadus

Not, I mean... ALL of them. Not just your 10,000.

FeuDRenais FeuDRenais 7 Mezheven 2011 7 Mezheven 2011 da 19:23:25 UTC link Liamm-peurbadus

Hey... why not just delete all the orphan sentences?

Clean slate!

FeuDRenais FeuDRenais 6 Mezheven 2011 6 Mezheven 2011 da 13:34:06 UTC link Liamm-peurbadus

+1