2011-05-16 11:41
Murphy law: it's when you are writting a long wall post about the new version, all it's feature, current limitation etc. that your firefox will crash, and in a way it has lost the message

So well before I get back the motivation to rewrite it:

here is the link , it's quite featureless (I will detail them after), so waiting my post, don't lose time saying "this or that is missing" :)

For the moment it's mainly to show you the improvements
speed improvement
realtime updated searchengine
translations showed up to a depth of 20 (or 5 on some pages) (but it's me who limited it, no more due to performance/architectural limitation)
realtime duplicate detection/merging

the database is an old old one, the user accounts has not been imported so feel free to register new accounts
it's on my own server, not the same as tatoeba, so feel free to torture it and report problems
hide replies
2011-05-16 11:42
PS: the selects doesn't show all the languages supported because I've still a nasty bug to fix with the search engine
2011-05-16 13:10
2011-05-16 13:12
2011-05-22 09:56
* Hope for the problem of duplicates :-) *

The new version is just WONDERFUL!!! Thank you very much, sysko! Certainly much more than a lot of work...

So now, if someone does not want to create a duplicate and wants to know, if a certain sentence has already a translation in a given language - easy game, he has just to go to (with the right number in the end). We will have to ignore the mention "you're not supposed to view this", but we all ignored lots of similar advices in our lives, no problem :-)

On the page it says, we should "Please repport any bugs/suggestions to 'team @t tatoeba dOt org' ". But sysko stated above: "it's quite featureless (I will detail them after), so waiting my post, don't lose time saying "this or that is missing" :)"

Be aware that the database is quite old. So there may be newer translations in the translation graph.

I found two graphs with a maximum distance of eight, the above mentioned (five English versions - or something similar to English, three in Esperanto or something similar...) and . Are there higher distances in the corpus?
hide replies
2011-05-22 10:19
actually the "you're not supposed to view this" is for me, as I use a template system, so if I view this it's to remind me to replace it by the actual right side modules :)

I've started listed all the missing stuff here;tracker_id=2
In the future (and even now) I will use it for managing feature request, bugs etc. If I've configurated correctly, it should be open to everyone, so feel free to add tickets (I will try to organize them later). I will soon list all the tickets which are part of the "first release" of this new version.

For the moment I'm trying to organize my new free time between tatoeba (the new version, discussion on the IRC channel, taking care of the current database) and my real life activities. Hope to have a stable schedule soon :)

I will try to replace the version of the database by a more recent one, so that it will be more useful for those who use it to view linking possibilities.

For the maximun distance, here is one that reach 20
Actually it's certainly more than 20, but for the moment I've fixed to 20 the maximun distance displayed, I will code after something to permit those who wants to view more
hide replies
2011-05-22 14:09
It would be cool to see the translations tree-like, so one can follow where theres a "translation bug" (or an ambigouity) if somewhere the meaning totally changes.

Maybe there could be a button to expand/collapse one more/less hirachy.
hide replies
2011-05-22 14:33
Actually the first version I've made was a tree-like, but as a default view, it's not convenient, but yep I can see to add this as an alternate view.
hide replies
2011-05-22 16:10
It should be relatively easy to have a toggle button and using jquery to switch between the two. In the end its just:
1, 2, 3, 4, 1, 2, 3, 4 <=> 1, 1, 2, 2, 3, 3, 4, 4
right? If the sentences had a different HTML-class than "directTranslation" like "translation_1" etc. the class could be used to rearange the sentences (for now the number could be used, but i dont know if the number will be kept). badly we would need a fallback for non-JS users.
hide replies
2011-05-22 16:21
In the case the starting structure would have been the tree, yep, but for some space optimisation reason, I don't keep this information when getting the sentences from the database. As anyway the informating bringing by the tree are not usefull for 95% of users

So I think i will adapt the following solution

1 - by default it display the "by distance" grouping
there will have a toggle button
the first time it will retrieve the tree structure of that group of sentences, (which is a little more heavy request than the "grouping by distance" one) and it will display it,
and after if you toggle it will be able as you said to do that only on client side, as we will have enough information to switch from one to an other

for people without JS the toggle button will simply redirect to a "show-as-tree" (or any fancy name) page with the tree structure,
the link will be simply overridden by the JS.
hide replies
2011-05-22 16:41
that sounds good. i didnt think of the request load, because i dont know too much about databases or their performance. Im just a user, craving for more features ;)
hide replies
2011-05-22 16:44
don't worry, that's the deal, you torture your sould to find new features and I torture mine to code it to statisfy your endless hungryness for fancy stuff :P
2011-05-22 16:13
and the expand/collapse would do this:
1, 2, 3, 4, 1, 2, 3, 4 <=> 1, 2, 3, 1, 2, 3
or this
1, 1, 2, 2, 3, 3, 4, 4 <=> 1, 1, 2, 2, 3, 3
2011-05-22 17:30
Thank you for the link with a long distance; the low number of your sentence gave me the idea to search in the beginning. has at least a maximum distance of 20, as well and ...

Really funny to see how these Chinese_whispers (Stille Post, téléphone arabe) change the sentences :-)
hide replies
2011-05-22 17:37
By the way: It seems the graph has about one thousand sentences... Some languages must be dozens of times in it...
hide replies
2011-05-22 17:42
I thought about it, but now I rejected the idea of drawing a graph with one thousand sentences :-)
hide replies
2011-05-22 17:51
As the code to manage the database can run without the website, I will try to propose some code example of tools manipulating the database (doing one that output the graph of the group of a sentence in "dot"[1] format that can be after converted into an image, should not be that complicated )
hide replies
2011-05-22 20:38
I've updated the database used on the new version, now it is only an hour old :)
hide replies
2011-05-22 20:45
this way it should be more usefull for those want to use it to help you link
now I think about it, I can try to keep it weekly updated (the script to feed the graph database use the weekly export anyway) and if someone feels ready to make a little user js script that replace for those who wants the link to ... link on by the corresponding one on, maybe that could ease a lot those who wants to help creating links.
What do you think ?
hide replies
2011-05-23 09:49
Merci beaucoup que tu as mis à l'heure la base de donnée!

Excuse-moi, s.t.p., sysko, en lisant ton commentaire ci-dessus j'ai l'impression que ma capacité de comprendre l'anglais est devenue plus faible ces derniers jours ;-)

Est-ce qu'il serait beaucoup de travail de proposer un lien à ton en dessous de chaque liste de traduction? Donc par exemple en bas de (4 traductions) on trouverait le lien (9 traductions).
2011-05-23 11:46
I'm reconfirming that the duplicate detector seems to work, at least in English. Also, Sysko, are you going to blow off this version of the database when you migrate TatoebaCPP to this domain?
2011-05-23 12:23
sysko: "translations showed up to a depth of 20 (or 5 on some pages) (but it's me who limited it, no more due to performance/architectural limitation)"

So it would be able to show even more translations. Would it be possible to put a button to see more than 20 translations?

By the way: Does it make sense to always show 20 translations? Wouldn't it be better - consume less power - just to show a depth of e.g. 5 plus a button to see more? I doubt that a lot of people will read one thousand translations...

(And a very small bug on this wall: There is a triangle on the right hand side below to change the size of the window; when you put it to the right, you can't see the triangle any more.)
hide replies
2011-05-23 12:54
Yep it will be possible to view everything :) with a show more button.

For the moment I always show 20 translations because it's mostly used by advanced users and I haven't implemented a view more button. Once it will be implemented I will turn the default distance to a lower one.