menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
sysko {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on sysko's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate sysko's sentences

sysko's messages on the Wall (total 1,397)

sysko sysko April 27, 2010 April 27, 2010 at 1:25:08 PM UTC link Permalink

ok at least the remove duplicate script will not produce anymore broken links

sysko sysko April 27, 2010 April 27, 2010 at 1:08:05 PM UTC link Permalink

the remove duplicate script does the following

identify all the sentence which have both the same language and the same text
and after it will keep the oldest sentence which are owned by someone (or the oldest one if none of the duplicate belongs to someone) and then will relink all links to the duplicate to this one
(so comments / translations / lists etc... etc.. )
and finally will remove the duplicate and keep only one
so the script will not produce any broken reference to a removed sentences

sysko sysko April 26, 2010 April 26, 2010 at 8:48:35 PM UTC link Permalink

python is an interpreted language so you don't need to compile it,
but i think it will not work for the moment on OS other than linux as eclectus as still some dependencies with KDE :( . but after cburgmer knows better than me, because at least he's eclectus author :p

sysko sysko April 26, 2010 April 26, 2010 at 8:11:29 PM UTC link Permalink

Biptaste but he's not really often here ^^

sysko sysko April 26, 2010 April 26, 2010 at 2:17:35 PM UTC link Permalink

so TAB wouldn't be easier to parse?

sysko sysko April 25, 2010 April 25, 2010 at 6:26:37 PM UTC link Permalink

now it should be displayed :)

sysko sysko April 25, 2010 April 25, 2010 at 2:37:41 PM UTC link Permalink

and now your dream comes true :) Iceland is officialy supported by Tatoeba :)
have fun ;-)

sysko sysko April 24, 2010 April 24, 2010 at 4:43:31 PM UTC link Permalink

send us the file to our email address team [at] tatoeba [dot] org, and i will see how to integrate it.
to be honnest i don't really how it works (A) at least I will contact the guys of this project to see what can we do:),
but it's already great if you have adapted it to Ukrainian

sysko sysko April 18, 2010 April 18, 2010 at 10:55:00 PM UTC link Permalink

yep this one http://snowball.tartarus.org/al...em_Unicode.sbl to be more precise :) thanks

sysko sysko April 18, 2010 April 18, 2010 at 4:48:26 PM UTC link Permalink

And for profanities , we have some "colorful" sentences (spoiler : "search XXX in the search engine")

sysko sysko April 18, 2010 April 18, 2010 at 3:19:34 PM UTC link Permalink

globally how the stemmer works for russian is explained here http://snowball.tartarus.org/al...n/stemmer.html , I admit I haven't read it entirely, as I've no notion in Russian (and moreover they provided something which work out of the box for this).

So I dunno how "easy' it is to adapt this to Ukrainian.

sysko sysko April 18, 2010 April 18, 2010 at 2:39:06 PM UTC link Permalink

stemming should be working again for most languages when using the search engine
i.e search "think" should also return "thinking" "thought" etc. same for French / Spanish / Italian / Russian etc.

by the way it will not work with Ukrainian but I was wondering if using the russian stemmer will produce "better than nothing" result ? Demetrius, Dorenda ?
still looking for Arabic and georgian stemmers

sysko sysko April 18, 2010 April 18, 2010 at 1:12:34 AM UTC link Permalink

The index has been updated, we've switched from lucene to sphinx for the search engine, and we will try to soon make it real-time updated :)

sysko sysko April 18, 2010 April 18, 2010 at 12:22:48 AM UTC link Permalink

No problem, but it will be added next week (the change of server makes us a bit busy ^^)

sysko sysko April 12, 2010 April 12, 2010 at 12:59:42 PM UTC link Permalink

Yep it has already been requested (first by ourselves :p) we plan to have a panel to have advanced search option, to be able to search not only by requesting words, but by more option (not translated in X , translated by user Y etc... )

for the moment we're planning to migrate to an other server to make the website faster and more reactiv, but as a lot of people seems to want it, we will try to implemenent it ASAP

sysko sysko April 10, 2010 April 10, 2010 at 11:16:01 PM UTC link Permalink

mysql + phpmyadmin ?

sysko sysko April 10, 2010 April 10, 2010 at 8:16:03 PM UTC link Permalink

ok thanks, that the correction will appear on next launchpad update

sysko sysko April 9, 2010 April 9, 2010 at 2:17:08 PM UTC link Permalink

yep SENTENCE1\tTRANSLATION1\nSENTENCE2 etc... :)

sysko sysko April 9, 2010 April 9, 2010 at 11:26:43 AM UTC link Permalink

yep this way
sentence1[tab]translation1
sentence2[tab]translation2

sysko sysko April 8, 2010 April 8, 2010 at 12:42:32 PM UTC link Permalink

I think Zipangu has developped some mantra for this :P
anyway, congrat to have reached 2,000+ sentences in arabic it's already amazing :)