menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
TRANG TRANG December 17, 2014 December 17, 2014 at 11:43:18 PM UTC link Permalink

** Tatoeba update (December 17th, 2014) **

http://blog.tatoeba.org/2014/12...17th-2014.html


# Small update

* We fixed the problem of the languages not being displayed on the translate page[1], in the list for random sentences.
* We fixed an issue where a sentence was not part of the search results, even though it had been indexed previously. This happened when the sentence was recently translated, or for which the owner or correctness has changed.


# UI translations

I'd like to mention that we now have the website interface translated 100% in 7 languages[2]: Arabic, Esperanto, Finnish, French, German, Italian and Russian.
We have as well Marathi (97%), Japanese (92%) and Polish (90%) not far from being completed.


# Sentences deduplication

We're delaying once more the sentences deduplication. There are still some details we'd like to fix[3]. Even though they are not critical and the deduplication itself is working properly (as far as we know), it's better to fix them sooner than later.

When everything is fixed, there will be another round of deduplication on the dev website. We will leave a few days again for everyone to check that there's indeed no major issue. Then we will run the script finally on the real website.

Thank you for your patience.


-----

[1] http://tatoeba.org/eng/activiti...late_sentences
[2] https://www.transifex.com/proje...toeba_website/
[3] http://tatoeba.org/eng/wall/sho...#message_21231

{{vm.hiddenReplies[21246] ? 'expand_more' : 'expand_less'}} hide replies show replies
gillux gillux December 21, 2014, edited December 21, 2014 December 21, 2014 at 9:44:24 PM UTC, edited December 21, 2014 at 9:44:58 PM UTC link Permalink

> * We fixed an issue where a sentence was not part of the search results, even though it had been indexed previously. This happened when the sentence was recently translated, or for which the owner or correctness has changed.

Just to elaborate on this. The original problem was that even though newly added sentences could be found within an hour (and now 15 minutes) using search, they could actually not be found as a translation. For example, let’s say you translate an existing Arab sentence A into a Bolivian sentence B. Then, you were able to find B by looking up words of B, but looking up “words of A translated into Bolivian” didn’t return A.

The interesting thing is that fixing that bug brought a new feature: changes in terms of authorship and unapproved status are now instantly visible in search results. These criteria affect the search result ranking algorithm (orphan sentences get a lower score, and unapproved sentences an even lower score). It means that for example, orphan sentences will be brought up in the search results as soon as someone adopts them.

The recent improvements on the search (including this) also lays the foundations for a more advanced search functionality with multiple criteria like owner, creation/modification date, tags, unapproved status etc.