menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
deniko deniko July 29, 2019 July 29, 2019 at 1:12:50 PM UTC link Permalink

Why am I finding "Yikes!" searching for "perfect"?

https://i.imgur.com/EoSIHeH.png

{{vm.hiddenReplies[32265] ? 'expand_more' : 'expand_less'}} hide replies show replies
AlanF_US AlanF_US July 29, 2019 July 29, 2019 at 4:00:19 PM UTC link Permalink

I see the same thing. Very odd. I don't see anything in the log on the "Yikes!" page that would explain it.

brauchinet brauchinet July 29, 2019 July 29, 2019 at 4:32:38 PM UTC link Permalink

This must be a temporary error in the search engine.
for example: right -> Yikes!
https://tatoeba.org/eng/sentenc...rom=eng&to=und
or darn ->
https://tatoeba.org/eng/sentenc...rom=eng&to=und

{{vm.hiddenReplies[32268] ? 'expand_more' : 'expand_less'}} hide replies show replies
deniko deniko July 29, 2019, edited July 29, 2019 July 29, 2019 at 4:34:14 PM UTC, edited July 29, 2019 at 4:39:49 PM UTC link Permalink

Yeah, I can see it for "right", but I don't see "yikes" in the search results for darn.

In any case, that's bizarre.

> a temporary error

Classifying any error as "temporary" is quite an optimistic outlook :)

{{vm.hiddenReplies[32269] ? 'expand_more' : 'expand_less'}} hide replies show replies
brauchinet brauchinet July 29, 2019, edited July 29, 2019 July 29, 2019 at 4:40:41 PM UTC, edited July 29, 2019 at 4:48:10 PM UTC link Permalink

Yeah, "darn" yields other unexpected results.
Funny, now I tried a lot of simple English words and didn't get false results (at least not on the first page). It seems I was lucky with the first two.

Edit: No, there are many false results.

gillux gillux July 31, 2019 July 31, 2019 at 6:50:40 AM UTC link Permalink

That’s very weird indeed. I am confident that this is caused by the recent update of the search engine. I’m working on it.

gillux gillux July 31, 2019 July 31, 2019 at 7:22:38 AM UTC link Permalink

Problem solved. Apparently English was the only language affected.

{{vm.hiddenReplies[32277] ? 'expand_more' : 'expand_less'}} hide replies show replies
deniko deniko July 31, 2019 July 31, 2019 at 8:37:23 AM UTC link Permalink

Thanks! That was fast, well done! @brauchinet's optimism was well founded then.

Is there a simple explanation of why that was happening, I wonder? It's so bizarre I'm really curious about the why.

{{vm.hiddenReplies[32278] ? 'expand_more' : 'expand_less'}} hide replies show replies
gillux gillux July 31, 2019 July 31, 2019 at 2:47:26 PM UTC link Permalink

I am not 100% sure about the why. The only thing I am sure is that the English index showed some errors when I checked it using the "indextool --check" command. So I recreated it.

When we moved from Manticore 2 to Manticore 3 [1] a few days ago, indexes were converted to a new format using a special tool [2]. That tool doesn’t look very stable yet because it caused some unexpected problems I had to work around at that time. So I suspect it also didn’t convert the English index correctly.

I remember "darn" eng→ita showed 4 results of which 2 were incorrect, and now it shows 4 results of which 4 are corrects. So I suspect the tool messed around with the list of sentences each keywords are associated to.

[1] https://github.com/Tatoeba/tatoeba2/issues/1929
[2] https://manticoresearch.com/201...-to-version-3/

{{vm.hiddenReplies[32279] ? 'expand_more' : 'expand_less'}} hide replies show replies
deniko deniko July 31, 2019 July 31, 2019 at 3:27:40 PM UTC link Permalink

Thank you elaborating. Index corruption seems like a plausible explanation as indexes are definitely used when you do a search.

Guybrush88 Guybrush88 August 1, 2019, edited August 1, 2019 August 1, 2019 at 7:25:50 AM UTC, edited August 1, 2019 at 7:26:08 AM UTC link Permalink

"Apparently English was the only language affected."

Actually, it seems that also other languages are having problems. I tried searching just now sentences in Italian and Esperanto contibuted in a span of time from two days ago to yesterday and they're not indexed yet

{{vm.hiddenReplies[32282] ? 'expand_more' : 'expand_less'}} hide replies show replies
TRANG TRANG August 1, 2019 August 1, 2019 at 10:31:06 AM UTC link Permalink

It's a different problem.

In your case, the issue was that the sentences were not indexed. The scripts to update the indexes were not running because the cron jobs were disabled.

I ran the scripts manually and I think gillux re-enabled the cron jobs, so it should be fine now :)

{{vm.hiddenReplies[32283] ? 'expand_more' : 'expand_less'}} hide replies show replies
Guybrush88 Guybrush88 August 1, 2019 August 1, 2019 at 10:32:57 AM UTC link Permalink

I understand, thanks to both you and gillux :)

gillux gillux August 1, 2019 August 1, 2019 at 11:20:08 AM UTC link Permalink

My bad, I forgot to re-enable reindexation /o\
Should be fine now.