menu
Tatoeba
language
Kaydol Giriş yap
language Türkçe
menu
Tatoeba

chevron_right Kaydol

chevron_right Giriş yap

Göz At

chevron_right Rastgele cümle göster

chevron_right Dile göre ara

chevron_right Listeye göre ara

chevron_right Etikete göre ara

chevron_right Ses ara

Topluluk

chevron_right Duvar

chevron_right Tüm üyelerin listesi

chevron_right Üyelerin dilleri

chevron_right Ana diller

search
clear
swap_horiz
search

Menü

Duvar'a dön

Aiji Aiji 14 Eylül 2017, 14 Eylül 2017 tarihinde düzenlendi 14 Eylül 2017 02:39:37 UTC, 14 Eylül 2017 02:40:29 UTC düzenlendi link Kalıcı bağlantı

I was wondering if a script updating all the sentences that were modified before the crash was coming soon.
I see some people starting adopting sentences that I had adopted in the past after I corrected them and correctly linking/unlinking them. The problem is that adopting without proofreading or prooflinking decrease the actual quality of the corpus.
I don't have the time (nor the will) to re-monitor hundreds of sentences that I corrected in the past so it would be very nice if an adequate script was on his way. Once the sentences are corrected, adoption and untagging can be re-operated without paying attention to the correctness of the sentence and its translations.

{{vm.hiddenReplies[28429] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
TRANG TRANG 17 Eylül 2017 17 Eylül 2017 16:53:18 UTC link Kalıcı bağlantı

It depends on your definition of soon, but I'm starting to working on it. I don't have a clear solution yet how this will be handled so I can't say for sure yet how soon this can be done.

In any case, related GitHub issue is: https://github.com/Tatoeba/tatoeba2/issues/1484

TRANG TRANG 8 Ekim 2017 8 Ekim 2017 18:21:07 UTC link Kalıcı bağlantı

CK made a script to list sentences that need to be reverted. Could anyone have a quick look and see if there's anything wrong in this file:
https://github.com/Tatoeba/tato...2017-09-11.zip

What I would do is basically replace the text/owner of the sentences by the ones in this file.

{{vm.hiddenReplies[28515] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
Aiji Aiji 9 Ekim 2017 9 Ekim 2017 01:55:12 UTC link Kalıcı bağlantı

The file looks okay to me. Especially for the change of owner :) I've started to do it again and it was painful ^^
Also, we may not want to replace the text if a modification has occurred since the recovery I guess.

Would there be a possibility to do the linking / unlinking to other sentences too ?
With the file you got, the indexes of the sentences that need to be checked are given, so we could just check what modifications they went through between the critical dates.

In any case, thank you for the work.

deniko deniko 9 Ekim 2017 9 Ekim 2017 08:41:50 UTC link Kalıcı bağlantı

I checked Ukrainian sentences on the list - the file looks good to me. Thanks CK for creating that.

TRANG TRANG 15 Ekim 2017 15 Ekim 2017 18:11:34 UTC link Kalıcı bağlantı

Sentences modifications/ownership should be restored now, based on CK's file. Can you have a quick check if things are okay on your side?

{{vm.hiddenReplies[28562] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
TRANG TRANG 15 Ekim 2017, 15 Ekim 2017 tarihinde düzenlendi 15 Ekim 2017 18:19:28 UTC, 15 Ekim 2017 18:37:24 UTC düzenlendi link Kalıcı bağlantı

Hmm... Actually never mind, I'm already seeing some inconsistencies.

Edit: So I reverted the replacement of sentences/ownership. We'll have to review the script that generates this file, it probably shouldn't take into account sentences that were modified after June 10th.

TRANG TRANG 22 Ekim 2017 22 Ekim 2017 17:43:20 UTC link Kalıcı bağlantı

Alright, CK fixed the files. I've made the changes in the database.

Can you have a look now? Hopefully it's alright.

{{vm.hiddenReplies[28579] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
Aiji Aiji 23 Ekim 2017 23 Ekim 2017 02:29:10 UTC link Kalıcı bağlantı

Thank you.
Ownership has been restored.
Linking / unlinking is not restored, which is a problem for hundreds of sentences whose translations had been corrected when adopted. For instance: https://tatoeba.org/eng/sentences/show/127776
A suggestion on how to handle that? Isn't there a way to simply reapply the linking / unlinking operated between March and June on sentences contained in the file? Or to avoid conflicts, reapply what occured from March to now (some may have been reapplied manually already).

{{vm.hiddenReplies[28584] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
TRANG TRANG 1 Aralık 2018 1 Aralık 2018 15:55:00 UTC link Kalıcı bağlantı

Good news, more than one year later, we made progress on restoring links:
https://github.com/Tatoeba/tatoeba2/issues/1724

One of our members, Yorwba, has found that the following 2949 links should be deleted:
https://raw.githubusercontent.c..._to_delete.csv

And the following 131153 links should be re-added:
https://raw.githubusercontent.c...to_restore.csv

I would greatly appreciate if a few people could look at the lists and check if they see anything wrong in there.

{{vm.hiddenReplies[30911] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
Aiji Aiji 2 Aralık 2018 2 Aralık 2018 14:17:32 UTC link Kalıcı bağlantı

Good news, good news.

What would you like us to do exactly? Without writing some scripts, looking at those raw files seems a bit difficult.

That being said, do you want us to pick up some links at random, and check their validity?

{{vm.hiddenReplies[30912] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
TRANG TRANG 2 Aralık 2018 2 Aralık 2018 15:24:58 UTC link Kalıcı bağlantı

Yes, pick up links at random and check if they indeed have to be deleted/re-added.

If someone wants to go further and write a script to check a larger amount of links, that's of course better.

{{vm.hiddenReplies[30913] ? 'expand_more' : 'expand_less'}} cevapları gizle cevapları göster
deniko deniko 3 Aralık 2018 3 Aralık 2018 11:46:08 UTC link Kalıcı bağlantı

I did what you suggested - picked a dozen of random Ukrainian sentences and checked the links, they seem to be all fine.

I also checked in Excel there are no pairs of sentences that are both linked and unlinked in the two files, so, I guess, they should be fine to be processed.