Menuo
To corpus maintainers and advanced contributors, we have identified 60 sentences that have been logged as deleted but are still present in the corpus.
https://github.com/Tatoeba/tato...ment-517555854
I'm just reporting it here and will let you handle them.
https://tatoeba.org/eng/sentences/show/861250 is a correct sentence. Should I delete it anyway?
When in doubt, just use the @delete tag. Either another corpus maintainer can have a look, or you can come back to it later.
C'est bien. Merci ! :D
Uhm, interesting.
It was tagged as "delete" because it was a duplicate of #1284790. However, #1284790 was deleted.
#1001614
#1012266
#1103005
#1130232
#1199315
#1236574
#1326
#1395554
#1748110
#20222
#203679
#257711
#322545
#334907
#341800
#3780560
#40917
#436802
#442402
#446043
#454574
#460906
#461060
#4703148
#526503
#549362
#570871
#5893
#5903
#5907
#5908
#5912
#5913
#5917
#5924
#5934
#660695
#69668
#69669
#710678
#743624
#81170
#861250
#871995
I would say that these sentences should be treated like any other. If they're bad, either fix or delete them. If they're good, leave them alone. For what it's worth, the sentences that I looked at, in the languages that I speak, were fine.
A lot of them fit the situation that Ricardo mentioned: At some time (apparently when duplicate-merging was not working), people were asking that they be deleted because they were duplicates. Now that duplicate-merging is working, we know that they will be merged if they're identical to other sentences. So there's no reason to do anything.
The only reason to delete the sentences is if you think that's important for the consistency of the database. But you don't seem to be saying that, and I would be surprised if you did.
There is also the case of copyrighted sentences. Some sentences might have been copied from somewhere and we decided to delete them for safety.
But yes, I should have been a bit more clear that I reported the sentences here on the Wall specifically because I think it's useful to have a second look at them.
I don't think these sentences are duplicates (at least not exact duplicates) otherwise Horus would have deleted them already. Duplicate-merging happens every 30 minutes or so and these sentences are not new at all.