Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on soliloquist's sentences
Wall messages
Logs
Audio
Transcriptions
Translate soliloquist's sentences
.
Thanks. I'm sorry to have bothered you.
The user glavsaltulo has 31,220 sentences (31,193 of them are in their native language, Lithuanian).
https://tatoeba.org/eng/activit...of/glavsaltulo
And only 4 of their sentences are untranslated.
https://tatoeba.org/eng/sentenc...&sort_reverse=
But their tcnt/scnt is 0.50 on the chart when 'Count only native contributions' is checked.
So there must be something that decreases this number other than untranslated and non-native sentences.
Unchecking that checkbox gives a more accurate number as you mentioned.
I don't quite understand how tcnt/scnt is determined.
The user below has 3 sentences all of which are linked to a translation, so I would expect their tcnt/scnt to be 1, but it is 0.3 on the chart.
https://tatoeba.org/eng/activit...ces_of/nnireyh
And this user's ratio is 0.8 on the chart although their all sentences are linked to a translation.
https://tatoeba.org/eng/activit..._of/0sigmaone0
What am I missing?
Thank you. It's been in my mind for a while, too. I wonder if it would be convenient to embed such short videos in the wiki, especially in the quick start guide.
Everything seems fine now. Thank you for implementing that category.
According to your chart, 5948 Hungarian and only 3 Hebrew sentences are OK, but the advanced search shows many more sentences that are tagged OK.
https://tatoeba.org/eng/sentenc...&sort_reverse= (20,613 results)
https://tatoeba.org/eng/sentenc...&sort_reverse= (124 results)
Thank you very much. My OK ratios are a bit higher than yours. I guess yours include the OK-rated (green marked) sentences, but not the OK-tagged ones. I had tried to add them together and remove the duplicates to avoid counting both tagged and rated sentences twice.
Thanks a lot. :-)
> There's a button in the green toolbar which will expand the menu for all sentences and keep it expanded as you go from a page to another.
Thanks. I didn't see that button. It's indeed available on these pages.
https://dev.tatoeba.org/eng/sen...ists/show/8909
https://dev.tatoeba.org/eng/sen...ll_in/eng/none
But it isn't available on this one as there's no green toolbar.
https://dev.tatoeba.org/eng/act...of/soliloquist
It works fine, thank you. I just want to say two things.
- Deleted sentences still show up on reviews/collections page, and it's not possible to remove/unmark them.
https://i.imgur.com/NRyX0aX.png
- The default view of the edit bar is minimized. To rate sentences in the list view, we need to expand the bar on each sentence. If we had an option to change the default view of the edit bar, it would be more practical for proofreading and linking.
https://streamable.com/vg9w4a
And here are the ratios for the top ten languages. (as of May 2nd)
English: 58.14%
Russian: 2.39%
Italian: 5.3%
Turkish: 3.06%
Esperanto: 23.6%
German: 2.81%
French: 1.62%
Portuguese: 3.96%
Spanish: 1.46%
Hungarian: 9.21%
* Sentences that are both tagged and rated OK, and sentences that are rated by multiple users are counted once.
* Sentences that have mixed rate marks (green-yellow or green-red) are not excluded.
* Self-approvals are not excluded.
I previously calculated comment/sentence ratios of some languages and Russian was one of the highest ones, so I assume it's well-maintained and review marks may be trivial for you, but I think it's essential for the Turkish corpus. We have a lot of unnatural sentences added by native speakers. Monitoring the proofreading progress is therefore important and valuable for us, and the same may go for others. Having this information may also encourage users to rate/tag sentences which I think is beneficial for the overall quality of the Tatoeba corpus.
Anyway, I tried to work out the ratio for the Turkish corpus and left it as a comment on #8739923. I may update it from time to time to show the progress.
Thank you.
Could you provide the statistics for proofreading ratios (both cumulative and individual)? I mean the ratio of sentences that are rated or tagged OK to the total number of sentences.
I previously shared my opinion here.
https://tatoeba.org/eng/wall/sh...#message_33311
We're also using the vocabulary feature and the advanced search to detect spelling errors in the Turkish corpus.
https://tatoeba.org/eng/wall/sh...#message_33249
https://tatoeba.org/eng/sentenc...omment-1131503
There has been a discussion recently on the Yiddish Duolingo Facebook page.
https://www.facebook.com/Yiddis...type=3&theater
There is no absolute consensus, but many of the participants seem to favor the Menorah flag which dates back to 2011.
https://en.wikipedia.org/wiki/User:S.fuer
Some Yiddish speakers seem to avoid the stripes though, thinking that they resemble the flag of Israel closely.
> How come there is no support for spell-check in the "Add new sentences" field?
I couldn't reproduce it. Both English and Turkish spellcheckers seem to work fine on Firefox 74.
https://streamable.com/7ov6p
Site sizin mi? Domain sorgusunda Türkiye'den alındığı görünüyor. Mantık güzel, ama tasarım biraz daha geliştirilse fena olmaz. İçeriği zenginleştirmek için buradaki cümlelerden de faydalanılabilir.
Obrigado pelo seu interesse.
We are aware of the issue. I collected the sentences by that account in a proofreading list. A considerable amount of them need improving.
https://tatoeba.org/eng/sentenc.../show/9444/eng
With an average of 100 sentences per day, it will take 12 years to finish proofreading. Slowly but surely, I hope. :-)
This is my personal list, and I'm proofreading from recent to oldest, but there's also another collaborative, hidden list with reverse order. Other Turkish ACs and CMs are dealing with it.
> Has it occurred to anyone to create a translation memory with Tatoeba sentences?
There is a small and useful program created by a Hungarian translator. It lacks some functionality, but it's free and easy-to-use.
http://farkastranslations.com/tmlookup.php
I'm using it with the Tatoeba corpus both for looking up translations of Turkish terms and for corpus maintenance. It's often more practical than searching online.