menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
soliloquist {{ icon }} keyboard_arrow_right

Profile

keyboard_arrow_right

Sentences

keyboard_arrow_right

Vocabulary

keyboard_arrow_right

Reviews

keyboard_arrow_right

Lists

keyboard_arrow_right

Favorites

keyboard_arrow_right

Comments

keyboard_arrow_right

Comments on soliloquist's sentences

keyboard_arrow_right

Wall messages

keyboard_arrow_right

Logs

keyboard_arrow_right

Audio

keyboard_arrow_right

Transcriptions

translate

Translate soliloquist's sentences

soliloquist's messages on the Wall (total 151)

soliloquist soliloquist September 16, 2020, edited September 18, 2020 September 16, 2020 at 9:06:52 PM UTC, edited September 18, 2020 at 10:38:33 PM UTC link Permalink

.

soliloquist soliloquist September 16, 2020 September 16, 2020 at 11:27:43 AM UTC link Permalink

Thanks. I'm sorry to have bothered you.

The user glavsaltulo has 31,220 sentences (31,193 of them are in their native language, Lithuanian).

https://tatoeba.org/eng/activit...of/glavsaltulo

And only 4 of their sentences are untranslated.

https://tatoeba.org/eng/sentenc...&sort_reverse=

But their tcnt/scnt is 0.50 on the chart when 'Count only native contributions' is checked.

So there must be something that decreases this number other than untranslated and non-native sentences.

Unchecking that checkbox gives a more accurate number as you mentioned.

soliloquist soliloquist September 15, 2020, edited September 16, 2020 September 15, 2020 at 8:15:59 PM UTC, edited September 16, 2020 at 11:29:12 AM UTC link Permalink

I don't quite understand how tcnt/scnt is determined.

The user below has 3 sentences all of which are linked to a translation, so I would expect their tcnt/scnt to be 1, but it is 0.3 on the chart.

https://tatoeba.org/eng/activit...ces_of/nnireyh


And this user's ratio is 0.8 on the chart although their all sentences are linked to a translation.

https://tatoeba.org/eng/activit..._of/0sigmaone0

What am I missing?

soliloquist soliloquist May 12, 2020 May 12, 2020 at 9:43:28 AM UTC link Permalink

Thank you. It's been in my mind for a while, too. I wonder if it would be convenient to embed such short videos in the wiki, especially in the quick start guide.

soliloquist soliloquist May 8, 2020 May 8, 2020 at 8:03:07 AM UTC link Permalink

Everything seems fine now. Thank you for implementing that category.

soliloquist soliloquist May 7, 2020, edited May 7, 2020 May 7, 2020 at 8:36:20 PM UTC, edited May 7, 2020 at 8:38:41 PM UTC link Permalink

According to your chart, 5948 Hungarian and only 3 Hebrew sentences are OK, but the advanced search shows many more sentences that are tagged OK.

https://tatoeba.org/eng/sentenc...&sort_reverse= (20,613 results)

https://tatoeba.org/eng/sentenc...&sort_reverse= (124 results)

soliloquist soliloquist May 7, 2020, edited May 7, 2020 May 7, 2020 at 10:34:29 AM UTC, edited May 7, 2020 at 10:35:27 AM UTC link Permalink

Thank you very much. My OK ratios are a bit higher than yours. I guess yours include the OK-rated (green marked) sentences, but not the OK-tagged ones. I had tried to add them together and remove the duplicates to avoid counting both tagged and rated sentences twice.

soliloquist soliloquist May 6, 2020 May 6, 2020 at 7:36:58 PM UTC link Permalink

Thanks a lot. :-)

soliloquist soliloquist May 5, 2020 May 5, 2020 at 9:00:20 PM UTC link Permalink

> There's a button in the green toolbar which will expand the menu for all sentences and keep it expanded as you go from a page to another.


Thanks. I didn't see that button. It's indeed available on these pages.

https://dev.tatoeba.org/eng/sen...ists/show/8909

https://dev.tatoeba.org/eng/sen...ll_in/eng/none

But it isn't available on this one as there's no green toolbar.

https://dev.tatoeba.org/eng/act...of/soliloquist

soliloquist soliloquist May 5, 2020 May 5, 2020 at 7:57:29 PM UTC link Permalink

It works fine, thank you. I just want to say two things.

- Deleted sentences still show up on reviews/collections page, and it's not possible to remove/unmark them.

https://i.imgur.com/NRyX0aX.png

- The default view of the edit bar is minimized. To rate sentences in the list view, we need to expand the bar on each sentence. If we had an option to change the default view of the edit bar, it would be more practical for proofreading and linking.

https://streamable.com/vg9w4a

soliloquist soliloquist May 5, 2020, edited May 6, 2020 May 5, 2020 at 7:55:45 PM UTC, edited May 6, 2020 at 4:08:40 AM UTC link Permalink

And here are the ratios for the top ten languages. (as of May 2nd)

English: 58.14%
Russian: 2.39%
Italian: 5.3%
Turkish: 3.06%
Esperanto: 23.6%
German: 2.81%
French: 1.62%
Portuguese: 3.96%
Spanish: 1.46%
Hungarian: 9.21%

* Sentences that are both tagged and rated OK, and sentences that are rated by multiple users are counted once.
* Sentences that have mixed rate marks (green-yellow or green-red) are not excluded.
* Self-approvals are not excluded.

soliloquist soliloquist May 5, 2020 May 5, 2020 at 7:54:17 PM UTC link Permalink

I previously calculated comment/sentence ratios of some languages and Russian was one of the highest ones, so I assume it's well-maintained and review marks may be trivial for you, but I think it's essential for the Turkish corpus. We have a lot of unnatural sentences added by native speakers. Monitoring the proofreading progress is therefore important and valuable for us, and the same may go for others. Having this information may also encourage users to rate/tag sentences which I think is beneficial for the overall quality of the Tatoeba corpus.

Anyway, I tried to work out the ratio for the Turkish corpus and left it as a comment on #8739923. I may update it from time to time to show the progress.

soliloquist soliloquist May 5, 2020 May 5, 2020 at 12:21:45 PM UTC link Permalink

Thank you.

Could you provide the statistics for proofreading ratios (both cumulative and individual)? I mean the ratio of sentences that are rated or tagged OK to the total number of sentences.

soliloquist soliloquist April 14, 2020 April 14, 2020 at 1:40:45 PM UTC link Permalink

I previously shared my opinion here.

https://tatoeba.org/eng/wall/sh...#message_33311

We're also using the vocabulary feature and the advanced search to detect spelling errors in the Turkish corpus.

https://tatoeba.org/eng/wall/sh...#message_33249
https://tatoeba.org/eng/sentenc...omment-1131503

soliloquist soliloquist April 11, 2020 April 11, 2020 at 1:18:10 PM UTC link Permalink

https://tatoeba.org/eng/wall/sh...#message_33993

soliloquist soliloquist April 10, 2020 April 10, 2020 at 8:51:55 AM UTC link Permalink

There has been a discussion recently on the Yiddish Duolingo Facebook page.

https://www.facebook.com/Yiddis...type=3&theater

There is no absolute consensus, but many of the participants seem to favor the Menorah flag which dates back to 2011.

https://en.wikipedia.org/wiki/User:S.fuer

Some Yiddish speakers seem to avoid the stripes though, thinking that they resemble the flag of Israel closely.

soliloquist soliloquist March 15, 2020 March 15, 2020 at 12:21:02 PM UTC link Permalink

> How come there is no support for spell-check in the "Add new sentences" field?

I couldn't reproduce it. Both English and Turkish spellcheckers seem to work fine on Firefox 74.

https://streamable.com/7ov6p

soliloquist soliloquist February 17, 2020 February 17, 2020 at 2:16:46 PM UTC link Permalink

Site sizin mi? Domain sorgusunda Türkiye'den alındığı görünüyor. Mantık güzel, ama tasarım biraz daha geliştirilse fena olmaz. İçeriği zenginleştirmek için buradaki cümlelerden de faydalanılabilir.

soliloquist soliloquist January 18, 2020 January 18, 2020 at 7:19:55 PM UTC link Permalink

Obrigado pelo seu interesse.

We are aware of the issue. I collected the sentences by that account in a proofreading list. A considerable amount of them need improving.

https://tatoeba.org/eng/sentenc.../show/9444/eng

With an average of 100 sentences per day, it will take 12 years to finish proofreading. Slowly but surely, I hope. :-)

This is my personal list, and I'm proofreading from recent to oldest, but there's also another collaborative, hidden list with reverse order. Other Turkish ACs and CMs are dealing with it.

soliloquist soliloquist December 29, 2019 December 29, 2019 at 12:41:03 PM UTC link Permalink

> Has it occurred to anyone to create a translation memory with Tatoeba sentences?

There is a small and useful program created by a Hungarian translator. It lacks some functionality, but it's free and easy-to-use.

http://farkastranslations.com/tmlookup.php

I'm using it with the Tatoeba corpus both for looking up translations of Turkish terms and for corpus maintenance. It's often more practical than searching online.