menu
तातोएबा
language
पंजीकरण लॉग इन
language हिन्दी
menu
तातोएबा

chevron_right पंजीकरण

chevron_right लॉग इन

ब्राउज़

chevron_right यादृच्छिक वाक्य दिखाएँ

chevron_right भाषा के अनुसार ब्राउज़ करें

chevron_right सूची के अनुसार ब्राउज़ करें

chevron_right टैग के अनुसार ब्राउज़ करें

chevron_right ऑडियो ब्राउज़ करें

समुदाय

chevron_right वाल

chevron_right सभी सदस्यों की सूची

chevron_right सदस्यों की भाषाएँ

chevron_right देशी वक्ता

search
clear
swap_horiz
search
soliloquist soliloquist 17 अक्तूबर 2019 17 अक्तूबर 2019 को 7:56:16 pm UTC link पर्मालिंक

**Using the Vocabulary Feature for Corpus Maintaining**

Recently, I have listed some common Turkish spelling errors as search links. There are hundreds of incorrect sentences. We have fixed some of them, and are still fixing.

https://tatoeba.org/eng/sentenc...omment-1131503

I was planning to check those links from time to time (once in several months), but then I realized that we could use the vocabulary feature to continuously check and monitor all of them at once, like an alarm system. I created an auxiliary account for this purpose and added those searches as vocabulary items.

https://tatoeba.org/eng/vocabul...shSpellChecker

It might be considered as bending the tools, but since this feature isn't used by many people, I wanted to give it a try. Also, people adding new sentences with wanted vocabulary items are usually native speakers, so they can easily notice that those items are spelling errors and ignore them. I think the risk-benefit ratio is favorable for now. If a feature showing who added which vocabulary item were implemented, this would be even less of a problem.

{{vm.hiddenReplies[33249] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
TRANG TRANG 17 अक्तूबर 2019 17 अक्तूबर 2019 को 9:07:04 pm UTC link पर्मालिंक

It's a creative way to do corpus maintenance :)

While it is not the intended use for the vocabulary feature and does mess up a bit the "sentences wanted" list, your use case is still a valid one, I think. It only means that we need a way to distinguish between vocabulary that was added to request an example sentence and vocabulary that was added just as part of a list of vocabulary.

That's an idea that I already mentioned in another thread:
https://tatoeba.org/eng/wall/sh...#message_31919