menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
belkacem77 belkacem77 November 1, 2018, edited November 1, 2018 November 1, 2018 at 8:53:19 PM UTC, edited November 1, 2018 at 8:56:25 PM UTC link Permalink

@Tatoeba contributors

When we launched our locale (language) on Tatoeba, we planned to create bridges between Kabyle and all world languages. We planned to produce a high quality of sentences: Common sense sentences, well written, well recorded. But, Kabyle is a language, and languages have rules: morphology and syntax, grammar and semantics, phonology and phénetics. So, before writing something in Kabyle language, be sure that you have basics in Kabyle and if someone makes error, there are others who can correct him, BUT you have to answer to comments!!!!

We have a chance to get a linguist as a maintainer for our corpus, so I beg you to listen to him, follow his advices.

Of course everybody can make mistakes and there are nice people to remind us, to guide us, to help us and we have to thank them for the job they are doing, BUT they can't monitor everyday, and write comments on every wrong sentence!!! Time is money, we should spend more time on adding and translating sentences rather than watching bad behaviour.

There is plenty of laguages where everyone can take part. If Kabyle do not suit you, you can choose another.

We intend to use this work on other free productions, so do not hurt us because we love every language, every people, every nation, every religion... We are raised to love diversity.

Thanks

{{vm.hiddenReplies[30630] ? 'expand_more' : 'expand_less'}} hide replies show replies
cueyayotl cueyayotl November 9, 2018 November 9, 2018 at 7:56:08 PM UTC link Permalink

Thank you for the message. If there is a mistake in any sentence and you leave a comment, but the author does not reply in 14 days, let a corpus maintainer know, and they can make the appropriate changes for you. (It is one of our rules)

TRANG TRANG November 9, 2018 November 9, 2018 at 8:34:17 PM UTC link Permalink

Note that on our Downloads page, we provide two files for the sentences. One of the files contains more information about the sentence, including the username of the owner of the sentence.

When you extract Kabyle sentences from the CSV file, you can define a whitelist and extract Kabyle sentences only from the whitelisted users, or you can define a blacklist and exclude all sentences from the blacklisted users.

I know it can be frustrating to see bad quality sentences in the corpus, but whitelisting/blacklisting is one good way to filter the corpus to better meet your quality standards.

You can also take into account information from the "User skill per language" to extract only sentences from native speakers, or you can use the "Users' sentence rating" to exclude sentences on an individual basis.

For the sentence rating, this requires you and your peers to enable the feature from your Settings, under "Experimental options" > "Activate the feature to rate sentences...". You can then start marking bad sentences as "not OK".
In your script that extracts the Kabyle sentences, you can then define a list of trusted users, and exclude any sentence that has been rated "not OK" by your trusted users.

This is just to say, Tatoeba provides some basic mechanism for you to filter out bad quality sentences. You'll need of course to spend a little bit more effort on your script, but then won't have to worry about every single sentence that is added into the Kabyle corpus.

{{vm.hiddenReplies[30703] ? 'expand_more' : 'expand_less'}} hide replies show replies
CK CK November 10, 2018, edited October 31, 2019 November 10, 2018 at 1:47:48 AM UTC, edited October 31, 2019 at 3:22:02 AM UTC link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[30705] ? 'expand_more' : 'expand_less'}} hide replies show replies
belkacem77 belkacem77 November 10, 2018 November 10, 2018 at 7:24:58 PM UTC link Permalink

Thanks CK

belkacem77 belkacem77 November 10, 2018 November 10, 2018 at 7:38:10 PM UTC link Permalink

Thanks Trang. I'm wirting several scripts according to needs. I'm begining with Audio sentences since only natives are recording.