menu
तातोएबा
language
पंजीकरण लॉग इन
language हिन्दी
menu
तातोएबा

chevron_right पंजीकरण

chevron_right लॉग इन

ब्राउज़

chevron_right यादृच्छिक वाक्य दिखाएँ

chevron_right भाषा के अनुसार ब्राउज़ करें

chevron_right सूची के अनुसार ब्राउज़ करें

chevron_right टैग के अनुसार ब्राउज़ करें

chevron_right ऑडियो ब्राउज़ करें

समुदाय

chevron_right वाल

chevron_right सभी सदस्यों की सूची

chevron_right सदस्यों की भाषाएँ

chevron_right देशी वक्ता

search
clear
swap_horiz
search
AlanF_US AlanF_US 31 मई 2013 31 मई 2013 को 11:54:55 pm UTC flag Report link पर्मालिंक

(1) Is there a way to search (without downloading the corpus) for sentences of a given language that have a "@needs native check" tag but do not have an "OK" tag?

(2) Is there an easy way for corpus maintainers to search for sentences that have both a "@needs native check" and an "OK" tag so that they can remove both tags (assuming the sentence really is OK now)? Is this something that's generally done?

{{vm.hiddenReplies[16801] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
marcelostockle marcelostockle 1 जून 2013 1 जून 2013 को 12:18:29 am UTC flag Report link पर्मालिंक

(2): I do it once in a while.
I load "tags.csv" on Excel, filter all the OK entries and all the @nnc separately, and import both index lists to matlab. There:
> arrayOK = false(1, 3000000);
> arrayOK(filterOK) = true;
> arrayNNC = false(1, 3000000);
> arrayNNC(filterNNC) = true;
>
> seq = 1:3000000;
> seq = seq(filterNNC & filterOK);

and that's it ^^
If you want, I can put a link to a results file here on the Wall tomorrow.

alexmarcelo alexmarcelo 1 जून 2013 1 जून 2013 को 12:23:34 am UTC flag Report link पर्मालिंक

> so that they can remove both tags
Why would we want to remove an "OK" tag?

al_ex_an_der al_ex_an_der 1 जून 2013 1 जून 2013 को 12:34:17 am UTC flag Report link पर्मालिंक

> remove both tags

"OK" tags should never be removed.
Their most important function ist to say: Even if the author isn't a native speaker, the sentence has been checked by a native speaker and therefore you can trust it as if it were owned by a native speaker.

Of course there are a lot of other sentences tagged with "OK" too, but it's just a sentence owned by a non-native speaker where this tag makes the really great difference in the eyes of the user.

Of course, if a sentence has both "@needs native check" and "OK", than "@needs native check" makes no sence any longer and should be removed. Mostly this is done right away when the sentence has been checked and tagged "OK".

{{vm.hiddenReplies[16805] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
alexmarcelo alexmarcelo 1 जून 2013 1 जून 2013 को 12:35:51 am UTC flag Report link पर्मालिंक

+1

AlanF_US AlanF_US 2 जून 2013 2 जून 2013 को 9:23:34 pm UTC flag Report link पर्मालिंक

I see. Thanks for the explanation.

marcelostockle marcelostockle 2 जून 2013 2 जून 2013 को 7:49:11 am UTC flag Report link पर्मालिंक

Here's the list I talked about. There were only 22 results:

20337
23054
59378
65213
71100
145969
320988
326866
450824
1065935
1443415
1523131
1763683
2069475
2120582
2180411
2180700
2220721
2220744
2223944
2265117
2468564

you can verify both tags until a maintainer removes the respective @nnc

{{vm.hiddenReplies[16820] ? 'expand_more' : 'expand_less'}} जवाब छिपाएँ जवाब दिखाएँ
AlanF_US AlanF_US 2 जून 2013 2 जून 2013 को 9:24:35 pm UTC flag Report link पर्मालिंक

Thanks!