الملف الشخصي
الجُمل
المفردات
Reviews
القوائم
المفضلة
التعليقات
التعليقات على جمل liori
رسائل الحائط
السجلات
تسجيل صوتي
المدوّنات
ترجِم جمل liori
As far as I understand, harshnisar's goal is to actually discover what specific problems (ie. what to focus on) do people have by analyzing the results of this survey—because either he doesn't have access to expert knowledge of the kind you suggest or he wants to compare experts' findings with common people's opinion.
W tej chwili najlepszym sposobem jest użycie programu „Shtooka Recorder”. CK przygotował krótką demonstrację i opis jak to robić: http://a4esl.com/temporary/tatoeba/shtooka/ (po angielsku).
I've uploaded the pre-crash dump of the wall messages here: http://tatoeba.org/files/downlo...e-crash.csv.xz
Probably this one: https://www.assembla.com/spaces...a2/tickets/413
Note, there's difference between FSF itself (the well-known organization with Stallman) and FSF France (which hosts Tatoeba).
And I agree that this would be probably the best option if someone really wants to financially help Tatoeba now. Handling donations has an overhead by itself, FeuDRenais pointed out additional things already—and FSF France should have all the donation-related legal matters already solved.
> Yes, I will definitely leave Tatoeba if there is one more English native Corpus Maintainer […]
That's fine for me.
Sure. The meeting was yesterday. Started at 16 and it seems that took much more than 3 hours… there were at least 40 organizations that wanted to get feedback.
As to Tatoeba, we were told there were no problems with our application, it was quite good. It seems that we have the bad luck of applying while there were many other language/translation-related organization applications, and Google just couldn't accommodate all of us.
We were encouraged to apply next year if the program will be continued.
From my experiences—many people on lang8 tend to accept sentences that are good enough to be understood, but not necessarily correct. But if your journal entry has been reviewed by several people, chances for getting a thorough review are quite good. You know this happens when you get an overly nitpicking review, possibly from a person who actually uses grammar terms to describe mistakes.
So, if you'll be careful to add only sentences after a careful review, I don't see why not. But even then, please add a comment to each such sentence that the sentence was checked by a native. The usual policy here is that non-native speakers are discouraged from adding sentences due to bad experiences with non-natives adding lots of incorrect sentences, and therefore your sentences could be seen in bad light.
Ah, finally you expressed your concerns in a clear way.
I don't know how did you associate “being a programmer” with “controlling” anyone. It's just an additional skill that might be useful.
Let me give an example. Using qdii's tatoparser and its regexp engine we found on IRC that there are sentences that start with a space character. This problem is not visible in HTML, but it may potentially hinder the deduplication process. All this thanks to programming skills of a contributor.
Do you still think that “Programming doesn't help maintaining corpora”?
Do you want to say that people who programmed Tatoeba do not care about Tatoeba contributors?
An architect is certainly in position of making the life of inhabitants easier… somehow I think though this wasn't what you had in mind.
Hello, nice to meet you.
Ah, btw, could you provide examples of him being “an aggressive person” and “intolerant”? That would add quite a lot of weight to your currently unjustified argument.
I fail to see an analogy here, could you explain?
That action shows that AlanF_US cares about the whole project, not only about the corpus.
Also, actually, programming does help with corpus maintenance. We couldn't cooperate on creating such a huge database without computer software, and Tatoeba website is software. If you deny that, you should probably start creating new corpus in your notepad.
That's why I don't add my own supposed-to-be-English sentences to the database ^^
Thank you.
I cannot judge his contributions, but I'd like to remind that AlanF_US contributed to our GSoC application. Just my 3 cents.
Ah, I forgot about that, thank you for remindering.
English also has a problem of unusually high ratio of bad sentences due to the Tanaka corpus. It seems that he wants to take care of them. That's a good reason to have more English corpus maintainers.
What would you want to see on his profile page?
Update on Google Summer of Code: Our application has been rejected. The details are not known yet; I will try to attend the feedback session on IRC, where we hopefully get justification on the result (channel #gsoc on Freenode on Friday, April 19, 2013 at 16:00 UTC).
I'd like to thank all the people who helped preparing the application: AlanF_US, CK, gillux, LinkMauve, Pepe, qdii and sysko. I hope next year we will have more luck!