وال (ہک تند)
گُر
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
CK
ہک گھنٹہ پہلے
LeviHighway
ہک گھنٹہ پہلے
frpzzd
ہک گھنٹہ پہلے
doemaar14
کل
gillux
کل
sharptoothed
کل
Babelball
کل
TATAR1
کل
LeviHighway
کل
AlanF_US
کل
Is adding a traditional Chinese entry discouraged if a simplified version lready exists and is correctly converted?
✹✹ Stats & Graphs ✹✹
Tatoeba Stats, Graphs & Charts have been updated:
https://tatoeba.j-langtools.com/allstats/
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.
Hello y'all,
I just wanted to ask for help contacting user tommg whose websites Linguno and ListeningPractice were based on the Tatoeba corpus. I've seen him mentioning a couple of times these projects here, so I thought that maybe someone may know him here.
The thing is that the first page has been unreachable for the past couple of days + the support e-mail address doesn't really respond to any message.
Obviously, if the post is against any rule in here, I'll delete it immediately.
I'm happy to say that Linguno is back in operation today.
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.
Plej varmajn paskajn bondezirojn al vi.
Ĝojan Paskon!
Ĝojan Paskon ankaŭ al vi!!
Hello, do we have any of the admins around? Some people have been posting unrelated things?
The content of this message goes against our rules and was therefore hidden. It is displayed only to admins and to the author of the message.
If you want to report a spammer, please send a private message to TatoebaAdmins (or, if you can't remember that username, any individual admin). Please do not write Wall posts with links to spammers's profiles, messages, or sentences, since this will bring them more attention and encourage them to write more spam.
Oh, I used team@tatoeba.org and didn't get an answer. So I used the wrong address. Thanks for letting me know!
That address works, too. I see that you wrote an e-mail three days ago. The messages were hidden pretty soon after that. Thanks for reporting the problem.
When you search for a Korean word on Tatoeba, such as 오늘, sentences including 오늘은 won't appear. Korean is a language that usually uses many kinds of suffixes. Is it possible for the search engine to recognize words with different suffixes?
You can use a * symbol to represent any number of characters. E.g. 오늘* will find 오늘 followed by any suffix: https://tatoeba.org/en/sentence...%EB%8A%98*&to=
You can also use it at the beginning, e.g. *십시오 for polite requests: https://tatoeba.org/en/sentence...C%EC%98%A4&to=
Or somewhere in the middle of a word if you want.
This and other search engine features are explained on the wiki: https://en.wiki.tatoeba.org/art...ow/text-search
Well, I wish more that Tatoeba treats Korean like Chinese and Japanese... Korean just has too many compounds and suffixes, so considering the parts between each space as independent words is impractical. Also, this makes the 'required vocabulary' have to include all forms of same words...
I understand, and I wish we had better support for Korean, too. But I would like to point out that the handling of Chinese and Japanese is different but not that great neither. Chinese and Japanese characters are all considered independently and the search engine does not recognize word boundaries. This leads to the limitation described here: https://en.wiki.tatoeba.org/art...ord-boundaries
Now we could perfectly enable the same behavior for Korean characters too, if you think it would be overall beneficial despite that limitation. If you'd like to help evaluating such change, we could enable it on our testing server and get your feedback.
Yes, that would be nice! Then "학생이에요." (I am a student) could be either found by "학생" (student) and "에요" (to be)? "대한민국" (Republic of Korea) could be either found with "민국" (republic) and "대한" (Korea) right? Also, if I add "월" (month) or "술래잡기" (tag) in required vocabularies, and if a sentence includes "5월" (May) or "술래잡기와" (와 means "and"), it will also be considered to include that word? Also I think it's good for Korean than other languages because Korean commonly uses 2,500+ seperate Unicode characters in their language. This would ensure the accuracy (it's not like it would include "apple" when you search "a", if you enabled it for English)
I have temporarily configured Korean to be treated like Chinese and Japanese on the testing server: https://dev.tatoeba.org/fr/sent...C%EA%B5%AD&to=
Note that the testing server only contains a subset of what is on tatoeba.org, and it is separate. Feel free to add whatever Korean sentences you want on the testing server, so you can test out how the search behave. (You’ll have to create a new account there.) Newly added sentences should appear in search results within 15 minutes.
Once it is confirmed that this change overall improves search in Korean, we can bring it to tatoeba.org, too.