Wall (7120 threads)
Astúcias
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
sharptoothed
3 days ago
sharptoothed
3 days ago
TATAR1
3 days ago
AlanF_US
4 days ago
sharptoothed
5 days ago
Shanaz
8 days ago
Qaztat
8 days ago
TATAR1
8 days ago
Tartar
8 days ago
menaud
11 days ago

fell in love

[not needed anymore- removed by CK]

:O what a good thing!
I wanted to do it for Italian long time ago, but I don't know how to do it :/
Can someone tell me how to create such a list or do it for me? ;)

I tried frequency list. Not only in Italian. But I like more concise pocket dictionaries. The list of 50000 words not yet in Tatoeba (Italian only):
http://similar.uw.hu/taitadict_a.htm
An Italian speaker can decide what are useful ones.
(I posted the German to Hungarian sentence links a week ago, what contains untranslated sentences/words, but I did not get any reply :P) Maybe this one will be more "useful".
Opinions please. >:D (I made a verb conjugator only for filter words, bah).

Oh thank you Debian, you've been so kind!
I'm going to start working on these words tomorrow.
Good night!!

http://en.wiktionary.org/wiki/W...requency_lists
there you can found some frequency list for a bunch of languages.

http://corpus.leeds.ac.uk/list.html
In addition to sysko's link, this page has some much larger frequency lists for a few languages that might be easier to programmatically extract.

to ease the navigation can you add an anchor to these 3 categories on your page (otherwise we need to scroll again and again :) )
by the way great idea (as ever :p)
I will try to do the same for Chinese and French.

Great idea!

Can you add wildcards to the search? It is frustrating to try to find affix translations as used in different sentences.

[not needed anymore- removed by CK]

Yes, it looks like the relevant setting (on the server side) is handled here as I've pasted here (from that link). But it's not a change I can make. That's why I decided to ask you guys to add it.
11.2.19. enable_star
Enables star-syntax (or wildcard syntax) when searching through prefix/infix indexes. Optional, default is is 0 (do not use wildcard syntax), for compatibility with 0.9.7. Known values are 0 and 1.
This feature enables "star-syntax", or wildcard syntax, when searching through indexes which were created with prefix or infix indexing enabled. It only affects searching; so it can be changed without reindexing by simply restarting searchd.
The default value is 0, that means to disable star-syntax and treat all keywords as prefixes or infixes respectively, depending on indexing-time min_prefix_len and min_infix_len settings. The value of 1 means that star ('*') can be used at the start and/or the end of the keyword. The star will match zero or more characters.
For example, assume that the index was built with infixes and that enable_star is 1. Searching should work as follows:
"abcdef" query will match only those documents that contain the exact "abcdef" word in them.
"abc*" query will match those documents that contain any words starting with "abc" (including the documents which contain the exact "abc" word only);
"*cde*" query will match those documents that contain any words which have "cde" characters in any part of the word (including the documents which contain the exact "cde" word only).
"*def" query will match those documents that contain any words ending with "def" (including the documents that contain the exact "def" word only).

Yep I know about this, but unfortunately the server is already used to the maximun of its capacity (the current load average is 1.36 1.60 1.84) so we can't enable this option without degrading the overall performance, but for sure we will activate it as soon as we have the new version.

why font of persian is changed?

Bei der Suche ist der Satz: "HINWEIS: Sätze, die eine direkte Übersetzung haben, werden angezeigt. " falsch. Müsste lauten: "HINWEIS: Sätze, die eine INdirekte Übersetzung haben, werden angezeigt. "

all raymond from shenzhen china come here,how are you.

Krom la vjetnama kiu estas mia denaska lingvo, mi parolas Esperanton.

I made a video tutorial for Shtooka. I'll probably redo it later for 1080p full HD :)
But for now, http://www.youtube.com/watch?v=AcJoLBjUOaY

AWESOME!!! THANK YOOOUUUU!!!
It's pretty good ^^ I'll add the link to your tutorial in the FAQ, if you're okay with that.

No problem, think of it as my apology for not getting the Lojban audio done yet :)
And as I said, I'll probably redo it in a higher resolution, and with more zooming.
So go ahead and add it. But be prepared to switch it out when the newer version comes out! ^___^

You're doing audio for Lojban? Cool. I can't wait to have some of my sentences recorded. :-)
You might want to read the comments on the sentences before recording them, because some of them are somewhat controversial or may contain errors. I'm not really sure about several of my sentences. Later I might make a list of all the ones I think are correct, and of all the ones that might not be.

Hmm, okay, I might do that then.
Or I might justs do the list and if people need me to re-record something because the sentence got altered, then I can do that.
Thanks for the tip though :)

I created two lists:
http://tatoeba.org/spa/sentences_lists/show/294 has sentences that I believe to be correct.
http://tatoeba.org/spa/sentences_lists/show/295 has sentences that I know are incorrect or about which I have doubts.
I have only had time to go through some of my sentences. Later I will finish mine and go through the sentences written by others. If you want to start recording, you can start with the ones in this list and any of my other sentences that look simple and reliable. (The things that I am most likely to get wrong are complex tenses and complex negation logic). Also, tijlan's sentences are very good and you can use those. Just make sure to read the comments. Any weird looking sentences should be run through jbofi'e to see if they parse. If not, don't record them. (In particular, a number of arihato's sentences don't parse.)

Hmm. Well, I might go through those /after/ I record. Because I already have an official list with the sentence number and everything nicely set up. But I can go through afterwards and let the officials know which sentences not to add because they're invalid.

** Tag cleanup I **
Now that we've semi-hardened the set of tags that we have available, it's time to start sifting through these and get some order in place. As a first step, I've been going through the quotes. Mainly because they're easy to identify and constitute around a fifth of the tags. Also because there are a lot of duplicate tags that will allow us to substantially cut down the number of tags.
In the first batch, I've sent Trang a list of tags that need translating or standardising (with the "by " and "from " prefixes discussed earlier here on the wall and have already become a de facto standard). Non-English tags have furthermore been translated or transcribed into English/ASCII.
To pre-empt natural concerns: Yes, this is only temporary. No, none of the original tag names are lost. I'm keeping track of these. Once we get a system in place to handle the translations, we'll reinstate them. Those interested in translating tags can contact me via a private message or email.
Please refer to:
http://martin.swift.is/tatoeba/tags-cleanup.html
for a list of tags that are being renamed.
The next step is to merge duplicate quote tags. After that, I plan to continue to organise the tags at
http://martin.swift.is/tatoeba/tags.html
and try to figure out which tags are particularly useful, which ones useless. CK has also done considerable work on tags (beside the mountain of work he's contributed to Tatoeba on various other fronts) and those interested should have a look at his work at:
http://a4esl.com/temporary/tatoeba/
Thoughts and comments on this batch and other issues related to tags are welcome, either in this thread, a new one or private message -- whichever best fits the topic.

About my 'dual' tag. I think it's useful, because it marks sentences where subject is used in the dual number. Although it's not a widespread feature of world's languages, there are still certain languages which use it. I use it for Old Estern Slavic, and you know better than me that it is used in Icelandic. It is also mandatory in the standard Arabic and in Slovene. So I think we should keep this tag.

Thanks for that clarification. I've added annotation to the tag in my list and moved it to the "Grammatical aspect, mood and tense (and number)" section.
PS. For those curious, I'd placed the 'dual' tag under the ??? section on that page. Near the bottom are a bunch of things I haven't sorted out for one reason or another.
PPS. For the super-curious, the dual number is actually very rare in Icelandic. It was in use for first and second person personal pronouns but not any more. The only real remnants are in the distinction between which of two and which of many (hvor and hver). A very useful distinction, that.

please delete the account "seegii"
because he/she/it behaves like a robot
and adding sentences that not compatible with Tatoeba standards

I've sent the user a message asking them to respond to the comments raised on the sentences. If nothing happens, we can safely delete these soon.