Dinding (7.146 topik)
Tips
Sebelum mengajukan sebuah pertanyaan, pastikan Anda membaca daftar Pertanyaan Umum terlebih dahulu.
Kami berusaha untuk menjaga lingkungan yang sehat untuk diskusi bersahaja. Silakan baca aturan kami mengenai perilaku buruk.
frpzzd
2 hari yang lalu
EugeneGS
2 hari yang lalu
frpzzd
2 hari yang lalu
EugeneGS
3 hari yang lalu
frpzzd
3 hari yang lalu
gillux
3 hari yang lalu
frpzzd
5 hari yang lalu
sharptoothed
6 hari yang lalu
marafon
7 hari yang lalu
Pfirsichbaeumchen
7 hari yang lalu

** Bug? **
When I go to http://tatoeba.org/eng/favorites/of_user/1005 and click "show user" with any username, I get a mostly blank page with the title "Errors". Blank because only the first 51 lines are received.

Fixed. The form wasn't supposed to be there anymore ^^

[not needed anymore- removed by CK]

It will be a long one:
Javascript template Italian (I removed some noise from text):
A http://similar.uw.hu/ita_1.htm
B http://similar.uw.hu/ita_2.htm
C http://similar.uw.hu/ita_3.htm
D http://similar.uw.hu/ita_4.htm
E http://similar.uw.hu/ita_5.htm
F http://similar.uw.hu/ita_6.htm
G http://similar.uw.hu/ita_7.htm
H http://similar.uw.hu/ita_8.htm
I http://similar.uw.hu/ita_9.htm
J http://similar.uw.hu/ita_10.htm
K http://similar.uw.hu/ita_11.htm
L http://similar.uw.hu/ita_12.htm
M http://similar.uw.hu/ita_13.htm
N http://similar.uw.hu/ita_14.htm
O http://similar.uw.hu/ita_15.htm
P http://similar.uw.hu/ita_16.htm
Q http://similar.uw.hu/ita_17.htm
R http://similar.uw.hu/ita_18.htm
S http://similar.uw.hu/ita_19.htm
T http://similar.uw.hu/ita_20.htm
U http://similar.uw.hu/ita_21.htm
V http://similar.uw.hu/ita_22.htm
W http://similar.uw.hu/ita_23.htm
X http://similar.uw.hu/ita_24.htm
Y http://similar.uw.hu/ita_25.htm
Z http://similar.uw.hu/ita_26.htm
Javascript template Russian (Yandex search engine links, the last few characters of alphabet was missing from my source wordlist file):
a http://similar.uw.hu/rus_1.htm
б http://similar.uw.hu/rus_2.htm
в http://similar.uw.hu/rus_3.htm
г http://similar.uw.hu/rus_4.htm
д http://similar.uw.hu/rus_5.htm
е http://similar.uw.hu/rus_6.htm
ё http://similar.uw.hu/rus_7.htm
ж http://similar.uw.hu/rus_8.htm
з http://similar.uw.hu/rus_9.htm
и http://similar.uw.hu/rus_10.htm
й http://similar.uw.hu/rus_11.htm
к http://similar.uw.hu/rus_12.htm
л http://similar.uw.hu/rus_13.htm
м http://similar.uw.hu/rus_14.htm
н http://similar.uw.hu/rus_15.htm
о http://similar.uw.hu/rus_16.htm
п http://similar.uw.hu/rus_17.htm
р http://similar.uw.hu/rus_18.htm
с http://similar.uw.hu/rus_19.htm
т http://similar.uw.hu/rus_20.htm
у http://similar.uw.hu/rus_21.htm
ф http://similar.uw.hu/rus_22.htm
х http://similar.uw.hu/rus_23.htm
If a made mistakes, I will delete this wall post, and will provide new links (or I overwrite current ones).

Thank you, I believe it may be useful. :)
The «ъ», «ы» and «ь» missing is perfectly OK: they aren’t allowed in the beginning of the word by Russian phonology.[1]
The «э» missing isn't a big deal since it's a rather rare letter.
But I’d like to warn against using the Russian wordlist. It is simply too big!
It contains, in particular:
· old-fashioned spellings (азбест/azbest instead of асбест/asbest ‘asbestos’)
· old-fashioned words (алкалический/alkalicheskij ‘alkaline (adj.)’ instead of щелочной/shhelochnoj)
· some names that have been obsolete for centuries like Агриппина/Agrippina or Андроник/Andronik
· two forms of patronymics for each male name: Андроникович/Andronikovich ‘son of Andronik’, Андрониковна/Andronikovna ‘daughter of Andronik’
· some words that can be used only in poetic speech, and sound very strange elsewhere (e.g. аметистово/ametistovo ‘amethystively, in an amethyst way’[2])
I doubt all Russian contributors will be able to cover this vocabulary in any reasonable period of time. I think we need something more concise.
[1] Ы/y` might occur at the beginning of some Turkish, Eskimo etc. placenames, but even there it is usually supstituted by И.
[2] Yes, it is very weird. Although it can be used as a part of compound words, e.g. аметистово-фиолетовый/ametistovo-fioletovy`j ‘purple like an amethyst, amethyst-purple’, but in such a case it’s not a word on its own.

By the way, ритуално is a mere mistake in the wordlist.

Demetrius, I can improve this list, but:
1) I used a big wordlist, so native speakers can decide which words are archaic, or not used at all.
1a) If you or anyone have suggestions, what wordlist/dictionary should I use, please tell me. The most recent monolingual dictionary (Толковый словарь) what I found is from 1950'. All those dictionaries available from slovopedia. (Modern ones contains many words from "old" dictionaries)
2) I do not like use any frequency lists, because they contains many words not used in colloquial speech. They even contain slangs.
3) What kind of list do you want? Even ruscorpora does not contains all necessary words. I can compile a wordlist from any documents. If I would use movie subtitles, it may contains too much slangs. If I would use contemporary literature, it may contains too much words, used only by writers.
4) There is no russian wordnet available yet (or not as big as it should be).
5) I could use a wordlist from open source spell checkers. It seems a good idea. But that list is huge too.
-) So I can not provide a concise and useful wordlist from materials, what I know about. Send me PM and links if you have some useful bookmarks. Or if it is not available in electronically form, I would still interested in it.

Very nice idea :) Once these lists start to shrink, it might be useful to do the same thing for words that only show up below a certain threshold -- for example, in fewer than ten sentences, or something like that.
Also -- translators, keep in mind that if you know the translation of a word in another language, you can search for existing sentences which you can translate in order to fill the gap. For example, if you want to add a sentence for the Russian word "зять" and you know that in English it can mean "son in law", then you can search for existing English sentences to translate:
http://tatoeba.org/eng/sentence...rom=und&to=und
At the same time, you can certainly always add new sentences :)
(By the way, CK, right now, the links for eng-2 and eng-awl still reference eng-1.html)

** Tag cleanup II **
We now continue with the clean up of the tags quotes. There were multiple tags that referred to the same person. I've identified these, transliterated others. There are a few still to do, but once these are done, almost a fifth of the tags will have been sorted and taken care of.
Please refer to:
http://martin.swift.is/tatoeba/tags-cleanup.html
for a list of tags that are being merged (there are also a couple of renames that don't seem to have gone through last time).
After that I'm hoping to finish sorting the tags before merging and renaming those. With that out of the way, we can start looking at categorisation schemes for the tags, to make them easier to find and use. I've been chatting with CK about that, but we'll try to have a little brain storm session at some point in the beginning of next year.
I just want to bring that to people's attention as it may help to start thinking about this. Send your ideas (even the half-baked ones) my way if you like.

So far, for me, searching for any word in Japanese spelled in kana will return unrelated results. It appears that each kana character is being treated as an individual word.

Yes, each character seems to be searched separately, but if you only get unrelated results, it's because there are no better matches.
Searching for "飛行機" will give you results for "飛行機" first, but sentences with "飛行" and "機" separately last.
Searching kana is a bit more difficult, because it will only turn up results where the sentence is written with the kana. Searching for "ひこうき" will, for example not turn up any meaningful results, but "そんなことないよ" will (though not that exact phrase, though).
飛行機: http://tatoeba.org/eng/sentence...rom=jpn&to=und
ひこうき: http://tatoeba.org/eng/sentence...rom=jpn&to=und
そんなことないよ: http://tatoeba.org/eng/sentence...rom=jpn&to=und

Though you can, of course, search for exact phrases with quotes (e.g. "そんなことないよ").

Tajperareto:
http://tatoeba.org/epo/sentence...ndom_sentences
Ĉu traduki frrazojn... > frazojn

Mi korektis tion ĉe https://translations.launchpad.net/tatoeba

fell in love

[not needed anymore- removed by CK]

:O what a good thing!
I wanted to do it for Italian long time ago, but I don't know how to do it :/
Can someone tell me how to create such a list or do it for me? ;)

I tried frequency list. Not only in Italian. But I like more concise pocket dictionaries. The list of 50000 words not yet in Tatoeba (Italian only):
http://similar.uw.hu/taitadict_a.htm
An Italian speaker can decide what are useful ones.
(I posted the German to Hungarian sentence links a week ago, what contains untranslated sentences/words, but I did not get any reply :P) Maybe this one will be more "useful".
Opinions please. >:D (I made a verb conjugator only for filter words, bah).

Oh thank you Debian, you've been so kind!
I'm going to start working on these words tomorrow.
Good night!!

http://en.wiktionary.org/wiki/W...requency_lists
there you can found some frequency list for a bunch of languages.

http://corpus.leeds.ac.uk/list.html
In addition to sysko's link, this page has some much larger frequency lists for a few languages that might be easier to programmatically extract.

to ease the navigation can you add an anchor to these 3 categories on your page (otherwise we need to scroll again and again :) )
by the way great idea (as ever :p)
I will try to do the same for Chinese and French.

Great idea!

Can you add wildcards to the search? It is frustrating to try to find affix translations as used in different sentences.

[not needed anymore- removed by CK]

Yes, it looks like the relevant setting (on the server side) is handled here as I've pasted here (from that link). But it's not a change I can make. That's why I decided to ask you guys to add it.
11.2.19. enable_star
Enables star-syntax (or wildcard syntax) when searching through prefix/infix indexes. Optional, default is is 0 (do not use wildcard syntax), for compatibility with 0.9.7. Known values are 0 and 1.
This feature enables "star-syntax", or wildcard syntax, when searching through indexes which were created with prefix or infix indexing enabled. It only affects searching; so it can be changed without reindexing by simply restarting searchd.
The default value is 0, that means to disable star-syntax and treat all keywords as prefixes or infixes respectively, depending on indexing-time min_prefix_len and min_infix_len settings. The value of 1 means that star ('*') can be used at the start and/or the end of the keyword. The star will match zero or more characters.
For example, assume that the index was built with infixes and that enable_star is 1. Searching should work as follows:
"abcdef" query will match only those documents that contain the exact "abcdef" word in them.
"abc*" query will match those documents that contain any words starting with "abc" (including the documents which contain the exact "abc" word only);
"*cde*" query will match those documents that contain any words which have "cde" characters in any part of the word (including the documents which contain the exact "cde" word only).
"*def" query will match those documents that contain any words ending with "def" (including the documents that contain the exact "def" word only).

Yep I know about this, but unfortunately the server is already used to the maximun of its capacity (the current load average is 1.36 1.60 1.84) so we can't enable this option without degrading the overall performance, but for sure we will activate it as soon as we have the new version.

why font of persian is changed?

Bei der Suche ist der Satz: "HINWEIS: Sätze, die eine direkte Übersetzung haben, werden angezeigt. " falsch. Müsste lauten: "HINWEIS: Sätze, die eine INdirekte Übersetzung haben, werden angezeigt. "