menu
Tatoeba
language
Registreren Anmellen
language Plattdüütsch
menu
Tatoeba

chevron_right Registreren

chevron_right Anmellen

Dörkieken

chevron_right Show random sentence

chevron_right Na Spraak dörkieken

chevron_right Na List dörkieken

chevron_right Dörkieken na Tag

chevron_right Audiodatein dörkieken

Community

chevron_right Pinnwand

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Wall (7,146 threads)

Tipps

Ehrdat du en Fraag stellst, lees man de Faken stellt Fragen (FAQ).

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Ne’este Narichten subdirectory_arrow_right

frpzzd

8 days ago

subdirectory_arrow_right

EugeneGS

8 days ago

subdirectory_arrow_right

frpzzd

8 days ago

subdirectory_arrow_right

EugeneGS

9 days ago

subdirectory_arrow_right

frpzzd

9 days ago

subdirectory_arrow_right

gillux

10 days ago

feedback

frpzzd

11 days ago

feedback

sharptoothed

13 days ago

subdirectory_arrow_right

marafon

13 days ago

subdirectory_arrow_right

Pfirsichbaeumchen

13 days ago

Swift Swift 2010 M12 11 2010 M12 11 23:01:13 UTC flag Report link Permalink

** Bug? **

When I go to http://tatoeba.org/eng/favorites/of_user/1005 and click "show user" with any username, I get a mostly blank page with the title "Errors". Blank because only the first 51 lines are received.

{{vm.hiddenReplies[4431] ? 'expand_more' : 'expand_less'}} hide replies show replies
TRANG TRANG 2010 M12 19 2010 M12 19 18:47:51 UTC flag Report link Permalink

Fixed. The form wasn't supposed to be there anymore ^^

CK CK 2010 M12 17, edited 2019 M10 30 2010 M12 17 11:53:27 UTC, edited 2019 M10 30 01:23:55 UTC flag Report link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[4479] ? 'expand_more' : 'expand_less'}} hide replies show replies
debian2007 debian2007 2010 M12 17 2010 M12 17 15:36:11 UTC flag Report link Permalink

It will be a long one:
Javascript template Italian (I removed some noise from text):
A http://similar.uw.hu/ita_1.htm
B http://similar.uw.hu/ita_2.htm
C http://similar.uw.hu/ita_3.htm
D http://similar.uw.hu/ita_4.htm
E http://similar.uw.hu/ita_5.htm
F http://similar.uw.hu/ita_6.htm
G http://similar.uw.hu/ita_7.htm
H http://similar.uw.hu/ita_8.htm
I http://similar.uw.hu/ita_9.htm
J http://similar.uw.hu/ita_10.htm
K http://similar.uw.hu/ita_11.htm
L http://similar.uw.hu/ita_12.htm
M http://similar.uw.hu/ita_13.htm
N http://similar.uw.hu/ita_14.htm
O http://similar.uw.hu/ita_15.htm
P http://similar.uw.hu/ita_16.htm
Q http://similar.uw.hu/ita_17.htm
R http://similar.uw.hu/ita_18.htm
S http://similar.uw.hu/ita_19.htm
T http://similar.uw.hu/ita_20.htm
U http://similar.uw.hu/ita_21.htm
V http://similar.uw.hu/ita_22.htm
W http://similar.uw.hu/ita_23.htm
X http://similar.uw.hu/ita_24.htm
Y http://similar.uw.hu/ita_25.htm
Z http://similar.uw.hu/ita_26.htm
Javascript template Russian (Yandex search engine links, the last few characters of alphabet was missing from my source wordlist file):
a http://similar.uw.hu/rus_1.htm
б http://similar.uw.hu/rus_2.htm
в http://similar.uw.hu/rus_3.htm
г http://similar.uw.hu/rus_4.htm
д http://similar.uw.hu/rus_5.htm
е http://similar.uw.hu/rus_6.htm
ё http://similar.uw.hu/rus_7.htm
ж http://similar.uw.hu/rus_8.htm
з http://similar.uw.hu/rus_9.htm
и http://similar.uw.hu/rus_10.htm
й http://similar.uw.hu/rus_11.htm
к http://similar.uw.hu/rus_12.htm
л http://similar.uw.hu/rus_13.htm
м http://similar.uw.hu/rus_14.htm
н http://similar.uw.hu/rus_15.htm
о http://similar.uw.hu/rus_16.htm
п http://similar.uw.hu/rus_17.htm
р http://similar.uw.hu/rus_18.htm
с http://similar.uw.hu/rus_19.htm
т http://similar.uw.hu/rus_20.htm
у http://similar.uw.hu/rus_21.htm
ф http://similar.uw.hu/rus_22.htm
х http://similar.uw.hu/rus_23.htm

If a made mistakes, I will delete this wall post, and will provide new links (or I overwrite current ones).

{{vm.hiddenReplies[4485] ? 'expand_more' : 'expand_less'}} hide replies show replies
Demetrius Demetrius 2010 M12 20 2010 M12 20 17:54:55 UTC flag Report link Permalink

Thank you, I believe it may be useful. :)

The «ъ», «ы» and «ь» missing is perfectly OK: they aren’t allowed in the beginning of the word by Russian phonology.[1]
The «э» missing isn't a big deal since it's a rather rare letter.


But I’d like to warn against using the Russian wordlist. It is simply too big!

It contains, in particular:
· old-fashioned spellings (азбест/azbest instead of асбест/asbest ‘asbestos’)
· old-fashioned words (алкалический/alkalicheskij ‘alkaline (adj.)’ instead of щелочной/shhelochnoj)
· some names that have been obsolete for centuries like Агриппина/Agrippina or Андроник/Andronik
· two forms of patronymics for each male name: Андроникович/Andronikovich ‘son of Andronik’, Андрониковна/Andronikovna ‘daughter of Andronik’
· some words that can be used only in poetic speech, and sound very strange elsewhere (e.g. аметистово/ametistovo ‘amethystively, in an amethyst way’[2])

I doubt all Russian contributors will be able to cover this vocabulary in any reasonable period of time. I think we need something more concise.


[1] Ы/y` might occur at the beginning of some Turkish, Eskimo etc. placenames, but even there it is usually supstituted by И.
[2] Yes, it is very weird. Although it can be used as a part of compound words, e.g. аметистово-фиолетовый/ametistovo-fioletovy`j ‘purple like an amethyst, amethyst-purple’, but in such a case it’s not a word on its own.

{{vm.hiddenReplies[4511] ? 'expand_more' : 'expand_less'}} hide replies show replies
Demetrius Demetrius 2010 M12 20 2010 M12 20 19:58:05 UTC flag Report link Permalink

By the way, ритуално is a mere mistake in the wordlist.

debian2007 debian2007 2010 M12 21 2010 M12 21 12:48:33 UTC flag Report link Permalink

Demetrius, I can improve this list, but:
1) I used a big wordlist, so native speakers can decide which words are archaic, or not used at all.
1a) If you or anyone have suggestions, what wordlist/dictionary should I use, please tell me. The most recent monolingual dictionary (Толковый словарь) what I found is from 1950'. All those dictionaries available from slovopedia. (Modern ones contains many words from "old" dictionaries)
2) I do not like use any frequency lists, because they contains many words not used in colloquial speech. They even contain slangs.
3) What kind of list do you want? Even ruscorpora does not contains all necessary words. I can compile a wordlist from any documents. If I would use movie subtitles, it may contains too much slangs. If I would use contemporary literature, it may contains too much words, used only by writers.
4) There is no russian wordnet available yet (or not as big as it should be).
5) I could use a wordlist from open source spell checkers. It seems a good idea. But that list is huge too.
-) So I can not provide a concise and useful wordlist from materials, what I know about. Send me PM and links if you have some useful bookmarks. Or if it is not available in electronically form, I would still interested in it.

kebukebu kebukebu 2010 M12 18 2010 M12 18 03:47:52 UTC flag Report link Permalink

Very nice idea :) Once these lists start to shrink, it might be useful to do the same thing for words that only show up below a certain threshold -- for example, in fewer than ten sentences, or something like that.

Also -- translators, keep in mind that if you know the translation of a word in another language, you can search for existing sentences which you can translate in order to fill the gap. For example, if you want to add a sentence for the Russian word "зять" and you know that in English it can mean "son in law", then you can search for existing English sentences to translate:

http://tatoeba.org/eng/sentence...rom=und&to=und

At the same time, you can certainly always add new sentences :)

(By the way, CK, right now, the links for eng-2 and eng-awl still reference eng-1.html)

Swift Swift 2010 M12 17 2010 M12 17 21:44:00 UTC flag Report link Permalink

** Tag cleanup II **

We now continue with the clean up of the tags quotes. There were multiple tags that referred to the same person. I've identified these, transliterated others. There are a few still to do, but once these are done, almost a fifth of the tags will have been sorted and taken care of.

Please refer to:
http://martin.swift.is/tatoeba/tags-cleanup.html
for a list of tags that are being merged (there are also a couple of renames that don't seem to have gone through last time).

After that I'm hoping to finish sorting the tags before merging and renaming those. With that out of the way, we can start looking at categorisation schemes for the tags, to make them easier to find and use. I've been chatting with CK about that, but we'll try to have a little brain storm session at some point in the beginning of next year.

I just want to bring that to people's attention as it may help to start thinking about this. Send your ideas (even the half-baked ones) my way if you like.

DancingHorses DancingHorses 2010 M12 17 2010 M12 17 06:42:43 UTC flag Report link Permalink

So far, for me, searching for any word in Japanese spelled in kana will return unrelated results. It appears that each kana character is being treated as an individual word.

{{vm.hiddenReplies[4473] ? 'expand_more' : 'expand_less'}} hide replies show replies
Swift Swift 2010 M12 17 2010 M12 17 13:37:28 UTC flag Report link Permalink

Yes, each character seems to be searched separately, but if you only get unrelated results, it's because there are no better matches.

Searching for "飛行機" will give you results for "飛行機" first, but sentences with "飛行" and "機" separately last.

Searching kana is a bit more difficult, because it will only turn up results where the sentence is written with the kana. Searching for "ひこうき" will, for example not turn up any meaningful results, but "そんなことないよ" will (though not that exact phrase, though).

飛行機: http://tatoeba.org/eng/sentence...rom=jpn&to=und
ひこうき: http://tatoeba.org/eng/sentence...rom=jpn&to=und
そんなことないよ: http://tatoeba.org/eng/sentence...rom=jpn&to=und

{{vm.hiddenReplies[4483] ? 'expand_more' : 'expand_less'}} hide replies show replies
Swift Swift 2010 M12 17 2010 M12 17 15:55:22 UTC flag Report link Permalink

Though you can, of course, search for exact phrases with quotes (e.g. "そんなことないよ").

Gyuri Gyuri 2010 M12 16 2010 M12 16 14:21:22 UTC flag Report link Permalink

Tajperareto:

http://tatoeba.org/epo/sentence...ndom_sentences

Ĉu traduki frrazojn... > frazojn

{{vm.hiddenReplies[4465] ? 'expand_more' : 'expand_less'}} hide replies show replies
Aleksej Aleksej 2010 M12 17 2010 M12 17 13:29:23 UTC flag Report link Permalink

Mi korektis tion ĉe https://translations.launchpad.net/tatoeba

natanael22vs natanael22vs 2010 M12 17 2010 M12 17 11:57:24 UTC flag Report link Permalink

fell in love

CK CK 2010 M12 16, edited 2019 M10 30 2010 M12 16 08:42:11 UTC, edited 2019 M10 30 01:24:08 UTC flag Report link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[4463] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pharamp Pharamp 2010 M12 16 2010 M12 16 15:19:00 UTC flag Report link Permalink

:O what a good thing!
I wanted to do it for Italian long time ago, but I don't know how to do it :/

Can someone tell me how to create such a list or do it for me? ;)

{{vm.hiddenReplies[4467] ? 'expand_more' : 'expand_less'}} hide replies show replies
debian2007 debian2007 2010 M12 16 2010 M12 16 23:17:07 UTC flag Report link Permalink

I tried frequency list. Not only in Italian. But I like more concise pocket dictionaries. The list of 50000 words not yet in Tatoeba (Italian only):

http://similar.uw.hu/taitadict_a.htm

An Italian speaker can decide what are useful ones.
(I posted the German to Hungarian sentence links a week ago, what contains untranslated sentences/words, but I did not get any reply :P) Maybe this one will be more "useful".

Opinions please. >:D (I made a verb conjugator only for filter words, bah).

{{vm.hiddenReplies[4471] ? 'expand_more' : 'expand_less'}} hide replies show replies
Pharamp Pharamp 2010 M12 16 2010 M12 16 23:48:56 UTC flag Report link Permalink

Oh thank you Debian, you've been so kind!
I'm going to start working on these words tomorrow.

Good night!!

sysko sysko 2010 M12 16 2010 M12 16 20:10:42 UTC flag Report link Permalink

http://en.wiktionary.org/wiki/W...requency_lists
there you can found some frequency list for a bunch of languages.

Zifre Zifre 2010 M12 16 2010 M12 16 22:07:01 UTC flag Report link Permalink

http://corpus.leeds.ac.uk/list.html

In addition to sysko's link, this page has some much larger frequency lists for a few languages that might be easier to programmatically extract.

sysko sysko 2010 M12 16 2010 M12 16 08:44:18 UTC flag Report link Permalink

to ease the navigation can you add an anchor to these 3 categories on your page (otherwise we need to scroll again and again :) )
by the way great idea (as ever :p)
I will try to do the same for Chinese and French.

Swift Swift 2010 M12 16 2010 M12 16 15:14:30 UTC flag Report link Permalink

Great idea!

Disputulo Disputulo 2010 M12 16 2010 M12 16 01:57:34 UTC flag Report link Permalink

Can you add wildcards to the search? It is frustrating to try to find affix translations as used in different sentences.

{{vm.hiddenReplies[4456] ? 'expand_more' : 'expand_less'}} hide replies show replies
CK CK 2010 M12 16, edited 2019 M10 30 2010 M12 16 04:15:42 UTC, edited 2019 M10 30 01:24:17 UTC flag Report link Permalink

[not needed anymore- removed by CK]

{{vm.hiddenReplies[4457] ? 'expand_more' : 'expand_less'}} hide replies show replies
Disputulo Disputulo 2010 M12 16 2010 M12 16 04:43:29 UTC flag Report link Permalink

Yes, it looks like the relevant setting (on the server side) is handled here as I've pasted here (from that link). But it's not a change I can make. That's why I decided to ask you guys to add it.

11.2.19. enable_star

Enables star-syntax (or wildcard syntax) when searching through prefix/infix indexes. Optional, default is is 0 (do not use wildcard syntax), for compatibility with 0.9.7. Known values are 0 and 1.

This feature enables "star-syntax", or wildcard syntax, when searching through indexes which were created with prefix or infix indexing enabled. It only affects searching; so it can be changed without reindexing by simply restarting searchd.

The default value is 0, that means to disable star-syntax and treat all keywords as prefixes or infixes respectively, depending on indexing-time min_prefix_len and min_infix_len settings. The value of 1 means that star ('*') can be used at the start and/or the end of the keyword. The star will match zero or more characters.

For example, assume that the index was built with infixes and that enable_star is 1. Searching should work as follows:

"abcdef" query will match only those documents that contain the exact "abcdef" word in them.
"abc*" query will match those documents that contain any words starting with "abc" (including the documents which contain the exact "abc" word only);
"*cde*" query will match those documents that contain any words which have "cde" characters in any part of the word (including the documents which contain the exact "cde" word only).
"*def" query will match those documents that contain any words ending with "def" (including the documents that contain the exact "def" word only).

{{vm.hiddenReplies[4458] ? 'expand_more' : 'expand_less'}} hide replies show replies
sysko sysko 2010 M12 16 2010 M12 16 07:21:46 UTC flag Report link Permalink

Yep I know about this, but unfortunately the server is already used to the maximun of its capacity (the current load average is 1.36 1.60 1.84) so we can't enable this option without degrading the overall performance, but for sure we will activate it as soon as we have the new version.

mahdiye mahdiye 2010 M12 15 2010 M12 15 12:26:14 UTC flag Report link Permalink

why font of persian is changed?

whosnick whosnick 2010 M12 15 2010 M12 15 06:54:24 UTC flag Report link Permalink

Bei der Suche ist der Satz: "HINWEIS: Sätze, die eine direkte Übersetzung haben, werden angezeigt. " falsch. Müsste lauten: "HINWEIS: Sätze, die eine INdirekte Übersetzung haben, werden angezeigt. "