Profile
Sentences
Vocabulary
Reviews
Lists
Favorites
Comments
Comments on debian2007's sentences
Wall messages
Logs
Audio
Transcriptions
Translate debian2007's sentences
(重な) !(貴重) "重な"
This working for me. Only 重な, some results are 慎重.
If I use only
(重な) !(貴重)
it would search for "重な" "な", so lots of results with "な", but without 重 kanji.
On Russian interface of main menu the text of equivalent translations of "Adopt sentences" and "Discuss sentences" are messed up ("Присваивайте предложения", "Обсуждайте предложения"). They should occupy one single line, but they are two-line text, or the previous menu item covers the next menu item. So the text is too long, and does not fit in one line. This makes the menu item unreadable. I tried it with different web engines. I am using Linux fonts, maybe you do not experience the same in MS environment.
It means every tickets like optimalization and "the small things" should be fixed in the c++ framework; is it worth send a patch to CakePHP framework? Where will be the SVN of the new framework? >:D I like to learn new things, that is why I am asking.
Down. I can not connect.
Both Eo-Rus:
tatoeba.org/eng/sentences/show/704547
http://tatoeba.org/eng/sentences/show/704545
http://tatoeba.org/eng/sentences/show/704543
http://tatoeba.org/eng/sentences/show/704541
http://tatoeba.org/eng/sentences/show/704537
http://tatoeba.org/eng/sentences/show/704533
http://tatoeba.org/eng/sentences/show/704531
http://tatoeba.org/eng/sentences/show/701871
http://tatoeba.org/eng/sentences/show/701869
http://tatoeba.org/eng/sentences/show/701867
http://tatoeba.org/eng/sentences/show/698064
http://tatoeba.org/eng/sentences/show/698061
http://tatoeba.org/eng/sentences/show/698058
http://tatoeba.org/eng/sentences/show/698034
http://tatoeba.org/eng/sentences/show/698026
http://tatoeba.org/eng/sentences/show/697673
http://tatoeba.org/eng/sentences/show/697671
http://tatoeba.org/eng/sentences/show/697669
http://tatoeba.org/eng/sentences/show/697667
http://tatoeba.org/eng/sentences/show/697665
http://tatoeba.org/eng/sentences/show/697662
http://tatoeba.org/eng/sentences/show/697660
http://tatoeba.org/eng/sentences/show/697658
http://tatoeba.org/eng/sentences/show/697656
http://tatoeba.org/eng/sentences/show/697654
These are form the last five days.
I saw one (or more) sentences copy-pasted from lernu.net forum. They are translations of daily esperanto sentences. If you are the author of those forum comments, it is allowed to do that, but otherwise? Or lernu.net is somewhat special, because it is voluntary?
I read the FAQ, I know we should not copy sentences; but when I see something fishy, I usually ask someone.
Thank you. I did not saw the feature in the comment of revisions. Maybe I mistaken.
(Delete wall post function is gone? I used a few times before. ... or I have a sort of amnesia.)
Can I add parentheses in my sentences as a comment? Like:
Original: Én elmentem a vásárba fél pénzzel.
Comment with parentheses: (Én) (elmentem a vásárba) (fél pénzzel).
Little clarification: Something like (Subject) (verb of motion && destination) (instrumental). I think it can be valuable, but I do not want to waste your precious SQL database with my sentence analysis. So I would like to post comments to my already written sentences. (I saw something like this in arihato's comments, and maybe I can do the same). Is it allowed or not? oO
Demetrius, I can improve this list, but:
1) I used a big wordlist, so native speakers can decide which words are archaic, or not used at all.
1a) If you or anyone have suggestions, what wordlist/dictionary should I use, please tell me. The most recent monolingual dictionary (Толковый словарь) what I found is from 1950'. All those dictionaries available from slovopedia. (Modern ones contains many words from "old" dictionaries)
2) I do not like use any frequency lists, because they contains many words not used in colloquial speech. They even contain slangs.
3) What kind of list do you want? Even ruscorpora does not contains all necessary words. I can compile a wordlist from any documents. If I would use movie subtitles, it may contains too much slangs. If I would use contemporary literature, it may contains too much words, used only by writers.
4) There is no russian wordnet available yet (or not as big as it should be).
5) I could use a wordlist from open source spell checkers. It seems a good idea. But that list is huge too.
-) So I can not provide a concise and useful wordlist from materials, what I know about. Send me PM and links if you have some useful bookmarks. Or if it is not available in electronically form, I would still interested in it.
It will be a long one:
Javascript template Italian (I removed some noise from text):
A http://similar.uw.hu/ita_1.htm
B http://similar.uw.hu/ita_2.htm
C http://similar.uw.hu/ita_3.htm
D http://similar.uw.hu/ita_4.htm
E http://similar.uw.hu/ita_5.htm
F http://similar.uw.hu/ita_6.htm
G http://similar.uw.hu/ita_7.htm
H http://similar.uw.hu/ita_8.htm
I http://similar.uw.hu/ita_9.htm
J http://similar.uw.hu/ita_10.htm
K http://similar.uw.hu/ita_11.htm
L http://similar.uw.hu/ita_12.htm
M http://similar.uw.hu/ita_13.htm
N http://similar.uw.hu/ita_14.htm
O http://similar.uw.hu/ita_15.htm
P http://similar.uw.hu/ita_16.htm
Q http://similar.uw.hu/ita_17.htm
R http://similar.uw.hu/ita_18.htm
S http://similar.uw.hu/ita_19.htm
T http://similar.uw.hu/ita_20.htm
U http://similar.uw.hu/ita_21.htm
V http://similar.uw.hu/ita_22.htm
W http://similar.uw.hu/ita_23.htm
X http://similar.uw.hu/ita_24.htm
Y http://similar.uw.hu/ita_25.htm
Z http://similar.uw.hu/ita_26.htm
Javascript template Russian (Yandex search engine links, the last few characters of alphabet was missing from my source wordlist file):
a http://similar.uw.hu/rus_1.htm
б http://similar.uw.hu/rus_2.htm
в http://similar.uw.hu/rus_3.htm
г http://similar.uw.hu/rus_4.htm
д http://similar.uw.hu/rus_5.htm
е http://similar.uw.hu/rus_6.htm
ё http://similar.uw.hu/rus_7.htm
ж http://similar.uw.hu/rus_8.htm
з http://similar.uw.hu/rus_9.htm
и http://similar.uw.hu/rus_10.htm
й http://similar.uw.hu/rus_11.htm
к http://similar.uw.hu/rus_12.htm
л http://similar.uw.hu/rus_13.htm
м http://similar.uw.hu/rus_14.htm
н http://similar.uw.hu/rus_15.htm
о http://similar.uw.hu/rus_16.htm
п http://similar.uw.hu/rus_17.htm
р http://similar.uw.hu/rus_18.htm
с http://similar.uw.hu/rus_19.htm
т http://similar.uw.hu/rus_20.htm
у http://similar.uw.hu/rus_21.htm
ф http://similar.uw.hu/rus_22.htm
х http://similar.uw.hu/rus_23.htm
If a made mistakes, I will delete this wall post, and will provide new links (or I overwrite current ones).
I tried frequency list. Not only in Italian. But I like more concise pocket dictionaries. The list of 50000 words not yet in Tatoeba (Italian only):
http://similar.uw.hu/taitadict_a.htm
An Italian speaker can decide what are useful ones.
(I posted the German to Hungarian sentence links a week ago, what contains untranslated sentences/words, but I did not get any reply :P) Maybe this one will be more "useful".
Opinions please. >:D (I made a verb conjugator only for filter words, bah).
Bug?
Home page http://tatoeba.org/eng/home
More latest comments (show more...) http://tatoeba.org/eng/sentence_comments/index
Filter by language http://tatoeba.org/eng/sentence_comments/index/hun
(2) second page on the top link http://tatoeba.org/eng/sentence...dex/hun/page:2
Press End key/Go down, (3) third page or any
http://tatoeba.org/eng/sentence...s/index/page:3
...Language filter now missing. The bottom links are not updated according to the language filter.
Sry if it is already posted, or the Wall is not the best place to submit this.
More about stats. Maybe I should not return to 4 days old discussion, but anyway.. When I hear the word "quality", I always think about creativity. I mean, err, I am lazy to translate sentences like: "I am a dentist." "She can not speak *." "Today/Tomorrow he is going to *.". Eh, maybe you get it, no offense.
So what if I could somehow measure, what sentences may appeared already in some form in Tatoeba. But it is impossible?! Well, because I do not have 50MB SQL free space (Tatoeba sentences), you can "measure" only one sentence here, only availabe for top 5 languages (but not Japanese, it is complex):
http://similar.uw.hu/
For every word there are some statistics: how many times written in Tatoeba, frequent words in the corpus, not so frequent word. So give it a try! Maybe I am the only one who is missing this feature :_)
(This is not an advertisement! It is a demonstration site! I hope Unicode will work well in your browser.)
It is my first try, I am sure it has mistakes. Still >:D.
Anyone need a perl/php/clojure script? (Mine was written in perl).
Orphan sentences based on downloadable *.csv files:
1) 5616
2) If you do not find a language, maybe it does not have orphans:
acm 1
ara 57
arz 1
bul 6
cat 3
ces 3
cmn 99
dan 4
deu 123
eng 664
epo 2641
est 21
eus 26
fin 5
fra 601
fry 5
heb 15
hin 4
hun 20
ind 13
isl 6
ita 176
jpn 151
kat 9
kor 6
mon 3
nld 182
nob 15
orv 3
pes 36
pol 17
por 47
ron 3
rus 283
slk 1
spa 63
sqi 2
swe 3
tur 12
uig 7
ukr 235
urd 1
vie 32
yue 1
zsm 1
3) How can you increase quality of tatoeba project, if someone able to provide the longest chain stats? (I am not good at graph theory, I do not see the reason of creating the abovementioned graph.)
Sometimes I make mistakes (well, all the time). Other Hungarian contributors may translate the same sentence in the future. It was the second time, when I saw, they translated well. And my translation was crappy. But I do not think, they "can" comment to my sentence, when I translated for example from Japanese, and they will translate it from German/Slovak. They may do not know where was it misinterpreted. They can not advise a translation if they understand the German one, and I understand a Japanese. Result: New Hungarian sentence will be created, I mean duplicated (but their last one is the correct translation). Solution: When I saw, they made a second (Hungarian) version, linked from another language (German) than mine (Japanese), I would check my sentence if I feel I did something wrong in the past. But I can not check up on myself, if I do not receive a message: "New translation added on one of your sentences in Hungarian" or something like that. That was my complain. Hah, maybe there is nothing wrong with the system (tatoeba, contributions, translation links), it was my fault. Maybe I miss a system what is more fault-tolerant, I mean if I faulted, made mistake. Maybe I need a private "ban me" button. Sorry :D. But I really do not know how can I fight against my clumsy sentences.