담벼락 (7,124 스레드)
sharptoothed
10 일 전
sharptoothed
10 일 전
TATAR1
10 일 전
AlanF_US
11 일 전
sharptoothed
12 일 전
Shanaz
15 일 전
Qaztat
15 일 전
TATAR1
15 일 전
Tartar
15 일 전
menaud
18 일 전

I'd like to download all French audio files from Tatoeba. Is that possible, and if yes, how to do so?


I just noticed that this partially answers a recent question I asked. Are there any plans to make audio downloadable in general? I am writing a free software (GPL) tool for language study and I would like to find the best way to make this audio available.

http://tatoeba.org/fra/sentences/with_audio/fra gives a list of French audios. You can use a plugin for Firefox that allows downloading all URLs of a certain type (mp3) from the page. There surely should be one, just search for it. Then you have 40 pages to grab, quite a manageable amount for some hand labor.
I can write a simple script for that, but only for Linux or Mac. PM me if you need one.

or i can do an archive of them that would not require doing massive request on the server

http://tatoeba.org/sentences/with_audio/fra should work.

I recomend the firefox addons "downThemAll".

[not needed anymore- removed by CK]

*** Warning ***
This list is based on declarations by contributors themselves.
Numerous contributors pretend to be native and are actually not so, out of pure game, perversion, or simply to measure their skills (they generally overestimate them by far...)
You must pay attention when you take their contributions for granted...

A feature request.
There are a lot of comments that looks strange after the sentence was changed. It would be nice to see the actual sentence as it was at the moment of commenting alongside with the comment. E.g.:
== Sentence #1234 ==
Some sentence.
== Comments ==
> "Same sentence". <-- The original sentence to be displayed.
Tom: Please change "Same" -> "Some".
----
Mary: Thanks. Done.
----
This would give some more sense to old comments.

You can see in the right side a history with first sentences and all changes.
Is a litle manual.

Do you mean in the [latest comments][1], or do you mean on the sentence pages themselves?
If you mean the latter, may I reffer you to my userscript, which puts the changelog into the comment section: [Tatoeba Sentence Timeline][2].
I’m going to work together with @sysko to integrate some of my userscripts, as until now only a few advanced users seem to user them (at least i didn’t get a lot of feedback, except from those).
However, I do support your request, as it would be easier to see whats happening if the original sentence were also displayed on the home page and in the [latest comments][1]
[1]: http://tatoeba.org/deu/sentence_comments/index
[2]: http://userscripts.org/scripts/show/121207

Warning! New spammer: Victoriababy
I've just been spammed by her...

Oh ok, I was just about to ask where I can report her.
But maybe it's still good to ask for future reference, so... how and where do I report spammers?

here or by sending me a private message/ or email

Me too.
http://tatoeba.org/spa/user/profile/victoriababy
Pensaba que yo era alguien especial y se lo dice a todos :(
XD

+1

Hello again. My question is about "vocabular diversity".
There are different modules through which people can contribute to Tatoeba: [Add, Translate, Adopt, Improve and Discuss Sentences] Although the rest 5 is kind of dependent on already available sentences, "Add Sentence" module is the most independent one. And it's fertile aspect makes everything to sprout from it. Just like vocabular diversity does. So, the core is "Add sentences", am I right?
At this point, shouldn't we add some auxillary tools to that module (add sentence), which will help contributors decide which words/phrases to use? By doing so the content of the Corpus will get richer and have a broader spectrum.
For that I can think of two resources:
1) Reference vocabularies: There are dictionaries already available for almost every language. When a contributor wants to add a sentence, the system can offer a word which is not used (or barely used) in any sentence.
2) Search results: When a user searches for a word to find related sentences and the search returns nothing; then the log files can be benefited to guide contributors to find a virgin word to use in a sentence.

+1
I had some vague idea about vocabulary diversity (in fact I recall using some contributor's script to whom I'm sorry for not remembering his name, to detect unused words to write into some new italian sentences the last time I added some.
The question alexander asked is a good one though: current developers of the project don't seem to have this much time to work on all the proposals, however good may they be.
Perhaps if someone coded them for them to only implement them it might work without taking too much time on their behalf.

I've modestly come to the conclusion that implementing an integrated system for this would be a lot easier with metadata put in practice. Without metadata information filtering virgin words maybe a real pain depending on the language's morphology. For example in the case of Turkish, determining unused words is not a simple "match and filter" operation. I am not a programmer but one seems to have to use kind of algorithm like the one used in "proofing softwares".

Indeed agglutinative languages would suffer from this, so I understand your objection. Luckily I'm in no position to try and program this feature myself, because I would have no idea as to how to proceed. Nevertheless, it's clear that it would require a very long and complicate scripting.

This would be very useful, but who will implement this?

I don't know. :)) Site maintainers-developers?

Cxu estas eble sercxi por aliaj uzantoj laux la ilia lingvo?
Is it possible to search for other users according to their language?

I was just going to ask if users could be searched based on their language profile. :P Not just their native language, but all languages listed on their profile. I now realize this is not possible. Maybe if there was a standardized way of listing the languages, like in some language exchange communities... Oh well.
I just wanted to find out if there were any members here capable of translating Finnish into their own mother tongue. There are a lot of words that I'd like to add here but I sense that there is not a very wide market for such a small language.

Add them anyway, we'll be really grateful if you do so. Even if now there are not many members who learn Finnish, we'd rather have a lot of sentences in your language so that one day a (future) member can translate them. :)

i agree, adding new words of any language is always useful for the quality of the corpus

Finland is the 12th largest contributor country on Tatoeba,ex-aequo with Turkey and The Netherlands, with 14 contributors indicating that Finland is their country on their profile https://docs.google.com/spreads...c&pli=1#gid=15

Finland has a population of 5.3 million, out of which roughly 95% speak Finnish, so outside of Tatoeba my language would be considered "small". However, it seems a number of Finns are interested in languages - Japanese has been especially popular lately (well, Japanese culture in general). Practically everyone is expected to have a good command of English, and on the west coast (where I live) there is a concentrated Finnish-Swedish speaking population which means many people are bilingual. You need to know Swedish if you want to deal with people because quite a few of them tend to be extremely particular about getting service in their own language. :D
Well, I'm happy to see there are so many people spreading word about my language. Of course, more people wouldn't hurt... :) I'll go add some sentences now, you can all thank me later when Finnish replaces English as the new worldwide business language. ;D

In 15th century, nobody would have ever bet that English, which was hardly standardised and spoken by a minority of europeans on a remote island would ever become "spoken" by so many people.
Besides, there's no such thing as a "worldwide business language". Earthlings conduct their business in various languages around the world as they always did. Most Finns don't do business with Swedes in English...

Hmm, maybe I phrased that poorly. What I meant that in many companies English has become the common standard in internal communication. I don't know if that is the case in all countries but at least here it is like that, especially in technical and engineering businesses. Bands and artists sing in English because it makes their music accessible to a wider audience than other languages do. Scientific reports are for a large part written in English, so much so that there has been some concern (in my country, don't know about others) regarding the future of smaller languages as English has become the common standard in conducting research and reporting the findings.
Of course it's better to deal with other countries in their own language if possible. Swedish is the second official language anyway so everyone already studies it in school, it would be stupid not to utilize that knowledge after all the long hours that were put into pounding it into your head. :) But in many cases when there is a need for a common language for communicating on an international scale, the choice tends to be English.
Sorry if I came off as arrogant, that wasn't my intention.

Hi to all. I'd like be an advanced user ¿corpus adminstrator?.
I need very much tags, and like normal user I cannot tag anything.
Sometimes I go to link or unlink sentences, but de first thing is tag sentences.
Thanks for read this.

I agree with jakov, alexmarcelo kaj Shishir and will welcome you as advanced contributor.

Thanks to all.
Yes, I mean advanced contribuitor.
http://blog.tatoeba.org/2011/05...rs-status.html
I write languages I know in my profile.

Hello
I've changed your status, you may beed to log out and log in for it to apply.

Many Thanks Sysko.
I go work very much confortable.
A question.
Is there a log for tags a link activity?
New links and sentences added to a tag.
I think is important for discover mistakes y with tags (además) for discover usefull tags.
A Log like the home page for sentences and comments.

not yet, that's the reason these features are reserved to advanced contributors, as we have no way yet to track it, I prefer to let only "trustable" people to use it.

Thanks, I try don't crash anything. :)

I think you mean "advanced contributor" btw.

Let him be a corpus administrator.

Let's do it step by step. For the first, i think he wants to be able to tag sentences.

wallebot has my approval

Please add some profile information and a picture. Other than that i think you are suited.

Do you mean "advanced contributor"? You have my support, too.

You've got all my support, but it's true, you should write the languages you speak and your country :)
Tienes todo mi apoyo, pero Jakov tiene razón, deberías escribir las lenguas que hablas y tu país :)

This may be a stupid question but what do the "contributions" listen on the user profile mean? I have under 300 sentences and have written two comments but the number of my total contributions is over 800. Can someone explain this to me? I tried looking at the FAQ and help pages but didn't find anything.

To make a long story short, you could see them as the number of entries in your log
http://tatoeba.org/deu/contributions/of_user/Henu/

Ah, it makes sense now. Thank you. :)

I've added Maori to the list of languages.

and also for Berber and Ladin

Bienvenue aux 3 nouvelles langues !

aren't we too short on active moderators in the present?

I volunteered, but they refused me.

hahahaha, but I meant thrustworthy ones xD
no offense : )

In fact I think corvard IS not unreliable... He's too direct sometimes, but I guess he is well-meaning.

yes, I quite agree
I just intended it as a joke : )

Sure, let's have more insulting teenagers as corpus moderators...we can't have enough of them, can we ?