clear
swap_horiz
search

Muro (4640 fadenoj)

keyboard_arrow_left 1234567...464
Ricardo14
antaŭ 2 tagoj
** Adding languages **

This has been intriguing me a little bit

For now, whenever someone wants Tatoeba to support a certain language, this user has to
1) get the
-ISO 639-3 code
-an icon that would represent this language better
-a link on wikipedia that talks about this language

2) send the information above to me, Nuno and cueyayotl

3) wait

in three single steps, we might have lost some members that would could have helped us by posting sentences - even more in endangered languages. This can be even harmful - why .*** language is supported and mine isn't? / it's taking too long to have my language on Tatoeba,etc

Suggestion

During the next weeks, I will find people who speaks these languages and ask them to
1 - create an account at tatoeba and start posting sentences (I've already done that). However, if it doesn't work,
2 - I'd post his/her sentences under my accounts after checking in foruns, another websites, etc

All that to Tatoeba support as many languages as possible.

Suggestions
kaŝi la respondojn
Selena777
antaŭ 2 tagoj
Maybe, it's better to create a special account for every "indirect" contributor? I.e. "All sentences added from this account was created by Mr./Mrs N, who is a native speaker of X language. As long as Mr./Mrs N refused to create his/her own account and post sentences by his/herself (lack of time, bad internet connection and so on), this account will be maintained by Ricardo14. If there is a few such "indirect" contributors in the same language it will be much easier to separate their contribution.
kaŝi la respondojn
Ricardo14
hieraŭ
It's a good idea but I wonder if we can "speed up" this process by alread having such sentences on Tatoeba already.
kaŝi la respondojn
Selena777
antaŭ 16 horoj
It's a question for those, who write the program code for Tatoeba. Could they add all languages with ISO codes at once? It will be great.
kaŝi la respondojn
kaŝi la respondojn
AlanF_US
antaŭ 13 horoj
Even if you're capable of doing it, this is a major change that should not happen without discussion with, at a minimum, Trang. I cannot imagine that she would be in favor of this, at least with our current interface.
kaŝi la respondojn
Ricardo14
antaŭ 12 horoj - antaŭ 12 horoj
@AlanF_US I didn't mean that I would do such thing alone and sorry if it seemed so. I just pointed out that I have the "techinical knowledgement" to do so.
I don't know if you remember the change of the Portuguese flag (cf https://github.com/Tatoeba/tatoeba2/issues/1054 // https://github.com/Tatoeba/tatoeba2/pull/1058 ). Some people asked me to change and I asked everyone who is native in that language and also members that translate sentences from/into. As far as the most of them were in favor of changing, I've changed that. It just shows that I never do (or I'll do) things without the consentment of anyone.
Again, *sorry* if it seemed like I'd do such thing without anyone's consetement but it never was my intention. That's why I posted my idea to the Wall.
kaŝi la respondojn
AlanF_US
antaŭ 9 horoj
Ricardo, I understand that you were putting this idea up for discussion. I knew that you would never go beyond submitting a pull request (which Trang would have to approve anyway), and when I think about it, I realize that you wouldn't even go that far on your own. I was just taken aback by how quickly the conversation seemed to be moving from "Here's an idea" to "Okay, so who's going to take care of implementing it, and how do we do it?" (Admittedly, that was not precisely what was happening.)

We have talked about the idea of mass addition of languages before. I've never been convinced that it addresses a real problem, and it's guaranteed to introduce problems at many, many levels (using the interface, working with the codebase) even beyond the problems of monitoring content contributed in languages that no one ever requested and hence no one is responsible for.

I agree that language extinction is a massive problem, but I think it's unrealistic and actually counterproductive to imagine a scenario in which all that stands between a language and oblivion is a two-week wait for it to be added to Tatoeba. If the only person who might contribute in a language is too impatient to wait that long to see its name listed in the drop-down list, they'll never have the perseverance it takes to learn how the site works, make high-quality contributions, and respond to comments.
Aiji
hieraŭ
Isn't that the purpose of the "other language" flag?
I think it already happened in the past, a user added sentences with the question mark flag, and in his profile it was written he was waiting for approval of his language.
kaŝi la respondojn
Ricardo14
hieraŭ
@Aiji

Yes, you're right. My point is: Can't we "speed up" this process by adding some languages before a new user requests it?
brauchinet
hieraŭ - hieraŭ
I'm not convinced that it's really a good idea to speed up the process.
We have now more than 100 languages with less than 30 entries (many of doubtful qualitiy, we may guess).
I find this is even more deterring for potential contributors than not having a language at all.
What would you think of a website when your language is represented by only 10 sentences, one of them perhaps something like Good day dear landwoman?

Maybe, on the contrary, it would be better to require at least say 100-200 sentences, before a new language is added.
This would ensure that the contributor (ideally a native speaker of course) has a serious interest in the language. Remember that the possibility to add sentences in "exotic" languages will likely - as it did in the past - attract non-natives. Their sentences remain uncorrected, and when resources of that language on the internet are scarce, these low quality sentences will find their way into all kinds of other language websites. So maybe we are not even doing those languages a favour by just providing a platform without any requirements in quantity (and quality).
kaŝi la respondojn
Ricardo14
hieraŭ
> We have now more than 100 languages with less than 30 entries (many of doubtful qualitiy, we may guess).

They're checked by either me or cueyayotl. I ask my friends (most of them, natives), if those sentences are correct or not and even on foruns.

> What would you think of a website when your language is represented by only 10 sentences, one of them perhaps something like Good day dear landwoman?

I don't think that happens here. Never seen that before (or cueyayotl, who "controls" which language will be added or not. Many weren't because they were either not sentences or don't have an ISO 639-3 code - https://tatoeba.org/eng/sentenc...ne/indifferent

> Remember that the possibility to add sentences in "exotic" languages will likely - as it did in the past - attract non-natives.

Is it too risky like that? I'm sure that we can bring people who speaks that language. Some of my friends will join Tatoeba in the following days and their languages were not added on Tatoeba

> Their sentences remain uncorrected, and when resources of that language on the internet are scarce, these low quality sentences will find their way into all kinds of other language websites.

We've been cleaning up that. I've asked people to adopt/delete sentences from people who have their accounts set as inactive or were banned. It's a small effort yet, I know, but we're taking care of that.

for sure I want a sentence like "I don't have a car" instead of" I have car not" or "I haven't haven't have car not" or "I don't not have car" but I believe we're alreading doing that every day, every time.

Just thoughts...
kaŝi la respondojn
AlanF_US
antaŭ 9 horoj - antaŭ 9 horoj
>> We have now more than 100 languages with less than 30 entries (many of doubtful qualitiy, we may guess).

> They're checked by either me or cueyayotl. I ask my friends (most of them, natives), if those sentences are correct or not and even on foruns.

I know that you and cueyayotl put in a lot of effort and are willing to put in even more, and we appreciate that. However, it's hard for me to imagine that two people have a network of accessible friends who speak more than 100 minority languages at a native level, or access to active forums that cover those languages, or even time to correct on the order of 3000 entries (100 times 30). But even if you did, you're talking about massively scaling up the number of languages. So your proposal would need to explain not only how you can take care of the current load, but how could deal with ten times as many. It would also need to explain why, if we have people ready to contribute and proofread sentences in all these languages, they haven't already joined. It would have to involve something more concrete than speculation that they're put off by the fact that their languages don't appear in a drop-down list. If you can attract lots of contributors, all the more power to you. But the time to put forward a proposal like this is after they join, not before.
beggi
antaŭ 5 tagoj
Arapça kelimelere bütün okutucu işaretlerin eklenmesi, bu dili benim gibi buradan öğrenmek istiyenler için elzem görünüyor. Bu konuda birşeyler yapılabilir mi?
kaŝi la respondojn
Gulo_Luscus
antaŭ 5 tagoj
Öncelikle hoş geldin. Bu konularda İngilizce yazarsan yöneticilerden veya Arapçayla ilgilenen diğer üyelerden daha net ve kesin sonuçlar alırsın. Bu dille ilgili bir bilgim olmadığından yardımcı olamayacağım ama sorununu İngilizce yazabilirim.


beggi asks if it is possible to add all Arabic diacritics to words, for they are needed to learn the language.

kaŝi la respondojn
odexed
antaŭ 5 tagoj - antaŭ 5 tagoj
> add all Arabic diacritics to words

This is not how native speakers write in Arabic. Newspapers, books (except the Holy Quran) are written without diacritics.
kaŝi la respondojn
beggi
antaŭ 5 tagoj
Sure they don't add them since they already know them. So I am not a native arabic speaker who wants to know how to write ...
kaŝi la respondojn
odexed
antaŭ 5 tagoj
The point here is that Tatoeba prefers the most natural form for the sentences.

> We want sentences that a native speaker would actually use.
http://en.wiki.tatoeba.org/arti...how/guidelines
kaŝi la respondojn
beggi
antaŭ 5 tagoj
Not a practical preference for Arabic learners lIke me...Thank you for the answers
OsoHombre
hieraŭ
I wish diacritics were systematically added. They are systematically used in the Quran to indicate how to precisely read the Holy Book, because any error in the pronunciation could change the meaning of a word, and misinterpretation of the meanings of sacred texts might be a very bad thing. However, diacritics aren't systematically used in normal writing (media, books, etc.) and this is one of the major obstacles for the promotion of the learning of Arabic by non-natives.
OsoHombre
hieraŭ
It is true that Arabic diacritics are very helpful for Arabic-language beginners. However, it is difficult to oblige ordinary contributors to systematically 'diacritize' (add tashkil = add diacritics) every single word they write. It's time-consuming. Yet if we want to make Tatoeba a really helpful website for Arabic-language learners, we should think about a system that I've observed is used for Chinese: the feature that lets people read a sentence in Chinese pinyin (Latin phonetic alphabet for Chinese) in addition to reading it in Chinese characters. It would be great if Tatoeba adopted the same system for Arabic with a feature that shows the same sentence with Arabic diacritics, but I think that, contrary to the feature used for Chinese pinyin, this cannot be automatically done for Arabic, because Arabic diacritization represents vowels (and sometimes tense consonants), but the occurrence of many of these vowels (especially at the end of an Arabic word) is governed by grammatical rules. Therefore, developing a program that automatically diacritizes Arabic sentences means that the program has to know Arabic grammatical rules, and this is not only very complex to prepare, but its complexity would also give way to many potential errors that the program would be making, and this would need years of continuous improvement before we see that program perform its function properly on Tatoeba, whew!!! Just talking about that makes me tired. It has to be done by someone, somewhere, some time, but I think that Tatoeba's volunteers aren't prepared for carrying out this from start to finish. Another solution would be having Arabic sentences diacritized by Tatoeba contributors. But before a sentence is diacritized, it needs to be really correct, because this diacritization involves literally re-writing the whole sentence by adding the diacritics manually. In this case, a sentence has really to be correct to be worth re-typing. One last solution that has just come across my mind is the transliteration of Arabic sentences in the Latin alphabet. There is already a Latin alphabet that's used by specialists to transliterate Arabic words and names (especially proper nouns), similar to the one used by Encyclopaedia Britannica (look at the map on this https://www.britannica.com/place/Saudi-Arabia). This is a partial, quick, and practical solution that would give people an idea on how to read an Arabic sentence or phrase, but the accuracy of such a transliteration system also needs to be checked by a native speaker (especially the word endings that are determined by grammatical rules).


kaŝi la respondojn
gillux
antaŭ 19 horoj
> I think that, contrary to the feature used for Chinese pinyin, this cannot be automatically done for Arabic, because Arabic diacritization represents vowels (and sometimes tense consonants), but the occurrence of many of these vowels (especially at the end of an Arabic word) is governed by grammatical rules.

What you’re describing is exactly the same as Japanese and furigana (readings of characters). Yet, some people have developed a software, Mecab, that performs a morphological analysis of Japanese and finds the right readings most of the time. And Tatoeba makes use of Mecab, shows the readings and allows contributors to manually edit them when Mecab gets it wrong.

I don’t think any of us is going to develop such a software for Arabic diacritics, but if you can find out one that is free of use, it should be rather easy to integrate with Tatoeba, because we already have this system called transcriptions. More about that: http://en.wiki.tatoeba.org/arti...iption-request

The rationale for furigana was helping learners of Japanese, so I don’t see why we wouldn’t have a similar thing for Arabic, as long as the technology allows it.
Pfirsichbaeumchen
antaŭ 5 tagoj
[Suggestion]

I agree it is a hindrance for Arabic beginners, who need diacritics to learn. Japanese and Chinese come with Furigana and Pīnyīn, respectively. It would certainly help to have a feature to display (or not display) full diacritics for Arabic.
kaŝi la respondojn
odexed
antaŭ 5 tagoj - antaŭ 5 tagoj
The problem here is that it's not possible to put diacritics right automatically. And I don't think anybody would contribute sentences with full set of diacritics (the most Arabic sentences here except for those from our experienced contributors don't even have the proper punctuation.
kaŝi la respondojn
Pfirsichbaeumchen
antaŭ 5 tagoj
Naturally people would have to go to the trouble of adding them manually. It is the same problem with Japanese. No machine can do that job reliably, yet the feature exists and is used by some.

I think the same was suggested for Hebrew a while ago, and there was the same resistance.
kaŝi la respondojn
odexed
antaŭ 5 tagoj - antaŭ 5 tagoj
I don't resist, I'm just pondering your suggestion. I myself am a student of Arabic and I know how hard it is to read it when you don't know the words yet. But I think it's bad to learn something wrong (I wouldn't trust this feature). In Arabic diacritics differ for the words depending on their position in the sentence and on the previous words so it's not very useful to only know how to pronounce standalone words.
I'd rather suggest to make the recording of the audio more simple (just by clicking a button without having to use any programs). This way people could contribute audio as fast as they translate so we won't have this problem.
kaŝi la respondojn
OsoHombre
hieraŭ
@odexed

Yes, I'm with you regarding making audio recordings simpler. I've read about that and it's a real conundrum (peppered with some bureaucracy).
odexed
antaŭ 5 tagoj
By the way, I think your suggestion would make more sense for Russian (if somebody needs the feature for displaying the accents).
kaŝi la respondojn
AlanF_US
antaŭ 4 tagoj
Another issue worth thinking about is whether characters with diacritics are treated like characters without them for the purpose of search, the same way that uppercase characters are treated like lowercase characters. This has to be set up manually. I did this for Russian with the stress mark (acute sign), and I also did it for Hebrew with the vowels and final consonants. Currently, there's nothing like this in place for Arabic.
kaŝi la respondojn
odexed
antaŭ 4 tagoj
I wouldn't change the current settings because sometimes diacritics in Arabic are indispensable because they could change the meaning.
For example, دَرَسَ means to study something and دَرَّسَ - to teach someone
kaŝi la respondojn
OsoHombre
hieraŭ
@odexed:

Another example:

نَزَلَ - to descend, to go down

and:

نَزْل - hotel

OsoHombre
hieraŭ
@AlanF_US

This is another problem, indeed. I user might be frustrated if they can't find an Arabic word in a search engine simply because the word isn't diacritized properly, however, there is another risk. The simplest solution would be 'turning off' the diacritics captor of a search engine, but in this case, if a search engine doesn't take into consideration diacritics, the search results might yield many irrelevant results. For example, the word 'درس' may either mean:

دَرَسَ - to study

or:

دَرْس - lesson


kaŝi la respondojn
odexed
hieraŭ - hieraŭ
If I don't know how to put diacritics on a word I can always look it up in my dictionary. On the other hand, I'd get frustrated while looking for examples with "دَرَّسَ" if there are thousands sentences with "دَرْس" and "دَرَسَ"
Ricardo14
antaŭ 4 tagoj
+1

Tatoeba is also a big tool for learners. There would be a way to do so, I think.
OsoHombre
hieraŭ
@Pfirsichbauemchen:

If there are any concrete suggestions, then I volunteer to take part in them. After all, I'm here for the promotion of the Arabic language (although I don't have much free time at any moment of the year and I may be absent for prolonged periods at a time).
kaŝi la respondojn
cueyayotl
antaŭ 18 horoj
I fully support such a system, where an alternate reading (in gray) for an Arabic sentence can be seen if and only if someone has manually submitted a version of the sentence with diacritics. In other words, if only a normal sentence is submitted, we will only see that sentence (in black), but if that person (or another user) submits the same sentence with diacritics (not as a new sentence, but through an extra button that can be activated for Arabic), then we will see the "normal" sentence in black and the sentence with diacritics in gray.
I fully support this kind of interface for all Arabic, all Chinese, Cyrillic-written Slavic, Philippine, etc. languages as well.
Ricardo14
antaŭ 2 tagoj
** A reflection about posting sentences on Tatoeba **

As everybody knows, Tatoeba's mission is to get sentences and translations from as much languages as possible. For now, 300+ sentences are supported here - and this is the point I'd like to cover now.

Take for example, countries. In English we'd say "the United States (of America)", "Russia", "France", "Germany", "Portugal", "Japan", etc. (there are over 200 countries). Are we "allowed" to post one sentence - and translate it - for each country? I mean

I'm from Japan.
I'm from Portugal.
I'm from Germany.
I'm from France.
I'm from Russia etc

They would be near-duplicate sentences in English but for most of languages, it would help - A LOT - to study them

Take Portuguese, for example:

(Eu) sou **dO** Japão.
(Eu) sou **dE** Portugal.
(Eu) sou **dA** Alemanha.
(Eu) sou **dA** França.
(Eu) sou **dOS** Estados Unidos. etc

"de" is a preposition and as in many languages, we "link" them with the article (o,a, os) - de+o=do, de+a=da, d+os=dos.

I'm also asking that because many people - me included - wouldn't be able to think on a sentence to use the words given.

kaŝi la respondojn
OsoHombre
hieraŭ
I agree with you, Ricardo. There are approximately 190 independent sovereign countries, each with its own name. If I write 190 sentences following the grammatical structure of 'I am from {country name}', I might have 190 sentences with exactly the same structure but with valuable linguistic information about the country names mentioned in the sentences, as this would help the reader-learner-translator learn how each country is called in a given language and how these country names are affected by the language's grammatical rules.

And, I'd like to remind everybody here that that issue of nearly-identical sentences is dead and buried. I no longer bother about that and I'd like to ask any normal human being that's a member of Tatoeba not to worry about that, too. I think that we're pretty free to write any sentence that's grammatically correct and meaningful as long as our intentions are to help the project, not to intentionally destroy it or affect its quality. Therefore, people contributing nearly-identical sentences for more linguistic information are not dumb and should not feel they're doing a bad thing.
Aiji
hieraŭ
We've already been through that several times.
I'll just quickly make a point. Again, please do not confuse the purpose of the tool with how to use the tool. Tatoeba is suppose to be a corpus of sentences, not a website to learn a language. If you think that it is helping your learning of languages, that is good, but please do not forget that other people share the same tool, and therefore may have different goals. Claiming a full liberty on the name of learning a language is doing to others what you don't want others to do to you.

Now, specifically for the countries name, again, a corpus of sentences and a corpus of words are two different things. Dictionaries are here to help finding the translation of a specific word. Tatoeba is not a dictionary. I'm pretty much sure that no language in the world has 200 grammatical rules, one for each country name... Therefore, for a corpus of sentences, the value to have the same sentences with 100 country names over having them with ten names is pretty much zero. However the value of having 10 country names over only 2 is huge, I join you on this point.
kaŝi la respondojn
Ricardo14
hieraŭ
> Now, specifically for the countries name, again, a corpus of sentences and a corpus of words are two different things. Dictionaries are here to help finding the translation of a specific word. Tatoeba is not a dictionary.

Haven't seen that this way. Thanks for that

> However the value of having 10 country names over only 2 is huge, I join you on this point.

Thanks.
AlanF_US
hieraŭ
I agree with Aiji. Some additional points of my own:

Tatoeba should try to do what it is best at, not what it is not suited for.
Individuals should use Tatoeba for what it is best at.
Individuals should use whichever tool is best suited for whatever immediate task is at hand.

People learned languages long before Tatoeba, and indeed long before the Internet Age. In the old days, if you wanted to learn French, you used a textbook. Somewhere in that book, there was a lesson on how to write phrases with country names. It presented an explanation (the fact that country names have gender and number, etc.), lists of countries with masculine, feminine, and plural names, sample sentences, and exercises. The good ones even contained pictures. The fact that a textbook was written by a small number of knowledgable people, proofread, and published, helped ensure that the explanation and the sentences were coherent and of high quality.

Nowadays, there are resources on the Internet that do pieces of what textbooks have always done, although in a more scattered form, so you can find the equivalent of the textbook country name explanation online (usually together with some advertising), at least for major languages.

Tatoeba allows people all over the world to collaborate on the "sample sentence" part of a textbook, and furthermore builds links between hundreds of different languages. I don't need to tell anyone here all the wonderful advantages that that brings. However, Tatoeba cannot and should not try to replace textbooks (printed or electronic), or electronic references. Unless you're a very young child, whose brain is wired to construct rules entirely from examples, you need to learn the concepts. There's only so far that tags and comments on individual sentences can take you if you don't start with an explanation.

>> I'm pretty much sure that no language in the world has 200 grammatical rules, one for each country name... Therefore, for a corpus of sentences, the value to have the same sentences with 100 country names over having them with ten names is pretty much zero.

Well put. Furthermore, I would say that in many ways, Tatoeba would decrease its value by covering 100 country names rather than 10, because it depends on human readers and translators. Let's say that I want to find sentences in language X that contain the phrase "I live in". Or let's say I want to translate the most recent sentences, or the most recent sentences in language X, or the most recent sentences by contributor Y. If contributor Y has just contributed 100 languages of the form "I live in" in language X, every one of those lists will be watered down by the same mind-deadening, content-poor material. While it's true that you increase the chances that someone looking for a sentence with specific content will find it, you will also increase the chances that someone who is looking for something broader or deeper will need to filter out all the unwanted noise.
kaŝi la respondojn
OsoHombre
antaŭ 21 horoj
AlanF_US: But there are hundreds of languages and dialects that would still not have dictionaries and grammar books for many years to come.
kaŝi la respondojn
AlanF_US
antaŭ 20 horoj
I would think that the priority for such a language or dialect would be a dictionary, or grammar book, or set of recordings, or academic paper, or even a description on a blog, not a random set of sentences on Tatoeba.
sharptoothed
antaŭ 22 horoj
** Tatoeba Stats **

Tatoeba stats, graphs & charts have been updated.

http://tatoeba.j-langtools.com/allstats/
kaŝi la respondojn
Guybrush88
antaŭ 22 horoj
thanks ^^
Ricardo14
antaŭ 21 horoj - antaŭ 21 horoj
Thank you so really much, bro!
(Edit - whoa... 21st (▼ 11)... gotta change that)
Tirifto
antaŭ 30 tagoj
Saluton! Mi legis en kelkaj frazoj Esperantaj je «preni la buson.» Tiu signifo de «preni» ŝajnas idioma de kelkaj lingvoj; nek PIV nek ReVo ĝin mencias. Ĉu ĝi vere taŭgas?
kaŝi la respondojn
PaulP
antaŭ 3 tagoj
Mi nur nun vidis vian komenton. Mi kredas, ke pli taŭga loko por la komento estus sub unu el la koncernaj frazoj. Ankaŭ mi persone taksas la esprimon „preni la buson” evitinda kaj uzas „veturi per buso”, „uzi la buson”.
arh
arh
antaŭ 4 tagoj
En respuesta a algunas consultas recibidas sobre la grabación de frases, he incorporado una pequeña sección de resolución de problemas a la guía.
Os recuerdo que podéis acceder a la última versión mediante este enlace:
https://mega.nz/#F!9cEixIAL!A5Al-5OWAzB_VTh0fBl0Ig
También se encuentra en el formato original (.odg), por si alguien desea traducirla o adaptarla a su propia lengua.
Os animo nuevamente a que pongáis voz a todas vuestras oraciones, para que, además de leerlas, los usuarios puedan escucharlas.
----
As I got some requests regarding problems with the recording of sentences, I have just added a short troubleshooting section to the guide.
Please remember that you may access the latest version using this link:
https://mega.nz/#F!9cEixIAL!A5Al-5OWAzB_VTh0fBl0Ig
I also left the original document (in .odg format) in case that anyone would like to translate or adapt it to his/her native language.
May I once again encourage you to record all your sentences, so that the users can not only read them, but also listen to them.
kaŝi la respondojn
odexed
antaŭ 4 tagoj - antaŭ 3 tagoj
Con todo respeto quiero hacerle una pregunta. ¿Usted realmente cree que las frases incompletas como
#5967339 - Porque lo quieres (Because you want it)
#5963710 - Quiero regalar (I want to give)
#5965211 - Comprar verduras y carne (To buy vegetables and meat)
#5965212 - Echar sal (To rub salt)
pueden ser de ayuda?
---
With all due respect, I would like to ask you a question. Do you really think that such incomplete sentences as
#5967339 - Porque lo quieres (Because you want it)
#5963710 - Quiero regalar (I want to give)
#5965211 - Comprar verduras y carne. (To buy vegetables and meat)
#5965212 - Echar sal (To rub salt)
may be helpful?
kaŝi la respondojn
arh
arh
antaŭ 3 tagoj
With all due respect, I do.
And, with all due respect, I don't feel this particular thread was the appropriate one for your comment.
But you know what? I'm too fed up and busy to keep on arguing with you about the quality of my work.
So, you won't have to worry any longer about my contributions.
I am done.
------
Con el debido respeto, así lo creo.
Y, con el debido respeto, este hilo no me parece el apropiado para tu comentario.
Pero, ¿sabes qué? Estoy demasiado harto y ocupado para seguir discutiendo contigo sobre la calidad de mi trabajo.
Así que, en adelante, no tendrás que preocuparte más de mis contribuciones.
Se acabó.
deniko
antaŭ 3 tagoj
Thanks arh, that's a very good guide. Maybe I'll start recording audio one day too.
Hybrid
antaŭ 19 tagoj
Firefox tells me that Tatoeba's login is unsafe.
kaŝi la respondojn
AlanF_US
antaŭ 17 tagoj
I tried logging out and in again in Firefox, but I didn't see any warning. Could you elaborate? I suppose that one of your Firefox settings could be responsible, but my security settings are pretty high, and I'm not seeing any problem.
kaŝi la respondojn
Hybrid
antaŭ 12 tagoj
kaŝi la respondojn
AlanF_US
antaŭ 12 tagoj
That message should be displayed if you're trying to log in from an address that does not start with "https:". The login page that I use has this address:

https://tatoeba.org/eng/users/login

Which address are you using?
kaŝi la respondojn
Hybrid
antaŭ 6 tagoj - antaŭ 6 tagoj
Thanks. I use tatoeba.org . It doesn't give the warning if I use https://tatoeba.org , but that takes much longer to type.
kaŝi la respondojn
AlanF_US
antaŭ 5 tagoj
I wrote a ticket about this:

https://github.com/Tatoeba/tatoeba2/issues/1457

Is it possible that Firefox is pulling up an old instance of the http address when you type "tatoeba.org" (no quotes) into the address bar? What happens if you clear all the instances of addresses that contain the string "tatoeba.org" from the list of suggestions? (You would use the down arrow to select a suggestion, then press the delete button to delete it.)
kaŝi la respondojn
Hybrid
antaŭ 4 tagoj
Thanks. It seems to use https now. So maybe that was the problem.
baudelaire
antaŭ 5 tagoj
Does tatoeba has an export option? Checking the downloads link, it looks like only pre-made exports are available. Is it possible to create custom exports?
kaŝi la respondojn
odexed
antaŭ 4 tagoj
Just curious, what kind of exports are you interested in? Could you please elaborate it?
kaŝi la respondojn
baudelaire
antaŭ 4 tagoj
Just make an arbitrary search and export it in the detailed format.
kaŝi la respondojn
odexed
antaŭ 4 tagoj
What do you mean by a detailed format?
kaŝi la respondojn
baudelaire
antaŭ 4 tagoj
https://tatoeba.org/eng/downloads

Fields and structure
1. Sentence id [tab] Lang [tab] Text
2. Sentence id [tab] Lang [tab] Text [tab] Username [tab] Date added [tab] Date last modified

kaŝi la respondojn
AlanF_US
antaŭ 4 tagoj
You can create a list (via "Browse/Browse by list") and then manually add items that you find during a search to it. Then you can use the "DOWNLOAD THIS LIST" button in the right sidebar to export the list (provided it contains no more than 100 sentences). Optionally, you can include the sentence ids and translations into a selected language. The format is similar to the the first one you mentioned. Specifically, it looks like this:

Sentence id [tab] Text [tab] Translation

where "Sentence id" and "Translation" are optional.

There is no option to export a custom list in detailed format, but you can always download the entire list of sentences in the corpus in detailed format and use that along with your custom list. You can load the full detailed list (~560 MB) into a text editor on your computer (provided you have enough memory). You can also extract information from it via utilities or programs, depending on how comfortable you feel with them. I have been experimenting with a Python package called csvkit that includes both a ready-made program that can be called from the command line and a set of utilities that can be called from Python. It works fine with the nondetailed format, but after some processing, it often produces an error related to character encoding when trying to read our detailed format. Thus, when I need information from the detailed download, I load it into a text editor.
Joutsentaika
antaŭ 10 tagoj
I have just created an account, and now want to add languages to my profile. I keep getting internal errors. Is this a common thing, a transient thing, an annoying occasional thing . . .? Is there anything I can do about it?
kaŝi la respondojn
AlanF_US
antaŭ 10 tagoj
Could you please give an example of what you're doing and the errors you're seeing?
kaŝi la respondojn
Joutsentaika
antaŭ 10 tagoj
My profile > Languages > ADD A LANGUAGE
Select Albanian from dropdown menu.
Click radiobutton 1: Beginner.
Click ADD LANGUAGE

Result:

An Internal Error Has Occurred.
Error: An Internal Error Has Occurred
kaŝi la respondojn
AlanF_US
antaŭ 10 tagoj
I was able to do it successfully three times in my browser (Firefox on Windows 7). What browser and operating system are you using?
kaŝi la respondojn
Ricardo14
antaŭ 9 tagoj
It happened in my Chrome on Windows 7

http://prntscr.com/elgdi7
kaŝi la respondojn
AlanF_US
antaŭ 8 tagoj - antaŭ 8 tagoj
Thanks, Ricardo. I find that if I go to the URL in your screenshot (https://tatoeba.org/eng/users_languages/save), I see the issue on Windows 7 in both Firefox and Chrome. However, I don't see it when I go through the steps that Joutsentaika described on Windows 7 in either Firefox or Chrome.
kaŝi la respondojn
Ricardo14
antaŭ 8 tagoj - antaŭ 8 tagoj
I followed the same steps, basically

1> Click over "My Profile";
2> From there, click over "Add a language"
3> Choose a language (I got "Scottish Gaelic") and the level ("0 - Almost no knowledge)
4> Click over "Add language" http://prntscr.com/em04ty
Error - http://prntscr.com/em052b - http://tatoeba.org/eng/users_languages/save

"An Internal Error Has Occurred.

Error: An Internal Error Has Occurred."
kaŝi la respondojn
AlanF_US
antaŭ 5 tagoj
Sometimes I get that error, but once I got a more detailed error message, namely:

:::
Not Found

Error: The requested address '/eng/users_languages/save)' was not found on this server.
:::

I added an issue ticket:

https://github.com/Tatoeba/tatoeba2/issues/1458
kaŝi la respondojn
Ricardo14
antaŭ 5 tagoj
Thank you for that!
Joutsentaika
antaŭ 7 tagoj
I'm using Internet Explorer 11 under Windows 10, and as I'm at work I've got no choice about that.
sharptoothed
antaŭ 7 tagoj
** Sentences & Translations Stats **

Tatoeba stats, graphs & charts have been updated.
Everything is accessible from one single page now:

http://tatoeba.j-langtools.com/allstats/
kaŝi la respondojn
raggione
antaŭ 7 tagoj
Thanks for the good work.
Guybrush88
antaŭ 7 tagoj
thanks!
Aiji
antaŭ 7 tagoj
Russian got over 500K! Congratulations :)

The new one is a pretty detailed chart.
kaŝi la respondojn
OsoHombre
antaŭ 6 tagoj
I would like to congratulate my dear Russian and Russian-speaking friends for the fact that now Russian has over half a million sentences on Tatoeba. I would also to thank the Russian friends who always translate my sentences into Russian together with those who supported me during my battle for name diversity and name-choice freedom on this website. Congratulations.
kaŝi la respondojn
odexed
antaŭ 6 tagoj
Thanks a lot. شكرا جزيلا
Selena777
antaŭ 6 tagoj
Thanks, Oso!
keyboard_arrow_left 1234567...464