Bacheca (7.355 discussioni)
Consigli
Prima di fare una domanda, assicurati di aver letto le domande frequenti.
Noi puntiamo a mantenere un'atmosfera sana per avere delle discussioni civili. Per favore, leggi le nostre regole contro i cattivi comportamenti.
TATAR1
7 ore fa
AlanF_US
ieri
AlanF_US
ieri
Thanuir
ieri
brauchinet
2 giorni fa
LeviHighway
2 giorni fa
Thanuir
2 giorni fa
Rovo
3 giorni fa
gillux
3 giorni fa
LeviHighway
5 giorni fa
Tatoeba Stats, Graphs & Charts?!
Tatoeba m'indique dans un premier temps que la phrase "N'interrompez jamais un ennemi qui est en train de faire une erreur." (citation de Napoléon Bonaparte) n'existe pas encore dans son corpus mais quand je veux l'introduire, il prétend que ma phrase ne peut être ajouté au prétexte qu'elle existe déjà (introduite par Voltaire (robot du Projet Voltaire).
Que faire parce qu"on n'a pas accès à cette phrase ?
Lause on tässä, niin muiden ei tarvitse etsiä: https://tatoeba.org/eo/sentences/show/7709519
Das könnte ein Vorschlag für eine Anpassung sein:
Wenn ein Satz eingegeben wird, der schon existiert, wird dieser Satz im Regelfall abgelehnt.
Wenn aber der existierende Satz "rot" ist, könnte er durch den neuen ersetzt werden.
Das wäre eine einfache Möglichkeit, rote aber korrekte Sätze zu "entsperren".
Man müsste sich überlegen, ob diese Vorgehensweise die Rechte des gesperrten Autors des Satzes verletzt. Wahrscheinlich nicht. Man könnte auch nur den Autor austauschen und das entsprechend in den Logs dokumentieren.
Realistically speaking, in this particular situation, if you think it's really important that the corpus have this specific sentence in a version that is not marked red, your best option is to add a slightly modified version such as:
Comme Napoléon Bonaparte a dit, "N'interrompez jamais un ennemi qui est en train de faire une erreur."
Since there are many reasons a sentence can be marked red (copyright issues related to the specific sentence, copyright issues relating in general to sentences added by the same author, violations of other Tatoeba standards), and since those reasons are not tracked by the system, it's not feasible for a new sentence that exactly matches an existing red one to automatically be let through.
I had a discussion at #13684027 , and found some problem that is universal to the corpus, so I want to discuss here.
An inactive non-native speaker wrote this sentence #12888798: "Do you want a electric gaming machine?"
I personally think that sentence is odd, not used, but it's completely grammatically correct. But we cannot infer that if the author of the sentence meant "consoles" or "video games". How should we deal with it? Should we keep it as "Do you want a electric gaming machine?", or should we change it into "consoles" or "video games", or should we delete it?
Jos lause on oikein, vaikka kenties epätodennäköinen, saa se yleensä jäädä.
Jos sillä on useita mahdollisia käännöksiä, voit kääntää sen millä tahansa tai usealla tavalla, tai tietenkin jättää kääntämättä.
Yksi vaihtoehto, joka tulee mieleen, on että sen kääntää nimenomaan sähköiseksi pelilaitteeksi, joka lienee samalla tavalla outo käännöskielessä kuin alkuperäinenkin lause on. Tällöin molemmat lauseet toivottavasti ilmaisisivat, että kirjoittaja tai puhuja ei oikein ole mukana nykyteknologian kehityksessä, tai käyttää tarpeettoman teknistä, tarkkaa ja hankalaa käsitettä, tms.
The best thing to do with a problematic sentence, whether it's incorrect or just odd, is to add a comment to the sentence. This serves a number of purposes:
- it tells the author of the sentence (who may still be active) and other members of the community that there is something questionable about it and that they should consider modifying, unlinking, retranslating, or deleting it
- it serves as a starting point for other members of the community to contribute their insights, which may be valuable
- it warns people who come across the sentence at Tatoeba or other sites that point to it that there is a problem with the sentence and that they shouldn't use it as a model
- it starts the customary two-week grace period after which a corpus maintainer can edit the sentence
You can also add a rating of "unsure" or "not OK", and (if you are an advanced contributor) a tag. The three tags "@change", "@needs native check", and "@check" are checked regularly by corpus maintainers, so they help ensure that the sentence will be reviewed.
The important thing is that you don't have to decide immediately and unilaterally what to do with the sentence. You just have to open a discussion and we can see where things go from there.
As for the general question of whether odd but correct sentences should be changed, the answer is that it depends on many factors, such as whether there is a replacement that is obviously better and whether the sentence is a problematic translation of another sentence.
As a minor point, apart from not being natural, "Do you want a electric gaming machine?" is not correct. The word "a" should be "an".
建議在 [詞彙請求] 中優先顯示例句數多的詞彙項,後顯示例句數少的詞彙項。
比如,包含「學生」的句子可能有 5 個,包含「大學生」的句子可能有 0 個。由於 [現有的詞彙請求] 中優先顯示例句數少的詞彙項,所以大部分人會優先新增「大學生」的句子。當包含「大學生」的句子達到了 10 個,包含「學生」的句子就達到了 15 個。但包含「學生」的句子大多數都是「大學生」。這樣對語料庫的詞彙豐富度是有害的。
I suggest that [Vocabulary Requests] prioritize displaying vocabulary items with more sentences, and show items with fewer sentences later.
For example, there might be 5 sentences containing "學生" and 0 sentences containing "大學生". Since the [Existing Vocabulary Requests] prioritizes items with fewer sentences, most people will add sentences for "大學生" first. By the time sentences containing "大學生" reach 10, sentences containing "學生" reach 15 — but most of those "學生" sentences actually contain "大學生". This is harmful to the lexical richness of the corpus.
Thanks, that is really a problem. We are keeping track of this issue here https://github.com/Tatoeba/tatoeba2/issues/3275 and I answered you there.
Il contenuto di questo messaggio infrange le nostre regole e dunque è stato nascosto. È visibile solamente agli amministratori e all'autore del messaggio.
In another post - https://tatoeba.org/de/wall/sho...#message_41768 - I offered to @LeviHighway to publish some small programs I'm using to extract greek sentences and roughly sort them by difficulty, based on word counts.
Meanwhile they are online. You can find them at https://github.com/kumakyoo42/tatoeba_tools
The sorting algorithm is mainly designed for languages with spaces between the words. I added support for counting symbols instead of words. With this, I hope at least, sentences of other languages might also be sorted roughly by difficulty. On a whim I checked this with Japanese sentences. At first glance it looks like it might work. The easiest one was "はい" (yes). But I don't know enough about Japanese language to judge the more complicated sentences.
Thank you very much, kumakyoo. Your tool is helpful. I'm always grateful for your programming works. They benefit everyone.
Il contenuto di questo messaggio infrange le nostre regole e dunque è stato nascosto. È visibile solamente agli amministratori e all'autore del messaggio.
Il contenuto di questo messaggio infrange le nostre regole e dunque è stato nascosto. È visibile solamente agli amministratori e all'autore del messaggio.
Corpus Maintainer Candidate for Chinese (Mandarin);
Kandidato por la bontenado de la ĉina korpuso;
Candidat pour la maintenance du corpus chinois (mandarin) ;
Кандидат в ответственные за корпус для китайского (мандаринского) языка;
Korpuspflegerkandidat für das Chinesische (Mandarin).
LeviHighway: https://tatoeba.org/user/profile/LeviHighway
🇬🇧 As usual, please feel free to send us a private message to share your opinion (click on the link below).
[epo] Kiel kutime, ne hezitu sendi al ni privatan mesaĝon pri via ĉi-rilata opinio (alklaku la ĉi-suban ligilon).
🇫🇷 Comme d’habitude, n’hésitez pas à nous envoyer un message privé pour nous faire part de votre opinion si vous le souhaitez (cliquez sur le lien ci-dessous).
🇷🇺 Вы можете, как обычно, отправить нам личное сообщение, чтобы поделиться своим мнением (кликните по ссылке ниже).
🇩🇪 Schickt uns wie immer gerne eine Privatnachricht, um uns eure Meinung mitzuteilen (auf den Link unten klicken).
https://tatoeba.org/private_mes...sichbaeumchen.
Il contenuto di questo messaggio infrange le nostre regole e dunque è stato nascosto. È visibile solamente agli amministratori e all'autore del messaggio.