Vägg - Tatoeba

Vägg (7 343 trådar)

Tips

Before asking a question, make sure to read the FAQ.

We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.

Senaste meddelandena

subdirectory_arrow_right

kumakyoo

igår

subdirectory_arrow_right

AlanF_US

igår

subdirectory_arrow_right

gillux

för 2 dagar sedan

subdirectory_arrow_right

kumakyoo

för 2 dagar sedan

feedback

CK

för 3 dagar sedan

subdirectory_arrow_right

kumakyoo

för 4 dagar sedan

subdirectory_arrow_right

frpzzd

för 4 dagar sedan

feedback

kumakyoo

för 4 dagar sedan

subdirectory_arrow_right

TATAR1

för 6 dagar sedan

feedback

TATAR1

för 6 dagar sedan

kumakyoo för 4 dagar sedan 16 mars 2026 16:55:26 UTC

flag

Report

link

Permalänk

Zwei Monate Tatoeba

So lange ist es schon her, dass ich diese Webseite gefunden habe. Inzwischen habe ich viele liebe Menschen kennengelernt. Vor allem gefällt mir der Humor, der hier immer mal wieder durchschimmert. Ich hatte beispielsweise bei einem Dutzend griechischen Sätzen von carlosalberto geschrieben, dass da das falsche Fragezeichen verwendet wurde. Er hat sich da die Mühe gemacht und bei jedem eine Antwort drunter geschrieben, ich gebe das einfach mal so wieder, weil ich es so lustig fand: Done - Yes - OK - Sim - Ναι - Oui - Incrível - Impressionante - Dezenas de casos - Quanta distração - Faltam poucas frases - Até o Mandarim usa o ponto de interrogação latino. Mas o Grego... - Está acabando - Só falta uma - Espero que esta tenha sido a última. Que trabalho lhe dei! Mil perdões. E muito obrigado.

Abgesehen davon bin ich aber immer noch etwas auf der Suche nach meinem Platz hier. Inzwischen kristallisieren sich zwei Ecken heraus, die da in Frage kommen: An der Software mithelfen und mich um den griechischen Korpus kümmern. Ersteres ist etwas, was ich eigentlich ziemlich gut kann - ich programmiere immerhin schon seit 43 Jahren - und dennoch komme ich hier immer wieder an meine Grenzen.

Das Problem ist ein Framework namens CakePHP, das wir hier verwenden. Ich bin ohnehin kein Fan von Frameworks. Die versprechen immer, alles einfacher zu machen, aber in Wahrheit fügen sie einfach nur eine zusätzliche Komplexitätsebene hinzu, ohne dass man viel davon hat. CakePHP ist da (meiner Meinung nach) keine Ausnahme.

Ein Beispiel: Ich versuche seit einiger Zeit dafür zu sorgen, dass normale Mitarbeiter die Verbindungen zwischen zwei Sätzen wieder aufheben können, wenn sie diese Verbindung selbst erstellt haben. Dafür braucht man eine Datenbankanfrage, die nicht ganz trivial ist. Ich habe etwa 15 Minuten dafür gebraucht, diese Datenbankanfrage zu formulieren. Ohne CakePHP könnte man diese Anfrage einfach so verwenden, da wären vielleicht noch weitere 5 Minuten für nötig; für CakePHP muss man sie aber übersetzen in eine Reihe von Funktionsaufrufen. Dafür habe ich letztendlich fast zwei Wochen und zweimal auch noch die Hilfe von gillux benötigt. Und das Lächerliche an der Geschichte ist, dass die Arbeit, die CakePHP einem jetzt abnimmt, die ist, alles wieder in die Datenbankanfrage zurück zu übersetzen, mit der alles begonnen hat. Das fühlt sich für mich sehr frustrierend an.

Nur leider ist halt CakePHP da und selbst wenn man wollte, wäre das aus der Tatoeba-Software nicht einfach rauszukriegen. Also muss ich da jetzt mit klarkommen, wenn ich weiter an der Software mithelfen will. Was mir helfen würde, wäre eine vernünftige Dokumentation (sowas wie die Java Language Spezification, die wirklich alle Details auflistet und nicht nur an Hand einiger Beispiel erklärt, wie man bestimmte Dinge erreichen kann), aber da habe ich bislang nichts Brauchbares gefunden.

Tja, und die andere Ecke wäre der griechische Korpus, aber ich glaube, da schreibe ich ein andermal was dazu. Der Text hier ist auch so schon lang genug. :-)

dölj svar visa svar

frpzzd för 4 dagar sedan, redigerad för 4 dagar sedan 16 mars 2026 18:12:52 UTC, redigerad 16 mars 2026 18:13:13 UTC

flag

Report

link

Permalänk

Kumakyoo, ich freue mich über deine Zusammenfassung hier und über die Mühe, die du dich gegeben hast, den Korpus und die Codebasis zu verbessern! :-)

Leider habe ich das Gleiche mit CakePHP erfahren (obwohl du hast wohl schon mehr Erfahrung damit als ich): ein kleines Stückchen Funktionalität, das als eine SQL-Query ganz einfach zu formulieren wäre, muss trotzdem als CakePHP-Funktion umformuliert werden und dadurch manchmal fünfmal komplizierter werden. Vor allem, weil CakePHP (im Vergleich zu anderen Sprachen/Frameworks) nicht so ausführlich dokumentiert ist, oder weil die Dokumentation nicht sehr navigierbar ist.

Dafür sehe ich keine gute Lösung. Idealerweise würde man das Ganze von Tatoeba in eine andere Framework (oder gar keine) oder sogar in eine andere Programmiersprache (vielleicht lieber etwas anderes als PHP?) umschreiben, wobei man SQL-Querys direkt benutzen kann, aber da ich selbst nicht in der Lage bin, eine solche Riesenaufgabe zu übernehmen, und die existierende Oberfläche schon sehr gut funktioniert, kann ich mich nicht viel beschweren.

dölj svar visa svar

kumakyoo för 4 dagar sedan 16 mars 2026 18:32:06 UTC

flag

Report

link

Permalänk

Ja, das hattest du an anderer Stelle ja auch schon mal geschrieben. So weiß ich immerhin, dass ich nicht der einzige bin, der damit Probleme hat. :-) Danke dir!

kumakyoo för 2 dagar sedan 19 mars 2026 09:08:49 UTC

flag

Report

link

Permalänk

So, ich hab' ja versprochen, auch noch was zum griechischen Korpus zu sagen. Zum einen habe ich mit einigen kleinen Programmen Sätze gesucht, die offensichtliche Fehler enthalten, also ein fehlendes Satzzeichen, ein falsches Symbol etc. Da hatte ich ja schon mal was dazu geschrieben.

Ich habe bei all diesen Sätzen einen entsprechenden Kommentar hinterlassen und @change gesetzt (ich bin ja inzwischen fortgeschrittener Mitarbeiter). Bei vielen stand aber schon ein solcher Kommentar und oft war @change auch schon gesetzt. Die meisten Kommentare wurden um 2015 rum geschrieben. Geändert hat sich an den Sätzen seither nichts.

Das liegt daran, dass hier keine griechischen Muttersprachler aktiv sind. Einen Korpus Maintainer gibt es dafür nicht. Und die meisten Besitzer der Sätze sind verschollen. Also ändert sich auch nichts und ich habe die Befürchtung, dass auch meine Aktivitäten ohne Resultat verpuffen werden, denn ich kann auch nichts mehr weiter tun; der Ball ist jetzt bei den Korpus Maintainern und Administratoren.

Darüber habe ich länger nachgedacht. Bei der Wikipedia könnte sowas nicht passieren. Das liegt einfach daran, dass dort jeder alles ändern kann, sogar nicht angemeldete Benutzer. Erst wenn es Probleme gibt, werden Artikel gesperrt und sind dann nur noch einigen Benutzern zugänglich. (Und es gibt seit einiger Zeit den Sichtungs-Mechanismus, dass Änderungen von Benutzern, denen nicht vertraut wird, erst noch gesichtet werden müssen, bevor sie der Allgemeinheit zugänglich gemacht werden.) Überträgt man das auf Tatoeba, dann kann man sagen, hier sind alle Sätze in dem Gesperrt-Zustand, was bedeutet, dass eine kleine Menge an Leuten die ganze Arbeit machen muss. Und zudem gehören die Sätze einzelnen Personen und sind nicht Allgemeingut, was wahrscheinlich die Ursache für diesen Unterschied ist.

Naja, da ich bei den kaputten Sätzen nichts mehr machen kann, habe ich mir was anderes gesucht, nämlich griechische Sätze ins Deutsche zu übersetzen. Dafür habe ich mir alle griechischen Sätze ohne deutsche Übersetzung (mit einem Algorithmus der auf Worthäufigkeiten basiert) grob nach Schwierigkeit sortiert und gehe diese jetzt der Reihe nach durch.

Inzwischen habe ich schon etwa 3000 Sätze übersetzt. Meist ist es einfach, aber manchmal muss ich auch ganz gut recherchieren, meist, weil mir nicht klar ist, ob der Satz eine übertragene Bedeutung hat oder ob er direkt übersetzt werden kann. In wenigen Fällen musste ich auf eine Übersetzung verzichten, weil ich zu gar keinem Ergebnis kommen konnte.

Jetzt habe ich noch 33800 Sätze zum Übersetzen übrig. Wenn ich im bisherigen Tempo weitermache, habe ich ausgerechnet, dann bin ich Ende 2027 damit fertig. Aber die Sätze werden ja schwerer und meine Motivation dürfte auch im Laufe der Zeit etwas zurückgehen und: Es kommen ja auch immer mal wieder neue Sätze dazu… Realistisch dürfte also eher 2040 oder so sein.

Naja, damit habe ich zumindest erst mal was zu tun. :-)

dölj svar visa svar

AlanF_US igår, redigerad igår 19 mars 2026 11:46:37 UTC, redigerad 19 mars 2026 11:47:42 UTC

flag

Report

link

Permalänk

Many thanks for tagging and adding comments to Greek sentences that need to be changed. I see 313 that are tagged "@change". While I don't know Greek, the comments indicate that most of the problems involve punctuation. I can see that there are some differences between punctuation in Greek and other European languages, but nothing conceptually difficult that would make me feel unqualified to perform these changes, given that you've described what needs to be done.

In general, corpus maintainers (including administrators) focus on fixing sentences in languages they know well, but if given enough information, they're also able to do it in other languages.

dölj svar visa svar

kumakyoo igår 19 mars 2026 19:17:15 UTC

flag

Report

link

Permalänk

Punctuation is almost identical to other European languages. The question mark looks like a semicolon (;) and is also that ASCII character, and the semicolon is a special symbol, a central dot (·). That's all. There is one special case, namely "ό,τι", where the comma is part of the word (to distinguish it from "ότι" which has a different meaning) and therefore in this case there is no space after the comma.

And I would be very happy, if one of the corpus maintainers could fix these sentences. :-)

gillux för 2 dagar sedan 19 mars 2026 10:10:11 UTC

flag

Report

link

Permalänk

Thank you for sharing your journey with us, it is always a pleasure to read your diary. It reminded me of similar happy moments with other Tatoebians I had in the past when I focused on translating Japanese sentences.

About CakePHP, I agree that its documentation is not always very good. What I usually do to find my way around is to directly browse the source code of CakePHP itself. I find it rather easy to get around and understand. For example, when you asked me how to use the NOT EXISTS clause, I just ran some `grep -r EXISTS vendor/cakephp/cakephp/src/` and quickly found the method. Hope this helps.

CK för 3 dagar sedan 18 mars 2026 05:25:08 UTC

flag

Report

link

Permalänk

🍎 I've been working on creating a non-stop bilingual TTS player for all the selected bilingual pairs I've been using for my /bilingual/ and /anki/ pages.

If you have time to help me, by watching videos for the languages you know, I would appreciate some feedback, especially feedback on whether the "wildcard" substitutions function is working in those languages.

► Please send feedback to me via https://tatoeba.org/en/private_messages/write/CK instead of cluttering this public wall.

This is an "unlisted" playlist of "unlisted" demo videos.
https://www.youtube.com/playlis...vCCwMU3aicHXQW
Average video length : 1 minute, 11 seconds
Total length : 10 minutes, 40 seconds

English-Dutch
English-French
English-German
English-Hebrew
English-Italian
English-Japanese
English-Korean
English-Portuguese
English-Spanish

► There is a link to the menu page for "alpha" version of this Bilingual TTS Player in the YouTube description of each of the videos.

The quality of the TTS voices will depend on what you have installed on your device.

For some of the languages, TTS may not be available, and you will only hear the English TTS.

för 4 dagar sedan 17 mars 2026 07:38:56 UTC

link

Permalänk

warning

Meddelandets innehåll strider mot våra regler och är därför dolt. Endast administratörer och meddelandets upphovsman kan se det.

för 4 dagar sedan 17 mars 2026 07:32:05 UTC

link

Permalänk

warning

Meddelandets innehåll strider mot våra regler och är därför dolt. Endast administratörer och meddelandets upphovsman kan se det.

TATAR1 för 6 dagar sedan 15 mars 2026 04:09:31 UTC

flag

Report

link

Permalänk

Tatoeba Stats, Graphs & Charts have been updated??

dölj svar visa svar

TATAR1 för 6 dagar sedan 15 mars 2026 05:58:36 UTC

flag

Report

link

Permalänk

Thank you very much. Now this information has appeared on the wall.

för 6 dagar sedan, redigerad för 6 dagar sedan 14 mars 2026 17:44:53 UTC, redigerad 14 mars 2026 17:47:30 UTC

link

Permalänk

warning

Meddelandets innehåll strider mot våra regler och är därför dolt. Endast administratörer och meddelandets upphovsman kan se det.

LeviHighway för 11 dagar sedan 10 mars 2026 06:17:23 UTC

flag

Report

link

Permalänk

How should we handle "Grammatically Correct but Unnatural" sentences?

For example: https://tatoeba.org/zh-tw/sentences/show/9987065

This sentence says: "In the dead of night, I am always as if I released a heavy burden."

This is completely grammatically correct, but to me personally it's semantically awkward.

Should we keep this sentence as the way it is, or propose a more natural rephrasing?

dölj svar visa svar

Ooneykcall för 11 dagar sedan 10 mars 2026 10:42:04 UTC

flag

Report

link

Permalänk

It probably felt natural to the person who posted it, who is also a native speaker?
People can and do regularly disagree about which phrasing is (the most) natural. I think you should like to have a consensus of several independent native speakers before it may be proper to modify a phrase that isn't outright wrong, especially if it was posted by a native speaker.

dölj svar visa svar

LeviHighway för 10 dagar sedan 10 mars 2026 16:02:10 UTC

flag

Report

link

Permalänk

To be honest, the controversy stems from the fact that Chinese is a paratactic language, whereas English and most Western languages are hypotactic. In Chinese, as long as a sentence has a clear predicate (sometimes even without a formal verb) most structural combinations are technically "grammatically correct."
However, in the pedagogical and professional reality of Chinese speakers, especially in formal writing, a sentence is considered wrong if it is semantically imprecise. Even in colloquial speech, where semantic rigor is often relaxed, I still maintain that "semantic awkwardness" constitutes a fundamental error.

dölj svar visa svar

DaoSeng för 10 dagar sedan 11 mars 2026 00:12:06 UTC

flag

Report

link

Permalänk

I think what you are trying to get at is that Mandarin is a topic-prominent language. Topic-prominent languages—independent of the syntactic rules that revolve around subject, verb, object—have a separate set of rules for how topics and comments should be treated. Thus, I would agree that some of the issues you point out (also with regard to other sentences you commented on) are far more serious than mere awkwardness.

The problem with Tatoeba—presenting a single sentence in isolation—is that it strips away the context that makes these topic-prominent languages tick. For some of these sentences you have qualms with, I can think of elaborate and specific discourse contexts that would render them natural. I would think, for many more of these, the contributor had very specific contexts in mind.

I do not see a way of getting around this issue. Only allowing sentences that sound good as stand-alone utterances is obviously no good. So, while I agree that this is a real problem, I am not sure if there is a way out.

kumakyoo för 19 dagar sedan 1 mars 2026 16:58:31 UTC

flag

Report

link

Permalänk

Im Chat (https://chat.tatoeba.org/) hatte ich mit @gillux mal kurz darüber nachgedacht, was Tatoeba am meisten nützen würde. Spontan hatte ich dort geschrieben, dass mir Feedback fehlt.

Ich habe seither viel darüber nachgedacht, wie solches Feedback aussehen könnte. Hier ein paar Ideen:

a) Hier bei der Pinnwand und bei den Kommentaren zu den Sätzen könnte man ein Thumbs-Up als Rückmeldung einbauen (aber bitte kein Thumbs-Down, das hat das Potential, die Stimmung zu vergiften). Vieles, was ich gelesen habe, finde ich gut und würde das gerne dem Autor zurückmelden, aber einen extra Kommentar dazu zu schreiben, fände ich überzogen. Ein Thumps-Up wäre genau das, was mir hier fehlt.

b) Wenn ich nach einem Tag Abwesenheit (ich muss ja auch manchmal arbeiten) zu Tatoeba zurückkehre, würde ich gerne wissen, was in der Zwischenzeit passiert ist. Derzeit beschränkt sich die Rückmeldung auf der Webseite auf "Jemand hat dir eine private Nachricht geschrieben." Per Mail erhält man zusätzlich noch Nachricht, wenn jemand zu einem Satz von mir was geschrieben hat, oder zu einem Satz, bei dem ich einen Kommentar geschrieben habe. Ersteres sind meist Rückmeldungen, dass ich was falsch gemacht habe. Zweiteres oft einfach nur ein "Danke."

Hier würde ich deutlich mehr wissen wollen: Hat jemand einen meiner Sätze übersetzt? Hat jemand einen neuen griechischen Satz erstellt? Hat jemand ein Etikett bei einem meiner Sätze (oder bei einem griechischen Satz) geändert? Und so weiter. Das kann man natürlich alles irgendwie mit diversen Suchen herausfinden, aber ich fände es schöner, wenn man das auf einen Blick serviert bekommen würde, so, wie beispielsweise bei GitHub.

c) Persönliche Statistiken. Es gibt zwar ein bisschen was dazu, aber was mir vor allem fehlt sind zeitliche Statistiken, beispielsweise ein Digramm, das für jeden Tag auflistet, wie viele Sätze ich hinzugefügt habe, wie viele ich übersetzt habe und wie viel ich insgesamt an dem Tag beigetragen habe. (Und mich würde das auch für andere Benutzer interessieren, dann könnte man beispielsweise auf einen Blick feststellen, dass jemand schon seit langem inaktiv ist.)

d) Ich weiß nicht, ob darüber hier schon viel diskutiert wurde. Ich denke, es würde Tatoeba gut tun, wenn man hier direkt eine recht einfache Lernsoftware haben würde. Ich stelle mir das z.B. so vor, dass man eine Liste an Übersetzungen auswählt und dann von dieser Liste einen Satz angezeigt bekommt, sowie drei potentielle Übersetzungen und man einfach nur wählen muss, welche der drei die korrekte Übersetzung ist.

Warum liste ich sowas bei Feedback auf? Nun, wenn man sich mit den Sätzen beschäftigt, fallen Fehler auf, und wenn man hier schon ein Konto hat, kann man den ja auch gleich beheben, oder? Wenn jemand sowas mit einer anderen Lernsoftware macht, dann ärgert er sich zwar vielleicht über den Fehler, aber die Wahrscheinlichkeit, dass diese Info hier ankommt, ist gering.

Tja, soweit meine Ideen. Was meint ihr dazu?

dölj svar visa svar

araneo för 19 dagar sedan, redigerad för 19 dagar sedan 1 mars 2026 19:46:20 UTC, redigerad 1 mars 2026 19:48:18 UTC

flag

Report

link

Permalänk

I agree a thumbs-up or heart react would be great, I was thinking about making a post about that too. That way we can show we agree with something or show our appreciation without spamming the wall.

For C, have you seen the statistics that sharptoothed makes? You can see them here (https://tatoeba.j-langtools.com/allstats/) if not. They're not what you asking for, but they are very interesting, and I agree even more statistics would be great.

For D, I think Tatoeba is intended to be purely for the sentences, and then other projects use the sentences to make learning tools. You can see them here if you haven't already: http://a4esl.org/temporary/tatoeba/links.html

dölj svar visa svar

Cezrun64 för 19 dagar sedan 2 mars 2026 06:41:46 UTC

flag

Report

link

Permalänk

Hmm... What does it say about Tatoeba: there are no Like icons, but there are Report/Snitch icons?

dölj svar visa svar

kumakyoo för 18 dagar sedan 2 mars 2026 17:09:42 UTC

flag

Report

link

Permalänk

*thumbsup* ;-)

kumakyoo för 18 dagar sedan 2 mars 2026 17:05:57 UTC

flag

Report

link

Permalänk

Danke, araneo, für deine Antwort. Ich kenne die Statistiken, die du verlinkt hast (die schließende Klammer muss man da übrigens aus der URL entfernen, das ist ein bekannte Bug dieser Webseite) und schaue sie mir sehr gerne an. Sie sind aber halt nicht auf dieser Webseite und das führt dazu, dass viele Benutzer sie nicht finden. Was ich suche, sind Möglichkeiten, die Leute bei der Stange zu halten, also zu verhindern, dass sie nach ein paar hundert Sätzen die Lust verlieren. Und dafür braucht es irgendeine möglichst direkte Rückmeldung, denke ich.

Das Problem ist ja, dass wir alle in unserer eigenen Sprache Experten sind, aber das genau die Sprache ist, an der wir am wenigsten interessiert sind. Das heißt, wenn man hier Sätze in seiner eigenen Sprache eingibt, nutzt einem das selber nichts. Man provitiert halt indirekt davon, dass andere das in ihrer Sprache auch machen. Ich denke, solche Statistiken könnten eine zusätzliche Motivation bilden.

Ähnlich sieht es auch mit den Lerntools aus. Auch die sind zu weit weg, als dass sie als Feedback noch geeignet sind. Inbesondere kann man die in der Regel auch nutzen, ohne dass man hier auch nur einen Satz eingibt. Ich habe schon vermutet (oder vielleicht auch irgendwo gelesen), dass Tatoeba ein reines Satz-Sammel-Tool sein soll. Ich denke aber schon, dass man ein Beispiel-Lerntool integrieren könnte ohne diesen Charakter aufzugeben. Man kann ja dort dazuschreiben, dass das primär zu Testzwecken da ist und dass es weitaus bessere Tools auf anderen Seiten gibt.

dölj svar visa svar

LeviHighway för 14 dagar sedan 7 mars 2026 01:42:34 UTC

flag

Report

link

Permalänk

我覺得學習工具很難實現。您提到引入「選擇正確的譯文」這類練習題，這類練習題應該是人工編寫的還是自動產生的呢？

如果是人工編寫，有沒有這麼多人手是個問題，練習題的製作規範也是個問題。

如果是自動產生，您希望是怎麼實現？隨機顯示一個句子，再隨機產生幾個選項嗎？如此一來，系統會產出很多沒有學習價值的練習題（比如過於晦澀的字詞或過於簡單且重複的句子），而且自動產生的選項也很有可能是明顯錯誤的，這種練習題對學習並沒有幫助。

dölj svar visa svar

kumakyoo för 14 dagar sedan 7 mars 2026 06:40:05 UTC

flag

Report

link

Permalänk

Ich würde auf alle Fälle den automatischen Ansatz verfolgen. Ziel ist ja nicht unbedingt, ein richtig gutes Lerntool zur Verfügung zu stellen, das kann auf externen Seiten und mit Apps passieren.

Ich fände es halt wichtig, dass die Sätze auch hier genutzt werden. Dann findet man Fehler und kann sie direkt beheben. Die Probleme, die du beschreibst, sehe ich schon auch, würde sie aber in Kauf nehmen. Ich denke, man könnte im Laufe der Zeit auch Algorithmen entwickeln, die die größten Probleme aussortieren. Zudem würde ich das auch eher spielerisch ansehen und weniger als Tool zum professionellen Lernen.

dölj svar visa svar

LeviHighway för 12 dagar sedan, redigerad för 12 dagar sedan 9 mars 2026 02:54:41 UTC, redigerad 9 mars 2026 02:55:08 UTC

flag

Report

link

Permalänk

我認為可以參考 Duolingo 的機制，透過選詞的方式來讓使用者組織出譯文。Tatoeba 有一些明顯的優勢：無刷題限制、無廣告、使用者可以隨時提出錯誤回饋並受到母語人士驗證。這也是我十分希望 Tatoeba 不只是句子的翻譯資料庫，也應該是詞彙及片語的翻譯資料庫的原因，目前沒有任何字典以 Tatoeba 的機制運營（除了 Glosbe，但 Glosbe 是無人管理品質的狀態）。

dölj svar visa svar

kumakyoo för 10 dagar sedan 10 mars 2026 16:05:19 UTC

flag

Report

link

Permalänk

Ja, das klingt alles gut. Bin ich auch dafür! :-)

marafon för 17 dagar sedan 3 mars 2026 13:13:05 UTC

flag

Report

link

Permalänk

Is there a way to make new members read the rules before contributing?

dölj svar visa svar

doemaar14 för 17 dagar sedan 3 mars 2026 16:30:29 UTC

flag

Report

link

Permalänk

+1 A simple line of code could be written that detects a user's first contribution on the website and automatically displays a message containing the rules.

dölj svar visa svar

marafon för 17 dagar sedan 3 mars 2026 17:02:37 UTC

flag

Report

link

Permalänk

I was thinking about something like that, yes.

dölj svar visa svar

CK för 17 dagar sedan 4 mars 2026 06:50:31 UTC

flag

Report

link

Permalänk

Something like the following text would be short enough that they might likely read it.

https://en.wiki.tatoeba.org/art...rt-quick-start

I think it would be better, though, not to link to the wiki page which takes people off the main tatoeba.org site.

dölj svar visa svar

AlanF_US för 16 dagar sedan, redigerad för 16 dagar sedan 4 mars 2026 13:29:30 UTC, redigerad 4 mars 2026 13:30:10 UTC

flag

Report

link

Permalänk

As an alternative (or in addition) to displaying information when a user first adds a sentence, we could add this information to whatever they see when they're signing up for an account.

dölj svar visa svar

LeviHighway för 16 dagar sedan 4 mars 2026 15:35:26 UTC

flag

Report

link

Permalänk

I think most people skip the "terms" while signing up for accounts.

dölj svar visa svar

AlanF_US för 15 dagar sedan 5 mars 2026 13:24:51 UTC

flag

Report

link

Permalänk

Terms of use are typically very long and often include information that seems obvious or irrelevant. We're talking about something much shorter, like:
- Write complete sentences, with correct capitalization and punctuation, in a language that you know well. Don't take sentences from elsewhere unless they're in the public domain, and don't use AI-generated or machine-translated sentences.

dölj svar visa svar

marafon för 15 dagar sedan 5 mars 2026 13:53:28 UTC

flag

Report

link

Permalänk

...and make sure you're adding your translation to the right sentence.

dölj svar visa svar

AlanF_US för 13 dagar sedan 7 mars 2026 15:42:44 UTC

flag

Report

link

Permalänk

I'd like to understand this better. Are you frequently seeing people adding translations that have nothing to do with the originals? Does it seem as though they intended to add translations to other sentences that they saw recently? Do you have any examples?

dölj svar visa svar

marafon för 13 dagar sedan, redigerad för 13 dagar sedan 7 mars 2026 19:43:07 UTC, redigerad 7 mars 2026 22:01:09 UTC

flag

Report

link

Permalänk

No. I'm seeing people adding translations to any language they see meaning to translate the indirect English or other language they know.

They see, for instance, this Russian sentence:
https://tatoeba.org/ru/sentences/show/13788899
Russian isn't listed in their profile among the languages they know, and they obviously don't understand it. However, they understand the indirect English (I want Tom to read this.) and they add their translation to the Russian sentence instead of adding it to the English one.
Then they translate from Finnish, Marathi, Berber, Chinese etc.

See e.g.
https://tatoeba.org/ru/activiti...Tigre?page=200
or
https://tatoeba.org/ru/sentence...omment-1570172
https://tatoeba.org/ru/sentence...omment-1570171
or
https://tatoeba.org/ru/sentence...omment-1567660

I see it happen once a month or so. Sometimes, by the time I notice this, hundreds of sentences have already been added.

kumakyoo för 15 dagar sedan 5 mars 2026 15:50:59 UTC

flag

Report

link

Permalänk

Ich glaube, dass es nicht viel bringt, wenn man versucht, die Leute zu zwingen, die Regeln zu lesen. Es wird genügend Leute geben, die das einfach wegklicken, ohne es richtig durchgelesen zu haben. Und andererseits besteht die Gefahr, dass man Leute, die sinnvoll beitragen wollen damit vergrault.

Was war denn der Auslöser für deine Frage? Welche Fehler machen die Leute?

Ich könnte mir vorstellen, dass eine Revert-Funktion sinnvoll wäre. Soweit ich weiß, gibt es sowas bislang nicht: Administratoren (und vielleicht auch Korpus-Maintainer) könnten sich anzeigen lassen, welche Aktionen ein anderer Benutzer vorgenommen hat und dann bei jeder anklicken, dass diese revertiert werden soll. Wichtig dabei ist, dass man viele Änderungen auf einmal rückgängig machen kann, und nicht jeden Satz einzeln anfassen muss.

Oder man könnte sich auch Kriterien überlegen, die automatisiert bestimmt werden können (kein Satzzeichen am Ende, sehr kurz, etc.) Wenn nach 5 Sätzen viele solcher Kriterien erfüllt sind, wird der Benutzer erst mal temporär gesperrt und ihm mitgeteilt, dass es so aussieht, als hätte er die Regeln nicht gelesen. Sollte man natürlich netter formulieren.

dölj svar visa svar

marafon för 14 dagar sedan, redigerad för 14 dagar sedan 6 mars 2026 13:32:40 UTC, redigerad 6 mars 2026 20:52:45 UTC

flag

Report

link

Permalänk

I think it's worth the risk.

Once a month a new user comes and starts adding hundreds of non-sentences without paying attention to comments nor (sometimes) to PMs. What’s worse, other users immediately start translating them, and thus the situation is snowballing.

Every other new user keeps adding sentences without proper capitalization and/or punctuation.

Now and then, a new user starts translating from every language they see, not understanding what direct and indirect translations are.
Some of them do that on purpose, using machine translation or AI.
Let alone those who add sentences in the languages they don't know.

There’s no need to mention that it ruins others' work.

So maybe we should take a risk and do something to make them read a few lines that Alan mentioned.

I'm not mean, just fair.

Regarding your suggestions, I'm not sure whether we have resources to carry out the first one, and whether it will be transparent enough. As for the second one, it’s really worth considering, if it’s not too hard to implement.

dölj svar visa svar

kumakyoo för 14 dagar sedan 6 mars 2026 16:06:57 UTC

flag

Report

link

Permalänk

I think, the implementation shouldn't be too difficult (*). But first of all we need a list of such criteria; they need to be language specific, of course.

I try to collect some criteria which should give a penalty:

* Sentence is very short. Maybe extra penalty if only one word.
* Wrong capitalization.
* Wrong punctuation.
* Contains a symbol that's unusual in that language.

(*) Actually I'm struggeling a lot with implementing something in Tatoeba. Stuff that I thought would be implemented in a few minutes takes weeks and I already had had to give up on one project. So maybe my estimate here is wrong too.

dölj svar visa svar

araneo för 14 dagar sedan 6 mars 2026 20:26:09 UTC

flag

Report

link

Permalänk

I do not agree about the short sentences, or think they should be at least dependent on the language. E.g. in Spanish they are more common than in English, because there is more conjugation of verbs, and pronouns can therefore be dropped. New users may feel more comfortable translating shorter sentences, and they're also what comes up first when you search by relevance, so this could lead to them being unfairly blocked. Maybe in this case instead of blocking them outright, it could just flag them to be checked by a moderator.

For the other suggestions, I agree they would be helpful, but I also wonder how difficult it would be to create an accurate set of characters, punctuation, and capitalization rules for all the languages on Tatoeba, especially as some languages use multiple scripts.

dölj svar visa svar

marafon för 14 dagar sedan, redigerad för 14 dagar sedan 6 mars 2026 20:42:30 UTC, redigerad 6 mars 2026 20:45:56 UTC

flag

Report

link

Permalänk

I agree. Actually, a simple message for new users would already be of great help.

kumakyoo för 14 dagar sedan, redigerad för 14 dagar sedan 7 mars 2026 09:03:32 UTC, redigerad 7 mars 2026 09:56:29 UTC

flag

Report

link

Permalänk

I meanwhile think it would be a good idea to issue a warning, whenever a user adds a sentence with some abnormalities. This would be useful for more experienced users as well.

For example, the last symbol of the sentence can be rather easily checked. I counted them for all languages with at least 10,000 sentences (top 69). This is my result:

jpn, cmn, yue: 。？！」…）：
ara, pes, ckb: .؟!")
hin, ben, asm: ।?!."
tig: ።:?.!፧፥፡
hye: ։:
ell: .;!"'”“»:…)
yid: .?!״”‟":
zhg: .?!"
jbo: aeiou.
all others: .?!"”':…)»“

I'm not familiar with all of these languages. If you are familiar with one of these languages and you think an other symbol should be added, please tell me. And if you think, some of them shouldn't be included, please also tell me (for example I'm quite unsure if the dot is considered correct in jbo).

This can be implemented in an easy algorithm, skipping all the languages that are not in the top 69. Of course, this check isn't perfect, because, for example french sentences should not end with ohne of those: "”'“ This list might be refined of course, but for a start, I think it would be sufficient, especially if we do not talk about stopping someone completely, but only about issuing a warning.

PS: I added an issue at GitHub: https://github.com/Tatoeba/tatoeba2/issues/3264

EugeneGS för 14 dagar sedan 7 mars 2026 09:06:41 UTC

flag

Report

link

Permalänk

> Maybe in this case instead of blocking them outright, it could just flag them to be checked by a moderator.

I think it would be very useful to have some sort of menu or autotags that would allow moderators to easily double-check new sentences (maybe not just those written by new contributors, but by anyone; perhaps with a different tag or something). I suppose adding an autotag system wouldn't be too difficult, but it would be very beneficial since it would make it easier to double-check new sentences and show what has and hasn't been reviewed by a moderator.

dölj svar visa svar

AlanF_US för 13 dagar sedan 7 mars 2026 22:23:45 UTC

flag

Report

link

Permalänk

It's not practical to have every sentence should go through a moderator before it is considered okay. It's not even practical to have every sentence contributed by a new user go through a moderator.

dölj svar visa svar

EugeneGS för 13 dagar sedan 7 mars 2026 22:49:24 UTC

flag

Report

link

Permalänk

I'm not saying it should go through a moderator before appearing in the corpus or something. I meant a tag or something that would make it easier to find unreviewed sentences, like removing marked sentences from a search. I'd love to have something like this for reviewing Toki Pona sentences while excluding previously reviewed ones.

frpzzd för 14 dagar sedan 7 mars 2026 08:00:39 UTC

flag

Report

link

Permalänk

I would suggest the following additional "warning sign" criteria: if a brand-new user translates into/from many different languages within a very short time of joining the site, or translates from languages that are not added to their profile. (Users are already prevented from translating INTO languages absent from their profile, but a user must have some skill level in a language in order to translate FROM that language as well.)

As far as how easy these are to implement - that's another question entirely. I've had the same experience as kumakyoo with developing on Tatoeba. (I have not done anything recently, but I added some things a few months ago.) Gillux is a great help when it comes to getting familiar with the code base and workflow, but it still takes a long time to implement very simple things.

dölj svar visa svar

AlanF_US för 13 dagar sedan, redigerad för 13 dagar sedan 7 mars 2026 23:17:23 UTC, redigerad 8 mars 2026 02:27:39 UTC

flag

Report

link

Permalänk

You make a good point about the asymmetry in the fact that a user cannot add a sentence in a language they don't know, but they're free to translate a sentence in a language they don't know. I think that's an oversight, and we could get a lot of advantage, without obvious disadvantages, by not just warning users who do this, but preventing them from doing it in the first place. I wrote an issue ticket (3266) about this on GitHub.

dölj svar visa svar

marafon för 13 dagar sedan, redigerad för 13 dagar sedan 8 mars 2026 00:13:58 UTC, redigerad 8 mars 2026 00:57:12 UTC

flag

Report

link

Permalänk

Thanks, Alan!

(BTW, I think the link should be: https://github.com/Tatoeba/tatoeba2/issues/3266 )

dölj svar visa svar

AlanF_US för 13 dagar sedan 8 mars 2026 02:27:25 UTC

flag

Report

link

Permalänk

When I wrote "#3266", I wasn't intending it to be a link, and it didn't occur to me that Tatoeba would treat it as a sentence link. Thanks for providing the actual link. I couldn't access it on a computer where I wasn't logged into GitHub, but hopefully the people who want to see it will be able to.:)

NappeunKkum för 15 dagar sedan 5 mars 2026 13:45:34 UTC

flag

Report

link

Permalänk

I'd love to add furigana to some japanese sentences, how can I do it?

dölj svar visa svar

araneo för 14 dagar sedan 6 mars 2026 14:54:29 UTC

flag

Report

link

Permalänk

You press on the two arrows button on the top right of the sentence, then the edit button on the left, and type the furigana in the {} brackets. I think you have to be an advanced contributor to be able to do it though. To become an advanced contributor you first need to contribute 1000 sentences.

Vägg (7 343 trådar)

Tips

kumakyoo

AlanF_US

gillux

kumakyoo

CK

kumakyoo

frpzzd

kumakyoo

TATAR1

TATAR1

Behöver du hjälp?

Utvecklare

Om