Wall (6,253 threads)
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
20 hours ago
22 hours ago
2 days ago
2 days ago
2 days ago
5 days ago
Edit: Resolved (thanks CK)
If you don't know how to solve this, you need to add an equals sign before the word, like "=expectations", clicking the "Help" button on top of the search bar gives you more functions
One thing that annoys me sometimes is when I look up something, for example "expectations", and it gives me all the forms of the verb as well, so I can't find the noun, even if I put the keyword in quotes.
Understating one's knowledge of a language on the profile page (such as rating your English as 1 star when you're obviously better than that and can communicate nicely) in a show of modesty or just for giggles may be funny but probably confusing for those not in the know. lol
I'm just always genuinely confused about having to rate anything on a scale of 1 to 5. When it comes to languages, I don't want to overstate my knowledge; I'd rather understate it.
If I could choose, I'd rather it be on a scale of 10 because I don't want to settle for one point or the other, and I round down to be safe (I think my Japanese is 1.5⭐ so I set it at 1⭐ since I think 2⭐ is overshooting, and my English is 4.5⭐, and it's only at 5⭐ because other natives asked me to keep it like that) although that can easily be explained in the description
But honestly I don't really have a need to understate my knowledge because I already know I'm bad at all of them :)
Makes sense. Point was, if you can have a non-basic conversation in a language (and discussing sentences isn't basic), then you're definitely better than 1 star and rating your skills as such is kinda silly and possibly misleading for those not in the know, i.e. rare visitors or newcombers. Just wondering quietly, not like it bothers me personally.
That's probably just how it is when dealing with self-rating.
You've got a range of people from those who are humble and under-value their skills, and those that are not so humble and proclaim full fluency in 5+ languages.
I found a problem. Some dialects are added to the language list, but it is just a dialect of a language. In other words, it does not have a separate iso-3. For example, in Shanghai dialect, I see Wuu is marked on its icon, and Wuu is the iso-3 code of Wu. Wu includes not only Shanghainese, but also other dialects, such as Zhejiang dialect and Suzhou dialect. Similar problems exist in other languages. For example, Kölsch is considered a dialect of Ripuarian,they share iso-3.
The solution is to change their names? Or keep it as it is, but give in other dialect a tag?
Colognian (Kölsch) can be added because it has an ISO 639-3 code (ksh) - https://en.wikipedia.org/wiki/Colognian, However, Zheijiang doesn't have one - https://en.wikipedia.org/wiki/Zhenjiang_dialect. You can request SIL to add a code for this language - https://iso639-3.sil.org/code_tables/639/data.
Hope I have helped you.
Thanks! But the iso-3 codes for Kölsch and Ripuarian are both ksh?
This problem (and especially wuu/shanghainese) have been discussed and partially solved in that thread https://github.com/Tatoeba/tatoeba2/issues/1670 but there is still some work to do. As much as we’d like to fix all the language names, each renaming needs to be studied with insight from native speakers and contributors. I suggest contacting sabretou if you want to help or inquire about it.
I’d like to help, but how can I help?
As Ricardo mentions, the decisive factor is the ISO 639-3 code. While Colognian is grouped within Ripuarian, only Colognian has an ISO 639-3 code, so someone who speaks non-Colognian Ripuarian will either need to petition SIL for a separate code assignment or assimilate with the Colognians or they're SOL.
That means that, in theory, the language corresponding to ISO 639-3 "wuu" should be named "Wu Chinese" on Tatoeba. However, as one of the languages added to Tatoeba before we had a sane "not our problem, go bother those bureaucrats instead" solution to the language/dialect distinction problem, Shanghainese gets some special treatment. There's a very old TODO comment about this in the source code: https://github.com/Tatoeba/tato...esLib.php#L305
This could be solved by getting individual ISO 639-3 codes assigned for various subvarieties of Wu, but the only person so far who requested SIL to make such a change did a really bad job fulfilling the formal requirements and the request was rejected. You can read the change request and the rejection notice here: https://iso639-3.sil.org/request/2019-063
So until someone makes a successful second attempt, we'll have to use tags instead. There's a tag for the Suzhou dialect with 14 sentences: https://tatoeba.org/zh-cn/tags/..._with_tag/1864 There may be others.
If it bothers you that there are sentences in Suzhou dialect listed as "Shanghainese", we could rename it to "Wu Chinese".
I think it can be named Wu. Because Shanghainese does not seem to have a separate iso-3 code, and there is more than one dialect in Wu.
In the Spanish interface, some of the logs are missing the date.
En el interfaz de español, a algunos de los registros les faltan la fecha.
For example / Por ejemplo
#7942751 Mary fue atacada por un leopardo en Tanzania.
The English logs say / Los registros en inglés dicen:
added by Seael, May 28, 2019
But the Spanish logs just say / Pero los registros en español solo dicen:
añadida por Seael,
Without the date / Sin la fecha
Same goes for the total number of sentences.
Lo mismo pasa con el número total de oraciones.
When I look at my sentences, the number of total sentences doesn't show.
Cuando veo mis oraciones, no se ve el número total de oraciones.
I looked into this problem. It happens because of a mistranslation of the Spanish UI. The date is translated here: https://www.transifex.com/tatoe...urce/186665868 (this link requires a Transifex account). The total number of sentences is here: https://www.transifex.com/tatoe...urce/163036559
You are welcome to improve the Spanish translation if you’d like. See the instructions here: https://en.wiki.tatoeba.org/art...ce-translation
Thanks, I sent my application to join.
Thanks again, I think I was able to fix it, at least on DevTatoeba
I went through the entire script and replaced the wildcards that were translated with the original wildcards, along with adding a couple translations myself, so hopefully this is smoothed out now
There are still a couple translations left before the entire interface is completed, and I'd rather a true native translated the rest
Thank you for helping us with this! I updated tatoeba.org today so your fixes are now online.
» Baluchi has been enabled as a User Interface language - https://tatoeba.org/bal
» 2 languages have been added
- Zeelandic - https://tatoeba.org/en/sentence...ne/indifferent
- Limburgish - https://tatoeba.org/en/sentence...ne/indifferent
Hello, I've just had an error. The code was ''60983a67df34e''. I was searching for the word ''запас". The issue turned out to be consecutive and did not seem to be disappearing. Thank you
This was a temporary error. Thank you for letting us know!
I also have this error at the moment.
This was also a temporary error. Thank you for letting us know!
** Stats **
1,186,073 sentences have OK ratings.
That's about 12% of our sentences.
These 30 members have made most of these ratings.
I did this for another purpose and thought that other members might like to know this information.
I'd be curious to know how the people on this list choose the sentences that they review and then mark OK. Do they consciously or unconsciously focus on any of the following? (Notice that here I'm using "native" as shorthand for "self-identified native-level".)
- sentences by non-native speakers who have explicitly requested reviews of their sentences in general
- sentences that non-native speakers have specifically asked to have reviewed
- sentences by non-native speakers in general
- sentences by particular speakers
- shorter sentences
- longer sentences
- sentences of interest to beginning learners (simpler structure/vocabulary)
- sentences of interest to advanced learners (more advanced structure/vocabulary)
- sentences with audio
- sentences that they themselves have written
A related question: do you use any means to try to randomize the sentences you consider, such as using a random sort criterion rather than a default ("relevance") or "shortest sentences first" criterion?
I worry that people inside and outside Tatoeba will use OK tags or ratings as criteria for deciding that sentences are particularly worthy of being selected/studied, or the absence of such tags/ratings for deciding that they are not worthy. There is already a high correlation between shorter, simpler sentences, and sentences that are selected for audio recording, and if these are also the sentences that preferentially receive OK tags/ratings, I am concerned that the diversity of sentences and coverage of vocabulary, here and in projects that use our corpus, will suffer.
In my case, I generally mark sentences OK only when the author has requested that they be checked. However, this does not mean that I think they're more suitable for use than other sentences that I think are OK but have not been singled out for review.
Please understand that I salute people for caring enough about the quality of our sentences to take the time to review them. I just would like to know how people are selecting the sentences that they are considering.
I don’t give many reviews, most of my OK’d sentences are from non-native owners when I come across the sentence, but I usually don’t search for sentences to review, and I do this for English and Spanish (the two languages I can judge somewhat accurately)
A good portion of my reviews are either “unsure” or “not OK” which usually mean that I want someone else to look at it, but those are sentences I come across as well and not ones I look up explicitly to review, mostly since I don’t trust my grammar knowledge to give accurate corrections a lot of the time
I mainly put the OK tag in sentences that had the @needs native check tag. I check other sentences too (at least from time to time) but don't tag them as OK unless the user is very prone to errors.
I put OK ratings on German sentences owned by non-natives that I consider natural, but I don't systematically look for sentences to rate. I often check sentences that have been added since my last visit. I never ever rate my own sentences. I don't rate sentences owned by other natives. I don't give negative ratings. After I correct a sentence, I always put an OK rating and an OK tag on it. I give ratings so people know they can trust those sentences and the authors also know they are correct.
Most natives also have weaknesses with some rules of the language, so in my opinion, it's important to check all of them. So I'm always reviewing randomly.
I'm reviewing all the Dutch sentences, except sentences that have recently been added by myself. When I'm rating a sentence, I'm also linking new sentences and checking the existing links. So later, when I'm coming across the rated sentence again, I only have to check the links again and make links with other sentences which have been added in the meanwhile.
If many native speakers happen to break a certain established rule, perhaps the rule should be quashed and actual usage granted legitimacy :p (of course clear errors exist as well)
Personally speaking, I mark as OK Italian sentences from other users to sort of say that I find them correct, and doing so helps me to proofread, so that I don't have to go through the same sentences over and over; but I don't like marking my own sentences, since it may be obvious that i find them correct enough when I submit them (althought I may miss some accidental mistakes)
I don't mark my own sentences OK either, but I sometimes mark them "unsure" if I'm especially uncertain
Merkitsen lauseita jos satun avaamaan niitä. Jossain vaiheessa kävin läpi satunnaisia suomenkielisiä lauseita ja arvostelin ja kirjoitin tunnisteita niille. Nykyään olen tehnyt sitä paljon vähemmän, mutta jos sattumalta avaan jonkin lauseen omassa näkymässään, laitan yleensä tunnisteet ja muut merkinnät sille.
Regarding the value of OK-ratings, I’m a bit on Cabo’s side. You have to be an insider to know why some sentences have them and others don’t. I would like to assume that sentences are correct normally, (though they might not always be perfect and natural). Judging „naturalness“ is often rather arbitrary.
I do use OK-ratings when I explicitely want to state a sentence that might be doubtful for other natives speakers is good in my opinion. - for example when it features a regionalism I am familiar with. (I admit that’s maybe quite the opposite of what OK means to most people.)
I use the ?-rating more often. To me, it means „Not really a good sentence“ and I imagine it will be understood as „There are native speakers who think that’s not really a good sentence“. So maybe I can prevent people from putting too much thought in such sentences.
I don't care about these ratings, only the 'unsure' and 'not OK' marks say something to me.
CK, you mark your own sentences as OK, and because of it you can't filter out such mistakes like extra words, letters in the wrong place... the same rate like a real double checker could.
** Stats - 2021-05-15 - Native Speakers with Contributions **
Cool I'm exactly 100th
** Tatominer **
Thanks to MarijnKp, Guybrush88, shekitten, Shishir, Thanuir, Johannes_S, AlanF_US, GlossaMatik, Pfirsichbaeumchen, manufrutos, Micsmithel, Ricardo14, Ergulis, GrizaLeono, Mtargem, felix63 and morbrorper for their contributions that helped move the project forward this week.
Check out the most searched words that lack sentences or translations in your language at https://tatominer.netlify.app.
What exactly can we talk about on the Wall? Are we allowed to just talk about languages and get answers from native speakers? If so, then...
English doesn't have any grammatical gender, but as a native English speaker myself, I can still say that I (and likely others I hope) do connect some nouns to a gender. For example, I still consider "butterfly" and "flower," or "la mariposa" and "la flor" in Spanish, as "feminine nouns," and "volcano" and "alligator," or "el volcán" and "el caimán," as "masculine nouns." But there are a few nouns I don't think fit the gender assigned to them in Spanish. For example, "color," "rainbow," "bikini," and "heart" are all masculine nouns in Spanish, and "lobster," "meat/beef/sausage," "beard," and "snake" are all feminine nouns in Spanish. For me, cows kind of look masculine, but the ones that give milk are female, so I guess I can see why it's feminine in Spanish. But with that logic, "mosquito" should also be considered feminine, since only the female ones bite. As for "el adjetivo" or "la oración," I really can't say much about either.
Anyway, my actual question is, are there any nouns that even native Spanish speakers think don't match their grammatical gender? Or does the gender always automatically give the speaker an idea of the noun?
Even as someone who's spoken Spanish from birth and still speaks it now and so should've theoretically mastered it, I've mixed up my genders plenty of times
But then again I sometimes mess up pronouns and names in English with the people around me (calling my brother "she" and my mom "Dad") so maybe it's just that my head stuffing needs changing
Plus my language development was odd so I'm sorta English-dominant now
I can see what you mean about some words matching the stereotypes of masculine and feminine. Volcanos are strong and powerful, while butterflies are delicate and aesthetically pleasing. I personally don't think a lot like that since I'm not a huge fan of gender roles or stereotypes, and linguistically I've gotten used to the singular definite article "the" that fits every word. I mostly guess what gender each word is based on what it ends in, and obviously it doesn't work all the time (el día, el problema, la radio, la canción)
If everything would go my way in my Spanglish speaking world, I'd use "the" for everything and not worry about what gender each word is. I think that if some people adhere strongly to classifying each word to a gender, it's promoting a rigid system of everything being predefined to the standards of the past and the status quo, with not much wiggle room. Granted, that's a bit of a philosophical stretch, but that's how I see it.
Also, it's too much work for me to memorize the gender of each and every word.
So no, in my case at least, the gender doesn't consciously give me an idea of the noun, most of the time. Subconsciously it's still there sometimes, but that might tone down in a few years.
I do wonder what true Spanish-speaking natives would say about this.
As far as I know, the Wall is OK for discussion as long as it's productive and especially if it involves several people, if it turns into a longer, more personal convo between two users it might be better to switch it over to the DMs. But I'm a relatively newer user too.
In most cases I'm personally just used to using one article or the other and don't think about it. In others, however, it's true that I consider some things male or female due to this influence of the language (such as the sun or the moon). In the cases of animals we tend to use their biological gender to refer to them and as a cow will never be male (that would be a bull - el toro) it has the feminine gender, in the cases where the word refers to the animal without taking the gender into account such as el mosquito and la hormiga, there are el mosquito macho and el mosquito hembra, la hormiga macho and la hormiga hembra if you want to refer only to one particular biological gender, so seeing an ant doesn't make me think it should be masculine because I know there are both.
If I were to give every single letter a gender, instead of the all-feminine letters Spanish has, it might be: la a, la b, la c, el d, la e, la f, el g, la h, el i, el j, el k, el l, la m, el n, el o, la p, el q, la r, la s, el t, la u, la v, la w, el x, el y, el z.