Perfil
Frasas
Vocabulary
Reviews
Lists
Marcapaginas
Comentaris
Comentaris sus las frasas de cueyayotl
Cabinats
Jornals
Audio
Transcriptions
Translate cueyayotl's sentences

They are probably as far apart as Galician and Portuguese (which HAVE two different ISO 639-3 codes) and probably more dissimilar, than say, varieties of English.
You are right, though. We would get plenty of duplicates and near duplicates (they use different spelling conventions). But, in the present state, we would definitely be repelling potential Valencian-speaking users who are not comfortable with the "Catalan" classification of their language. We would be throwing out a big chunk of our corpus... a shame, as Valencian is quickly being spoken less and less.
I wonder if there is something more clever that we could do... maybe have both languages (and flags)... but catalog all sentences under the code "CAT" as usual. There MUST be something...

I wonder if we couldn't add Valencian as a separate language, distinct from Catalan, due to political reasons? Valencian speakers are usually very insistent that their language is distinct from Catalan, and I wonder how many potential members we may have lost due to our current classification scheme, an issue brought to my attention by our new Valencian user (isnamar).
The main issue is that they share the same ISO 639-3 code: CAT.

It is OK if they are vulgar, but if they don't sound Polish at all, that is a big problem.
You can send a private message to me with the Sentence #s, or to any user with an 'admin' or 'corpus maintainer' status here:
https://tatoeba.org/eng/users/all
Thank you for your interest. We still don't appear to have a corpus maintainer for the Polish language, so maybe after you have been here on Tatoeba for a while, you could become our first Polish language corpus maintainer? :)
Welcome to Tatoeba!

Tatar is usually written in Cyrillic, though it has been written with Latin and Arabic scripts. I wonder if, like Uzbek, it could have the alternate scripts generated automatically. The Cyrillic and Latin can be transcribed character by character, so it shouldn't be TOO difficult to implement (though the Arabic is another matter).
We now have Cyrillic-Latin pairs (like #1285837 - #4925667) as well as Latin-Arabic pairs (like #4813918 - #4813914)

-- I don't know what you're talking about with your "logic".
>> PS: what would you say about voting for mathematical results ? Let's make a test.
>> 2+2 = ?
>> a) 4
>> b) 2
>> c) 0
>> d) 5
>> I vote for c)
-- I just provided a good exemple of crowd idiocy
>> It [Urbandictionary] refers you to another word ('cunt') and gives you an example sentence (which should give one the idea of what the word means already). When you click on the word 'cunt' it sends you to its page, and after seeing that the definition 'woman' doesn't make sense in the original example sentence, we have the definition: "A synonym for a woman's genitalia, vagina, pussy, etc." Now we have learned the definition of the word, as we wanted.
Maybe there are examples of crowd idiocy on the site, but there just aren't any in this particular entry. Could you please provide another example? Thank you in advance.

>> Sure, here is a typical example http://www.urbandictionary.com/...hp?term=snatch
It refers you to another word ('cunt') and gives you an example sentence (which should give one the idea of what the word means already). When you click on the word 'cunt' it sends you to its page, and after seeing that the definition 'woman' doesn't make sense in the original example sentence, we have the definition: "A synonym for a woman's genitalia, vagina, pussy, etc." Now we have learned the definition of the word, as we wanted.
'Wisdom of the crowd' works fine here. Language is not an exact science, nor does it necessarily follow logic; a sentence being correct in a language cannot be compared to "1+1=2" or "1+1≠2".

I just upgraded to Windows 10, and with the Internet Edge, the editing icons are too large and don't 'fit'.
Image here: http://imgur.com/916gBTr
How do I get around this?
Edit: It looks like other browsers are OK, just Internet Edge. I'm not able to find any setting on the browser that will correct the icons, though.

Question: on my quest for "wrong flags", I realized that it would be very helpful if I could just search for sentences of users not in any language declared on their profile. Is there a way to do this?

Issue: Recently, when I try to send a link (usually by message; I haven't tried by comment) it gets shortened to something like this: https://tatoeba.org/por/stats/s...es_by_language
Then, when the receiver tries to open the link, it tries to forward them to the shortened link (which, of course, does not exist), instead of the full link. Is there a way around this?

I would also like to add that not having a language added could deter any native-speaking visitors. We have a process for adding new languages, but it is not always clear to new users/visitors to this site. If someone visits the site, and doesn't see their language here, more probably than not, they will not create an account here and not come back. Obviously, speakers of rarer languages will not flock to the site, but all it takes is one, in order to have a decent corpus of a language (as per Amastan with Berber, 123xyz with Macedonian, and sabretou with Marathi). Having more languages improves our odds.
I WOULD (in theory) like to see all six thousand odd languages here, though, in all honesty, it would be impossible to verify all activity in the lesser known languages, and easier for trolls to troll on this site. As it is, all the languages we have have plenty of resources. Though it is often impossible to know if a particular sentence is what a native speaker would say, we can definitely know if that sentence is indeed in the language it is supposed to be in and that, at the very least, makes grammatical/syntactical sense. I and several other dedicated users will continue to see to that.

No, Impersonator is right. The sentences on this site are INDEED in Hmong Daw [MWW] (White Hmong). It must have slipped my mind when I asked for Hmong Njua (Green Hmong) to be added (I think I may have been talking to a Green Hmong friend that day)... but to be fair, I did request Hmong [HMN] to be added instead. Especially in the USA (but also in Laos), many White Hmongs mix Green Hmong words into their language, so I would have preferred to have a general Hmong [HMN] language added. Also, the flag is a general Hmong Flag, and not of a particular Hmong language, so the flag probably need to be changed as well (at least to add [MWW] on the side, or ON the flag)
I believe someone claimed that [HMN] included more than just Hmong, but call it Miao or Hmong, they are all part of the Hmong language family. My recommendation is not to change these sentences into Hmong Daw [MWW] (White Hmong), but rather to just Hmong [HMN]. What do you guys think?

Would it be possible to add such a feature to other languages such as Hakka Chinese, Sumerian, or Khmer? Khmer might be harder as not even Google Translate does it well, though there is a site that gives good transliterations (though not really IPA): dictionary.tovnah.com
As for Sumerian, there is a system we use, and you can see it on the Wikipedia article: https://en.wikipedia.org/wiki/C...dian_Cuneiform
Though the "Primary" transliterations are not always accurate (à la Japanese-style)

There's been a couple of times where I thought the 'Auto detect' was misidentifying the language of my sentences, but it turned out that it was just the flag that had changed.
I DO think this is the better choice, though it does feel strange to use the new flag (being as used to the old flag as I was). The shade of the Spanish flag SHOULD be dark (It's correct now.) It'll take a couple of months to get used to, but it is something I believe we WILL get used to.
As for the Ladino flag, I don't really know if it SHOULD be changed. Though CK's suggestion above is not bad at all, nor is this unofficial flag:
http://i.imgur.com/LthAIHq.png
@odexed, I don't have a GitHub account either :) I just look there and comment here.
@Ricardo14, no way! Ever since you took over, things have really taken off! Now, we have people contributing in languages that some of us have never even heard of! :)

The Kashubian sentences just started to fly in! It's only a matter of time for the other languages :)
P.S. 'Unknown language' sentence #4678214 was changed to 'Kamba' for some reason. Everybody check the sentences in your languages to make sure there are no other unwelcome newcomers :)

But, we WERE able to see the flag. This change was requested by user sabretou, Ricardo14 posted a link to the ticket where CK even posted what the new flag would look like, but nobody commented.

But there is no central organization, or standard symbol. The only other possibilities are adding unofficial flags such as:
http://i.imgur.com/LthAIHq.png
or
https://s-media-cache-ak0.pinim...2a1dad1c8a.jpg
or even
http://2.bp.blogspot.com/-5uRAg...d%C3%ADa+4.png

How about an Israeli flag edited with the letters LAD on the side? (à la Afrikaans, Xhosa, Coastal Kadazan, etc.?).
[From the 30x20 pixel flag] Maybe the flag can be shifted 5 pixels to the left (cropping the 5 leftmost pixels), a 10x20 rectangle added on the right side (cropping the 5 rightmost pixels of the original flag) with the letters LAD written in 16x7 within the rectangle (1 pixel from the Israeli flag, 2 pixels from the right edge, and 2 pixels from either top or bottom).

+1
I'm all for it! I prefer using parentheses as well, but either is fine.
I suggest we do this with Chinese and Nahuatl (possibly Mari [Hill/Meadow Mari]) as well.

We have Arabic (MSA), Algerian Arabic, Egyptian Arabic, Iraqi Arabic, Moroccan Arabic, North Levantine Arabic. If you are looking for Egyptian Arabic, make sure to look through the letter 'E'.

These are all by user @carlosalberto
Maybe he is the right person to ask.