Wall (5,915 threads)
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
13 hours ago
15 hours ago
2 days ago
3 days ago
3 days ago
3 days ago
3 days ago
3 days ago
3 days ago
Hi everyone, I've just noticed that a lot (but not all) sentences labelled as Slovak are in fact not Slovak, and a couple of sentences labelled as Russian are in fact not Russian.
Some "Slovak" examples:
I know they're not Slovak because, as a native speaker of Czech, I am able to understand Slovak. I don't understand these, though. They appear to be in some Southern Slavic language, possibly Serbian/Croatian or perhaps Slovenian.
Some "Russian" examples:
These can't be Russian because, obviously, they're not in the Cyrillic script. Again they seem to be Serbian or Croatian, or perhaps Slovenian.
I wonder, what's going on here? Are these all language-detection errors?
It should be good now :)
There's just the mystery of there being 71 sentences in one and 69 in the other... I was sure I always added them in pairs. I wonder what happened.
Nevermind, it's just a flag display error on the main page.
Indeed, I updated it ^^
Thank you for bringing this up on the wall. I've gotten numerous comments on my sentences because of this, and so maybe it's better to just put the explanation here for everyone to see.
Yes, there are numerous sentences under "wrong flags". This is not a mistake. This is intentional. All the sentences you have mentioned are either in Bosnian or Croatian. Because those languages are not yet available on Tatoeba, there is no correct flag to put them under. If you look at the sentence tags, you'll notice that all of the sentences you highlighted are already marked "Bosnian" or "Croatian". They will be fixed and put under the proper flags once these languages become available.
You're right. Some languages aren't listed at Tatoeba yet, despite sentences having been added in them. These are sometimed incorrectly labelled until the language can be added.
These have, however, been tagged as sentences in another language. You should see the tag to the right of the sentence. If you come across a sentence with the wrong language and isn't tagged, you can comment on that sentence (or, in particular if you find a large batch of these post them here as you've just done).
The system doesn't say when the tag was added so there's a chance the tags were added after you posted this here. In that case; thanks for bringing it to the attention of the community.
I see. These sentences are work in progress. Nothing to be alarmed about, then.
Saluton al ĉiuj, mi estas 78 aĝa emerita pensiulo, Mi esperantistiĝis en 1957. Mi estas kompilanta Esperanto-Vjetnama vortaron. Krome Esperanto, mi parolas la vjetnaman (mia denaska linvo), la francan (tre bone), la anglan (sufiĉe bone). Pezuras al mi respondi9 al ĉiuj viaj demandoj. Thu (aŭtuno)
Ni estas feliĉa vi estas tie ĉi!
(my Esperanto is not good ^^')
[not needed anymore- removed by CK]
We will do this in the new version, (yeah it's my new "How can I make people wait sentence :p" )
To be honnest the code which manage the user profile is a bit crappy, and the time it will take is too long compare of the use "I'm talking for the do not show transliteration"
For the point 2, the new version will use a totaly new architecture, which will permit really easily this kind of "only what I need" filtering.
Tatoeba 'Back door' request.
I would like a url like the
ones but where you can use parameters to determine what is shown.
e.g. something like ...
* Show sentence 311760
* Show translations (trans=1)
* Do not show indirect translations (indirect=0)
* Do not show transliteration (or furigana) (translit=0)
I think this would be useful when using this site as a learners reference from third party sites (or programs).
it's typically what will be the purpose of the API :)
Hi, I'd like to share two of my ideas.
1. Export to csv is a great feature! However, I had a little problem with that csv - when I opened it in MS Excel, it was not recognized as a Unicode file, so non-Latin letters were displayed incorrectly.
After I opened and re-saved it in Notepad, Excel correctly recognized the encoding of the file.
I compared the original CSV and the re-saved one, and I noticed that the latter contained extra three bytes at the beginning with hex codes 0xEF, 0xBB, 0xBF. I'm not very skilled in the technical stuff, but I believe this is some attribute that indicates the encoding of the file.
If you could add these three bytes by default that would be great! If not, it's also ok with me - notepad is always handy.
2. Could you please implement the notification for new personal messages? We've already discussed this with Demetrius. He's told me that this hadn't been implemented because of potential spam threat, but maybe you could make this checkmark unchecked by default, and only those who are fully aware of what they are doing will check it?
It's really convenient to know immediately that you have a new message.
That's all for now :)
Many thanks to the Tatoeba team for such a great project!
> three bytes at the beginning with hex codes 0xEF, 0xBB, 0xBF.
This is all about the BOM. Microsoft programs loves the BOM, but many Linux, Unix and Mac OS programs hate the BOM.
Technically the BOM is not required for 8-bit Unicode (which Tatoeba uses) but it is still valid.
There are a bunch of free utilities around on the web to add/remove BOMs.
(P.S. BOM = Byte Order Marker).
> when I opened it in MS Excel, it was not recognized as a
> Unicode file
This is a known Excel bug (= Excel sucks ;-). Here is how to import a file in Excel 2007 (and later).
1. Create a new, blank Excel workbook.
2. Click 'Data'
3. Click 'From text'
4. Select file and follow instructions.
I've just tried it - and it works, thanks for another way around.
But for most of the users (including me) it's way faster and easier to click "open" after the file is downloaded, than opening Excel and opening it through the Import feature. With these three bytes that I mentioned Excel opens the file fine.
However, as I've already mentioned, I'm not a technician, maybe they work only for Windows or maybe there are other potential problems.
this site can be so useful for the new learners,,,but it still has some empties,,,i really need such a site,,but i wish the establisher of this site,,make this site perfect,as its name tatoeba,,,,thanks alot,all the best ...............
Just about everything on this site is open source and available for use. You can take a copy for yourself and do what you want with it (within the terms of the license).
But the best way to make this site perfect is to take part in improving it. For example, by helping the Tatoeba interface support more languages.
what is your goal?
To have examples of sentences for every word, in all languages in the world. ^^
It will help:
- language learners (and teachers -- they can create exercises based on this data)
- program developers (you can download all the sentences from http://tatoeba.org/eng/download...mple_sentences )
It's also good because rare languages are supported, like Uyghur or Shanghainese. It's hard to find examples in these languages.
dear sysko!i meant TATOEBA project!
dear Dametious!if i translate one sentence to Persian wrongly,do you check whether it's true?
I can’t check Persian. :( I don’t know it.
But other people can.
I can check only Russian and Ukrainian translations.
*I can’t check Persian _yet_.
*I don’t know it _yet_.
*But other people can _for now_.
No, this is not going to happen. If I were to learn Iranian languages, I would take up Tajik, because it’s written in Cyrillic. ^^
It's actually a brilliant idea to add Tajik sentences to here, we merely need to find a Tajik native :)
Language of Tajikestan is Persian!
Persian of Iranian is different from Tajik,but the difference isn't great,
we call their language FARSI-E-DARI,
but our language is FARSI
Farsi, Tajik, Dari.
Dari is a branch of Farsi!
Dari and Farsi are separate languages. What you are saying is like saying that Dutch is a branch of English.
No!do you know?i'm from the north east of Iran where is near to Afghanestan,Tajikestan,Uzbakestan,i have heard how Tajik,Afghan people talk.
i have heard their language.in scientific text,it is written,that Dari & Tajiki are a kind of Persian(or Farsi).
i understand their(tajik & afghan) word!!!!!
Wiki gives an indicative list of differences between Farsi and Dari here:
perhaps,i have too much patriotic emotion!:)
Well, understanding does not mean the same language. Ukrainians understand 99% of Belarusian and vice versa, but it does not mean they are the same language.
I'm not sure whether it's relative or not, but we and Iranians might have different understanding of what language really is.
I used to study Modern Standard Arabic and I communicated a lot with Arabs from different countries. They were all damned sure, that Standard Arabic, Egyptian Arabic, Saudi Arabian Arabic and all other kind of Arabics are the language. They are just different dialects. That's what they are taught at schools. Any Western linguist, however, would classify Egyptian Arabic and say MSA as two different though closely related languages.
you took the words from my mouth...exactly my view of how it should work ;D
Yes, but I can read Tajik letters, and I can’t read Persian. :)
So do I. In fact, Cyrillic or Latin characters are much more precise than Arab ones in rendering the real pronounciation, because they do show vowels, which the Arab alphabet does not show.
it shows some vowel such as:a,ou
in addition to,in Farsi,we don't have much vowels,
any way,Arab script became usual,when Arabians attacked to Iran,to extend their religion,about 1300 years ago
after this war,First Persian poet was written by Rodaki,he lived in where we call Uzbakestan,this happend about 2centuri later than
The Arabic letters are not at all suitable for Indo-European languages such as Farsi or Tajik simply because they do not allow writing words with the same consonant profiles and different vowels (homography), which leads to misunderstanding.
yes,your word is right,before 1300 years ago(before arab attack to Iran)
irainian write by a script that is likes sanskcrit!
Well, it depends on the language.
Arabic script for Uyghur shows all the vowels.
That would be my decision as well :)
Transliteration seems to be up for Uzbek.
Question: do you know why е is transliterated as "ye"?
i try to check their sentences!
yeah!this site is good for learners.i learned some grammar structure.
you mean, what is the goal of Tatoeba project ?:)
Is it just me, or has there been a sudden flood of new users here today?
In fact seems tonight something has made tatoeba famous in Iran, if someone can explain why ? Anyway I hope they will find tatoeba usefull, tough we only have 31 sentences in Persian yet :(
do you know?your site introduced by BBC Persian channel!
i think the reason is this!!!!!!!!!!!
Have you a link or so ?:)
what do you mean?
i am not bilingual,i speak English difficulty,ha ha ha
however,i register;i join your community
I mean, do you have a youtube link, or in BBC Persian website, where we can see this ?:) I'm really curious ^^
No problem for not being bilingual, I'm also struggling with English hehe
i can't hyperlink ,because Iranian don't acsess
this site+Youtube.they are filtered
in addition to if previous session of CLICK program(this program introduce you)exist their site,you can't understand Persian
http://www.bbc.co.uk/persian/tv...tv_click.shtml Saeb finds a link :)
I have a Iranian friend, so when I will see him, I can ask him to help me understand :)
We do need a safe search!
Now I'm not sure I should have added 398986 ;)
I was really happy to know you Demetrius.
Help me to find the meaning of words.
i don't need interpret scentences!
As blay_paul said, the goal of tatoeba is to do what dictionnaries don't, so to be used in parallel with them, not to replace them. The main reason is that it's no use to create duplicate works, and we prefer to make only one things and make it good, rather than too much things of poor quality/quantity.
Thanks,i already knew some of this sites!
i didn't express my mean correctly1
i mean smart site,these sites are designed bad!
it doesn't matter for me online dic is to mother tongue!
if a dictionary exist free to download,let me know!
Tried a dictionary? It would probably help if we knew which language you were interested in as well.
thank you for reply!
i need a prefect dictionary
my mother tongue is Persian.
I don't know of any perfect dictionary.
Are any of the following any good?
Thursday WWWJDIC examples update summary.
36 records deleted.
17 new records.