Wall (6,005 threads)
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
23 hours ago
2 days ago
3 days ago
3 days ago
3 days ago
4 days ago
4 days ago
4 days ago
By the way, about the XXX tag...
How it should be used? Consider the following groups:
a) sentence describes a sexual intercourse in rude words,
b) sentence uses the rude words with an indirect meaning, to describe something other,
c) sentence describes a sexual intercourse with euphemisms
In which cases XXX tag should be used?
I would say a) and c) and possibly b) (but depending on circumstances).
We'll probably need to fine tune that sort of thing later. It's not really a priority at the moment as the tags don't actually do much yet.
I don't think these should should be tagged under a single label. I think a) and b) could be tagged with something like "rude" or "obscene" and a) and c) with something like "sex" or "pornographic". c) should furthermore be tagged with "euphemism".
I this is actually a great time to think about how we're using and how we'd like to use the tags, seeing how it's not a priority and we can play around with them.
Should we keep proverbs offending other nations in the database? And how should these be tagged?
We already have some Ukrainian proverbs about Russians in the database (sth like “You can ward off the devil crossing yourself, but you can’t ward off a Moskal”). ^^
You could tag them as "stereotype", "prejudice" or even "bigotry".
I'm all for freedom of expression, but I think that clear lines are more useful for a project such as this (oh, and trying to steer around hypocritical stances would be nice). Seeing how the aim of this project is to gather sentences and link them to translations -- not to disseminate facts -- and the web is open to anyone wanting to espouse their hate, I'm not overly concerned about losing offensive content.
I do, however, prefer the idea of simply filtering out the filth one doesn't like.
There are already quite a few labelled as 'Lie'. Filters would be nice (filtering out XXX sentences should be a priority if we want to be 'school friendly'), but I think that sort of content is still needed (in moderation) to give a full coverage of language usage.
> full coverage of language usage.
Talking about which, I'm reminded of a Japanese speaker who was certain that a certain body part was referred to as "pussy cat". Nothing I could say would persuade him that the 'cat' wasn't needed; he just said "I use it all the time with my girlfriend so it must be right."
Feature request - download all in current web.
Suppose you have sentence A, B, C, D, E, and F like below:
.A -+- B --- C ---E.
....+- D --- F......
Then I'd like to be able to download all of those sentences as a list regardless of what sentence I'm looking at at the moment.
maybe in a near future but not now looking to the way they're stored, and when it will be possible to do so, then you will be able to see A B C D E when looking to whichever sentence, it will not be restricted to indirect translation anymore.
When that time comes, it would be useful to be able to increase and decrease the depth from which sentences are shown. I've sometimes been translating a sentence and known that it is a translation of a third one, a couple of nodes away. Since it isn't shown, I've had to do a bit of gymnastics to link those. It would be great to be able to show progressively more distant sentences until one finds the one one's looking for.
Also, it would be great to be able to search for sentences in one particular language that are only indirectly translated into another particular language.
Hi,I've just found Tatoeba and would like to contribute.
I could do it with the Esperanto version. I am a teacher of Esperanto at language schools and teached it for two years at the University of Santiago de Compostela as well.
Also I could help with the Galician version, if you want to make a separate section for Galician. Galician is a variety or Portuguese and there are two ways of writing it: with Spanish ortography and with Portuguese ortography (commonly with the other main varieties of Portuguese: Brazilian and Portuguese from Portugal). I have a degree in Galician Philology and teaches Galician for the Autonomous Government of Galicia and at language schools.
I'm a translator and interpreter, too. I work with both varieties of Galician, Spanish, standard Portuguese of Portugal, Romanian and occasionally Esperanto.
welcome aboard :)
Welcome to Tatoeba, Shazzan :)
If you were talking about translating the website into Esperanto, then you will need to register on a website called Launchpad and then translate from this page:
If you were talking about translating the sentences in the corpus, then I suppose you already figured out.
Galician is not yet an officially supported language, but you can simply follow the instructions in the FAQ to have it added: http://tatoeba.org/eng/faq (cf. "I'd like to request a new language. What do I have to do?")
In general, if you have trouble understanding how Tatoeba works don't hesitate to ask.
I often switch between several languages, and it's always somewhat painful having to locate them in the total list of languages. It happens regularly that I inadvertently select the wrong language. Something like a list of favourite or least recently used languages for quicker selection would be very helpful.
I agree that the language list is becoming a bit too crowded but I can't say for sure when we can improve this... We will though, don't worry ^^
But in the meantime, if you want to avoid switching too often between the mouse and the keyboard, you can do the following after you have entered your translation:
- Press tab to set the focus on the languages list
- Enter the first letter(s) of the language of you want to select. For instance "ger" for German (or you can simply enter "g" and then navigate press down until you reach German).
- Press tab to set the focus on the "submit translation" button.
- Press enter to submit the translation.
This way, at least, you don't need to switch to the mouse to change the language.
In the future, we'll try to set up an interface where you won't need to use the mouse at all. It will be less cumbersome.
I made a mock-up version of the Tatoeba search interface moving the search bar in front of the language selection menus and adding the accesskey "4".
That would simplify the process into:
2. Type search query
3a. If language settings needn't be set, hit <Enter>
3b. If language settings need to be set, tab and select before tabbing to the submit button.
All with the keyboard. :-)
Any chance something like this could find its way into the interface soon?
Mock up: http://martin.swift.is/tatoeba/home.html
Accesskey combinations for a number of user agents: http://en.wikipedia.org/wiki/Ac...erent_Browsers
Right, right, sysko had told me about this. Thanks for reminding me!
And yes it can find its way into the interface soon :) Next update.
In certain cases where entering text into a form is the only purpose to going to that particular page, setting the focus to that form does make sense. So on the "Contribute" page it might make sense to set the focus to #SentenceText.
Secondly, could you add an accesskey ("4" seems to be as standard as it gets for search fields) for the search box? It's fairly unobtrusive as it has virtually no side effects.
Great! Thanks. :-)
Ok, I'm looking forward to the day this gets some improvements. Until then I can live with the current solution, just wanted to get my thoughts across. Thanks.
Do you mean the drop down lists of languages? Most user agents let you jump to the item by pressing the first few characters.
For example, for English you can select the list (by clicking on it with your pointer, tabbing to it or however you select things) and then hit the "e" key twice (the first gets you to Egyptian Arabic and the second key press selects the entry below; English.
You can also type the first few characters. In the case of Italian, you'd be quicker to type "it" than hit "i" four times.
Hope that helped.
I'm aware of that quick navigation feature with the keyboard, but in my eyes that's still too cumbersome. Too many clicks, too many changes from mouse to keyboard and back to mouse. A handful of configurable language buttons somwhere near or some often used entries at the top of the drop down list would be much better.
[@sysko and anyone interested]
Uzbek script-switching code is here:
The Cyrillic>Latin should work fine, Latin>Cyrillic may have problems with Russian loanwords. Also, it may be slow, since it’s a simple PHP str_replace w/ 2 arrays.
A simple page for anyone to toy with the transliteration:
There are about 26 sentences with the 'Bosnian' tag. Is that enough for a flag?
Yes. Especially because there are going to be lots, lots more...
Are there any Bosnian and Croatian sentences these NOT matching Serbian?.. :o
I add all three versions together when I add Bosnian/Croatian.
We really need the flags before the duplicate-script removal starts eliminating Bosnian sentences as duplicates of the Croatian... ;-)
(just kidding, I differentiated all the flags)
(Croatian as well, please)
Could you add a link from the sentence annotations page of an example to the example itself?
should have a link to
It seems silly that this hasn't been brought up before, but why not add a note about the need for periods and capitalization? I think a lot of new users are just not aware of this point until they get commented on.
We'll be processing sentences to automatically add missing periods, capital letters and fix other typography issues. Sacredceltic had actually brought this up already.
But it's true we can add a note about punctuation and capital letters. I'll append it to the red warning message below the translation form.
BTW, we should be careful when adding sentences with periods.
Sometimes there are non-standard 1337-like ‘punctuation’ like "!!1" in the end.
I’m not sure whether we should keep such sentences though...
I think non-standard punctuation is appropriate when the rest of the example is of the same style. (This applies to the ♪'s used to end some Japanese sentences).
I've got ~3k sentences with the period at the beginning -_-' (help, plz?)
I think looking how long we know each other that I can do it for you :p
can you give an id of a correct position period and a non correct one, because in the database I'm a bit confused about how they display arabic script.
It sounds like a good idea to me.
Thursday WWWJDIC examples status update.
30 records deleted.
13 records added.