Wall (7,275 threads)
Tips
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
small_snow
2 hours ago
frpzzd
yesterday
LeviHighway
2 days ago
frpzzd
2 days ago
sharptoothed
2 days ago
LeviHighway
2 days ago
lingomaxim
2 days ago
frpzzd
2 days ago
LeviHighway
3 days ago
alt
3 days ago
Minor list glitch.
If you look at the list page it says 'DELETE ME!' has 34 entries, but it actually only has seven. I think that deleting a sentence does not remove it from the 'count' in lists. There are probably lost of 'null entries' against IDs in lists and possibly favourites.
I noticed something similar yesterday with my "Dutch sentences to be translated into any language" list. It now says it has 102 sentences, when actually there are only 20. There aren't over a hundred sentences that are or have been in the list, though, so I don't think it has something to do with deleting sentences.
I have updated the lists counts. We don't know why your list ended up with 102, dorenda... But if it happens again, let us know, and try to give as many details as possible ^^
Yes indeed, deleting sentences doesn't remove from list. We know this but since it's not extremely important, we didn't fix it yet.
There's a white? (invisible?) flag in the lang stats that has 4 sentences and gives no description when I hover over it.
white flag ? wha ..what? where ? I see nothing ? Are you sure Saeb ? :angel face:
more seriously, it was due to a little bugs, but it's fix :)
yes in the search engine. I can see example sentences in many languages (also languages i am not interested in). Therefore I want to choose only German, English, Japanese and Russian. :) Thanks for the fast reply.
(can you use the reply button rather than create a new thread :) )
Ok I see, you would like to be able to filter on a set of languages you would defined, for the moment it's not possible, but it will be implemented in a "advance search" page, but to be honnest, I don't know when we will find time to implement this.
Sorry, for now it's not possible.
The best you can do is specify a source language and a target language (for instance from German to English). You would then only have results in this pair of languages.
But there's nothing like searching from German and display translations only in English, Japanese and Russian. You'd have to search German->English, then German->Japanese, then German->Russian, if you really need to filter out unwanted languages.
How can I select three languages for example sentences? This is very important. :)
you mean in the search engine, if you explain us what you want, we can try to find a way to do this for you even if its not possible from the interface
Missing sentences?
The following sentence pairs used to be in the Tanaka corpus, but do not appear to be in Tatoeba now. (They probably where there a week or two ago). I don't know of any reason they would have been deleted - would it be possible to track down what happened to them?
お前らの憐れみなんぞ願い下げだ。
I don't want your pity.
このホールは2000人収用できる。
This hall holds 2000 people.
心停止の患者を応急処置で生かした。
We used emergency measures to revive the cardiac arrest patient.
I'll dig in the logs tonight, to see if I can find them.
Anyway, if they were present in Tatoeba a couple of weeks ago, then you should be able to track down what happened to them if you can somehow figure out their id.
Although, if the deletion never appeared in the "latest contributions", then it must have resulted from either a dysfunction of Tatoeba, or of the duplicate deletion script, or a human mistake...
[EDIT] This is starting to look ominously like something might have gone wrong on my side ... but I'm still not sure of that.
If you could, say, do an output of all Japanese sentences missing from the current state compared with three weeks ago I should be able to pick out the suspicious ones. I will continue to try to track down what happened and what's gone missing in the meantime.
Latest ones -
ジャスティン班長、亜空間ライダーに反応出ました! [Manga]
Squad Leader Justin, a blip's appeared on the hyperspace radar!
母親は息子に言い聞かせようとしていた。
The mother tried to reason with her son.
> If you could, say, do an output of all Japanese
> sentences missing from the current state compared with
> three weeks ago I should be able to pick out the
> suspicious ones.
Yes, I was going to do this for all sentences anyway ^^ Just to make sure nothing's wrong... I wouldn't sleep in peace knowing that sentences can disappear for no apparent reason :S
Found one more.
外れた。相手に着弾していない。
Off-target. My opponent hadn't been hit.
I think I may have found out where, and when, the problem might have originated (if not why or who ;-).
It seems like they might have gone missing about the time I stopped maintaining the examples myself - so possibly the last update was missed. That doesn't explain why I didn't notice them missing before though. It's all a little mysterious.
Small(?) request - could you check that carriage returns (line feeds) accidentally left on the end of index data get filtered out?
Excel tends to add one every time I copy to the text box and I'm not sure I've always remembered to delete them.
normaly it should, we remove all kind of blank characters at the end and beginning of sentences.
OK, that's a relief.
any chance we can get a list(s) with all (english) sentences having 10 words or less?
I'm guessing this could speed up translation for everyone...I've been getting only complex ones lately.
[not needed anymore- removed by CK]
[not needed anymore- removed by CK]
sweet! I'm guessing you used the indices files? genius!
Tatoeba has been updated.
http://blog.tatoeba.org/2010/05...22nd-2010.html
But... We didn't get to test a lot before updating, so please let us know if you find any bug ^^
[not needed anymore- removed by CK]
Yes, you can still remove items from lists, but only from the "Edit" page.
Same thing for changing the name. Also, you can only rename lists you have created (I should remove this "tip" message from the index page...).
Add Yiddish? That's what I'd like to work on.
It's noted :)
In the meantime, you can always add sentences in Yiddish, even if it's not in the "official" list of supported languages. They will (should) appear with an "unknown" flag. And you can set the language later, once it's Yiddish added.
It also actually helps us if you can add at least 5 sentences, so that we have something to test the language detection with.
Oh and, we need an icon for this language. What would you recommand us to use?
I'm not sure how detection works for languages, but if there is some sort of orthographic convention analysis, perhaps I could give some rules that would help to detect Yiddish.
for the moment we use the google API to detect languages (we plan to have our own system but to be honnest it's not in our priority list), so I don't know if they support Yiddish, but anyway you can directly specify the language when adding a translation/new sentence, then the website will remember which language you've just contributed in.
Well, I know that one can translate to and from Yiddish with Google Translate. However, it's pretty terrible. :D
I'll just keep them all in my list until I can set them as Yiddish.
Thanks for the help sysko :)
Ok cool I'll give that a go. One trouble with Yiddish is that it's never been a territorial language (well, maybe 1,000 years ago).
The largest group of speakers are in America, then probably in Israel. So, neither of those flags can really be used. This is part of what I don't like about using flags for language. It's a narrow categorization which doesn't always work.
Maybe a white background with Yiddish (in Hebrew characters - easily found on wikipedia) written over it? Since there is nothing official and won't be.
I added a sentence in Yiddish (392224). It simply detected it as Yiddish, which I guessed it would.
One problem, the full stops go the beginning. Is there really no way of fixing this?
> I added a sentence in Yiddish (392224). It simply
> detected it as Yiddish, which I guessed it would.
Actually it's been detected as Hebrew rather than Yiddish. (Hebrew has been supported in Tatoeba for a long time.)
I don't know these languages, and how different they are. But it doesn't look like we can consider them the same language ^^
So if you if were adding sentences in Yiddish and not Hebrew, you will have to remember to change the language of your sentences once we add Yiddish (hopefully next weekend).
> I don't know these languages, and how different
> they are. But it doesn't look like we can consider
> them the same language ^^
Hebrew is a Semitic language, from the Afroasiatic family, and Yiddish is a Germanic language, from the Indoeuropean family. So they shouldn't be too similar. :D
By the way, it would be nice to have some kind of transliteration if you ever have some time left to implement it. :)
Sorry, yes, I meant Hebrew. :P
They are totally different languages. It's like Swahili and English. Same script, completely different languages.
Even the ways that the common script is used, is different. Not like French and English, where the scripts is used alphabetically. Yiddish uses the Hebrew script alphabetically.
f there is some way to implement some sort of transliteration, I'll happily do that. Right now I'll just wait for the language to be added and other features can be added afterwards. Even if I just have to add it as a comment. (Which I already do for the few irregularly spelt Germanic words, and the Semitic components (which use a Hebrew style spelling and are not phonetic)).
Hope that makes some sense... lol
Yep the problem is well known, and it comes from our side, so basically you don't need to change your sentences, it's us who need to change someting to handle right to left languages ^^
Ok, thanks for that sysko. I'll wait until support is ready.
Hey Yeshua! Nice to see you here. :)
(Sorry, I have nothing to add to the topic.)
Hi Dorenda :)
Browsing
Sorry for flooding the wall like this, but how about adding "unknown" to the language list on the browse page? There are currently some 59 sentences with an unknown language and I haven't seen a way to list them.
Also, when a language is set, it could be useful if the "previous" and "next" links would bring one to the previous and next sentence in that language.
> Sorry for flooding the wall like this, but how about adding
> "unknown" to the language list on the browse page? There
> are currently some 59 sentences with an unknown language
> and I haven't seen a way to list them
Adding "unknown" language to the language list may not really be worth it. I'd rather simply have a page that lists all the sentences with an unknown language.
Anyway I made a list if you are interested.
http://tatoeba.org/eng/sentences_lists/show/91
A lot of them need to be deleted though.
> Also, when a language is set, it could be useful if the
> "previous" and "next" links would bring one to the
> previous and next sentence in that language.
sysko has added this feature a few days ago actually ^^ It will be available soon :)
I ran through the list and most of these sentences are owned so not much can be done to clean them up by non-admins anyway.
Nice to hear about the previous-and-next feature. No-one can call you guys lazy, pumping out features like this. Well done. That won't be said often enough. :-)