Wall (6,960 threads)
Tips
Before asking a question, make sure to read the FAQ.
We aim to maintain a healthy atmosphere for civilized discussions. Please read our rules against bad behavior.
marafon
4 days ago
CK
4 days ago
sharptoothed
9 days ago
Cangarejo
9 days ago
Cangarejo
13 days ago
Thanuir
13 days ago
ondo
13 days ago
ddnktr
14 days ago
ondo
14 days ago
AlanF_US
17 days ago
Could we have the duplicate removal script run?
Also I sent in an email with 600+ sentence IDs that can have "British English" as a tag - what's the status of that?
I'm taking care of the British tag right now.
Got your email - thanks for taking care of that for me. :-)
in fact I will need to readapt the duplicate removal script to now handle tags and also to handle audio, which is not the case yet
and now we have more and more audio, I really need to take care of this before re-running it, and as I've said I've really few free times yet :(
Is it possible to download the lists as a tab separated file or xml file rather than a CSV?
Perhaps I'm misunderstanding the question, but aren't the lists already tab separated...?
If you click on download, it won't download right away, it will first lead to a page where you can choose what data to include in your list, and there's a short explanation about the structure.
=> sentence_id [tab] sentence_text [tab] translation_text
Unless there's a bug we missed, you will definitely be able to import them into Anki :)
you're faster than me ^^
no we only export in csv yet as it's the format which is the more commonly used, and it can easily import in excel etc.
(and anyway a csv can be a "whatyouwant separated file" so even with tab, it's still a csv, it's just the separator which change, so with a little script, you can easily from one kind of csv to another)
if you plan to reuse the data for a website/application or so, don't forget the CC-BY licence obliged you to notice us of the use (except for personnal use-only of course) and to say where the data come from etc.
The reason I ask, is because I have issues with the csv if the source sentences contain commas themselves, as it appears that there are more columns in that row then there should be.
yep sure, it's a known issue, but it was not the priority yet, so yep I had in mind to replace coma by tab to avoid the problem of coma in sentences
the export was coded really quickly, as this was not something really vital for normal user
we will warn you once it will be corrected (just to warn you, we're more or less only 2 students to make/maintain/promote/debug this website on our freetime, so lack of time/not in priority list is the main reason for most missing features/bugs here)
How is the data for the linking of the sentences stored? Would I be able to access the source files used by the project's website itself to be able make my own export code into a format that's much more useful for my purposes?
For the note, my own purposes are creating lists while browsing and being able to download and import said lists into my Anki deck.
we use a database, and our server is already to small to support much more feature, so no, till we don't have an other server, we will not permit "total" export of the database,
but what you want (create list and export them in anki) is already possible, create a list in the "list" section, and while browsing tatoeba each sentences has an icon "add" too list, and when you want to export it, you go in the list section, you click on your list, and then on the big green download button
moreover if you check the anki plugin list, there's a tatoeba's plugin for japanese learner (made by one of our user)
hope this help
[not needed - removed by CK]
I already do ;), keep up the great work!
in the long long todo list, it's planned to have the "all sentences of user X" to be displayed like list, with possibility to directly translate etc.
Just noticed that on the Contribute page, it says: reporting mitakes.
(if you're not logged in.)
Thank you, it's corrected ^^
@Trang / Sysko
Could you make the list
http://tatoeba.org/eng/sentences_lists/show/26
public so I can remove sentences when I'm finished with them?
Trang will certainly make a post on the blog, but I tell it here now
now it's possible to filter to view only sentences with audio
http://tatoeba.org/eng/sentence...nly-with-audio
and now those who wants to record by their own, it will be possible to submit us your recording (we will explain this more in detail soon, but more or less it will be the following
* you tell us you want to record audio
* we give you a list to use with the software "swac-recorder"
* you give us back the zip with all the audio
* I will upload them in the server
(Yeah I know, in the future we will improve this a lot, but this is a first step as ever) )
it's also possible now to send us list of sentences, (for massive adding) trough our email address
we accept the following format
text
and
text[tab]translation
and "show all in" is by default "reverse sorted" so you will see last sentences added first
I've also fix some problem in Japanese romanization ( ? which were transcribed as ?[?] etc.)
except bug fixing, and little request, this should be my last "normal" update from me before a long time, from a "coding" point of view, I will focus on an other project, and on a new version of tatoeba.
It's not a big issue, and I don't know if it is easy to solve, but can an admin disable the transcription of punctuation marks?
これ 何[なに] ?[?]
=>
これ 何[なに] ?
The transcriptions get unnecessarily longer.
done
? and ! seem to be fine, but the "thought fullstops" "..." are annyoing, e.g. nº75468.
There are a number of problems with the transcription system, but I think everybody is a little overworked right now.
I've just commit something, it should fix (only) the problem with ? and !, it will be available this weekend
I can't find sentences with typing keywords.
Yes, sorry, that's because the server was down yesterday, and we need to re-launch indexation of the sentences.
It won't be available for another 6 hours at least as I have a train to catch right now...
It should be reworking now (I'm back from holidays)
Translating sentences is not working? :(
Huh, my bad... just a geek using the latest Opera on Linux.
[not needed - removed by CK]
Thanks! Could come in very useful.
Note how all the "By-foo" tags clump together. ;-)