Muro- Tatoeba

Muro (7.277 hilos de mensajes)

Consejos

Antes de preguntar, asegúrese de leer FAQ.

Nuestro objetivo es mantener una atmósfera saludable para las discusiones civilizadas. Por favor, lea nuestras reglas contra el mal comportamiento.

Últimos mensajes

subdirectory_arrow_right

sharptoothed

hace 2 días

subdirectory_arrow_right

small_snow

hace 2 días

subdirectory_arrow_right

frpzzd

hace 3 días

subdirectory_arrow_right

LeviHighway

hace 4 días

subdirectory_arrow_right

frpzzd

hace 4 días

feedback

sharptoothed

hace 4 días

subdirectory_arrow_right

LeviHighway

hace 4 días

subdirectory_arrow_right

lingomaxim

hace 5 días

subdirectory_arrow_right

frpzzd

hace 5 días

feedback

LeviHighway

hace 5 días

JimBreen 2 de junio de 2010 2 de junio de 2010, 3:26:09 UTC

flag

Report

link

Enlace permanente

The WWWJDIC interface to Tatoeba now has links to both the Japanese & English sentences (previously it was just to the English half of a pair.)

ocultar respuestas mostrar respuestas

blay_paul 2 de junio de 2010 2 de junio de 2010, 9:51:49 UTC

flag

Report

link

Enlace permanente

Thanks - I'm sure that will come in handy for me.

brauliobezerra 1 de junio de 2010 1 de junio de 2010, 17:13:01 UTC

flag

Report

link

Enlace permanente

I have a game for you: fill the blanks!

http://dl.dropbox.com/u/6060033...raduzidas.html

Warning: BIIIG HTML table.
Warning2: it's not updated as you add the sentences...

ocultar respuestas mostrar respuestas

TRANG 1 de junio de 2010 1 de junio de 2010, 18:34:25 UTC

flag

Report

link

Enlace permanente

Awesome!!! :D Thanks for making this page!

Can I request something? =]

I'd love to also have the sentences themselves, not just the id's. This way you can see right away what are the most translated sentences in Tatoeba. You can also see which sentences you may feel like translating first.

ocultar respuestas mostrar respuestas

brauliobezerra 2 de junio de 2010 2 de junio de 2010, 3:17:38 UTC

flag

Report

link

Enlace permanente

Originally I put the sentences, but some took too much space. But I could cut them at some point to solve this problem.

Can I request something also?
Is there any possibility to host this script (and possibly others) on Tatoeba's server? This way they could be more up to date and thus useful. I can change them to PHP if needed (I'm using Ruby).

ocultar respuestas mostrar respuestas

TRANG 2 de junio de 2010 2 de junio de 2010, 20:23:32 UTC

flag

Report

link

Enlace permanente

Yes, there is always a possibility :)

But I can't tell how soon your scripts can be integrated... As far as I'm concerned, I know I will not have time until at least another two weeks =/

Demetrius 1 de junio de 2010 1 de junio de 2010, 18:54:08 UTC

flag

Report

link

Enlace permanente

There is no sentence with id 242883.

ocultar respuestas mostrar respuestas

sysko 1 de junio de 2010 1 de junio de 2010, 19:07:19 UTC

flag

Report

link

Enlace permanente

maybe because the sentence has been merged between the last export (which has used our braulio) and now :)

brauliobezerra 1 de junio de 2010 1 de junio de 2010, 19:12:36 UTC

flag

Report

link

Enlace permanente

Since this sentence has some historic, I guess it was deleted (for being a duplicate, for example) since I downloaded the list last Saturday.

Demetrius 1 de junio de 2010 1 de junio de 2010, 19:08:03 UTC

flag

Report

link

Enlace permanente

Something is wrong here.
E.g. 51700 already has a Belarusian translation.

ocultar respuestas mostrar respuestas

sysko 1 de junio de 2010 1 de junio de 2010, 19:09:22 UTC

flag

Report

link

Enlace permanente

same reason I think (change between the export and now)

blay_paul 1 de junio de 2010 1 de junio de 2010, 7:34:42 UTC

flag

Report

link

Enlace permanente

wwwjdic format export.

Hi Trang, and Sysko, would it be possible to do a list export on the fly with the same fields as WWWJDIC.CSV ?

I've been annoying Jim with partially indexed records because it's more convenient for me to do them after the weekly download update. If I could export the 'INDEXING UNDERWAY' records a couple of days _before_ the weekly update they'd be done by the time Jim starts doing his stuff.

ocultar respuestas mostrar respuestas

TRANG 1 de junio de 2010 1 de junio de 2010, 9:48:14 UTC

flag

Report

link

Enlace permanente

Yes, it's possible. I will take the time in my very busy schedule for this :P

By the way, if one week is to short, you could perhaps tell Jim to only download and process the file once every other week, or once a month?

Just because there's a weekly export doesn't mean you, Jim and whoever else, have to work on this basis. Having for instance two weeks would allow you to use the "halfway" export to check if there's any kind of error, and correct them before the "final" export.

And it should give you less pressure on getting your things done :)

ocultar respuestas mostrar respuestas

JimBreen 1 de junio de 2010 1 de junio de 2010, 9:56:03 UTC

flag

Report

link

Enlace permanente

I could certainly download and install in WWWJDIC less frequently (after all, I went a year without an update). Each week I spend quite a lot of time fixing things that I find messed up; usually changes in the Japanese without matching changes in the indices.
I wouldn't want it to build up too long as this checking process could become quite large.

Maybe the best thing would be for Paul to tell me before a Saturday if he has unfinished indices, and I will skip that week's download.

ocultar respuestas mostrar respuestas

blay_paul 1 de junio de 2010 1 de junio de 2010, 10:08:30 UTC

flag

Report

link

Enlace permanente

Every other week is one possible approach. However I wouldn't really want to go more than a week between updates as too much builds up to check / fix.

Are the exports an automated process? Would it be annoying / awkward to do it twice a week?

I think the main problem is that me, and Jim, both end up fixing much of the same things at the same time (duplicating effort). If you had one export on a Thursday and one on a Saturday I would be able to get my checks done _before_ Jim gets it.

Or maybe you could do it once every 5 days and I could take
5th, 15th, and 25th
while Jim updates on
10th, 20th and 30th

ocultar respuestas mostrar respuestas

TRANG 1 de junio de 2010 1 de junio de 2010, 18:59:02 UTC

flag

Report

link

Enlace permanente

It's automated. I can set it up for twice a week. Let's say Thursday 9AM?

ocultar respuestas mostrar respuestas

blay_paul 1 de junio de 2010 1 de junio de 2010, 19:19:59 UTC

flag

Report

link

Enlace permanente

That sounds good to me. I'll download on Thursdays and Jim can download on Saturdays.

I'll only need wwwjdic.csv on Thursday, by the way.

ocultar respuestas mostrar respuestas

JimBreen 2 de junio de 2010 2 de junio de 2010, 3:23:16 UTC

flag

Report

link

Enlace permanente

It's Sunday for me (Saturday in backward countries like the UK and France...)

TRANG 2 de junio de 2010 2 de junio de 2010, 20:38:38 UTC

flag

Report

link

Enlace permanente

Okay, I set it up so that the wwwjdic.csv file gets updated also on Thursdays, at 9AM (France time).

JimBreen 1 de junio de 2010 1 de junio de 2010, 3:17:59 UTC

flag

Report

link

Enlace permanente

Sentence 155946 has two sets of indices, which point to different English sentences. Odd.

ocultar respuestas mostrar respuestas

JimBreen 1 de junio de 2010 1 de junio de 2010, 4:17:38 UTC

flag

Report

link

Enlace permanente

164243 is the same. Is this because duplicates are being deleted?

ocultar respuestas mostrar respuestas

blay_paul 1 de junio de 2010 1 de junio de 2010, 6:49:03 UTC

flag

Report

link

Enlace permanente

Yes, if they're ones I set up to be deleted I try to delete one of them in advance, but one or two slip out.

CK 30 de mayo de 2010, modificado 25 de octubre de 2019 30 de mayo de 2010, 3:35:41 UTC, modificado 25 de octubre de 2019, 8:10:43 UTC

flag

Report

link

Enlace permanente

[not needed anymore- removed by CK]

ocultar respuestas mostrar respuestas

blay_paul 30 de mayo de 2010 30 de mayo de 2010, 5:00:12 UTC

flag

Report

link

Enlace permanente

1324 is bad. I missed that one.

It probably should be one of

That's /my/ line!
That's MY line!

(as there is a valid need of represented emphasis in that sentence).

I don't think 73507 and 73508 are likely to cause any trouble, but they are in violation of the 'no annotations guideline' and aren't even in the (grandfathered in) wwwjdic meta information format.

CK 30 de mayo de 2010, modificado 25 de octubre de 2019 30 de mayo de 2010, 4:15:04 UTC, modificado 25 de octubre de 2019, 8:10:36 UTC

flag

Report

link

Enlace permanente

[not needed anymore- removed by CK]

ocultar respuestas mostrar respuestas

blay_paul 30 de mayo de 2010 30 de mayo de 2010, 4:54:18 UTC

flag

Report

link

Enlace permanente

1) Separate handling for Meta information in Tatoeba sentences is already in the todo list.

2) [M] and [F] tags were removed from the English sentences and applied to the Japanese in the last update done to the Tanaka Corpus before control was turned over to Tatoeba. Unfortunately recent events suggest the last update didn't make it to Tatoeba. This hasn't been fixed because of 1)

3) When the meta information system is redone it is planned to re-evaluate the basis of the [M] and [F] tags. 僕 alone will almost certainly not be worth an [M] tag (due to developments in modern Japanese). Although I would disagree that beginners of Japanese necessarily know about 'boku' and 'kimi' being used in masculine speech.

blay_paul 29 de mayo de 2010 29 de mayo de 2010, 21:02:51 UTC

flag

Report

link

Enlace permanente

Spurious line feeds not removed from Index data input.

See the Index data for sentence 101622.

ocultar respuestas mostrar respuestas

blay_paul 29 de mayo de 2010 29 de mayo de 2010, 21:03:54 UTC

flag

Report

link

Enlace permanente

Also, could records with a 'meaning' field of -1 be excluded from the wwwjdic.csv file?

ocultar respuestas mostrar respuestas

TRANG 29 de mayo de 2010 29 de mayo de 2010, 23:29:53 UTC

flag

Report

link

Enlace permanente

Yes.

ocultar respuestas mostrar respuestas

JimBreen 1 de junio de 2010 1 de junio de 2010, 1:44:01 UTC

flag

Report

link

Enlace permanente

Yes, there were a heap of them in the last wwwjdic.csv. There were also two blank lines. The first time that has happened. (Spurious line feeds?)

ocultar respuestas mostrar respuestas

JimBreen 1 de junio de 2010 1 de junio de 2010, 1:52:02 UTC

flag

Report

link

Enlace permanente

Also 315382 came through with "\N" as the Japanese and English. (Just an index...)

ocultar respuestas mostrar respuestas

JimBreen 1 de junio de 2010 1 de junio de 2010, 3:00:27 UTC

flag

Report

link

Enlace permanente

Actually, there were about 20 with \N for the English, Japanese or both.

ocultar respuestas mostrar respuestas

blay_paul 1 de junio de 2010 1 de junio de 2010, 6:51:47 UTC

flag

Report

link

Enlace permanente

I think they related to manual deletions or something. I think they were fixed in Tatoeba shortly after the index data download was updated.

TRANG 29 de mayo de 2010 29 de mayo de 2010, 23:29:42 UTC

flag

Report

link

Enlace permanente

Indeed... I'll trim the input before it gets saved.

Other than that there was another index with an extra new line. I corrected both.

blay_paul 30 de mayo de 2010 30 de mayo de 2010, 0:21:42 UTC

flag

Report

link

Enlace permanente

Great - thanks for both of those. They'll make my life easier, once a week. ;-)

blay_paul 29 de mayo de 2010 29 de mayo de 2010, 23:06:51 UTC

flag

Report

link

Enlace permanente

Could we have a duplicate removal script run soon to, please?

ocultar respuestas mostrar respuestas

TRANG 29 de mayo de 2010 29 de mayo de 2010, 23:47:33 UTC

flag

Report

link

Enlace permanente

Okay, it's done.

By the way, is there any reason why you add these "For duplicate removal script" comments?

When the sentences are merged, all the comments of the deleted sentence are moved to the remaining sentence...

If you are posting these comments to keep track, it is best to also indicate the id's of the sentences that have to be merged, not just that it has to be deleted ^^

ocultar respuestas mostrar respuestas

blay_paul 30 de mayo de 2010 30 de mayo de 2010, 0:21:04 UTC

flag

Report

link

Enlace permanente

> By the way, is there any reason why you add these
> "For duplicate removal script" comments?

Only so that Jim (and other users) can see what I have changed on the basis that it is a 'near duplicate' before the merge happens. Basically it's so people have a chance to complain.

I delete the comments post merge when I come across them (which is quite often, for technical reasons).

saeb 29 de mayo de 2010 29 de mayo de 2010, 17:22:10 UTC

flag

Report

link

Enlace permanente

I thought you guys checked your facebook group (ok I'll shutup :P). well, 2 issues I wanna raise (I know...others already brought it up...he he plaigiarism):
http://bit.ly/cO4t8E
http://bit.ly/cg3rXJ

ocultar respuestas mostrar respuestas

sysko 29 de mayo de 2010 29 de mayo de 2010, 17:35:02 UTC

flag

Report

link

Enlace permanente

I proposed a long time ago to implement the possibility in sentence comments to write something like @saeb, and it will warn you (trough a private message, or a dedicated section) this way asking for someone helps on a sentence will be easier / or to involve someone when chatting about "how to correct this sentence" ? (by the way nice picts :p)

ocultar respuestas mostrar respuestas

Demetrius 31 de mayo de 2010 31 de mayo de 2010, 14:31:23 UTC

flag

Report

link

Enlace permanente

BTW personal messages don't attract much attention. Is it possible to change design somehow when you have unread ones?

ocultar respuestas mostrar respuestas

JimBreen 1 de junio de 2010 1 de junio de 2010, 1:27:28 UTC

flag

Report

link

Enlace permanente

I agree. An email alert that there are messages would be good. And
of course one for "@JimBreen" in a comment too.

ocultar respuestas mostrar respuestas

sysko 1 de junio de 2010 1 de junio de 2010, 1:44:41 UTC

flag

Report

link

Enlace permanente

it's planned for this release (I will try to do both), for other users who fear "tatoeba spam" you have an option in your profile do desactivate email (though we need to make it more precise to be able to desactivate email notification for each kind, PM, comments, etc.)

this way, as Pharamp would like this (the others tell us what you think) she would like to warned when a translation is added to one of the sentences she likes, so maybe when precise filtering will be possible, have the possible to activate email sending when someone translate a favorite sentences ? (that will give a reason for the existence of favorites ^^)

saeb 29 de mayo de 2010 29 de mayo de 2010, 22:04:08 UTC

flag

Report

link

Enlace permanente

Tribute to sysko:
http://bit.ly/aJ13uT

ocultar respuestas mostrar respuestas

sysko 29 de mayo de 2010 29 de mayo de 2010, 22:11:40 UTC

flag

Report

link

Enlace permanente

yeah

saeb 29 de mayo de 2010 29 de mayo de 2010, 18:06:23 UTC

flag

Report

link

Enlace permanente

glad to know you have plans :)

saeb 29 de mayo de 2010 29 de mayo de 2010, 16:51:04 UTC

flag

Report

link

Enlace permanente

tribute to Trang:
http://bit.ly/9rtoRS

ocultar respuestas mostrar respuestas

saeb 29 de mayo de 2010 29 de mayo de 2010, 16:52:32 UTC

flag

Report

link

Enlace permanente

still need to think of one for sysko...and baptiste :P

Muro (7.277 hilos de mensajes)

Consejos

sharptoothed

small_snow

frpzzd

LeviHighway

frpzzd

sharptoothed

LeviHighway

lingomaxim

frpzzd

LeviHighway

¿Necesita ayuda?

Desarrolladores

Acerca del sitio