Veggskilaboð frá sharptoothed

sharptoothed {{ icon }}

keyboard_arrow_right

Notandasíða

keyboard_arrow_right

Setningar

keyboard_arrow_right

Orðaforði

keyboard_arrow_right

Dómar

keyboard_arrow_right

Listar

keyboard_arrow_right

Eftirlæti

keyboard_arrow_right

Ummæli

keyboard_arrow_right

Ummæli á setningum frá sharptoothed

keyboard_arrow_right

Veggskilaboð

keyboard_arrow_right

Saga

keyboard_arrow_right

Upptökur

keyboard_arrow_right

Umritunir

translate

Þýða setningar frá sharptoothed

sharptoothed 25. september 2015 25. september 2015 kl. 12:51:50 UTC

link

Tengill

> transcription and stress marks - are extra information which is not trivial to machine-generate

Indeed, and this is true for Japanese furigana as well. So, it would be ideal if we could edit this data just like we edit sentences.

sharptoothed 25. september 2015 25. september 2015 kl. 08:24:40 UTC

link

Tengill

> I think it shouldn't be too nice to language learners

I think, we should distinguish two types of learners: those who use Tatoeba to learn languages and to improve or extend their knowledge but avoid contributing translations they are not certain in (or avoid contributing translations at all), and those who confuse Tatoeba with sites like Lang-8 and similar or consider Tatoeba as a good place to raise their self-esteem, etc. Tatoeba can't be too friendly for the former, I believe. The more tools and additional information on sentences Tatoeba provides, the better. As for the latter, I don't really think that lack of those tools and info would stop much of them. They still have Google Translate and stuff, after all.

By the way, I'm pretty sure that all Russian language learners here, no matter how proficient they are, would be happy to have, for example, a phonetic transcription for every Russian sentence or at least to see stress marks placed on every word. :-)

sharptoothed 14. september 2015 — breytt 14. september 2015 14. september 2015 kl. 11:40:03 UTC — breytt 14. september 2015 kl. 11:55:36 UTC

link

Tengill

5.
I don't think we need different kinds of "unsure". The fact that someone feels unsure about some sentence is negative enough by itself, I think, no matter what makes him feel so. And I don't think we should make "unsure" marks anonymous. The author deserves to know who marked his sentence that way and has a right to request explanations, I believe.

6.
Ooneykcall's rating system, as I see it, effectively introduces "OK", "not OK" and 3 levels of "unsure". As I mentioned above, I'm not really sure if it worth having several "unsure" marks.

7.
The idea of distinguishing between sentences I'd use myself and those I wouldn't seems highly subjective to me. I think it often has nothing to do with the real quality/value of a sentence but rather refers to personal preferences. That is, it maybe helpful for personal use but it as well maybe useless in the project-wide scope.

8.
I think we shouldn't take translation quality into consideration. Bad translations can be good sentences and vice versa.

9.
I like the idea of having ""I've read this, but I don't want to rate it" mark. Indeed, there are sentences I don't want to deal with. On the other hand there are also sentences that, in my opinion, don't make up good examples that are worth being kept in Tatoeba.

10.
I like this idea, too. I think I could use it to mark foreign sentences I'd like to translate but have doubts if they are good enough.

sharptoothed 6. september 2015 — breytt 14. september 2015 6. september 2015 kl. 13:11:12 UTC — breytt 14. september 2015 kl. 07:31:43 UTC

link

Tengill

1. What kind of sentences do you mark as "unsure"?

I mark as "unsure" the sentences that seem awkward, illogical or somewhat unnatural to me as well as the sentences with questionable punctuation.

2. Have you ever thought about rating a sentence and then decided not to? If so, why?

Yes, it happens sometimes. Mostly when I want to mark a sentence as "not OK" but then realize that I'm ether not so sure about that or don't have enough arguments to mark it that way. I usually mark such sentences as "unsure" or just skip them.

3. How do you rate sentences that you know are correct but which you wouldn't use yourself?

In most cases I skip them and sometimes I use "unsure" mark. I think it's better not marking a sentence at all than giving it wrong or biased mark.

4. In your real life, do you have any kind of collection that include things that you think are bad?

No, I don't have such kind of collection. I don't think they are of any use for me. Bad examples are only good when corresponding good examples exist, I believe.

sharptoothed 29. ágúst 2015 — breytt 29. ágúst 2015 29. ágúst 2015 kl. 18:39:39 UTC — breytt 29. ágúst 2015 kl. 19:56:08 UTC

link

Tengill

Thanks, Trang! Everything seems to work just fine.

sharptoothed 29. ágúst 2015 29. ágúst 2015 kl. 07:23:27 UTC

link

Tengill

Wrong sentence count displaying sentences in "Browse by language" feature when both "Not directly translated into:" and "Only sentences with audio:" filters used and, as a consequence, broken paging.

How to reproduce.
1. Open https://tatoeba.org/eng/sentenc...nly-with-audio
Japanese sentences with audio will be found and displayed. Remember the sentence count in the result (474 at the moment).
2. Press ">>" button and make sure that paging is working correctly. You will be taken to the page 48.
3. Now select 'English' in the "Not directly translated into:" drop-down menu. The page will be reloaded and you'll notice that the sentence count remained unchanged.
3. Press ">>" and see the page 48 with blank sentence list. Currently, only 10 pages of the Japanese sentences with audio not translated into English exist.

sharptoothed 28. ágúst 2015 28. ágúst 2015 kl. 07:03:31 UTC

link

Tengill

I think we should have more mercy for our admins and not make them do such a boring job. :-) Instead a robot that sends e-mails to potentially inactive members and processes responds from them could be developed.

sharptoothed 18. ágúst 2015 — breytt 18. ágúst 2015 18. ágúst 2015 kl. 07:10:37 UTC — breytt 18. ágúst 2015 kl. 07:12:27 UTC

link

Tengill

> * Contributors can now adopt sentences from deactivated accounts.

It would be nice if we could see those sentences in the "Orphan sentences" list (accessible via "Contribute >> Adopt sentences" menu item).

sharptoothed 13. ágúst 2015 13. ágúst 2015 kl. 09:09:06 UTC

link

Tengill

Horus is our deduplication bot. It finds and merges duplicate sentences in Tatoeba database.

sharptoothed 29. júlí 2015 29. júlí 2015 kl. 09:24:06 UTC

link

Tengill

Похоже на то.

sharptoothed 29. júlí 2015 29. júlí 2015 kl. 07:02:37 UTC

link

Tengill

https://tatoeba.org/rus/sentences_lists/show/992
This list rather looks like a random sentences collection to me. I think it would be better to delete it.

sharptoothed 22. júlí 2015 22. júlí 2015 kl. 15:18:54 UTC

link

Tengill

Indeed, it can solve most of "non-canonical punctuation marks" problems if we come to an agreement.

sharptoothed 22. júlí 2015 — breytt 22. júlí 2015 22. júlí 2015 kl. 13:07:49 UTC — breytt 22. júlí 2015 kl. 14:48:09 UTC

link

Tengill

** To whom it may concern **

[RUS] Список русских предложений, имеющих дубликаты или "почти дубликаты" по состоянию на 18 июля 2015 г.
[ENG] List of Russian sentences that have duplicates or near-duplicates as of July 18, 2015.

http://j-langtools.com/tatoeba/rus-dups.html

sharptoothed 21. júlí 2015 21. júlí 2015 kl. 20:52:02 UTC

link

Tengill

I suspect that somehow the word processor used to write the sentence just made an error doing character substitution.

sharptoothed 21. júlí 2015 — breytt 21. júlí 2015 21. júlí 2015 kl. 19:42:18 UTC — breytt 21. júlí 2015 kl. 21:22:03 UTC

link

Tengill

To tell the truth, I have no idea why no-break space was used in the 2nd sentence in that position. No-break space unlike regular one doesn't allow line break at its position and this is the only difference. In Russian typography it's recommended to put no-break space before the dash between two words or after the dash if it starts the line (in dialogues, for example). It seems that the sentence was originally typed in some text editor which performs automatic character substitution (it worked wrongly in our case) and then copied to Tatoeba.

> Let's have a Github issue to have Horus programmed to treat the two spaces identically.

It would be nice but what sentence Horus should prefer during merging: the one that contains special space or the one that don't? This is not a trivial question since in addition to different spaces there are different dashes, hyphens, quotation marks, etc. and different languages have different typographical rules and, ideally, Horus should be aware of all (or the most common at least) of them. The plus is that "more clever" Horus would be able to improve sentences brining them to standard typeset. But it would take time and efforts of our busy developers.

sharptoothed 21. júlí 2015 21. júlí 2015 kl. 18:47:22 UTC

link

Tengill

There are different kinds of spaces. In this particular case we can find two of them: regular space that we normally use (all spaces in #2735348) and no-break space (the space after the dash in #4340200). It's impossible to distinguish them by eye but they are completely different symbols for computers.
Of course, we can make those sentences identical manually and let Horus merge. But this won't eliminate the problem.

sharptoothed 21. júlí 2015 — breytt 21. júlí 2015 21. júlí 2015 kl. 16:16:08 UTC — breytt 21. júlí 2015 kl. 16:21:57 UTC

link

Tengill

oops! It seems it was replaced with regular white space when I copy-pasted those sentences. Thanks, CK!

sharptoothed 21. júlí 2015 — breytt 21. júlí 2015 21. júlí 2015 kl. 15:50:41 UTC — breytt 21. júlí 2015 kl. 15:58:48 UTC

link

Tengill

It seems that Horus is incapable of merging identical sentences with em-dashes. I've just found two identical sentences (I don't see any difference at least):
https://tatoeba.org/rus/sentences/show/2735348
https://tatoeba.org/rus/sentences/show/4340200

sharptoothed 20. júlí 2015 20. júlí 2015 kl. 08:57:32 UTC

link

Tengill

I see this:
https://dl.dropboxusercontent.c...87/2099363.png
Not bad at all to my slipshod taste. :-)

sharptoothed 20. júlí 2015 20. júlí 2015 kl. 08:07:59 UTC

link

Tengill

I see much better picture:
https://dl.dropboxusercontent.c...87/4355771.png
But, yes, the problem exists.

Vantar þér aðstoð?

Forritarar

Um

Veggskilaboð frá sharptoothed (samtals 670)

Vantar þér aðstoð?

Forritarar

Um