menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search

Sentence #2481838

info_outline Metadata
warning
Your sentence was not added because the following already exists.
Sentence #{{vm.sentence.id}} — belongs to {{vm.sentence.user.username}} Sentence #{{vm.sentence.id}}
{{vm.sentence.furigana.info_message}} {{vm.sentence.text}}
star This sentence belongs to a native speaker.
warning This sentence is not reliable.
content_copy Copy sentence info Go to sentence page
subdirectory_arrow_right
warning
{{transcription.info_message}}
Translations
Unlink this translation link Make into direct translation chevron_right
{{translation.furigana.info_message}} {{translation.text}} Existing sentence #{{::translation.id}} has been added as a translation.
edit Edit this translation
warning This sentence is not reliable.
content_copy Copy sentence info Go to sentence page
subdirectory_arrow_right
warning
{{transcription.info_message}}
Translations of translations
Unlink this translation link Make into direct translation chevron_right
{{translation.furigana.info_message}} {{translation.text}} Existing sentence #{{::translation.id}} has been added as a translation.
edit Edit this translation
warning This sentence is not reliable.
content_copy Copy sentence info Go to sentence page
subdirectory_arrow_right
warning
{{transcription.info_message}}
{{vm.expandableIcon}} {{vm.sentence.expandLabel}} Fewer translations

Comments

alvations alvations June 6, 2013 June 6, 2013 at 1:51:29 PM UTC link Permalink

@CK, Thanks for the note on the @possible copyrights violation. However, it isn't a copyrights violation because delph-in resources are open source. http://www.delph-in.net/

The purpose of me putting up the sentences in tatoeba is to extend the number of language pairs for these test suite sentences so as to quicken the grammarian's task of writing computational grammar.

This is my userpage in the delph-in group http://moin.delph-in.net/LilingTan.

arademaker arademaker May 8, 2020 May 8, 2020 at 9:47:09 PM UTC link Permalink

Yes, I agree this sentence is from the opensource project. I am also part of the initiative, http://moin.delph-in.net/AlexandreRademaker. How can unlock the sentence?

arademaker arademaker May 31, 2020 May 31, 2020 at 8:53:34 PM UTC link Permalink

What does it mean the red mark `this sentence is not reliable`?

Yorwba Yorwba June 1, 2020 June 1, 2020 at 1:54:12 PM UTC link Permalink

It means that there's an issue with the sentence and it should not be included in the exported data. In this case, the issue is that the sentence was apparently copied from somewhere else, so it might be a copyright violation.

alvations, who copied it, and you, arademaker, argue that it isn't a copyright violation because it was copied from an open-source project. However, "open source" doesn't mean "you can do with it whatever you want". There are many different open-source licenses, and they are not always compatible with each other, because one license may forbid something that another license explicitly allows.

In this case, the sentence was apparently copied from the [incr tsdb()] test data, which according to http://www.delph-in.net/itsdb/ is licensed under the following terms: "The software (in source code) and data are made available to the general public, free of royalties, for academic or other non-commercial use, including deployment in corporate environments." This seems to imply that the original authors do not wish for their data to be used commercially.

On the other hand, Tatoeba data is licensed under the CC BY 2.0 FR license, which does allow commercial usage. If we were include it in the exported data, someone might use it e.g. in a commercial dictionary, going against the wishes of the [incr tsdb()] authors. That's why this sentence is marked as "not reliable".

If you want to have this sentence included, you'd need to show that it is not a copyright violation. Either by showing that the data in [incr tsdb()] was collected from somewhere else and the [incr tdsb()] license terms against commercial use don't apply, or by convincing the copyright owner(s) of [incr tsdb()] that they should relicense their data under a license compatible with CC BY 2.0 FR.

arademaker arademaker June 1, 2020 June 1, 2020 at 6:44:11 PM UTC link Permalink

No itsdb is a tool, its license is not applicable to the data. This dataset came from https://w3.ual.es/~nperdu/hpsuite.htm and later was incorporated as a simple test suite for MRS semantic representation documented http://moin.delph-in.net/MatrixMrsTestSuite.

In https://delphinqa.ling.washingt...t-suite/484/2, another user from the DELPHI-IN community argues that its license is probably MIT.


fcbond fcbond June 2, 2020 June 2, 2020 at 2:25:25 AM UTC link Permalink

Hi,

The source of this sentence is the English Resource Grammar <http://moin.delph-in.net/ErgTop>, specifically the MRS test suite
<http://svn.delph-in.net/erg/tru...tons/mrs/item>

The licence is MIT:
http://svn.delph-in.net/erg/trunk/LICENSE

Other languages are available here:
http://moin.delph-in.net/MatrixMrsTestSuite

MIT is compatible with CC BY, so no problems!

itsdb is a tool used for handling linguistic testsuites, and is licensed under the lgpl, but this is not relevant to the actual data.

I hope this clears things up, please let me know if you need any more information.

Yorwba Yorwba June 2, 2020 June 2, 2020 at 6:53:58 PM UTC link Permalink

I had a hard time finding a license file for the data on the DELPH-IN website. Thank you for providing the link. Since the MIT license requires the license statement to be retained, it should be added as a comment for all sentences from the MRS test suite:

***

Permission is hereby granted, free of charge, to any
person obtaining a copy of this software and associated
documentation files (the "Software"), to deal in the
Software without restriction, including without
limitation the rights to use, copy, modify, merge,
publish, distribute, sublicense, and/or sell copies of
the Software, and to permit persons to whom the Software
is furnished to do so, subject to the following
conditions:

The above copyright notice and this permission notice
shall be included in all copies or substantial portions
of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF
ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED
TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A
PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT
SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY
CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR
IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
DEALINGS IN THE SOFTWARE.

***

Given that the attribution requirements are satisfied, I think this sentence and others from the MRS test suite can be added to Tatoeba without causing a legal issue. But let's see what @TRANG thinks.

arademaker arademaker June 2, 2020 June 2, 2020 at 7:34:13 PM UTC link Permalink

Is there any easier way to associate comments to a set of sentences? Copy/paste the MIT license in all sentences can be very tedious.

TRANG TRANG June 3, 2020 June 3, 2020 at 5:02:01 AM UTC link Permalink

Yes, I think it's no problem to keep them.

As for how to deal with the attribution:

- If the sentences are gathered under the same icon, the attribution could be added in the profile description.
- The sentences could be tagged "License: MIT".
- The sentences could be added to a list. The list title could be used to mention essential information (source and license).

Maybe there are other ways to deal with it. Personally, I'm fine with any solution that would allow a script/bot to do whatever is needed if one day someone isn't happy about how the license terms aren't being strictly respected.

It's understandable that manually copy-pasting license statements as comments on every sentence isn't practical (and a bit spammy too if there's a lot of sentences).

arademaker arademaker June 3, 2020, edited June 3, 2020 June 3, 2020 at 11:58:54 AM UTC, edited June 3, 2020 at 12:26:14 PM UTC link Permalink

Hi @TRANG, I have created a list

https://tatoeba.org/eng/sentenc...ts/show/166576

I didn't know how to make `sentences are gathered under the same icon`. It would be nice to be able to add comments to lists, so the text of the license and more information about the list could be added.

mramosch mramosch June 3, 2020 June 3, 2020 at 12:12:01 PM UTC link Permalink

Should be @TRANG... ;-)

TRANG TRANG June 3, 2020 June 3, 2020 at 1:48:11 PM UTC link Permalink

> https://tatoeba.org/eng/sentenc...ts/show/166576

I will remove the red warnings from the sentences in that list.

> I didn't know how to make `sentences are gathered under the same icon`

Sorry I meant to say "under the same account", not same icon :)

For instance if all the English sentences created under the account alvations were from http://moin.delph-in.net/ErgTop%3E, then the source and license statement could be written in the profile description (https://tatoeba.org/eng/user/profile/alvations).

> It would be nice to be able to add comments to lists, so the text of the license
> and more information about the list could be added.

Yes, definitely. I'm quite sure this has been requested before.

Metadata

close

Lists

Sentence text

License: CC BY 2.0 FR

Logs

This sentence is original and was not derived from translation.

Abrams barked.

added by alvations, June 6, 2013

linked by alvations, June 6, 2013

linked by alvations, June 6, 2013

linked by alvations, June 6, 2013

linked by alvations, June 6, 2013

linked by purple_ice, July 2, 2023