TagsView all tags
LicenseCC BY 2.0 FR
This sentence is original and was not derived from translation.
added by alvations, June 6, 2013 at 11:27 AM
@CK, Thanks for the note on the @possible copyrights violation. However, it isn't a copyrights violation because delph-in resources are open source. http://www.delph-in.net/
The purpose of me putting up the sentences in tatoeba is to extend the number of language pairs for these test suite sentences so as to quicken the grammarian's task of writing computational grammar.
This is my userpage in the delph-in group http://moin.delph-in.net/LilingTan.
Yes, I agree this sentence is from the opensource project. I am also part of the initiative, http://moin.delph-in.net/AlexandreRademaker. How can unlock the sentence?
What does it mean the red mark `this sentence is not reliable`?
It means that there's an issue with the sentence and it should not be included in the exported data. In this case, the issue is that the sentence was apparently copied from somewhere else, so it might be a copyright violation.
alvations, who copied it, and you, arademaker, argue that it isn't a copyright violation because it was copied from an open-source project. However, "open source" doesn't mean "you can do with it whatever you want". There are many different open-source licenses, and they are not always compatible with each other, because one license may forbid something that another license explicitly allows.
In this case, the sentence was apparently copied from the [incr tsdb()] test data, which according to http://www.delph-in.net/itsdb/ is licensed under the following terms: "The software (in source code) and data are made available to the general public, free of royalties, for academic or other non-commercial use, including deployment in corporate environments." This seems to imply that the original authors do not wish for their data to be used commercially.
On the other hand, Tatoeba data is licensed under the CC BY 2.0 FR license, which does allow commercial usage. If we were include it in the exported data, someone might use it e.g. in a commercial dictionary, going against the wishes of the [incr tsdb()] authors. That's why this sentence is marked as "not reliable".
If you want to have this sentence included, you'd need to show that it is not a copyright violation. Either by showing that the data in [incr tsdb()] was collected from somewhere else and the [incr tdsb()] license terms against commercial use don't apply, or by convincing the copyright owner(s) of [incr tsdb()] that they should relicense their data under a license compatible with CC BY 2.0 FR.
No itsdb is a tool, its license is not applicable to the data. This dataset came from https://w3.ual.es/~nperdu/hpsuite.htm and later was incorporated as a simple test suite for MRS semantic representation documented http://moin.delph-in.net/MatrixMrsTestSuite.
In https://delphinqa.ling.washingt...t-suite/484/2, another user from the DELPHI-IN community argues that its license is probably MIT.
The source of this sentence is the English Resource Grammar <http://moin.delph-in.net/ErgTop>, specifically the MRS test suite
The licence is MIT:
Other languages are available here:
MIT is compatible with CC BY, so no problems!
itsdb is a tool used for handling linguistic testsuites, and is licensed under the lgpl, but this is not relevant to the actual data.
I hope this clears things up, please let me know if you need any more information.
I had a hard time finding a license file for the data on the DELPH-IN website. Thank you for providing the link. Since the MIT license requires the license statement to be retained, it should be added as a comment for all sentences from the MRS test suite:
Permission is hereby granted, free of charge, to any
person obtaining a copy of this software and associated
documentation files (the "Software"), to deal in the
Software without restriction, including without
limitation the rights to use, copy, modify, merge,
publish, distribute, sublicense, and/or sell copies of
the Software, and to permit persons to whom the Software
is furnished to do so, subject to the following
The above copyright notice and this permission notice
shall be included in all copies or substantial portions
of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF
ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED
TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A
PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT
SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY
CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION
OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR
IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER
DEALINGS IN THE SOFTWARE.
Given that the attribution requirements are satisfied, I think this sentence and others from the MRS test suite can be added to Tatoeba without causing a legal issue. But let's see what @TRANG thinks.
Is there any easier way to associate comments to a set of sentences? Copy/paste the MIT license in all sentences can be very tedious.
Yes, I think it's no problem to keep them.
As for how to deal with the attribution:
- If the sentences are gathered under the same icon, the attribution could be added in the profile description.
- The sentences could be tagged "License: MIT".
- The sentences could be added to a list. The list title could be used to mention essential information (source and license).
Maybe there are other ways to deal with it. Personally, I'm fine with any solution that would allow a script/bot to do whatever is needed if one day someone isn't happy about how the license terms aren't being strictly respected.
It's understandable that manually copy-pasting license statements as comments on every sentence isn't practical (and a bit spammy too if there's a lot of sentences).
Hi @TRANG, I have created a list
I didn't know how to make `sentences are gathered under the same icon`. It would be nice to be able to add comments to lists, so the text of the license and more information about the list could be added.
Should be @TRANG... ;-)
I will remove the red warnings from the sentences in that list.
> I didn't know how to make `sentences are gathered under the same icon`
Sorry I meant to say "under the same account", not same icon :)
For instance if all the English sentences created under the account alvations were from http://moin.delph-in.net/ErgTop%3E, then the source and license statement could be written in the profile description (https://tatoeba.org/eng/user/profile/alvations).
> It would be nice to be able to add comments to lists, so the text of the license
> and more information about the list could be added.
Yes, definitely. I'm quite sure this has been requested before.