menu
Tatoeba
language
Register Log in
language English
menu
Tatoeba

chevron_right Register

chevron_right Log in

Browse

chevron_right Show random sentence

chevron_right Browse by language

chevron_right Browse by list

chevron_right Browse by tag

chevron_right Browse audio

Community

chevron_right Wall

chevron_right List of all members

chevron_right Languages of members

chevron_right Native speakers

search
clear
swap_horiz
search
TRANG TRANG October 29, 2017 October 29, 2017 at 8:18:10 PM UTC link Permalink

**Tatoeba & Mozilla Common Voice**

A couple of weeks ago I mentioned that we've been contacted by Mozilla to explore ways we could partner up with their project Common Voice.
https://tatoeba.org/eng/wall/sh...#message_28540

I'd like to carry on the discussion, especially on two topics.


1) The license.

CK quickly pointed out that Common Voice is using CC-0, and license was actually one of the first things we discussed about with Common Voice. They are very well aware it's a topic we need to solve before we can establish any further collaboration.

CC-0 is a pretty strict requirement on their side and even if they'll definitely be happy to give attribution to Tatoeba, at the end of the day, they still want to be able to publish their data under CC-0. They cannot do this when using our data, unless Tatoeba's data would be CC-0 too.
On our side, we obviously cannot just migrate our content from CC-BY to CC-0 out of the blue...

So our projects are currently incompatible, but the way I see it is that Tatoeba will need, sooner or later, to upgrade Tatoeba to allow people to contribute under other licenses than CC-BY. Similarly to what we've done with audio.
We would then have part of our content under CC-0, and this part can be used by Common Voice. We would also be able to handle sentences that are under more restrictive licenses, that we currently cannot accept in Tatoeba (CC-BY-SA, CC-NC, etc).

That's one possibility, but again, nothing's decided yet. If anyone has another point of view on this, feel free to share.

I'd be interested to know as well how you feel about CC-0. Is is a license you'd consider using for your contributions?


2) What if one day Tatoeba stopped collecting audio?

Just to be clear, I don't mean removing audio from Tatoeba. Listening to the pronounciation of a sentence is an invaluable feature, and there's no reason to remove it. But what if we're delegating the responsiblity of collecting audio to another project, namely Common Voice?

This is of course most relevant for those of you who have contributed audio, or would like to contribute audio someday. How would you feel about contributing audio for another project? Would you be happy about it? Would you be bothered by it? Do you have any concerns about it?

If you feel like discussing this topic more in private, feel free to send me a private message instead of replying on the Wall.