[OSM-legal-talk] ODbL: Where do we stand regarding collective/derivative databases
Frederik Ramm
frederik at remote.org
Mon Jul 27 20:51:47 BST 2009
Hi,
generally good progress on ODbL; many things have been cleared up
and we will soon be at a point where the proposal for a license change
is not some cloudy abstract thing any longer but a very concrete
proposal that people can evaluate.
After the LWG has made an effort to resolve the questions about what is
substantial and what is a derived work, in my eyes there's one big issue
that remains, and that is "what is a derivative database".
To recap, my understanding is that if I produce (+publish) works based
on a derivative database that I have created then I have to make that
database available, fully, under ODbL. If I, on the other hand, produce
works based on a collective database that is half ODbL and half
proprietary, then I only have to make the ODbL part available. Is that
everyone else's reading as well?
Let us look at someone who mixes OpenStreetMap and Navteq data. Say I
produce map tiles (clearly a produced work, no?) where all the streets
come from Navteq, but all the footways come from OpenStreetMap. There
are a number of ways to do this, all leading to the exact same result,
and nobody from the outside can see which of 1,2,3 I am using:
1. Configure my Mapnik tile generator so that it accesses two different
postgis databases - one containing Navteq and one containing OSM - to
produce merged map tiles.
2. Pour OSM and Navteq data into the same postgis instance but have
different tables (e.g. planet_osm_roads and navteq_roads) which are
joined by Mapnik's SELECT statement.
3. Extract all footway geometries from OSM and insert them into my
postgis database containing Navteq street data, then run Mapnik on the
resulting database.
The way I read the license, option 1 would be definitely ok, option 3
would definitely lead to my having to release the Navteq data, and
option 2 would be somewhere in between (probably ok until unknown to me,
Matt comes along and makes Mapnik internally create temporary tables on
the fly for better performance in which case I'd be creating temporary
derivative databases without even noticing...)
Evil business genius that I am, I would of course claim to be doing 1
even when doing 3 and nobody would have the right to challenge me,
right? Which would ultimately mean that:
"If there is any conceivable way that a produced work could have been
created by using a collective rather than a derivative database, then
only the ODbL licensed part of the data source has to be released."
This is becoming interesting, we're very much into real-world business
scenarios now. There are lots of people who'd shy away from using OSM
outright but if they could use a Navteq basemap and sprinkle that with
any additional detail that OSM might have that would be just great for
them.
Let us look at someone who has a Navteq and an OSM data base, and runs a
comparing analysis which results in *removing* all features from the OSM
database which were also in Navteq. He clearly creates a derivative
database but one which has no data added, just data deleted. He now
employs technique #1 from above to merge the Navteq data set and the
reduced OSM data set into one that contains the "best of both worlds".
Since he is clearly operating on a collective database, he only has to
release the derived OSM database under ODbL - the value of which is
almost zero to the community since it has no data added (the only thing
you can do with it is find out which of OSM's features are present in
Navteq as well).
Is everything I write here correct and compatible with what others are
thinking? Is there some lawyer opinion on cases like this documented
somewhere in the vast depths of our Wiki and LWG minutes?
(I'm just trying to determine what exactly ODbL mandates - not trying to
find out what would be desirable in an ideal world.)
Bye
Frederik
--
Frederik Ramm ## eMail frederik at remote.org ## N49°00'09" E008°23'33"
More information about the legal-talk
mailing list