[OSM-talk] Attribution guideline status update

Christoph Hormann osm at imagico.de
Fri Aug 9 11:06:43 UTC 2019


I am strongly against this in the current form because it addresses none 
of the major issues about corporate attribution of OSM (or lack 
thereof).

1) It does not in any way address the problem of second rate attribution 
(i.e. someone else - usually the service provider of the map service or 
the media outlet publishing the map) is being attributed more 
prominently than OSM.  The '50 percent rule' you invented:

"If OpenStreetMap data accounts for a minority (less than 50%) part of 
the visible map rendering, attribution with other sources on a separate 
page that is visible after user interaction is acceptable."

is ridiculous because 50 percent of the map area being functionally 
empty is essentially a property of most maps, in particular at large 
scales or high zoom levels.  There is no basis in the ODbL for allowing 
attribution in a case where attribution is required that is 
not "reasonably calculated to make any person [...] aware".  Therefore 
i would consider that rule in clear violation of the license.

And frankly it also contradicts the fundamental self-image of the mapper 
community.  As has been discussed plenty of times the way geodata is 
generated in OSM is fundamentally different from other geodata sources.  
While elsewhere people generating geodata are almost always rewarded 
for their work also in other form (like salery) in OSM the only 
recognition mappers receive from external data users is the attribution 
required by the license.  Putting OSM on the same level as other data 
providers like you do above is totally inappropriate.

As previously said my suggestion for regulating this is:

"If anyone else is attributed in the context of a work based on OSM data 
(like other data providers, designers, service providers or publicists) 
the OpenStreetMap attribution needs to be at least on the same level of 
prominence and visibility as those."

2) Also beyond that you formulate more exceptions than actual 
requirements and where you formulate requirements they are put in 
obviously weasely terms or are tightly limited to very specific 
situations:

* "you may omit the word "contributors" if space is limited" - since 
space is always limited obviously this is a bogus requirement with no 
practical effect.  So you essentially say "© OpenStreetMap" is always 
sufficient.

A suitable rule would be:

"if space is so limited that printing '© OpenStreetMap contributors' at 
a legible text size would take an unreasonable amount of space you can 
shorten this to '© OpenStreetMap'"

* "Except for small images, attribution must be visible [...]" - being 
vague here while being precise with the 480 pixel in case of mobile 
applications is remarkable.  But even more remarkable is that there is 
no attribution requirement given for these "small images" - which can 
be interpreted as if no attribution is required for small images at 
all!

* Naturally the section on "Geocoding - Search" would be generic on any 
non-visual interactive applications using OSM data.  Limiting these 
requirements strictly to geocoding is questionable.

* Declaring printing the URL as the only and a sufficient method "to 
make any Person [...] aware that [...] is available under this License" 
in non-digital/non-interactive applications does not seem a good way to 
implement the idea of the license.  Mentioning the license directly (© 
OpenStreetMap - source data available under ODbL) seems a more suitable 
and should at least be an equally allowable method of attribution in 
such cases.

3) Your paragraph about "Machine learning models" is essentially out of 
place in an attribution guideline.  The whole idea of a produced work 
becoming a derivative database is extremely delicate and with various 
issues.  The concept of derivative databases and produced works depends 
on an uninterrupted chain of responsibility from the original database 
via derivative database to produced work.  Interrupting this chain by 
allowing a produced work to be turned back into a derivative database 
essentially breaks the license.

The very purpose of a machine learning system is to generate semantic 
data and a common property of such systems is that when run on the 
training scenario they more or less reproduce the training data.  
Considering this an exceptional use case is highly questionable.

Sneaking this into an attribution guideline is ill-advised IMO.  
It seems this has been looked at purely from the perspective of 
corporate OSM data users and not from the perspective of hobby mappers.  
I see no reason other than corporate greed why machine learning models 
trained with OSM data should not be considered derivative databases.

4) The most obvious practical guideline to fulfill the "reasonably 
calculated" would be that the attribution would need to be designed in 
a way that at least 50 percent of the map users could, when asked about 
the origin of the map they are looking at, quickly and without much 
difficulty point to the attribution.  But you don't say anything in 
that direction.

Overall i think this is totally unacceptable and looks pretty much like 
being written by corporate representatives as how they would like 
attribution to be handled with very little regard to the interests of 
the hobby mapper community and the mission of the OSMF.  I formulate 
this so strongly because i have on many occasions in the past pointed 
out that we have to formulate clear requirements to data users for what 
we expect from them - yet i can find hardly any of this in the draft.  
This is very disappointing.  As i have shown above with various 
formulation suggestion it is not actually that difficult to put clear 
requirements into words which makes me think this draft explicitly did 
not want to do so.

If the OSMF is not able to create an attribution guideline that 
safeguards the interests of the OSM community we will have to create 
our own guideline that lives up to the promise of being a 
real "community guideline".

-- 
Christoph Hormann
http://www.imagico.de/



More information about the talk mailing list