[Imports] Import of 14, 020 United Nations Mission in Liberia place nodes

Rafael Avila Coya ravilacoya at gmail.com
Mon Sep 15 16:25:49 UTC 2014


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi, Christoph:

I got your point. In fact, I added place=unknown as the tag you have
to put for those nodes whose place type cannot be verified, like when
lowres imagery, clouds, etc.

What I don't see is your concern on giving all nodes the default value
of place=hamlet (80-90% of the nodes fall in this type for Liberia)
and not place=unknown, that would make mappers to change almost 100%
of the values (quite annoying). This import is manual, and done
through the HOT Tasking Manager, so we can keep very easily trace of
what every mapper involved is doing. Every task can be validated by a
second mapper, and hopefully I will do some follow up on the first
tasks done by every mapper, so checking if they are doing well, not
only with the place type but with all the steps of the process.

Moreover, if you pay attention to the workflow, all nodes are checked
by the importing user using the TODO List plugin. Chances that they
miss checking the place type are really low, as they have to mainly do
2 things:

1. Check if there is any duplicate. In case there is, conflate them.
2. Check that the place=hamlet is correct, and change it if it is not.

Do 2 or 3 tasks and you find yourself doing a mechanical job that
makes it really difficult to miss that point, if not impossible. I've
participated in several jobs like this, and I find it really hard to
miss any step when trained. And everyone makes a last review of the
task before uploading, whether for this job of for any other one.
Plus, the validation process.

The aim of giving place=hamlet instead of place=unknown as default is
clear: make easier and faster the import.

Cheers,

Rafael.

On 15/09/14 20:44, Christoph Hormann wrote:
> On Saturday 13 September 2014, Rafael Avila Coya wrote:
>> 
>> Changing the place value is really fast (+/- 5 seconds).
> 
> You are missing my point here i think - giving nodes a tag based
> on the vague assumption that for the majority of them this will be 
> correct is inconsistent with the idea of a responsible import. 
> There is no information in this and you cannot determine for a
> node in the database afterwards if its tag has been diligently 
> determined from reliable information or if it is just the
> automatic default.  As a result even the hard work of those doing
> actual verification on the data is devalued since a place=hamlet
> does not tell if it has been verified against images/local
> knowledge or if it has just been pushed as it is into the
> database.
> 
> If you want to responsibly perform this import and appreciate the 
> work of those participating in it you should convert the data with 
> reasonable defaults that do not imply any knowledge of the places 
> you do not have (like place=unknown) and clearly instruct those 
> doing the import to only change this after individual assessment 
> using reliable information from other sources.  That way you can 
> later assume for a node tagged place=hamlet that this has been 
> verified (unless it has been uploaded as part of a changeset 
> containing 1000 nodes all tagged place=hamlet in which case the 
> mapper involved has probably taken the shortcut... ;-)
> 

- -- 
Twitter: http://twitter.com/ravilacoya

- --------------------------------

Por favor, non me envíe documentos con extensións .doc, .docx, .xls,
.xlsx, .ppt, .pptx, aínda podendoo facer,  non os abro.

Atendendo á lexislación vixente, empregue formatos estándares e abertos.

http://es.wikipedia.org/wiki/OpenDocument#Tipos_de_ficheros
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with undefined - http://www.enigmail.net/

iQIcBAEBAgAGBQJUFxMAAAoJEB3niTly2pPQC38P/0GMBWHVRErDd3abmtC/cipE
uD5U3pEi6rN2UhTDZN2Ib14kyAaUzlV70TTN/f+EKc2CYdtYb/0NsWmZK/YC9fV1
vsovmy9cOrzh6CSkDlU7rvS7H/oVeNjJFjpShCp3EpcAAgdmuUKaGZIusnMCA2Pm
yZUAtYhucm9kzHVFdZCXCxK8Nf+nDztG8HbB80LBnelQ3Oft4EJyFf10em/VP/i4
UErRmzXgHLvy7HnWjgS/EhyK1bWYtCgZXOW9Ez+weVTVZweSB5Ekw63CUwgfWvSf
vtSWyQJHRM6Xd0z47nBf8Zj+EZ52hXfCM1sMcX3YnUypoj6RilywXZuaLI5qLLGW
2wPo+PETmuqd1wgZeBkxYceFah3OXg6oBUqImXIOn8z/UOyIUxJUeN3Zipg3m1zf
fhmTyteRnfzSQSstaexQga3DPIesuyj99wtFk1eopHgFFRDWVvmzsK8XNvFUkuo+
bnh/LqPXNhLEx4U2AIs1SBeedGN1yBhDyhfvFao7C4EwWmLkKi18zjHtaxT5044t
zP3CgLfWNKabQCJWmLokiNyKNS3SrEiU5EZ39Fu8kqlZrLgB8SmVOvoCeWVVNgH8
tvKqtolf/A2VwBoszDDTAECCmUXzGFO4OG3o1yUCWKVP5fCpFPAnzK4KZBIyaGOy
ZtuNN9vOBKNFuWHhbMdj
=ue08
-----END PGP SIGNATURE-----



More information about the Imports mailing list