I applaude your effort to update the addresses. I'm guessing the size of the change set would make the current MapRoulette connection task seem trivial. <div><br></div><div>I did look at your tiger.py script. I think br might also stand for branch as well as bridge. Also, I've seen mtn for Mountain. </div>
<div><br></div><div>Clifford</div><div class="gmail_extra"><br><br><div class="gmail_quote">On Mon, Nov 26, 2012 at 6:29 PM, Serge Wroclawski <span dir="ltr"><<a href="mailto:emacsen@gmail.com" target="_blank">emacsen@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hello all,<br>
<br>
After the OSM US Google Hangout two weeks ago, there was talk of<br>
bringing back the effort I started six months ago to create a TIGER<br>
expansion bot to run against the roads in the US.<br>
<br>
I've brushed off the code and made several improvements to it (more on<br>
this later in the mail).<br>
<br>
In order to facilitate community involvement, I've talked with the OSM<br>
US board and we're going to have a process by which the code is<br>
officially vetted.<br>
<br>
That process begins with this email. I'm making the most recent<br>
version of the code available at:<br>
<br>
<a href="https://github.com/emacsen/tiger-expansion" target="_blank">https://github.com/emacsen/tiger-expansion</a><br>
(there was a URL for the previous version, but this is where the<br>
current, up to date code will live).<br>
<br>
I encourage people to review the code.<br>
<br>
In addition, on Thursday, November 29th at 8pm EST on Google Plus,<br>
we'll have another public hangout where I'll do a code walkthrough.<br>
This will be an opportunity for people to bring up questions or<br>
concerns they have about specific code issues.<br>
<br>
>From there, baring any major issues, I'll send a followup email to<br>
this email where I'll make a final request for comment. This will be<br>
for specific code issues, and people are encouraged to send in any<br>
specific code related issues, and we'll have that review period open<br>
for one week.<br>
<br>
After that, the code will be executed, and that execution period will<br>
probably be several days, as I'll be manually supervising the<br>
execution myself.<br>
<br>
In anticipation of the code walkthrough on Thursday, I'll give a high<br>
level overview of the code, as well as the changes from the version<br>
six months ago.<br>
<br>
The code is written in Python, and it uses a simple XML parser to<br>
parse OSM XML. I have a simple framework for handling this in the<br>
pyxbot.py file, which handles the parsing and selection tprocess.<br>
<br>
The tiger.py file contains TIGER specific expansion code, and the<br>
selection process is quite simple. The selector looks for ways which<br>
have a "highway" key and a "name" key present in the tag.<br>
<br>
The selected tags then go through a transformation, which looks for<br>
name, name_1, name_2, etc and looks for corresponding tiger tags<br>
(tiger:name_base), etc. It then pieces apart the name from the<br>
existing name and reconstructs it using the expanded tiger tags. If<br>
the new name is different, then it is stored.<br>
<br>
If the name is already properly expanded, then the way is ignored but<br>
if there's a problem with the tag expansion, then that way information<br>
is stored elsewhere for review.<br>
<br>
The review file (a CSV file) contains information about all the ways<br>
that didn't process properly, such as the way ID, the (primary) name,<br>
and the reason for the failure.<br>
<br>
This file can then later be review later, or fed into a MapRoulette<br>
challenge.[1].<br>
<br>
Now, for those folks who looked at the code six months ago, these are<br>
the major changes:<br>
<br>
1. I've expanded the expansion table quite a bit, through extensive testing.<br>
<br>
2. I've added the review file functionality<br>
<br>
3. I've added name_1, etc. functionality.<br>
<br>
4. The code is more modular than it was<br>
<br>
5. The code is easier to run from the command line<br>
<br>
<br>
So, the code is out there. If you have technical questions, I'll go<br>
into more depth Thursday.<br>
<br>
- Serge<br>
<br>
[1] I'm hacking on the MapRoulette code to make it easier to add new<br>
challenges, such as this.<br>
<br>
_______________________________________________<br>
Talk-us mailing list<br>
<a href="mailto:Talk-us@openstreetmap.org">Talk-us@openstreetmap.org</a><br>
<a href="http://lists.openstreetmap.org/listinfo/talk-us" target="_blank">http://lists.openstreetmap.org/listinfo/talk-us</a><br>
</blockquote></div><br><br clear="all"><div><br></div><br>
</div>