<br><br><div class="gmail_quote">On Fri, Feb 17, 2012 at 12:02 AM, Alan Mintz <span dir="ltr"><<a href="mailto:Alan_Mintz%2BOSM@earthlink.net">Alan_Mintz+OSM@earthlink.net</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
At 2012-01-15 05:35, Mike N wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
On 1/15/2012 8:28 AM, Nathan Edgars II wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Actually the script also expanded the W to West. But my point is that it<br>
is a TIGER entry error, and any future script needs to take into account<br>
that these exist and people may have already fixed them to the correct<br>
names.<br>
</blockquote>
<br>
Agreed- if we're thinking of a bot that periodically fixes everything, we need a special tag that says "abbreviation_bot=back_off" (but perhaps not so verbose) - something that tells the bot not to touch the name because it is unusual and has been manually checked.<br>
</blockquote>
<br>
I hope there is no such bot being contemplated again. The last one created lots of issues.</blockquote><div><br></div><div>Sounds like it would be better to come up with a more comprehensive algorithm for the bot, not outright deny the need for it altogether. Granted, it did make minor messes in Oregon (where names with "St." meaning "Saint," "Santa/o" or "Sainte" are slightly more common) and Oklahoma (where single-letter street names are slightly more common), but overall, the automation saved countless hours of manual name expansion for the minor cost of having to deal with a very small number of largely regionally-isolated edge cases.</div>
</div>