Lots of weird ones from Florida Many should not give you an issue due to how your processing, but it is best to test them anyhow. Also it might be a good reference when looking at other expansions after this runs.<br>way id="10761946" "name" v="E 10th Ct E"<br>
way id="10763539" "name" v="E 10th St E"<br>way id="10759486" "name" v="E 14th Pl E"<br>way id="11018453" "name" v="E 1st Avenue Pl" <-- not really a problem, just... odd<br>
way id="10763214" "name" v="E 40th Pz E" <-- Note the double space before E<br>way id="10966845" "name" v="E Camp N Comfort Ln" <-- Non directional N<br>
way id="11210989" "name" v="E Canal St N"<br>way id="10967404" "name" v="E Dr"<br>way id="10974755" "name" v="E Dr Martin Luther King Jr Blvd"<br>
way id="11278916" "name" v="E H St E"<br>way id="10965707" "name" v="E Ln"<br>way id="11242732" "name" v="E Martin Luther King Jr Dr"<br>
way id="11102139" "name" v="E Pl"<br>way id="10959109" "name" v="E St Andrews Dr"<br>way id="10827576" "name" v="E St James Loop" <-- I guess Tiger did not abbreviate loop<br>
way id="11272826" "name" v="E St Johns St"<br>way id="11021472" "name" v="E St Louis Ave"<br>way id="11065801" "name" v="E W Reeves Rd"<br>
way id="103599461" "name" v="E. Watson Road" <-- Not a tiger import<br>way id="10983188" "name" v="East North Street" <-- already expanded tiger<br>way id="11270447" "name" v="East North St" <-- not expanded<br>
way id="11274418" "name" v="Edwin St N E"<br>way id="10851149" "name" v="Egret's Walk Cir S" <-- In case the 's causes problems<br>way id="10808177" "name" v="Ellesmere E"<br>
way id="10951424" "name" v="Ave del Ctr"<br>way id="11288799" "name" v="Avenue E N"<br>way id="10939680" "name" v="Avenue N"<br>
way id="11285084" "name" v="Avenue N NW"<br>way id="11097378" "name" v="Dr"<br>way id="10812824" "name" v="Dr Faruqui Dr"<br>way id="11358527" "name" v="Dr Joe Abal Dr"<br>
way id="10919692" "name" v="Dr Martin L King Jr Dr"<br>way id="11128816" "name" v="N 14th St Pl"<br>way id="10982651" "name" v="N 19th Cir SW"<br>
way id="39488514" "name" v="N 22nd St." <-- non tiger<br>way id="10885972" "name" v="N 3rd Street Cir"<br>way id="10993673" "name" v="N Blvd"<br>
way id="10807124" "name" v="N Cortez Dr Cir C"<br>way id="11371860" "name_1" v="N Cswy" <-- "name" v="N Causway"<br>way id="11090351" "name" v="N E 144th Avenue Rd"<br>
way id="11080981" "name" v="N E 238 Ave Rd"<br>way id="11089629" "name" v="N E 62nd Ct Rd"<br>way id="10927659" "name" v="N E St"<br>
way id="11013343" "name" v="N F S 595-2"<br>way id="10925619" "name" v="N N St"<br>way id="11359562" "name" v="N N Road"<br>way id="10921209" "name" v="N S St"<br>
way id="10880720" "name" v="N St Andrews St"<br>way id="10765917" "name" v="N St Clair St"<br>way id="10979914" "name" v="N St Peter St"<br>
way id="11302478" "name" v="N Swan Ct NE"<br>way id="10243562" "name" v="N W 34th St R"<br>way id="11092219" "name" v="N W 51 St Ct"<br>
way id="10927760" "name" v="N W Ave F North"<br>way id="10763701" "name" v="N de Gama Ave N"<br>way id="26630760" "name" v="N orth22nd Street" <--bad manual edit<br>
way id="27354570" "name" v="N orthGarcia Avenue" <--bad manual edit<br>way id="10754189" "name" v="N-Yellow Pine Cir" <-- "name_1" v="Yellow Pine North Cir"<br>
way id="119723334" "name" v="N. Shingle Lane" <-- non tiger<br>way id="10983026" "name" v="N19th Ave" <-- "tiger:name_base" v="111th" Probably due to edits<br>
way id="11058140" "name" v="NE 40 Ln" <-- "name_1" v="NE 1 St Ave" Version 1 tiger<br>way id="10806770" "name" v="NE 16th Ter; NE 17th Ave" <-- double name possibly from edits<br>
way id="11079312" "name" v="NE 172 Ave Rd"<br>way id="11089303" "name" v="NE 18th Ave; NE 9th St" <-- double name possibly from edits<br>way id="10800930" "name" v="NE 19th Ter; NE 25th St" <-- double name possibly from edits<br>
way id="11100990" "name" v="NE 196 Ter Rd"<br>way id="11099492" "name" v="NE 21st Ter W"<br>way id="11088248" "name" v="NE 220th Ave Rd"<br>
way id="11062349" "name" v="NE 3 Rd Ave"<br>way id="11081124" "name" v="NE 36th Av Rd"<br>way id="11070763" "name" v="NE Mt Zion A M E Church Ave"<br>
way id="11081908" "name" v="NE226 Ter"<br>way id="28931406" "name" v="NE31st Ave" <-- non tiger<br>way id="10789444" "name" v="NE<br>
way id="10788734" "name" v="NW 10th St Access Rd"<br>way id="10788581" "name" v="NW 126th Ave; NW 126th Way"<br>way id="10242655" "name" v="NW 141st"<br>
way id="10242241" "name" v="NW 181 St"<br>way id="11128828" "name" v="NW 181st St"<br>way id="11085308" "name" v="NW 21st Street"<br>
way id="11082282" "name" v="NW 221st Street Rd"<br>way id="10765627" "name" v="NW 231 St"<br>way id="11151648" "name" v="NW 4th Avenue Cir E"<br>
way id="10792992" "name" v="NW 6th Ave; Blanch Ely Ave"<br>way id="10809778" "name" v="NW 71st Pl; NW 71st St"<br>way id="10928777" "name" v="NW Avenue G; Avenue G North; NW Avenue G"<br>
way id="11273744" "name" v="NW Dr"<br>way id="107757877" "name" v="NW NW 125th Avenue" <-- non tiger<br>way id="10246730" "name" v="NW30Ln" <-- name1 has spaces<br>
way id="11065133" "name" v="National Forest Rd 141A"<br>way id="11060010" "name" v="Nf Rd 354"<br>way id="11083729" "name" v="Nfr 75B"<br>
way id="11034257" "name" v="Nfs 572 B"<br>way id="10237516" "name" v="Nnw 141 St"<br>way id="10803531" "name" v="Nmw 49th Ave"<br>
way id="83737572" "name" v="North 46th Streeet" <--manual expansion typo<br>way id="10874252" "name" v="Northern Pacific Dr N"<br>way id="11124503" "name" v="Northwest 38th Court; NW 38th Ct"<br>
way id="11213490" "name" v="Norwich O" <-- "tiger:name_direction_suffix" v="O"<br>way id="57732753" "name" v="Nw 35th Ave" <-- name case<br>
way id="9059279" "name" v="S St"<br>way id="11058256" "name" v="S W Cr 347"<br>way id="11030290" "name" v="S and S Ln"<br>way id="34939098" "name" v="S.W. Sundance Trail" <-- non tiger<br>
way id="10927892" "name_1" v="SE Ave E" <-- "name" v="Avenue E South" <br>way id="11234533" "name" v="SE Ave K Pl" <-- "tiger:name_base" v="SE" "tiger:name_type" v="Ave"<br>
way id="11200345" "name" v="SE Ave F Pl"<br>way id="11100255" "name" v="SE Summerfield Way; Summerfield Way"<br>way id="11298967" "name" v="SE W Snow Rd"<br>
way id="10244663" "name" v="SE182 Ave"<br>way id="11121266" "name" v="SW 108th Stcr" <-- "name_1" v="SW 108th St"<br>way id="11156351" "name" v="SW 108th Stcr N"<br>
way id="11167562" "name" v="SW 112th Cir Ln S"<br>way id="11032640" "name" v="SW 28th Ter; SE 28th Ter"<br>way id="11107394" "name" v="SW Dr Martin L King Jr Dr"<br>
way id="11101573" "name" v="SW St George St"<br>way id="10767432" "name" v="Saint St SE"<br>way id="10777261" "name" v="Scallop Dr; George J King Blvd; Glen Cheek Dr"<br>
way id="10762071" "name" v="W 33rd St W"<br>way id="10763371" "name" v="W 30th St W"<br><br><br><br><br><div class="gmail_quote">On Fri, May 11, 2012 at 7:38 PM, Serge Wroclawski <span dir="ltr"><<a href="mailto:emacsen@gmail.com" target="_blank">emacsen@gmail.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="im">On Fri, May 11, 2012 at 4:17 PM, Dale Puch <<a href="mailto:dale.puch@gmail.com">dale.puch@gmail.com</a>> wrote:<br>
> I understand the script checks for only one instance of the abbreviation.<br>
<br>
> My point was what is someone manually expanded ONE of the abbreviations,<br>
> leaving "st something street"? Is that checked for?<br>
<br>
</div>I have a number of thoughts here:<br>
<br>
1. Real world examples.<br>
<br>
Many of the examples I've seen are contrived. I'm glad we're testing,<br>
but testing needs to be based on actual data seen in the US dataset.<br>
<br>
That said:<br>
<br>
2. There are a couple of ways to handle this:<br>
<br>
* One way (the most conservative way) would be to test for untouched<br>
TIGER ways. That is ways in which they're still at version 1. This<br>
would be a real problem, though, since there are lots of examples were<br>
someone may have fixed the geometry without touching the tags.<br>
<br>
* The other way is a method I'm using in an experimental branch of the<br>
code on my machine, which is to try to be a bit more selective about<br>
the expansions of road types. If we assume that the road type always<br>
appears after the base name, we can be handle examples like (real<br>
world example) "St Marys St". The same would hold true for direction<br>
tags, so we'd be able to expand "E E St" confidently as well.<br>
<br>
But there's a catch. If someone would have edited the name of the<br>
above street from the original "St Marys St" to "St. Marys St" then<br>
that test would fail, and the expansion would never occur, where as in<br>
the current version, it would.<br>
<br>
So:<br>
<br>
3. Any method used is going to produce some number of potential either<br>
false positives or false negatives. I contend that the number of<br>
errors in either case will be so tiny that it will be lost in the<br>
noise, but there's no way to promise it will always be 0. The best we<br>
can do is toss out uncertain expansions and have them handled manually<br>
(which is something I'm working to make better in the next version of<br>
the code as well).<br>
<br>
But:<br>
<br>
4. I don't want us to rely on cleverness. I'd much rather rely on<br>
people testing the code with real world inputs and checking the<br>
outputs.<br>
<br>
<br>
I should have a new version of the code either tonight or tomorrow,<br>
with the new expansion rules.<br>
<span class="HOEnZb"><font color="#888888"><br>
- Serge<br>
</font></span></blockquote></div><br><br clear="all"><br>-- <br>Dale Puch<br>