[OSM-dev] Automatic Spell check service

Stefan de Konink skinkie at xs4all.nl
Wed Jan 2 18:56:09 GMT 2008


 From my mail box I found some related messages in 'Spell check data to 
a file'.

I wonder if it could be accepted that every day an automatic spell check 
script would operate on the database and checks for less common 'name' 
tags for automatic spelling corrections.


select name, count(name) AS count from way_tags group by name order by 
count, name;

...or more advanced:
select way, name from way_tags where name = ( select name from way_tags 
group by name having count(name) = 1);

If anyone interested in some plane query results of different tags 
ordered by (non-)presence:

http://www.mediafire.com/?6uwdsgnuc9z



So basically:
- skip stuff with the tiger namespace
- check the untrimmed results:
select way, name from way_tags where name <> TRIM(name); (around
  107 results)

- check the count(name) results less than 10, lowercase, trim them and 
apply something like Levenshtein distance to the results and the higher 
numbers.




Comments?



Stefan




More information about the dev mailing list