[OSM-dev] Minute Diffs Broken

Matt Amos zerebubuth at gmail.com
Tue Oct 6 17:09:54 BST 2009


On Tue, Oct 6, 2009 at 3:03 PM, Florian Lohoff <flo at rfc822.org> wrote:
>
> Hi,
>
> On Tue, May 05, 2009 at 08:32:44AM +1000, Brett Henderson wrote:
>> Hi Everybody,
>>
>> Unfortunately the minute diffs appear to be regularly missing data.  In
>> the last 8 hours at least 3 changesets have been missed.  The ones I've
>> noticed are 1076325, 1076998, 1077469.  These have been detected by
>> comparing the normal minute diffs against another minute diff process
>> running half an hour later.  I don't know what is causing these
>> changesets to be applied to the database so slowly, whether it's just
>> their size or some other factor I don't know.  I don't know if this is
>> something that can be fixed, or whether the current osmosis extraction
>> method is too time-sensitive and simply broken.
>>
>> At some stage over the next day or so I'll try to publish the audit
>> results automatically so that the problems are at least visible.
>>
>> The hourly and daily diffs should be more reliable because they run with
>> a 30 and 40 minute delay respectively although theoretically there's no
>> guarantee that they're correct either.
>>
>> So, any suggestions on how to fix this?
>>
>> I've been trying to avoid requiring any changes to the main database in
>> order to keep things simple but perhaps it's unavoidable.  One way
>> around the problem would be to introduce delta table(s) in the main
>> database populated by triggers on the existing history tables and
>> containing the ids and timestamps of changes.  Osmosis could read those
>> tables and delete records as it processes them.  It's a major change though.
>>
>> This isn't an ideal forum for coming up with solutions, but I thought it
>> was important to ensure people are aware of the problem.  I'll try to
>> spend some time on IRC over the next few days.  Whatever the solution, I
>> won't have the time (or skills) to do it on my own.
>
> Is this fixed or at least somebody has an idea how to fix this? Its
> been 6 Months since i switched to hourly updates which did not suffer
> from this problem although i'd like my applications to be more current.

we hope so. there are (currently experimental) "replication" diffs
here http://planet.openstreetmap.org/minute-replicate/ in a three-deep
numeric directory structure. these diffs are produced from the
transactions in the database, so each doesn't strictly cover a minute
of edits, but they are produced with almost no delay and shouldn't be
missing any elements.

cheers,

matt




More information about the dev mailing list