[OSM-dev] scaling
Frederik Ramm
frederik at remote.org
Sun Jan 9 22:35:02 GMT 2011
Steve,
SteveC wrote:
> I don't mean just another big import. Sadly I can't be public
> but I had a conversation with a large company over a year ago (no,
> it's not MS or CM) who speculated about putting OSM on the front
> page of their maps product, which would approximately turn all of our
> yearly statistics to daily or weekly numbers. We went through a
> decision tree about how that could happen. Every leaf node on that
> tree came back as basically we couldn't do it.
This was not purely a techical issue. If we were set up, technically, to
handle something like what you're describing here, the "eternal
september" effect would kill off the community for good.
So in a way, we must be glad that we have this technical limitation,
because it protects us against someone dumping 1000s of new users every
week onto our mailing lists, IRC channels, forums, onto help.osm.org,
the Wiki, and int our regular pub meets.
> If you think through it,
> within any reasonable time frame (like 6-12 months) it's very hard to
> make that happen, and so you may as well go build your own things.
> Which I think sucks and is a loss for OSM.
I honestly think that the project would degrade tremendously were
something like this to happen. We can handle growth, and I also believe
we can handle super-linear growth, but something that "turns all our
yearly statistics into daily or weekly numbers" would be the end of
everything that is sane about OSM.
> Now this conversation has come up a few more times recently with
> other large mapping companies. And I feel like I'm rehashing those
> conversations above. I'd love to be public about it, but those
> companies aren't ready to talk yet.
Good for them because they would have to take the heat for killing OSM -
or nonchalantly thinking about doing that.
> Even if people weren't privately proposing notching up our traffic a
> few orders of magnitude
Please stop thinking of this as a technical issue. It isn't!
> But the direction of supporting and encouraging basic
> things like scaling is I think well within the bounds.
>
> I haven't a clue what we should use to scale horizontally.
I think "encouraging scaling" is within the bounds of what OSMF could
think about. Whether or not such scaling needs to be horizontal,
vertical, deep-bore or whatever other method is currently en vogue,
that's perhaps something best left to the TWG.
If OSMF board wants to realistically construct a scenario that has the
number of new users, and the amount of edits and traffic, multiplied by
100 in the frame of a few months, then I'm sure that TWG can provide
ideas how to get there. But you would *have* to find someone who creates
that scenario for the social side and looks into the issues I've brought
up above or else we'll be fucked on a level that's much harder to fix
than a slow database.
I don't know how much your personal view might be tainted by the US
perspective. Compared to Europe, ans especially taking into account the
population base, community size in the US is still negligible. You might
be tempted to think that you can play va banque there as there isn't a
lot to lose. If that's the case then maybe we should work on providing
more separation - allowing you to supercahrge and horizontally scale
OSM-US with the help of one megacorp or the other, but keeping us here
in Europe out of harm's way.
(Indeed one way of scaling would be to "devolve" the central database -
have several of them for different regions. That would make it more
difficult for people to write bots that wreak havoc with the world-wide
dataset but I wouldn't consider that a loss ;)
> So, how do we get from here to there? Speaking strictly personally, I
> think one of the best uses of funds in or out of OSM has been bug
> bounties.
Does anyone have an example where this has worked in OSM? I always
perceived these bounties as your personal hobby horse that never worked
for us but maybe my thinking has clouded my vision here.
Personally, I think that people tend to be encouraged by the hope that
their solution might become THE solution used in OSM. Tell a hacker that
he gets $1000 if he gets something to work, and he'll say "oh well,
let's leave that to those who need the cash". Tell a hacker that you'll
roll out his idea big time if he proves it can work, and he'll happily
do it...
> So, what do you think? And if you agree it's worth doing, how do we
> achieve it either as individuals or the board or companies supporting
> it?
First of all, stop talking of the kind of explosive growth you're
talking about. Set yourselves a realistic growth scenario - I'd say
something like "number of contributors & data size double every year" or
so. Sit down with TWG and find out how long the traditional approach
will hold out, and how and when to scale, then make plans for that. I'm
sure TWG will be mature enough to say what they can do and what they
can't, and where they will need input & concepts from the project at
large. Find suitable ways to solicit such.
If you are looking for a project that can support the kind of explosive
growth you envisage - factor 100 in a few months -, then start a new
project, one that relies much less on individual human beings. A project
where you can simply scale by adding another server, without also having
to tear down the pub and erect a town hall in its place.
Bye
Frederik
--
Frederik Ramm ## eMail frederik at remote.org ## N49°00'09" E008°23'33"
More information about the dev
mailing list