Sunday, November 11, 2012

INTERNET SPECIAL..How Google's India maps are built



How Google's India maps are built

Google Maps here is an entirely user-driven project, aided by some sophisticated code
How did Google Inc., a start-up that had majored in the art and science of Internet search, create maps that have become the go-to guide for commuters world over? Last month, The Atlantic offered a behind-the-grids view of how the technology major went about building the maps that have come to become one of the most expansive and accurate sources of geo-information on the Web.
This first of its kind report on what Google dubs as ‘Project Ground Truth’, its very secret mapping programme, revealed that base layers for these maps source data from the U.S. Census Bureau, the Geological Survey, and of course, from the famous – infamously discontinued after its launch in India – Google Street View vans that have clocked over 5 million miles till date. All this is, of course, crunched by Google’s sophisticated algorithms and clever, rigorous code.
Evidently, the process of creating among the most accurate maps in the world is complex, and fascinating. But in India, where an increasing number of smartphone users are logging on to maps, the ‘ground truth’ is significantly different: all that you see on the grid on Google Maps in India has been created by users. An entirely crowd-sourced product, our maps are created by us.
Altogether different from how the search giant went about mapping in these developed countries, in India, Google is only now exploring partnerships with some government establishments to map rural India better. That its Street View vans are now resting in the basement of Google’s office in Bangalore after the police stymied the project citing security concerns, is also perhaps one of the reasons why Google may be sticking to this user-only approach.
But Lalitesh Katragadda, Google India, Country Head (Products), one of the co-creators of Google Map Makers, insists that the reason they choose to focus on users is because emerging markets like India are in “hyper-growth”. “This means, the maps here change every day. If I leave Bangalore for a year, I come back to see an altogether different city, unlike say Silicon Valley where not much changes. There is constant churn of points of interest, road geometry, one-ways and building profiles. That kind of data can’t be given fresh to users without getting users on board,” he says. This, he says, is a problem that hasn’t been solved.
Brushing aside questions on what competitors are doing, and competition, he says everybody is fighting fierce over the first billion users (in developed economies), but the real challenge is to take this to the next 4.5 billion users and “getting that right is more important”. Since its launch in 2005, Google claims it has covered 1.5 billion more people across the world, and managed to map over 180 developing countries that had practically nothing in the way of high quality maps before. These numbers, the company clarifies is based on its own estimations based on populations of areas that have access to high quality maps. For instance, the population of a slum settlement in Bangalore doesn't get counted as its user base for the purpose of this estimation. "When we sat down to write code in 2004-05, the entire developing world was dark. Today, we've managed to pushed the needle from around 1 billion in the developed world to 2.5 billion (worldwide)." This, he adds, would not have been possible if it weren't for a platform like Google Map Maker that provided users with a simple, lively and interactive platform to map their streets, their neighbourhoods or their cities. It solved a problem that an individual company couldn't, by simply tapping into the wisdom and initiative of the crowd.
A "socio-technological" task
Map Maker, the “socio-technological phenomenon” that Mr. Katragadda and a small team at Google started in 2006 when they geo-tagged Ulsoor lake here, is a tool that allows users everywhere to edit (or create from a scratch) the maps they use. Maps, whether user-created or otherwise, are scientifically some of the most challenging data sets ever dealt with; far from simple text or images, they involve what in computer science parlance is called a graph comprising connected edges or nodes. And most of all, the demand for accuracy is so high that maps less than 97 or 98 per cent accurate are simply irrelevant. So how does Google ensure that these maps, created by random but enthusiastic users, meet such high standards?
How it's done
Creating maps out of crowd-sourced data presents technological and systems challenges that are huge. To begin with, how does a user make changes, and how does the machine learn to recognise these gestures, trust the data that is being fed, and update the maps in real time?
The high-resolution satellite imagery on the Map Maker platform is where you start. Once the user feeds in a landmark, or say names a road, the Map Maker encodes the science of computational geometry to automatically synthesise the map based on individual gestures from users. This begins with a user visually gesturing their knowledge, i.e. sketch on top of satellite imagery. Map Maker then takes these small inputs from a whole bunch of users and synthesises that into a connected graph that is geo-spatially overlaid on top of the world, Mr. Katragadda explains.
Another key challenge, he explains, is making all this happen in real time. This, he says, is what sets Google’s platform apart from other community-driven Open Source mapping projects such as Open Street Maps. Besides code written by “very smart engineers” to solve what is obviously a hard system challenge, it also leverages the Google Cloud.
This gives Google the ability to manipulate the entire map of the world in real time in its data centres. So, every change flows to all the data centres that are serving maps.
But what about accuracy, undoubtedly, the most important prerequisite of a map? The “secret sauce” is what Mr. Katragadda calls the “trust system”, which computer engineers will call a supervised learning system. This, he explains, is a machine learning system that notes all that people are doing — who is editing, who is moderating and what which moderator is saying.
This means that when you come in as a new user, with no mapping history, the first few edits go straight into moderation. This moderation is done mostly by the community, but also a small team of engineers that provides ‘support and guidance’ to the mapping community. It is, essentially, PageRank (Google's patented link analysis algorithm, for internet search) applied to people for the task of mapping. This “trust system” coupled with the community approach lies at the heart of the entire Google Map experience in India. “Over time, the machine learns based on hundreds of signals — where can you be trusted to map, what you map, which regions and what kind of mapping you do. It creates a profile that informs when you come in to map, the system knows that you are more trustworthy,” Mr. Katragadda explains, adding that over 80 per cent edits in India are approved immediately in real time.
When asked if this isn’t similar to the kind of system they have at the other community-run Open Source project, Wikipedia, he says that it’s radically different because there, the moderators are hand-picked, and the machine doesn’t continually learn and change your trust parameters based on your history. “That’s static, while this is a truly dynamic system.”
Not the 'California Indians'
While these huge technical challenges have more or less been solved, Mr. Katragadda feels that the path ahead is dotted with “socio-technological” hurdles. In India, for instance, he explains, the first 100 million users are taken care of. He calls them the “California Indians”, who behave and have needs similar to his friends back in California. But for the next billion and more, the needs are “extraordinarily different”: their ability to afford devices is different, their language and content needs are local and education levels are different. This, he says, translates into them needing a visual Web. The challenge for us is to understand what they need and build services that are different from what Google, and literally the entire Internet has built for them till now, he says. That is, how to get the mainstream of the world online, make the Internet useful for them and remove the huge information asymmetry that we now live with.
And the solution to this, Mr. Katragadda recognises, goes deeper than offering turn-by-turn navigations for motorcyclists in local languages. “It’s a lot more, and a lot more different.”
Deepa Kurup TH121028

No comments: