Google Index Reaches 1 Trillion URLs - InformationWeek
IoT
IoT
Mobile // Mobile Applications
Commentary
7/25/2008
05:23 PM
Thomas Claburn
Thomas Claburn
Commentary
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail
50%
50%
RELATED EVENTS
Building Security for the IoT
Nov 09, 2017
In this webcast, experts discuss the most effective approaches to securing Internet-enabled system ...Read More>>

Google Index Reaches 1 Trillion URLs

Three years after Google declared that its index was three times larger than any other search engine and then declined to cite a specific number to support that claim, it was widely believed that Google had tired of index one-upmanship and that it would no longer be measuring its index.

Three years after Google declared that its index was three times larger than any other search engine and then declined to cite a specific number to support that claim, it was widely believed that Google had tired of index one-upmanship and that it would no longer be measuring its index.Well, Google has its yardstick in hand once again.

Two Google engineers on Friday said that Google's index of the Web now contains 1 trillion unique URLs.

That's a lot of URLs. However, there's a lot of chaff in there. Consider that "Google" alone returns 2,740,000,000 Google search results, "Yahoo" returns 2,930,000,000, and "eBay" returns 1,080,000,000. Add up the number of results generated by searching for the top 100 keywords and you'd have a significant fraction of 1 trillion.

In 1998, when Google opened for business, it had 26 million URLs. By 2000, it had reached 1 billion. In 2005, Google claimed it had more than 8 billion Web URLs in its index, at least until it took the index count off its home page. In 2008, Google's measure of the Web is 1 trillion Web URLs.

So it appears that Google's index is exploding with new Web pages. From 2000 to 2005, Google's index grew by a factor of 8. From 2005 to 2008, it grew by a factor of 125.

There you have evidence of the information explosion that Google and other companies are trying to fight through the Information Overload Research Group.

Maybe.

Google software engineers Jesse Alpert and Nissan Hajaj admit that they don't really know how many unique Web pages there are. And they acknowledge that Web URLs are essentially infinite because dynamic page generation for things like future calendar months means there's always another page to crawl. "But we're proud to have the most comprehensive index of any search engine," they say in a blog post.

So forget the math. The numbers are too slippery. Really, this is about bragging rights.

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
How Enterprises Are Attacking the IT Security Enterprise
How Enterprises Are Attacking the IT Security Enterprise
To learn more about what organizations are doing to tackle attacks and threats we surveyed a group of 300 IT and infosec professionals to find out what their biggest IT security challenges are and what they're doing to defend against today's threats. Download the report to see what they're saying.
Register for InformationWeek Newsletters
White Papers
Current Issue
2017 State of IT Report
In today's technology-driven world, "innovation" has become a basic expectation. IT leaders are tasked with making technical magic, improving customer experience, and boosting the bottom line -- yet often without any increase to the IT budget. How are organizations striking the balance between new initiatives and cost control? Download our report to learn about the biggest challenges and how savvy IT executives are overcoming them.
Video
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Flash Poll