The InformationWeek -- Blogs
Google

Topics:   Google

  • Email this page E-mail this page
  • Print this page Print this page
  • Bookmark and Share
  • icon

Google Index Reaches 1 Trillion URLs


Posted by Thomas Claburn, Jul 25, 2008 05:23 PM

Three years after Google declared that its index was three times larger than any other search engine and then declined to cite a specific number to support that claim, it was widely believed that Google had tired of index one-upmanship and that it would no longer be measuring its index.


Well, Google has its yardstick in hand once again.

Two Google engineers on Friday said that Google's index of the Web now contains 1 trillion unique URLs.

That's a lot of URLs. However, there's a lot of chaff in there. Consider that "Google" alone returns 2,740,000,000 Google search results, "Yahoo" returns 2,930,000,000, and "eBay" returns 1,080,000,000. Add up the number of results generated by searching for the top 100 keywords and you'd have a significant fraction of 1 trillion.

In 1998, when Google opened for business, it had 26 million URLs. By 2000, it had reached 1 billion. In 2005, Google claimed it had more than 8 billion Web URLs in its index, at least until it took the index count off its home page. In 2008, Google's measure of the Web is 1 trillion Web URLs.

So it appears that Google's index is exploding with new Web pages. From 2000 to 2005, Google's index grew by a factor of 8. From 2005 to 2008, it grew by a factor of 125.

There you have evidence of the information explosion that Google and other companies are trying to fight through the Information Overload Research Group.

Maybe.

Google software engineers Jesse Alpert and Nissan Hajaj admit that they don't really know how many unique Web pages there are. And they acknowledge that Web URLs are essentially infinite because dynamic page generation for things like future calendar months means there's always another page to crawl. "But we're proud to have the most comprehensive index of any search engine," they say in a blog post.

So forget the math. The numbers are too slippery. Really, this is about bragging rights.

« Open Text Rounds Out Offerings With Strategic Acquisitions, Partnerships | Main | Mobile Set To Revitalize Local Newspaper Business? »



Sign Up Now
For InformationWeek News Alerts




This is a public forum. United Business Media and its affiliates are not responsible for and do not control what is posted herein. United Business Media makes no warranties or guarantees concerning any advice dispensed by its staff members or readers.

Community standards in this comment area do not permit hate language, excessive profanity, or other patently offensive language. Please be aware that all information posted to this comment area becomes the property of United Business Media LLC and may be edited and republished in print or electronic format as outlined in United Business Media's Terms of Service.

Important Note: This comment area is NOT intended for commercial messages or solicitations of business.




 
Sign Up For The Grok on Google Newsletter
Every Thursday, Tom Claburn and his fellow analysts offer all the news, insight, analysis, and strategic thinking you need to understand the company and complex phenomenon known as Google.

Sign up for our free, weekly newsletter today!

Newsletter Archives


  :: THE LATEST GOOGLE NEWS ::



 

  1. Sequential Programming: Like Eating Peas with a Straw.
  2. Biomolecular device using self-assembled DNA nanostructures?
  3. Coreinfo v2.0: A Simple Utility to Understand the Manycore Complexity in Windows


Join The InformationWeek Group On LinkedIn


                           


  1. Too Much Netbook For Too Litl?
  2. Sprint And T-Mobile Headed The Wrong Direction
  3. More Reasons Why Linux Misses The Desktop
  4. Windows 7 Is Broken, So What?


  1. Florida Hospital Dials Up iPhones For Nurses
  2. Is Antivirus Software Dead?
  3. Securing The Cyber Supply Chain
  4. CIO Profiles: Christopher Rence, Chief Information And Business Transformation Officer Of FICO
  5. InformationWeek Analytics Research: Federated Search
  6. Practical Analysis: The Fastest-Growing Security Threat

 

  Ars Technica
Boing Boing
Channel 9 Forums
CRN Blogs
Dr.Dobb's Portal: Blogs
Engadget
Gizmodo
GrokLaw
  Lifehacker
Schneier on Security
Slashdot
TechCrunch
Techdirt
Techmeme
Valleywag

  DECEMBER 2008
NOVEMBER 2008
OCTOBER 2008
SEPTEMBER 2008
AUGUST 2008
JULY 2008
JUNE 2008
MAY 2008
  APRIL 2008
MARCH 2008
FEBRUARY 2008
JANUARY 2008
DECEMBER 2007
NOVEMBER 2007
OCTOBER 2007
SEPTEMBER 2007