It's the latest step in the agency's efforts to move beyond basic keyword search engine capabilities and apply search technologies that better help its personnel connect the dots.
Weeks from now the platform's reach will be extended to include intelligence gathered through the interception of radio and other signals and news feeds such as Reuters as well as message traffic from the State Department, "our largest authoritative repository of database information about foreign military capabilities," said Lewis Shepherd, chief of requirements and research for the DIA, which has 11,000 military and civilian employees worldwide.
When searching for information, DIA personnel may have as many as 300 individual data feeds, databases, and data from other intelligence agencies they can access. The 20 sources available today through Endeca represent more than half of all the agency's data, and the agency plans to add more over the next two years. Other sources can be searched using search technology from Autonomy Corp., Google, and Vivisimo Inc.
Shepherd and his staff first met with Endeca in 2004 because the agency needed a more advanced way of extracting useful information from an enormous volume of data. Prior to using Endeca, responding to a specific query from Congress or another agency could take as long as five days. Now it takes minutes. At the time, Endeca's guided navigation approach was unique in terms of the methods it used for finding information, Shepherd said.
Today, there's no shortage of search technologies out there. What continues to appeal to the DIA is Endeca's ability to offer users guided navigation as part of the search summaries it delivers. Endeca uses what the company calls a "faceted classification system" to read meta data tags on all of the data flowing through its systems and sort query results by any category in any order. The agency created these data tags using a number of different XML formatting and meta data tagging software packages, including those from Attensity and Inxight Software as well as Lockheed Martin's AeroText and SRA International's NetOwl.
Endeca's searches come back to the user in a series of menus. "It's almost like having a librarian organize your data and summarize what's on your shelves for you," said Endeca CEO Steve Papa. If one was to search on the late singer Frank Sinatra, for example, the search results might be broken down into a number of categories, including year, record labels, and movies, as opposed to a Google-like listing of all keyword matches. (In Sinatra's case, that's more than 2.7 million matches).
More specific to the DIA, when an analyst searches for information related to a "tank," the search engine is able to understand that, if the analyst is also searching for information about artillery, then "tank" refers to a vehicle rather than a container of water. The search results will be prioritized to reflect this.
Page 2:
History Of Search Technologies At DIA
![]()
1
|
2
Next Page »
Stay connected and informed by visiting the CA Solutions Center Community!

Become a member today for instant access to free InformationWeek research, expert advice, peer perspectives, and more on the following topics:
- Application Performance Management (APM)
- Security Management
- Mainframe 2.0
- IT Automation
- Service Assurance
Also, visit our Government and Financial Services groups to see how these technologies apply specifically to those industries.
NOTE: Offer valid for U.S., U.S. possessions, & Canada only.