dtSearch Corp

The Smart Choice for Text Retrieval since 1991, dtSearch offers 17 years of experience in text search. The dtSearch product line includes enterprise and developer text search products, meeting some of the largest-capacity text retrieval needs in the world. dtSearch products have received hundreds of excellent press reviews and case studies. (Please see for these.) The company has distributors worldwide, including coverage on six continents.

Our Website:

Latest Content From dtSearch Corp

Whitepaper: Searchable Web Content on Portable Media

by dtSearch CorpFeb 15, 2010

Searchable Web Content on Portable Media

Whitepaper: Indexed vs. Unindexed Searching

by dtSearch CorpJan 01, 2008

Both indexed and unindexed searching have their place in the enterprise. Indexed text retrieval is typically more efficient for uses such as general information retrieval, distributed searching, and security classification systems. But unindexed searching, too, has its place�in outgoing e-mail filtering, searching of live data sources such as RSS news feeds, and sometimes in forensics. This article will attempt to explain which search technique to use when, and why.

Whitepaper: How Developers Can Cross the Full-Text Search/Fielded Data Divide

by dtSearch CorpJan 01, 2008

Where individual PCs can store gigabytes of data, and enterprise intranets and public sites terabytes of data, finding the correct document (or Web page) requires a complete arsenal of full-text indexed and fielded data search tools. While this combination makes sense for the end user, from a development perspective, these two approaches to data are very different � the equivalent of �apples and oranges.� This article discusses methods for synthesizing the "apples" of full-text searching with the �oranges� of fielded data, using the dtSearch Text Retrieval Engine as an example.

Whitepaper: Methods of Integrating Full-Text and Fielded Data Search

by dtSearch CorpJul 01, 2007

This article addresses methods of integrating fielded data with full-text indexed search, with the aim of providing more relevant search results. The discussion relies on the dtSearch� Text Retrieval Engine for its specific examples, although the general concepts have broader applicability.

Document Metadata: The simplest option for integrating fielded data and full-text searching is to use existing fields in documents. For example, MS Office, OpenOffice, PDF, HTML, and other documents all contain metadata fields. Using the fields inside these documents has the advantage of making each document its own self-contained data unit. The diversity of document types and the size of a document collection can, however, make adding fields to each document prohibitively time consuming. The fielded data itself may also require a more complex table or hierarchical data structure than the underlying documents' fielded data options support.

Database Metadata: Another alternative is to store fielded data for each document within a separate database such as SQL or XML. The documents themselves can either remain outside the database with only a filename or other identifier in the database. Or the documents can be inside a BLOB field in the database. Because a structured database holds the fields, the database approach supports a more complex relational metadata structure.