TechWeb Digital Library

Effectively Mining and Using Coverage and Overlap Statistics for Data Integration

Source: Microsoft
Date: January 2008
Type: White Paper
Rating: (0)

Overview: Recent work in data integration has shown the importance of statistical information about the coverage and overlap of sources for efficient query processing. Despite this recognition there are no effective approaches for learning the needed statistics. The key challenge in learning such statistics is keeping the number of needed statistics low enough to have the storage and learning costs manageable. Naive approaches can become infeasible very quickly. This research paper presents a set of connected techniques that estimate the coverage and overlap statistics while keeping the needed statistics tightly under control.


Click here to download now

View all content from this source

Not what you’re looking for? Search again
Go Advanced »
Email Alert

Receive an email alert whenever new content is added to the Business Intelligence section of the TechWeb Digital Library

More Business Intelligence Resources

Content Everywhere: 10 Gotchas That Can Derail Your ECM Initiative
Enterprise content management deployments are fraught with dangers, such as weak search capabilities and poor requirements definitions, that can grind...

Impact Business Performance with BPM
To maximize your chances for success, you need the ability to continually optimize, streamline and align business processes to meet changing business needs for greater performance. Business process management (BPM) from IBM can help.

Upcoming Webcasts

More On Business Intelligence