Commentary
Text-Analytics Demand Approaches $1 Billion
Brand-reputation management, market research, competitive intelligence, and customer-service needs have led to fast growth. Here what's driving interest.Software and service text-analytics revenues now total $835 million globally, according to a 2010 market study I completed recently.
Growth is steepest for applications that seek business insight in social networks, online media, and surveys. Applications include brand-reputation management, market research, competitive intelligence, and customer service and support. For these applications and others, text analytics brings automated, natural-language processing techniques to bear to identify and extract names, facts, relationships, sentiment, and other information in blogs, forums, news, social updates, e-mail, and a range of enterprise sources.
More Software Insights
Webcasts
- Perform Better in a Hybrid Cloud World
- Entering the Scrum: Taking the First Steps on Your Agile Journey
White Papers
- Big Data Management for the Enterprise
- Six Ways to Improve Customer Loyalty and Drive Retention with Integrated CRM
Reports
More >>I'll will describe my findings in detail at next week's 2011 Text Analytics Summit, slated for May 18-19 in Boston. These content-analysis capabilities extend business intelligence (BI) and predictive-analytics coverage, as Doug Henschen reported with an online focus in Social Media Shapes Up As Next Analytic Frontier.
My $835 million market-size estimate covers software licenses, service subscriptions, and vendor-provided technical support and professional services. Despite strong growth, it remains a small fraction of Gartner's $10.5 billion 2010 valuation of the broader BI, analytics, and performance-management software market.
My estimate captures the value both of core content-analysis capabilities and of text analytics' contribution to four content-related application categories: Information capture (most often via Web scraping), information management (descriptive metadata and "unstructured" text), text-fueled enterprise applications, and search-based applications.
The search-based applications category is worth an estimated $300 million of the $835 million text-analytics total value. It includes Web and enterprise search, e-discovery, and business, scientific, and legal information services. The last three are typically accessed via search interfaces and rely on knowledge bases populated by mining textual sources, such as judicial records, research papers, and online forums. Examples include the West Litigation Monitor from Thomson Reuters, Elsevier's SciVerse platform, and ConsumerBase from NetBase.
The enterprise-applications category comprises software and services for business functions such as customer relationship management (CRM), market research, enterprise feedback management/surveys, and competitive intelligence.
Enterprise information management (EIM) systems store both text and accompanying descriptive metadata (such as author, title, topic, publication date, and tags) that facilitates publishing the stored text as "smart content." EIM places a premium on multi-channel publishing, reuse, and targeting.
Information capture or acquisition starts with Web crawling and page or document retrieval, for example, locating and scraping prices from online commerce sites for competitive intelligence purposes. It further includes text extraction from mark-up and binary formats such as HTML, PDF, and Word; data cleansing that removes ads, menus, spam, and other extraneous content; metadata extraction, and deduplication.
Related Reading
| To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy. |
Subscribe to RSSResource Links
Related Webcasts
- Perform Better in a Hybrid Cloud World
- SMB Server Guide: Meeting Email, Virtualization, and Business Application Challenges
- Techniques for Next-Gen Data Protection using Next-Gen Computing
- Best Practices for Improving Database Testing: Upgrades, migrations, business growth and more - ensuring you can handle the workload!
- Insurance Workforce Optimization: How To Work Smarter To Benefit Your Customers, Employees and the Bottom Line
This Week's Issue
Free Print Subscription
SubscribeCurrent Healthcare Issue
- InformationWeek Healthcare CIO 25: Our second annual honor roll of the health IT leaders driving healthcare's transformation.
- EHR Unreadiness: Only a small percentage of physicians planning to apply for Meaningful Use funds have e-health record systems capable of achieving most of the requirements. .
- And much more!
- Read the Current Issue
Related Whitepapers
Featured Broadcast
Organizations must rigorously protect their data from all threats - including theft by outsiders and insiders, malicious attacks that can distort or destroy data, and inadvertent corruption or misuse by employees.Download this white paper and find out how to safeguard data and fulfill compliance mandates.
Learn More












