Big Data. Big Decisions
InformationWeek
Special Coverage Series


IBM Analytics Help NC State Sell Tech Innovations

Breakthrough NoSQL-style text analysis speeds search for technology developers.

IBM and North Carolina State University (NCSU) have revealed details of an innovative text analytics application that is helping the university's Office of Technology Transfer find potential partners.

NCSU faculty, staff and students developing new technologies don't always know where to turn to commercialize their inventions. It's job of the Office of Technology Transfer at the Cary, NC, campus to manage the university's intellectual property, finding best-possible development partners for early stage technologies.

Seven full-time licensing professionals in the office handle the task of combing through sources including SEC filings, tech blogs and Internet sites to find would-be partners. But conventional search methods were too time-consuming to keep up with demand.

"We manage more than 3,000 technologies at any one time, so the case load is large and we needed something that could speed the process of finding potential partners," said Billy Houghteling, Director of the Office of Technology Transfer.

Houghteling's group started working with IBM about eight months ago as part of a much larger innovation management initiative across the entire 16-campus NCSU system. To meet the unit's fine-grained search needs, an IBM team recommended two enabling technologies: IBM Big Sheets and the IBM Content Analyzer.

Big Sheets is a non-relational data-analysis tool that provides a spreadsheet-style analysis front end for Hadoop. Often associated with the budding NoSQL movement, Hadoop is an open-source alternative database that can integrate huge volumes of unstructured data. Web content and social-network data streams are typical of the kinds of data handled in Hadoop.

IBM Content Analyzer is a language-aware search and text analytics engine. The tool has been integrated with Big Sheet so it can explore and analyze unstructured data exposed therein. The two tools work together to intelligently spot relevant content with far greater precision than simplistic keyword-search tools.

In a pilot test conducted this spring, the Big Sheets-Content Analyzer combination turned up concise, highly targeted lists of partner prospects for two promising technologies developed at NCSU.

"The tools identified the same 10 to 15 companies that we identified with our traditional triage methods, but they also uncovered 25 additional companies, more than doubling the number of potential partners," Houghteling said.

To be precise, the Big Sheets/Content Analyzer combo didn't hand researchers 40 leads on a silver platter. Rather, prospects were winnowed down from relevance-prioritized list of about 1,000 candidate companies. Researchers focused on the first few hundred hits to get to the list of final candidates. But the bottom line is that the Big Sheets/Content Analyzer approach took a much shorter time to yield a more complete list of prospects.

"With these tools I'm confident we can take a triage process that took about four months to identify a licensee down to as little as one month," Houghteling said.

Big Sheets and the IBM Content Analyzer are now being deployed as part of a larger collection of tools and repositories that will enable the Office of Technology Transfer to quickly explore the data sources it has always used. The key difference is that they'll cut to the licensing stage in a matter of weeks instead of months.

Ironically, a well-known analytics language got its start at NCSU before it was commercially developed by SAS. There's even a SAS Hall on campus, but apparently the home-town tech vendor doesn't have a monopoly on NCSU's choice of analytics.



Related Reading


More Insights




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

BYTE encourages readers to engage in spirited, healthy debate, including taking us to task. However, BYTE moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. BYTE further reserves the right to disable the profile of any commenter participating in said activities.

Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.

Follow InformationWeek

By The Numbers

What Are Your Primary Concerns About Using Big Data Software?

Base: 417 respondents at organizations using or planning to deploy data analytics, BI or statistical analysis software
Data: InformationWeek 2013 Analytics, Business Intelligence and Information Management Survey of 541 business technology professionals, October 2012

What Do You Think?

What's your attitude about SQL analysis on top of Hadoop?
We want fast, standard SQL analysis capabilities on Hadoop ASAP
Hadoop is for unstructured data; SQL is for relational databases
We'll give SQL on Hadoop a try, but relational DBs will remain the mainstay
Given strong SQL support on Hadoop, we'd nix the data warehouse
We're not interested in Hadoop
No opinion



Related Content

From Our Sponsor

Five Big Data Challenges and How to Overcome Them with Visual Analytics

Five Big Data Challenges and How to Overcome Them with Visual Analytics

Business leaders often need a visual snapshot of data to quickly grasp and use it. This paper identifies five challenges in presenting data and how visual analytics can resolve them. Solutions are suggested to overcome the challenges of: speed, data clarity, data quality, displaying meaningful results, and dealing with outliers.

Game-Changing Analytics: How IT Executives Can Use Analytics to Create Innovation and Business Success

Game-Changing Analytics: How IT Executives Can Use Analytics to Create Innovation and Business Success

Today's competitive advantage requires a deeper understanding of your business, your market and your customers. As an IT executive, you can drive that knowledge transformation. In this white paper, learn how to make decisions as a strategic business leader and three steps to begin an analytics initiative within your enterprise.

Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics

Data Visualization Techniques: From Basics to Big Data with SAS Visual Analytics

High-performance data visualization turns sophisticated analyses into meaningful graphics, leading to faster and smarter decision making. In this white paper, learn how visual analytics can transform big data, with additional features such as real-time functionality, mobile compatibility, robust applications for technical groups and accessibility for nontechnical users.

Big Data: Lessons from the Leaders

Big Data: Lessons from the Leaders

Financial performance, competitive advantage, operational efficiency, strategic decision making - every business goal can extract value from big data, and the time for doubt or inaction has long passed. In this Economist Intelligence Unit report, in-depth interviews with data pioneers reveal the link between the effective use of big data and the bottom line among other results.

Decision-Driven Data Management: A Strategy for Better Decisions with Better Data

Decision-Driven Data Management: A Strategy for Better Decisions with Better Data

Which came first, the data or the decision? This white paper makes the case for having a decision in mind, then tailoring big data's volume, variety and velocity to achieve business results such as overcoming customer dissatisfaction or creating well-informed strategies in real time.

Informationweek Reports

Research: The Big Data Management Challenge

Research: The Big Data Management Challenge

The challenge of big data is real, but most organizations don't differentiate 'big data' from traditional data, and nearly 90% of respondents to our survey use conventional databases as the primary means of handling data. We'll help you understand what constitutes big data (it's not just size) and the numerous management challenges it poses.