Semantic Web Business: Going Nowhere Slowly
Newest First  |  Oldest First  |  Threaded View
Page 1 / 2   >   >>
User Rank: Apprentice
3/28/2014 | 12:58:26 AM
The vital role of ordinary people
A prime concern should be how to give ordinary people an incentive to put their data on the web. The answer must lie in making their work easier e.g. with linkable data on the web rather than dead data in spreadsheets. Talk of RDFs, URIs just puts off people who have this type of data, and is not needed here.

Here is an open email to Phil Archer posted on W3C forum that outlines the rationale:

With a significant part of the semantic web designed jointly by practitioners and technologists, practitioners get the chance to benefit from automated tasks; technologists get the chance to apply their skills for tasks they otherwise would not know exist.
User Rank: Apprentice
1/23/2014 | 1:01:51 PM
Re: The point is not that there's nothing to SemWeb...
14 year ago I was a semantic web researcher and evangelist. 7 years ago i started to become more sceptical and wondering where the industrial uptake was. Now i am neutral and looking for evidence for both views. I am co-chairing the industrial track at the next International Semantic Web conference, so curious what comes out ( I still believe in the grand vision of the Semantic Web, but unclear on what technologies or communities will finally make it happen. 

My observations so far are:
- The SemWeb community has not done a too good job in explaining it to business so businesses clearly understand the implications such as benefits versus TCO. Ask any CIO or CTO of a large corporation on SemWeb or SOA and how much money s/he has invested in them (successfully or not, that is another issue).
- SemWeb seems to be out there (see Frank's list) but the fact that only a few people know such extended lists also says something. To me it says that SemWeb is still with early adopters (BBC), not in the mainstream market (when you wouldn't even think of generating a list). I suspect that several on the list are more referring to trials of the technology in industry than to operational systems. 
- There is a "categorisation" issue. All agree that is out there, but the sceptical says it's not SW while the advocates say it is. Same for Google Graph, FB Graph, etc. I would say the real question is whether those major uptakes of SemWeb-like technology are thanks to the SemWeb community and dedicated (government) investments, or whether they come from alternative efforts with a pragmatic business perspective.  I think the latter, but actually i don't know.
- Semantic Web is on the Gartner Hype Cycle curve since at least 2006. And the estimation is every year that it takes more than 10 year to reach productivity, also in 2013. 
User Rank: Apprentice
1/11/2014 | 10:10:12 AM
Narrow views of the Semantic Web
I think that I too would feel that the Semantic Web was a poor solution if I saw it only as an "alternative vision" to sentiment analysis. The problem as I see it is that the Semantic Web is so much more. I have not personally worked with it in connection with Big Data, but regardless of Grime's views Gartner has just placed Big Data as the driver of Semantic Web and identified the pair as transformative technologies within the not so distant future. (Transformative is the highest rated impact of the technologies analyzed.) Gartner seems to have until recently seen Semantic Web only in terms of the vision of a Global Graph, which it saw as unattainable or at least to far into the future to be of interest. Its realization that it has practical here and now utility has jumped it from ascending towards the peak of inflated expectation and skipped it right over to the trough of disillusionment. David Siegel's story is sad, but what does it really tell us? I have had similar difficulties. I nevertheless still feel that there is tremendous benefits to be reaped from its use. What would be helpful would be to see more discussion about how to convey these benefits.
User Rank: Apprentice
1/8/2014 | 9:09:57 PM
Re: Always Just Beyond The Horizon
I know he's just being provocative to evoke comments, but Seth is usually more insightful than this. I think he's using the ad-hoc social analytics hammer and seeing everything as a nail. And his research on "semantic web" as he so narrowly defines it seems to have stopped in 2006. This is no better than Shirky's old rant on filter failure.

The semweb philosophy has evolved well beyond linked data, and many of the practitioners are many of the same proprietary vendors and services Seth cites. They're just offering opaque SaaSes or APIs, and the stuff is under the covers. So some of the standards haven't panned out, and the dreamy open source, open data vision hasn't been fulfilled--that doesn't mean the tech isn't being used.

Seth seems to be focused on perishable data because that's where the most growth and chaos is, and that's fine. but the data that's not so perishable merits a lot more care. Why did people go through the pain of XBRL? To get beyond the limits of provincial data and into more global reusability that's truly reliable. Similarly, people who've studied semweb methods enough to make best use of them and have endured some pain are seeing scalability benefits when it comes to auto curation for example. See It takes time and expertise to build systems like this one.

Are you guys looking at DAM at all? There are use cases like the Magnum Photo case described in a sidebar here: Involves a clever use of crowdsourced image tagging in a gamified environment. I'm not sure how you'd scale decent search and discovery in huge online photo repositories if you didn't use a method like this.

Another person in this thread cites a dozen case studies that go beyond the vague decks you're describing.

The text analytics methods Seth points to are valid, but they don't go far enough. They need to be used in conjunction with other methods when it comes to non-perishable data/content. What's blooming now is a heterogeneous approach to schema--fixed on the RDBMS side, dynamic, multiple, and schema-on-read on the NoSQL side, and optional, shared and collaboratively built a la I think the bottom line is that the tools and methods are finally starting to fit the jobs, and there's a widening selection of them.

You have posted helpful pieces on NoSQL data use cases in the past. Just think you should look at what's happening on the content processing side once in a while. They're building on top of what's possible with NLP engines, for example. There's more back and forth between the NoSQL and semweb communities than there used to be, and the result could be standards such as JSON-LD that are more aligned with the way developers work.
User Rank: Apprentice
1/8/2014 | 3:15:56 PM
Re: The point is not that there's nothing to SemWeb...
At Google we're seeing markup (an RDF vocabulary) on 5+ million domains - from big names to the long tail. Pizza shops, museums, TV shows, opening hours, research papers, job listings, company logos, govt datasets... across broad range of areas, every walk of life. 

In my view it's a mistake to contrast some perceived "classical", stilted and fragile notion of Semantic Web against the strawman rival of newer flexible/pragmatic approaches. All communities have internally some such axis against which their efforts can be plotted, and variations on the neat/scruffy distintion. The most successful RDF work has always had a pragmatic, hacker side to it, and embraced tooling (NLP, machine learning, databases, ...) from a variety of fields.  It's not a competition - use all the tools that make sense to get a job done! 
User Rank: Apprentice
1/8/2014 | 11:48:40 AM
Re: The point is not that there's nothing to SemWeb...
Hi Seth,

I'm afraid I must take strong exception to your position.

I have heard people redefine the Semantic Web for years in some narrow way so they can say it has failed.  The most fallacious redefinition is that it requires "perfect" (top-down) data definition.  This is exactly contrary to the definition of the Resource Description Framework.  RDF expects people to lie and to be mistaken. It expects and allows for dirty data.  Any other presumption would simply not match the real world.  It specifically allows and encourages bottom-up creation of distributed data.

Quoting the RDF 1.1 Primer, "RDF is intended for situations in which information on the Web needs to be processed by applications, rather than being only displayed to people."  Its use is currently being wildly deployed and quite successful, as pointed out by Frank and Amit (and others, if you ask Google).

Also, it was the Semantic Web that gave us SPARQL, the only standard query language for distributed data.  SPARQL is being widely adopted by vendors for the simple reason that SQL will never allow cross-implementation queries.

I'd love to see you absorb Frank's and Amit's comments and write a more balanced article.  Would you consider doing so?
User Rank: Apprentice
1/8/2014 | 9:49:10 AM
Re: Always Just Beyond The Horizon
Google Knowledge Graph is an example.  It's a graph knowledge base that is isomorphic to RDF. In fact, Google now provides the download files of the open source version, Freebase, as RDF.
User Rank: Apprentice
1/8/2014 | 9:35:47 AM
Re: The point is not that there's nothing to SemWeb...
Ironic to see quoted as a signal that Semantic Web technology will never work....  It is semantic web technology. "No systems using the information from"? Bing, Yahoo and Google harvesting doesn't count? Hmm... 

And what about Google's Knowledge Graph? It's a harvested/copied-and-then-curated RDF graph, now used to power Google's front-page. See e.g. righthand-side of

Reading "one of these slideshows to confirm a suspicion" that "there are no names of actual companies putting them into practice"? "Did anyone really back that vision in a serious financial way?" "Only useful for information enrichment in certain domains"? I've apparently been too slow in updating my "Semantic Web Good News Show" (at ), so here goes with a bunch of recent updates. In no particular order:

Who's using the GoodRelations product vocabulary and markup?
  • Google
  • Yahoo!
  • BestBuy
  • (15 Million items)
  • (250,000 items)

... and 10,000 more ( )

Both Oracle's DB and IBM's DB2 implement semantic web datamodels and protocols

Pretty much every webpage on the BBC website (heard of them?) now hits an RDF triple-store. Examples:,, 60.000(!) BBC news items annotated with RDF (, etc.

NXP is one of the world's biggest makers of microprocessors (4.3b$). On they have data on 26.000(!) products, internal triplestore (Dydra, 250K entitles, 2.5m triples)  to drive a website, this is externally available, to make it part of a broader ecosystem.

Renault publishes configuration options for its cars in RDF

Electricity de France generates 300.000 personalised energy bills using SemWeb technology:

New York Times publishes Linked Open Data (adds from celebreties) goes RDF

Monster Board goes RDF:

Bill and Melinda Gates foundation goes RDF:

Let me know if you want more, I can supply dozens more of these. But of course no number may ever be enough if "you've been a semantic web skeptic for years"....
IW Pick
User Rank: Apprentice
1/7/2014 | 4:40:45 PM
The point is not that there's nothing to SemWeb...
My point is not that there's nothing to SemWeb. I'll repeat a couple of sentences from the opening paragraph: "SemWeb is a narrowly purposed replica of a subset of the World Wide Web. It's useful for information enrichment in certain domains, via a circumscribed set of tools." My point is that there's much, much more available, providing similar capabilities, via more dynamic technologies and methods.
User Rank: Author
1/7/2014 | 2:58:15 PM
Re: Always Just Beyond The Horizon
"We will never achieve its ideal universe of neatly marked up data..." Did anyone really back that vision in a serious financial way?
Page 1 / 2   >   >>

The Business of Going Digital
The Business of Going Digital
Digital business isn't about changing code; it's about changing what legacy sales, distribution, customer service, and product groups do in the new digital age. It's about bringing big data analytics, mobile, social, marketing automation, cloud computing, and the app economy together to launch new products and services. We're seeing new titles in this digital revolution, new responsibilities, new business models, and major shifts in technology spending.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest - July10, 2014
When selecting servers to support analytics, consider data center capacity, storage, and computational intensity.
Flash Poll
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join InformationWeek’s Lorna Garey and Mike Healey, president of Yeoman Technology Group, an engineering and research firm focused on maximizing technology investments, to discuss the right way to go digital.
Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.