Software // Information Management
Commentary
2/3/2014
09:06 AM
Doug Henschen
Doug Henschen
Commentary
Connect Directly
Google+
LinkedIn
Twitter
RSS
E-Mail

Big Data's 2 Big Years

During the past two years, Hadoop hit the big time, in-memory arrived, and practitioners saw data analysis deliver real results.

Comment  | 
Print  | 
Comments
Threaded  |  Newest First  |  Oldest First
D. Henschen
50%
50%
D. Henschen,
User Rank: Author
2/3/2014 | 12:33:38 PM
Jury is still out on Hadoop
It's still early days for new platforms like Hadoop and (soon to be covered in a separate collection) NoSQL and NewSQL databases. According to recent research by Gartner, nearly half of practitioners in a survey said they're unsure they'll get value from the platform. About 30% of enterprises that have invested in big data technologies including Hadoop and NoSQL database, according to the data.

Breaking down the Hadoop users, Cloudera says 1/3rd are just getting started and experimenting, 1/3 are using the platform for specific, mission-critical workloads, and 1/3 are maturing into multiple production applications and are actually using the platform to replace incumbent data-management tech (like databases, ETL etc.). But back to Gartner's stats, 31% have no plans to roll out a big data project - about the same % as in 2012. Long story short, Hadoop is far from established as a must-have enterprise data-management platform.

 
JeanL536
50%
50%
JeanL536,
User Rank: Apprentice
2/4/2014 | 6:01:25 PM
Re: Jury is still out on Hadoop

As Dr Victor Tang and MIT pointed out.. " I see NO NEW SCIENCE HERE!! only more data to work with!" and I agree with him as well as several others.. in his words.. "slapping together a bunch of bottle rockets with string does not make it a jet engine!!" Creating ever larger and larger containers for data (which is what all present technologies do, save one) by definition will not scale.. a perfect example of this is Bitcoin hash chain.. 2.5 exabytes and counting..? takes days to get any days and 50k $ machines to get any data out of it.

The BIG DATA truth of all this is really quite simple.. there is a reason for the existence theoretically of 6 normal forms of normalization .. it was not put there by accident.. only one or two systems have reach the 4th normal form. and only one has reached the 6th normal form.. the FUTURE is EFFICIENCY not SIZE.. Efficiency grants you speed, simplicity, and capacity.. where anything else simply cannot compete

Associative Data Management and Knowledge Operating System using a Data Instance centric architecture, where Data Instances are typically atomic. Each Data Instance can be at the center with all its associations. The base structures encapsulate the Data Instances and can generally be identical in form and function, and application independent. Encapsulate references can include references to all other directly related


independently encapsulated Data Instances. The encapsulated references can be both unique identifiers for each and every associated Data Instance and also logical indexes that encode the abstracted location of each Data Instance, making it possible to both identify and locate any Data Instance using the same reference key.

http://www.atomicdb.org/

It's been 50 years since Ed Codd announced his famous 12 Rules on how to store data in 2-dimensional tables and gave birth to the Relational Database


In the last 50 years, everything in the world of data management technology has evolved with dizzying speed, except this one area. We still build most of our information systems with 50-year old technology that causes enormous frustration, cost and struggle


Fifty years later, we have invented and proven a technology that easily captures information will all the context and relationships that the organization wants. In this new world
That's not all there is to this new technology. There are other capabilities that make it essential and mandatory for the next 50 years:


  • the ability to change and evolve the system on the fly
  • the ability to connect organically with any other system built in the same technology (solving the warehouse problem)
  • the ability to store and transmit information with a much higher level of inherent security
The Agile Archive
The Agile Archive
When it comes to managing data, donít look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Tech Digest, Dec. 9, 2014
Apps will make or break the tablet as a work device, but don't shortchange critical factors related to hardware, security, peripherals, and integration.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
Join us for a roundup of the top stories on InformationWeek.com for the week of December 7, 2014. Be here for the show and for the incredible Friday Afternoon Conversation that runs beside the program!
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.