Social Science Site Using Azure Loses Data - InformationWeek
Cloud // Cloud Storage
10:26 AM
Connect Directly

Social Science Site Using Azure Loses Data

Dedoose, a data analytics system, suffered a failure on Azure that may mean three weeks of lost data for customers.

8 Data Centers For Cloud's Toughest Jobs
8 Data Centers For Cloud's Toughest Jobs
(Click image for larger view and slideshow.)

Dedoose, a social science data company, has lost some of its customers' data while operating on the Microsoft Azure cloud. "This is a horrible moment for our company. We have never lost data or had a breach," said Dedoose president Eli Lieber in an interview.

At best, it will be able to restore data stored through April 11, Lieber said, and perhaps only up to March 30. For an uncertain minority of customers, data added to their accounts after April 11 has been lost, said Lieber.

"It's impossible to say" how many customers have lost data because the firm doesn't monitor how customers are using their accounts, he continued. "Only users can assess losses in their projects, not Dedoose," he said, but the company is doing everything it can to help them recover data., founded in 2006 in Manhattan Beach, Calif., provides social scientists with an online analytics application, EthnoNotes, used by both commercial and academic researchers. Lieber said the application is used for marketers, pharmaceutical company researchers, academic social scientists, and others.

[Want to learn more about a documented failure in Azure operations? See Microsoft Pins Azure Slowdown On Cloud Software Fault.]

Dedoose offers its EthnoNotes application entirely on the Microsoft Azure cloud, and offers, a free cloud storage service, as backup storage. Many customers put data into Word documents and Excel spreadsheets. Through its consulting services, Dedoose will help customers retrieve that data and rebuild their databases when other sources aren't available.

The Los Angeles Times reported that Dedoose officials sent an email Monday saying that both its operational and storage systems had failed. "The timing of this event was such that our entire data storage container was corrupted -- including the master database and all local backups," the company wrote in the email.

"Within minutes of discovering the problem, we contacted Microsoft Azure support. Unfortunately, Microsoft was unable to recover these data... from its servers," the Dedoose email to customers said. Microsoft officials couldn't be reached for comment.

"Since our initial communication about this issue, we have also learned that the backup files stored to an independent location were also corrupted. We are working with Gladinet services to have the non-corrupted data transferred to alternative storage locations and restore the integrity of these files," the message continued, according to the L.A. Times.

Asked who was responsible for the data loss, Azure or Dedoose, Lieber said: "It's a complicated situation." Dedoose produced its EthnoNotes system and installed it on Azure, but didn't expect that its data could be lost when it failed. He declined to estimate the respective shares of responsibility. Dedoose is still investigating why the system failed. Lieber is one of the authors of the EthnoNotes system.

"Things happened on the platform that we were blind to. Similar events have happened (to other Azure customers) and the data was not recoverable," Lieber said, without naming any other Azure users who suffered a similar failure.

Dedoose is revising its all-cloud approach to include a redundant system separate from the operational one on Azure. "We are putting measures in place so that if the platform is destroyed again, we will live on," said Lieber. "Within two weeks, our systems will be tremendously more robust."

But he added, "Ultimately, we are the responsible party. As we learn more about what happened, we are taking steps to protect our users." Lieber declined to say how many customers were affected or how many customers in total use EthnoNotes. But he said the number affected, while a minority, is a higher than the "dozens of customers" suggested by another description.

Are you better protected renting space for disaster recovery or owning a private cloud? Follow one company's decision-making process. Also in the Disaster Recovery issue of InformationWeek: Five lessons from Facebook on analytics success. (Free registration required.)

Charles Babcock is an editor-at-large for InformationWeek and author of Management Strategies for the Cloud Revolution, a McGraw-Hill book. He is the former editor-in-chief of Digital News, former software editor of Computerworld and former technology editor of Interactive ... View Full Bio

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Newest First  |  Oldest First  |  Threaded View
<<   <   Page 2 / 2
User Rank: Strategist
5/14/2014 | 3:56:54 PM
Seems a bit disingenuous to name Azure so prominently
I don't think this article would have been posted without Azure being in the picture, yet it seems fairly clear from the description that Dedoose was treating Azure like a regular VPS/dedicated hosting provider instead of actually architecting for the cloud and failure.  "Storage and system went down at the same time?" I don't think you're supposed to be shocked about that if you have a clue what you're doing--that's a bit like saying, "My laptop was stolen AND so was the hard drive inside it".  Likewise, there's zero excuse on a provider if you aren't testing your backups.  You should constantly be checking your backups.

Perhaps the only reason to feature this type of incident and highlight the "cloud" aspect of it would be to point out that companies are still completely clueless about how to architect their applications for the cloud--and, in particular, blame their cloud providers for issues that are 100% their own fault.  (And absent any additional information, that's the right way to apportion blame here: 0% Azure, 100% Dedoose).
Charlie Babcock
Charlie Babcock,
User Rank: Author
5/14/2014 | 1:35:27 PM
Rumors of its death may be highly exaggerated
I don't know how recoverable the data is by users themselves. This is social science project data, not necessarily business transactions with customer accounts screwed up and revenue lost. I suspect some of it is recoverable or able to be reconstructed by other than the usual backup and recovery means. But I don't know for sure. The company may survive this because of the value of its analytics application, EthnoNotes. 
Andre Leonard
Andre Leonard,
User Rank: Apprentice
5/14/2014 | 1:04:58 PM
Catastrophic Failure..
There are certian events in busienss which are catasrophic. You reach a point you can never recover. This is one of them. Learn from your mistakes, rebrand yourself and move on.
<<   <   Page 2 / 2
Register for InformationWeek Newsletters
White Papers
Current Issue
Digital Transformation Myths & Truths
Transformation is on every IT organization's to-do list, but effectively transforming IT means a major shift in technology as well as business models and culture. In this IT Trend Report, we examine some of the misconceptions of digital transformation and look at steps you can take to succeed technically and culturally.
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Flash Poll