The InformationWeek -- Blogs
Welcome Guest. | Log In| Register | Membership Benefits

Storage Blog

Topics:   Storage

  • Email this page E-mail this page
  • Print this page Print this page
  • Bookmark and Share
  • icon

When Controllers Fail


Posted by George Crump, Dec 9, 2009 10:05 AM

What are the chances of a controller failing in a storage system? I don’t know the exact statistic but its safe to assume that its pretty low. When they do fail, the ramifications can be extreme, especially in the increasingly virtualized data center that counts on shared storage. Active-Active controllers provide the protection from controller failure but they are a bit of a misnomer. Both controllers are being used but they are assigned to specific workloads.


Most controller based storage architectures have at least two controllers for redundancy, higher end systems may have more. As stated earlier the problem is that these controllers do not typically share a workload. Each controller is assigned a specific set of disks or LUNs to manage. That controller is responsible for responding to I/O requests, providing the XOR calculation for RAID strategies and providing any of the data services that the storage system provides like snapshots, thin provisioning or replication to just those specific LUNs.

If a controller fails, access to the LUNs that were assigned to the controller is now rerouted through the primary controller. In the virtualized world this could mean the instant movement of the I/O requests of dozens of virtual machines to another controller. Of course this other controller already had a series workloads of its own to support. The result is that in a dual controller system your performance just got cut in half. In today’s environment with RAID 6 and all the various data services that storage controllers provide, they may already be burdened and may not have the excess capacity to support the extra load without noticeable losses in performance to the user.

Quad or more controllers does not really help this situation as there is always going to be a load that moves fully to an additional controller. The exception could be if the storage system had the intelligence to move the LUNs to the least busy controller. The answer may be to have all the storage workloads already spread across all the available controllers evenly. For example in a four controller system, all four controllers are responding to the I/O requests for all the LUNs in the storage system. If there is a controller failure only 25% of the workload needs to be reallocated. Assuming that most systems do not run at sustained 75% utilization, then the failure should cause no noticeable performance loss to the applications.

To deliver this type of capability is more than likely going to require a clustered or grid storage implementation where the storage I/O workload is shared across all of the controllers or nodes in the system. Without that capability storage managers should pay very close attention to their storage processor utilization. Anything above 50% on any of the controllers should be a cause for concern and possibly a hardware upgrade.

Track us on Twitter: http://twitter.com/storageswiss

Subscribe to our RSS feed.

George Crump is lead analyst of Storage Switzerland, an IT analyst firm focused on the storage and virtualization segments. Find Storage Switzerland's disclosure statement here.

« Where’s Hitachi Data Systems In All These Alliances? | Main | EU Hits New Low Vs. Oracle As Kroes Mocks Senate Letter »



Sign Up Now
For InformationWeek News Alerts




This is a public forum. United Business Media and its affiliates are not responsible for and do not control what is posted herein. United Business Media makes no warranties or guarantees concerning any advice dispensed by its staff members or readers.

Community standards in this comment area do not permit hate language, excessive profanity, or other patently offensive language. Please be aware that all information posted to this comment area becomes the property of United Business Media LLC and may be edited and republished in print or electronic format as outlined in United Business Media's Terms of Service.

Important Note: This comment area is NOT intended for commercial messages or solicitations of business.




 
 

  1. No Silver Bullet for Parallelism
  2. Think Parallel 2010, Five Years of Multicore
  3. It's All In the Strategy, It's All About the Design


Join The InformationWeek Group On LinkedIn


  1. Motorola Droid Users Burned Again
  2. Let Stormy Session On Cloud Standards Be Your Guide
  3. Google Overhauls Maps For Android
  4. HTC: Hey Apple, You Are So Wrong


  1. 4 Keys To Storage Management
  2. 2010 Data Center Trends Report
  3. App-Aware Networks Get Closer To Reality
  4. 10 Steps To Ace A FISMA Audit
  5. CIO Profiles: David Wennergren, Deputy CIO Of The Department Of Defense
  6. Google Releases Free Web Security Scanner

 

  Ars Technica
Boing Boing
Channel 9 Forums
CRN Blogs
Dr.Dobb's Portal: Blogs
Engadget
Gizmodo
GrokLaw
  Lifehacker
Schneier on Security
Slashdot
TechCrunch
Techdirt
Techmeme
Valleywag

  DECEMBER 2008
NOVEMBER 2008
OCTOBER 2008
SEPTEMBER 2008
AUGUST 2008
JULY 2008
JUNE 2008
MAY 2008
  APRIL 2008
MARCH 2008
FEBRUARY 2008
JANUARY 2008
DECEMBER 2007
NOVEMBER 2007
OCTOBER 2007
SEPTEMBER 2007