Amazon S3 Slowed By Software 'Misconfiguration' - InformationWeek
IoT
IoT
Cloud // Cloud Storage
News
8/11/2015
09:06 AM
Connect Directly
Twitter
RSS
E-Mail
50%
50%

Amazon S3 Slowed By Software 'Misconfiguration'

Mistake in S3 traffic system configuration produced severe latencies and error rates for AWS S3 users early Monday morning.

Cloud Computing: 8 Hidden Costs
Cloud Computing: 8 Hidden Costs
(Click image for larger view and slideshow.)

Amazon Web Services reported a rare instance of a slowdown and poor performance of its S3 storage service in the early morning hours of Monday, Aug. 10. After initially applying the wrong fix, AWS backtracked and corrected the problem, getting S3 back in operational shape before 7 a.m. Monday Eastern time.

Amazon's Service Health Dashboard reported at 3:36 a.m. Eastern time that it was investigating an elevated level of error rates on requests for service to S3, and 24 minutes later it reported it was trying to determine the root cause. The errors were occurring in what it termed the US Standard region, which it didn't define, but included its most heavily trafficked site, US East in Northern Virginia.

At 4:52 a.m. Eastern the Amazon dashboard reported it was "actively working on the recovery process, focusing on multiple steps in parallel." Customers could expect to continue to see elevated rates of errors and wait times as they attempted to use the service, Amazon reported.

[Want to learn more about what was behind another recent outage? See AWS Outage Traced To Route Leak.]

At some point, AWS realized it had identified the wrong root cause and backtracked to reassess and follow a different recovery route. The S3 slowdown was "due to a configuration error in one of the systems that Amazon S3 uses to manage request traffic," it reported.

(Image: soleg/iStockphoto)

(Image: soleg/iStockphoto)

After identifying the correct cause, AWS began to report recovery from error rates and latencies at 6:36 a.m. Eastern. By 6:46 a.m. it was able to report that the system was operating normally.

"We pursued the wrong root cause initially, which prompted us to try restorative actions that didn't solve the issue. Once we understood the real root cause, we resolved the issue relatively quickly and restored normal operations," the Service Health Dashboard reported.

The slowdown affected other services that depend on S3, including Elastic MapReduce and call-up of customers' Amazon Machine Images out of storage for use in the early morning hours, the Service Health Dashboard said.

Charles Babcock is an editor-at-large for InformationWeek and author of Management Strategies for the Cloud Revolution, a McGraw-Hill book. He is the former editor-in-chief of Digital News, former software editor of Computerworld and former technology editor of Interactive ... View Full Bio

Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
Gigi3
100%
0%
Gigi3,
User Rank: Ninja
8/11/2015 | 11:36:36 PM
AWS S3 performance
"Amazon Web Services reported a rare instance of a slowdown and poor performance of its S3 storage service in the early morning hours of Monday, Aug. 10. After initially applying the wrong fix, AWS backtracked and corrected the problem, getting S3 back in operational shape before 7 a.m. Monday Eastern time."

Charles, yes you are right. On Monday S3 performance was poor and bit slow. Any idea why it happens? We though it's because of some network issues.
CharlesB21101
50%
50%
CharlesB21101,
User Rank: Strategist
8/11/2015 | 12:49:09 PM
Any further comment, explanation coming?
S3 is the linchpin of AWS services. When it isn't functioning right, other services are affected and one of the main reasons that customers come to Amazon is at risk. I expect the company will have something more to say about this incident, along with a possible acknowledgement that this slowdown was an inconvenience to customers.
How Enterprises Are Attacking the IT Security Enterprise
How Enterprises Are Attacking the IT Security Enterprise
To learn more about what organizations are doing to tackle attacks and threats we surveyed a group of 300 IT and infosec professionals to find out what their biggest IT security challenges are and what they're doing to defend against today's threats. Download the report to see what they're saying.
Register for InformationWeek Newsletters
White Papers
Current Issue
2017 State of IT Report
In today's technology-driven world, "innovation" has become a basic expectation. IT leaders are tasked with making technical magic, improving customer experience, and boosting the bottom line -- yet often without any increase to the IT budget. How are organizations striking the balance between new initiatives and cost control? Download our report to learn about the biggest challenges and how savvy IT executives are overcoming them.
Video
Slideshows
Twitter Feed
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.
Flash Poll