Amazon Cloud, Analytics Help Researchers Fight Famine - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Cloud // Software as a Service
News
9/23/2015
10:05 AM
Connect Directly
Twitter
RSS
E-Mail
50%
50%

Amazon Cloud, Analytics Help Researchers Fight Famine

Researchers in the US and China explore rice genomes with AWS analytics tools to develop drought and disease resistant crops.

10 Cloud Storage Options For Your Personal Files
10 Cloud Storage Options For Your Personal Files
(Click image for larger view and slideshow.)

In a gamble against time, Amazon Web Services is making the genome of 3,024 varieties of rice publicly available in the cloud so that researchers can use the data to come up with the strains most likely to prove resistant to heat, drought, and disease.

Rice makes up 20% of the calories consumed in the world, but yields are going down in many of the rice-growing regions of the world, wherever temperatures are above average during the crop's growing season.

Researcher Jan Leach says warmer temperatures also make the diseases that impact rice more virulent, at the same time the plants' inherent resistance to disease appears to be compromised. If warmer temperatures are here to stay, then the nature of the world's rice crop must evolve quickly to meet changing conditions and avoid widespread hunger, she said in an interview.

"You get more diseases at high temperatures and the plant's inherent resistance is not as effective," she said from her office at Colorado State University in Fort Collins, Colo., where she is a distinguished professor in plant research.

[Want to learn more about Amazon Web Services' growing cloud strength? See Amazon's Profitable Q2: Is It Bigger Than Wal-Mart?]

A coalition of researchers, including the International Rice Research Institute based in the Philippines and the Chinese Academy of Agricultural Sciences in Shenzhen, is behind an effort to keep the world's rice fields productive. Out of 100,000 varieties of rice, they selected the 3,024 with distinctive characteristics believed to be tied to the crop's future. Researchers in China then unraveled the genome of each variety, a monumental effort that captured 30 million variations in the varieties' nucleotides.

That amounts to 120 TBs of data, now stored for free on AWS Simple Storage Service and available to researchers around the world. "It's a huge amount of data. It's not like you're going to download it. You can't just email it around," said Leach.

But with the data available, research that's been extremely difficult to do in the past may become possible. By comparing the genomes looking for common characteristics among the different varieties, researchers can narrow the gene sequences they need to examine to pursue certain crop qualities.

(Image: gmutlu/iStockphoto)

(Image: gmutlu/iStockphoto)

"If we know this gene functions to give disease resistance in this variety, we can ask, why isn't it giving resistance in these other varieties?" asked Leach. Instead of needing to examine 1,500 nucleotides in each variety, researchers might be able to narrow the field to 5, based on the 3,024 reference genomes.

By examining the context and surrounding nucleotide sequencing, researchers may find a way to turn on resistance in additional varieties. Likewise, they can examine "snips," or short gene sequences, for such characteristics as heat resistance, drought resistance, taste, and nutritional value. Much of the world's rice crop is grown underwater or on heavily irrigated land. India, which produces 20% of the world's rice, had been predicted during 2008-2009 to be suffering a decline of 10 million tons of output, during a year of drought. If droughts become more common, rice that can survive dry conditions may become the mainstay for significant segments of the world's population.

Marco van den Berg, CIO of the International Rice Research Institute, said in an email interview that no other food crop has been subject to this level of genomic analysis before. "Public availability (of the results) speeds up research and stimulates the availability of additional sequences. The availability of the data opens venues to do research which we probably haven't even imagined yet," he wrote.

"AWS has made it easier to compare the different genomes," said Leach. An initial analysis of the 3,024 genomes has been conducted using the DNAnexus analysis platform on Amazon to study and compare the varieties. Amazon used 37,000 CPUs, over two days, to conduct the initial analysis. That data made available to researchers worldwide has Leach, who has worked in plant research for 30 years, excited.

In addition, Amazon has made open source analytical tools, such as the command line SamTools and user interfaces Iobio and Galaxy, available as part of the data set, she said.

"You don't need to be a high-powered compute center to inspect the data," she said. Introductory use of the data is free. At some level, Amazon will charge users (like other cloud customers) for the use of compute, memory, and CPUs. Researchers at cost-conscious institutions will probably learn how to use Amazon Spot Instances, which are available on idle processors to low bidders during periods of low demand, such as the middle of the night.

In fact, the placement of the data, the availability of both the sophisticated DNAnexus platform, and open source tools amount to a giant experiment being conducted for the first time in the cloud.

No other food crop has had so many sequences published together, and no one is sure what to expect in the way of results.

But Leach and van den Berg know the depth of interest among agronomists and crop researchers. Leach said her co-researchers in grain crops such as wheat, oats, barley, and maize will be following the outcomes as a gauge of what could be done in their own fields.

"Having this genome information is a very important step … We will help other fields by making this data available to everybody," she said.

Charles Babcock is an editor-at-large for InformationWeek and author of Management Strategies for the Cloud Revolution, a McGraw-Hill book. He is the former editor-in-chief of Digital News, former software editor of Computerworld and former technology editor of Interactive ... View Full Bio

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Comments
Newest First  |  Oldest First  |  Threaded View
kstaron
50%
50%
kstaron,
User Rank: Ninja
9/29/2015 | 3:12:55 PM
Re: Goal is to leave nature's sequencing intact
I imagine that some may work to use the gemone information to create a plant through selective breeding and others may want to genetically modify the rice. I hope that no matter what happens, that the new rice is "open source" meaning that it is not owned by a particular entity. I would like to see the end effort of this be a hunamitarian one and not a commercial one.
impactnow
50%
50%
impactnow,
User Rank: Author
9/29/2015 | 11:45:41 AM
Re: Goal is to leave nature's sequencing intact

 

Charlie that is great hopefully those that seek to genetically modify foods for other purposes can learn from this important work. Poverty in today's era is a travesty and getting this type of science behind food provision is one of the key variables that can influence it.

Charlie Babcock
50%
50%
Charlie Babcock,
User Rank: Author
9/28/2015 | 7:07:46 PM
Goal is to leave nature's sequencing intact
Impactnow, you make an important point. There are many people wary of the gene splicing that goes into Monsanto seed-engineering efforts for such purposes as greater resistance to pesticicdes. I don't think the International Rice Research Institute is conducting its research on the rice genomes for purposes of gene splicing. Rather, it and the members of the consortium would like to identify which traits are active in some varieties and the sequences that signal they are active. Then they might be combined through cross pollination or selective breeding, leaving nature's gene sequencing intact. 
impactnow
50%
50%
impactnow,
User Rank: Author
9/28/2015 | 1:24:39 PM
Gene modification

Fascinating look at how technology can impact crops however as you are probably aware there is controversy around genetically engineered food and possible unknown health effects. Do you think these efforts will be influenced by the movement away from genetically engineered foods?

Charlie Babcock
50%
50%
Charlie Babcock,
User Rank: Author
9/24/2015 | 4:11:23 PM
About those drought resistant strains.....
California farmers haven't taken much interest in rice varieties heretofore, but they may get more in tune with rice genetics if the drought continues. 25% of rice fields are fallow due to the drought. Of the 431,000 acres under cultivation, rice requires 5.1 feet of irrigation on the field during the growing season. Other users are bidding for that water at $700 per acre-foot, which means rice farmers could make a lot of money without planting. Few jobs are associated with the crop. With almonds, 1,000 acre feet of irrigation translates into 6 jobs. With rice, which is seeded from the air and harvested by large machines, it's just one job. Half the crop is shipped overseas, mainly to Japan where Calif. medium-grained rice is used in making sushi. 
Charlie Babcock
50%
50%
Charlie Babcock,
User Rank: Author
9/23/2015 | 1:24:35 PM
U.S. doesn't know much about rice variety
While there are thousands of varieties of rice grown around the world, only about 10 "if that many" are grown in the U.S., Leach told me. The two prominent crops are long grain, sticky rice grown in Louisiana, Texas and other Southern states and short grain, non-sticky rice grown in California. California exports much of its crop to Japan, where tastes appreciate its variety.
Slideshows
IT Careers: Top 10 US Cities for Tech Jobs
Cynthia Harvey, Freelance Journalist, InformationWeek,  1/14/2020
Commentary
Predictions for Cloud Computing in 2020
James Kobielus, Research Director, Futurum,  1/9/2020
News
What's Next: AI and Data Trends for 2020 and Beyond
Jessica Davis, Senior Editor, Enterprise Apps,  12/30/2019
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
The Cloud Gets Ready for the 20's
This IT Trend Report explores how cloud computing is being shaped for the next phase in its maturation. It will help enterprise IT decision makers and business leaders understand some of the key trends reflected emerging cloud concepts and technologies, and in enterprise cloud usage patterns. Get it today!
Slideshows
Flash Poll