Indexing Cloud Storage - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Cloud
Commentary
1/18/2010
11:05 AM
George Crump
George Crump
Commentary
50%
50%

Indexing Cloud Storage

Cloud storage may end up being the great storage repository in the sky. The destination that holds all our data and gets it off of our local storage. Whether you use this as a fourth tier of storage that your internal archive spills over too or as your sole archive, someday you are going to need to find data in it. Should we be indexing cloud storage to find the needle in the haystack?

Cloud storage may end up being the great storage repository in the sky. The destination that holds all our data and gets it off of our local storage. Whether you use this as a fourth tier of storage that your internal archive spills over too or as your sole archive, someday you are going to need to find data in it. Should we be indexing cloud storage to find the needle in the haystack?As we discuss in our article "The Importance of a Cloud Storage API", some cloud storage providers can solve the indexing cloud storage problem up front by having API sets that allow for tagging of information as it is moved into the cloud. This meta-data allows you to set information about the information so that when it comes time to retrieve that data you can provide keywords to help you find it. The challenge with an API set is that it needs an Independent Software Developer to integrate it into their solution. These vendors have either brought out Independent Software Vendor (ISV) programs to help popularize this concept or are working on it.

The value of tight integration with ISV's is that you can archive and fill in the keywords at the moment of archive. Check a box, fill in the keywords and click archive. This is when the keyword information is probably going to be the most accurate because it is fresh in the mind of the person doing the archive. In fact the application itself may be able to supply the key word data. Recall, when needed, could then be done right through the application or via a stand alone interface that the cloud storage provider has. The value of the later being that the search should work across all applications that were archiving to the provider.

To ease adoption many suppliers have added a NAS Gateway or have used standard internet friendly protocols like WEBDAV or REST to allow access to the cloud based storage. Indexing of the NAS gateway approach should allow for indexing solutions from Index Engines or Kazeon's (now EMC) and others to index this as if it were a normal file system. Testing of this type of solution should be done first to understand what the ramifications might be.

Indexing cloud storage should be an early consideration as you start to select a cloud storage platform. Successfully finding the needle in the haystack requires proper planning upfront. Waiting until you have a few TBs of information first is going to be problematic.

Track us on Twitter: http://twitter.com/storageswiss

Subscribe to our RSS feed.

George Crump is lead analyst of Storage Switzerland, an IT analyst firm focused on the storage and virtualization segments. Find Storage Switzerland's disclosure statement here.

We welcome your comments on this topic on our social media channels, or [contact us directly] with questions about the site.
Comment  | 
Print  | 
More Insights
Commentary
Enterprise Guide to Edge Computing
Cathleen Gagne, Managing Editor, InformationWeek,  10/15/2019
News
Rethinking IT: Tech Investments that Drive Business Growth
Jessica Davis, Senior Editor, Enterprise Apps,  10/3/2019
Slideshows
IT Careers: 12 Job Skills in Demand for 2020
Cynthia Harvey, Freelance Journalist, InformationWeek,  10/1/2019
White Papers
Register for InformationWeek Newsletters
State of the Cloud
State of the Cloud
Cloud has drastically changed how IT organizations consume and deploy services in the digital age. This research report will delve into public, private and hybrid cloud adoption trends, with a special focus on infrastructure as a service and its role in the enterprise. Find out the challenges organizations are experiencing, and the technologies and strategies they are using to manage and mitigate those challenges today.
Video
Current Issue
Getting Started With Emerging Technologies
Looking to help your enterprise IT team ease the stress of putting new/emerging technologies such as AI, machine learning and IoT to work for their organizations? There are a few ways to get off on the right foot. In this report we share some expert advice on how to approach some of these seemingly daunting tech challenges.
Slideshows
Flash Poll