I'm not terribly motivated to do a detailed analysis of data warehouse appliance list prices, in part because everybody knows that data warehouse appliances tend to be deeply discounted... That said, here are some insights on data warehouse appliance prices...
I'm not terribly motivated to do a detailed analysis of data warehouse appliance list prices, in part because:
Everybody knows that in practice data warehouse appliances tend to be deeply discounted from list price.
The only realistic metric to use for pricing data warehouse appliances is price-per-terabyte, and people have gotten pretty sick of that one.
That said, here are some insights on data warehouse appliance prices...
Reasons people criticize per-terabyte data warehouse appliance price metrics include:
Price-per-terabyte metrics ignore issues of throughput, latency, workload, and so on.
Price-per-terabyte metrics ignore quality of storage medium (slow disks, fast disks, Flash, etc.)
Price-per-terabyte metrics can be radically affected by changes in disk size.
Nonetheless, it is common to discuss data warehouse appliance price/terabyte. When one does, it is common to refer to user data rather than some measure of raw disk capacity.
Advantages of this approach include:
User data is what matters.
User data is what users doing product evaluations or setting budgets can best estimate in advance.
User data is a reasonable and popular basis for software-only analytic DBMS pricing.
Disadvantages of this approach include:
It depends on assumptions about compression (and in some cases indexing and so on), which are highly dependent upon the specifics of the data set.
Some vendors and users indeed think in terms of raw disk capacity.
Oracle perhaps excepted, data warehouse appliance vendors tend to be laudably conservative in the compression assumptions they build into their per-terabyte price metrics.
That was based on 2.25X compression. Since then, Netezza has upgraded its compression. Netezza now quotes 4X compression. Accordingly, Netezza's list price is now around $11,000/TB. (A little below, actually, per Phil Francisco.)
And by the way, if you mirror your data on a SAN, you can stuff twice as much into the EMC Greenplum Data Computing Appliance as otherwise, but then you also have to pay for 36 TB of capacity per half-rack appliance on a SAN.
Eric Guyer reminded us that Oracle Exadata has high list prices. He also reminded us that Oracle Exadata is apt to be deeply discounted.
A couple of versions ago, I outlined the complexities of Exadata pricing.I'm not terribly motivated to do a detailed analysis of data warehouse appliance list prices, in part because everybody knows that data warehouse appliances tend to be deeply discounted... That said, here are some insights on data warehouse appliance prices...
The Agile ArchiveWhen it comes to managing data, donít look at backup and archiving systems as burdens and cost centers. A well-designed archive can enhance data protection and restores, ease search and e-discovery efforts, and save money by intelligently moving data from expensive primary storage systems.
2014 Analytics, BI, and Information Management SurveyITís tried for years to simplify data analytics and business intelligence efforts. Have visual analysis tools and Hadoop and NoSQL databases helped? Respondents to our 2014 InformationWeek Analytics, Business Intelligence, and Information Management Survey have a mixed outlook.