Notes on Data Warehouse Appliance Prices
I'm not terribly motivated to do a detailed analysis of data warehouse appliance list prices, in part because everybody knows that data warehouse appliances tend to be deeply discounted... That said, here are some insights on data warehouse appliance prices...
I'm not terribly motivated to do a detailed analysis of data warehouse appliance list prices, in part because:
Everybody knows that in practice data warehouse appliances tend to be deeply discounted from list price.
The only realistic metric to use for pricing data warehouse appliances is price-per-terabyte, and people have gotten pretty sick of that one.
That said, here are some insights on data warehouse appliance prices...
Reasons people criticize per-terabyte data warehouse appliance price metrics include:
Price-per-terabyte metrics ignore issues of throughput, latency, workload, and so on.
Price-per-terabyte metrics ignore quality of storage medium (slow disks, fast disks, Flash, etc.)
Price-per-terabyte metrics can be radically affected by changes in disk size.
Nonetheless, it is common to discuss data warehouse appliance price/terabyte. When one does, it is common to refer to user data rather than some measure of raw disk capacity.
Advantages of this approach include:
User data is what matters.
User data is what users doing product evaluations or setting budgets can best estimate in advance.
User data is a reasonable and popular basis for software-only analytic DBMS pricing.
Disadvantages of this approach include:
It depends on assumptions about compression (and in some cases indexing and so on), which are highly dependent upon the specifics of the data set.
Some vendors and users indeed think in terms of raw disk capacity.
Oracle perhaps excepted, data warehouse appliance vendors tend to be laudably conservative in the compression assumptions they build into their per-terabyte price metrics.
I wrote last year that Netezza provides the traditional industry benchmark for per-terabyte pricing. When I wrote that, the "Netezza price point" had just become a little under $20,000/TB.
That was based on 2.25X compression. Since then, Netezza has upgraded its compression. Netezza now quotes 4X compression. Accordingly, Netezza's list price is now around $11,000/TB. (A little below, actually, per Phil Francisco.)
As Doug Henschen reports, the EMC Greenplum Data Computing Appliance starts at $1 million for 18 terabytes of uncompressed user data. EMC/Greenplum also cites a 4x compression figure. That all works out to the vicinity of $14,000/TB.
And by the way, if you mirror your data on a SAN, you can stuff twice as much into the EMC Greenplum Data Computing Appliance as otherwise, but then you also have to pay for 36 TB of capacity per half-rack appliance on a SAN.
Eric Guyer reminded us that Oracle Exadata has high list prices. He also reminded us that Oracle Exadata is apt to be deeply discounted.
A couple of versions ago, I outlined the complexities of Exadata pricing.I'm not terribly motivated to do a detailed analysis of data warehouse appliance list prices, in part because everybody knows that data warehouse appliances tend to be deeply discounted... That said, here are some insights on data warehouse appliance prices...
About the Author
You May Also Like
2024 InformationWeek US IT Salary Report
Aug 15, 20242024 InformationWeek US IT Salary Report
May 29, 20242022 State of ITOps and SecOps
Jun 21, 2022