The TPC-H Benchmark is for Decision Support Systems. It’s described very detailed on the TPC.ORG site, but you may find it quite an effort to generate the data and prepare the SQL for table creation and reporting.
At least I did, which is why I thought having that all ready for download and run would be helpful.
What I have prepared for Oracle and for Exasol is:
- The data files (CSV format) for the 1 GB TPC-H
- The DDL for the TPC-H tables
- The loader commands to populate these tables
- The 22 queries for the TPC-H benchmark
You can download it here:
The data volume is of course quite small for a production data warehouse but ideal for quick testing and self-education. I’m using it together with VirtualBox and VMs on my notebook with 16 GB memory.
See here for a demo – I’m setting up the TPC-H for both Oracle and Exasol and then I do a comparison:
Some remarks about the comparison:
I’m an Exasol employee and the outcome is very positive for Exasol.
Never the less, I tried to do a fair comparison. It’s just running the pure 22 SELECT statements, no tuning, no tweaking of the Exasol database or the underlying VM.
The Oracle version is quite recent (18.3) but not the most recent, same with the Exasol version (6.2), not the just released Exasol 7.0.
As you can see, the Exasol database is out of the box about 6 times faster than the Oracle database for the same workload having the same hardware resources – without any tuning.
I suppose you could get better performance from Oracle for the 22 queries with some effort, like analyzing the workload, adding indexes of the certain available types, partitioning the tables, adding SQL Profiles and Optimizer Directives etc.
The point is, that’s all not required with Exasol. I just run the workload twice and everything is self-optimized afterwards.
You could call this an autonomous database 😉
It’s totally easy to reproduce the test for yourself: Just download our free Community Edition; it’s what I’m using in this benchmark.
Keep in mind that this is a Decision Support System benchmark with an analytical workload. Oracle looks much better in a comparison with an OLTP workload.
But for analytics: Exasol stands behind nobody.
To all the other vendor’s presales consultants out there who encounter us on a PoC: Good luck 🙂
Data Vault is just too slow?
You think Data Vault is a great model but you struggle with query performance?
Use Exasol as a Platform!
We have the remedy: Use our analytical database Exasol as a platform and query performance problems with Data Vault disappear.
What’s the problem with Data Vault?
The Data Vault model comes with an increased number of tables and joins. Other data warehouse platforms likely have a problem to deliver acceptable query performance with that.
Exasol solves that problem!
Exasol on the other hand is very good with joins even on many tables. We use columnar storage and we always compress data. And our database is designed to do MPP with commodity hardware.
With these attributes, combining Data Vault with Exasol delivered the best query performance for that model already using earlier versions of Exasol. And for a very reasonable price, if I may add that.
Even faster now with Exasol 7.0
Now with the release of Exasol 7.0, Data Vault performance got even better, thanks to the introduction of the HASHTYPE datatype for hash join columns and several improvements on JOIN operations.
Curious? Take action!
You want to see for yourself? Download our free Community Edition and run a test.
I bet we beat any other platform you have in place at the moment when it comes to query performance on a Data Vault model!
We’re also happy to do a PoC with your data and your queries.
As I like to say: Don’t believe it, test it 🙂
Isn’t it nice if you can make an impact with your tweets and posts? That’s what I thought today while investigating our online learning registrations.
As you may know, Exasol offers free online learning – recently also combined with free certification. We have also recently changed our course curriculum to better suit customer demands.
Together with very many IT people being in home office, that all leads to quite high numbers in registrations to our online learning platform called Exacademy: 617 in 2020 so far.
That’s many for a still quite small training & certification department like ours, and we couldn’t possibly educate that many customers with instructor-led courses.
Today, I looked at the Exacademy registrations per day:
Roughly 10 per day on average, but 74 on March 27th. What happend that day? Now I looked at my tweets from March:
Tweet analytics shows:
Looks like these 2000 impressions contributed to the unusual high number of Exacademy enrollments, which makes me quite happy 🙂
By the way, we provided 200 free certification exams so far. Get your free Exasol online training together with a free certification now: training.exasol.com