I just came back from an Inhouse course in Basel (Switzerland) for the Novartis AG. It was about Partitioning in general and about some 11g New Features in this area. The present Novartis IT staff was very experienced (some of them worked with Oracle since version 5) and friendly – so we had a pleasant atmosphere to work in. I would like to share one of my demonstrations from this class with the Oracle Community.
One very useful feature about partitioning that is available since version 8 is the possibility of partition pruning. That means, that the optimizer during the parse phase of a SQL statement is able to exclude certain partitions from scanning because according to the definition of the partitioned table present in the Data Dictionary, the desired rows can’t be in those partitions.
When we for example have a table that is partitioned by range on a date column with one partition for each quarter of the year, it is implicitly clear for a query like
select * from t where time_id=to_date('01-01-2009','dd-mm-yyyy');
that the desired rows can only be in partition Q1. Without the presence of indexes, this query is round 4 times faster (because we have 4 assumed roughly equal filled partitions) than a Full Table Scan. An 11g New Feature is interval partitioning. Before 11g, you have to create a new partition on a range partitioned table manually, if you want to add a new range (for example a new quarter). Interval partitioning does that automatically.
The following demonstration shows the benefit of partition pruning & interval partitioning. I did it on my notebook with Oracle Enterprise Linux and 126.96.36.199 . The two tables with 20 Million rows need round 600 MB space and it took a couple of minutes (less than 15) to create them.
SQL> create table nonpart (id number, name varchar2(50), salary number); begin for i in 1..10000000 loop -- insert 10 Million rows insert into nonpart values (i, 'John Doe', 9000); end loop; commit; end; / SQL> set timing on SQL>select salary from nonpart where id=4711 -- FTS on 10 Million rows table; SALARY ---------- 9000 Elapsed: 00:00:04.13
Now we create the partitioned table with very little coding effort:
SQL> create table part (id number, name varchar2(50), salary number) partition by range (id) interval (10000) -- one partition each 10000 ids (partition p1 values less than (10001));
The following insert creates 999 partitions automatically! [Updated the following part according to Randolf’s advice in the comment below]
SQL> alter table part nologging; SQL> insert /*+ append */ into part select * from nonpart; SQL> commit;
Due to partition pruning, even with this relatively tiny tables we see a dramatic speed up of the query:
SQL> select salary from part where id=4711 -- scan on 10.000 rows partition; SALARY ---------- 9000 Elapsed: 00:00:00.05
You can also tell from investigating the execution plan via set autotrace on or explain plan for that the above query made use of partition pruning, but I like to see (and show) that in this fashion a little more obviously 🙂