A task that you will most likely encounter during the administration of Exadata is the replacement of a damaged Hard Disk on the storage servers. Fortunately, this is quite easy, because almost everything is done by the system itself 🙂
Especially, the original Celldisks and Griddisks are rebuilt automatically on the Cell Layer. On the Database Layer, the related ASM disks also get rebuilt automatically, while due to the (at least) normal redundancy, the availability of the Database(s), relying on the diskgroups is not affected. The task is briefly described in this MOS Note.
As soon as the Hard Disk failure is noticed by the MS (Management Server) background process on the Cell, it will raise an alert that will also be published to Grid Control, if configured. Immediately, due to Pro-Active Disk Quarantine, the ASM-, Grid- and Celldisks get dropped. ASM rebalancing is triggered. You as the responsible Admin notice the alert and order a replacement Disk resp. use a Spare Disk to plug it into the Cell after you plugged out the damaged one. The Cell can stay online, because the Hard Disks are hot-pluggable.
No further administrative work to be done, typically. Easy, isn’t it? Mr. Sengonul from Turkcell (leading global system provider for mobile communications in Turkey, one of our Customer Exadata references) has published the Logfiles from such an incident with this posting. Thank you for that and also for your fine presentation about the Exadata Migration!
#1 von divakarmehta am Januar 17, 2012 - 13:03
Many thanks Uwe for your wonderful Articles…Really really useful
#2 von Uwe Hesse am Januar 17, 2012 - 13:12
You’re welcome 🙂 Thank you for the nice feedback!
#3 von Ranjeet patil am Juni 15, 2014 - 12:39
Really really useful ………….
#4 von Raju Krishnan am August 17, 2015 - 19:10
Thanks very much for your blogs, lot of useful information!
I have a question – do you have any information on the impact of a storage cell going down on the performance of an exadata half rack? or a full rack?
With normal redundancy / high redundancy?
#5 von souleymane bah am Januar 19, 2017 - 13:36
Really cool topic