Oracle CSI Kernel
Posted: Wednesday, 5 November 2014 15:19
Two staff members from Oracle will give a presentation on Thursday 4th December, from 18.00 to 19.30. The venue is MAL B30.
Abstract: In 1990, a typical Sun system had a single 10mhz cpu and 4MB of memory and might have run in the region of 30-50 processes. Today Oracle ships systems with 512 cpu's and 4TB of memory and such systems have a few 100,000 processes at most. The roadmap suggests that by 2015 this will rise to cpu counts around 13,000 and 150TB of memory to serve workloads in excess of 1 million processes. Like C.S.I., when a system fails a post-mortem is required. A crash dump is the body, an image of the system memory at the time of failure. This talk looks at technical, logistical and tools challenges of diagnosing system failure after the fact when the body is 4TB in size and the challenges of scaling post-mortem failure diagnosis to ever larger configurations.
The presentation will be given by Clive King. Dr Clive King is a Senior Staff Engineer in Oracle Solaris Engineering. He has worked for Sun -> Oracle for 15 years in a variety of support, professional services and kernel engineering roles. His particular areas of focus are system performance, scalability and root causing system failures. He teaches problem solving, risk management and root cause analysis within Solaris Engineering. He is a BCS examiner for a professional examination, a BCS academic panel member and has been a PhD external examiner. Outside work, he splits his time between family and mountain running.
An IEEE software paper on which the talk is based can be accessed here. Clive states that "the talk won't be rocket science". A second Oracle staff member, Tim Jones, will give a few examples from his wide experience.
No booking is required. If you have an interest in the presentation then please attend.