Sivakumar, Head – Technology & Development (siva.k@appnomic.com)
The ITIL standard based guidelines recommend the following set of process for Operational Management –
- Incident Management – Process to handle any disruption of service with measurable Service Level Agreements
- Problem Management – Process to find the root cause and closure for repeating incidents or new incidents without known solution with the intent to build the knowledge base so that the recurrence of the same problem can be solved quickly.
- Change Management – Process to handle any technology, system or application addition or change in a stable production setup with minimal impact on services.
- Release Management – Process to test & verify any change before rollout into the production setup.
- Configuration Management – Process to keep the Configuration Database updated with the latest information as and when any change is implemented in the production setup.
Ref: ITIL Standards
Autonomic Computing defines systems that conform to the following aspects of self-management –
- Self-configuration - Automated configuration of components and systems follows high-level policies. Rest of system adjusts automatically and seamlessly.
- Self-optimization - Components and systems continually seek opportunities to improve their own performance and efficiency.
- Self-healing - System automatically detects, diagnoses, and repairs localized software and hardware problems.
- Self-protection - System automatically defends against malicious attacks or cascading failures. It uses early warning to anticipate and prevent system-wide failures.
Ref: www.research.ibm.com/autonomic/research/papers/AC_Vision_Computer_Jan_2003.pdf
Imagine ITSM after Autonomic Computing systems are in abundance. Incident Management could become non-existent as the systems would be able to correct themselves when it detects a failure. Problem Management may still exist as there would always be complex issues that cannot be self-healed by the system and hence would need the intelligence of a human being. But then the human-being needs to be far more advanced than the complex system to solve such problems.
Change & Release Management would continue to exist in one form or the other as there would always be a need to change & improve the systems for automation and efficiencies. Configuration Management that is at the heart of ITIL may be fully automated with complete versioning & backup available as of any point in time. Configuration changes may be done automatically by the system based on increased load or localized failures to meet the SLA. Policy definitions (including SLA) will become the only configuration item that needs to be maintained by the user.
Are we anywhere close to reaching the ideal state described above? What are the technologies that helping us progress in this direction? Is autonomic computing a viable solution in the near future? How much progress has been made in this direction? Is there a different approach to solve the ITSM problem? I’m trying to find answers to these questions and would like to discuss my thoughts with groups working in these technology areas. I will keep posting my thoughts and findings in the coming weeks.