cancel
Showing results for 
Search instead for 
Did you mean: 
alexcorcoles
Flight Engineer
Flight Engineer
  • 425 Views

Sharing Incident Management Responsibilities

Alice in the operations team handles the fourth out of hours incident in the week for the new application. Alice feels the new application has more incidents than other applications.

Alice talks to Bob in the development team. Bob explains that management requests features for the new application, and the development team does not have time to fix stability bugs.

Alice measures that the new application averages five incidents per week, while other applications average two incidents per week.

Alice proposes to her manager Carol that they start sharing incident management work with the development team. Carol discusses the idea with Dave, the development team manager. Carol negotiates a compensation plan and a rotation schedule so developers can handle incidents out of hours.

After handling some incidents, Bob identifies some issues that lead to instability. Bob negotiates with Dave and management to hold off some new features so that developers can address the instability issues.

After implementing the fixes, out of hour calls decrease, and Bob rarely needs to handle calls. Alice and the operations team's load also returns to normal.

More importantly, Bob learns about some reliability issues, and takes those issues into consideration when developing new applications. Therefore, newly developed applications are more reliable when they are released than before Bob's experience working with operations. Additionally, the organization learns that aligning the incentives for different teams is effective.

Reflect about the preceding history.

Post your opinion about what would happen if a system administrator proposed this change in an organization you know. Review other students' posts and discuss.

0 Kudos
Join the discussion
You must log in to join this conversation.