Anonymous
During anonymous 360 employee survey as well as in all hands meeting, my team expressed concern regarding growing operation burden , on-call burden. The band to be reserved for operation load was 20%, in reality, 40% of team bandwidth was being consumed for KTLO activities.
This was indeed a serious concern and I wanted this to be addressed on immediate basis.
I worked with the managers in my team to come up with a plan on how to address in increasing burden. We operationalises the plan which consisted of following actions -
1. We invested some budget to get SRE team onboarded to take the load of production system caused issues
2. Identified recurring issues and started implementing automation
3. With a target of 5% reduction of operation you, short term projects were started that directly fixes the issue causing components.
This having a focus on addressing the very valid teams input to better the execution, we had a very productive quarter which not only made our product better, but also increased engineer productivity and engagement.