SYSTEM ADMINISTRATOR - Linux L3
Publication Date:
Dec 11, 2024
Ref. No:
520669
Location:
Talawade Software Technology P, IN
SENIOR CONSULTANT
Candidate must posses L3/SME level experties in Linux,AIX,NIM,VCS,HMC.
Must be certified in AIX and or Linux
Candidate with Linux Build/engineering as secondary knowledge will be given preference
Responsibilty:
• Lead and drive the diagnosis and technical restoration of Major incident for
o Drive the analysis and Permanent Corrective Actions (PCA) for AtoS related to incidents that occur within the tower
o Based on historic events, jointly analyse and develop, or enhance proactive alerting and health check actions or identify baseline measurements to drive fast identification or repair actions.
• o Leads and drives the incident diagnosis and technical restoration on behalf of Atos for their respective tower and associated applications that they are trained in
Leads the tower service recovery and stabilization activities during each incident/event related to the designated applications. Each support team will have their own SOP health-check to be presented upon joining the incident bridge
o Change
Plan: Evaluates and approves changes for the core BC5 applications which they are trained in
Execution: If Change is High/Medium risk, execute the change. If Change is Low risk, oversee the execution of Change
o Problem
Drive the analysis and Permanent Corrective Actions (PCA) for AtoS related to incidents that occur within the applications they are trained in
Based on historic events, jointly analyse and develop, or enhance proactive alerting and health check actions or identify baseline measurements to drive fast identification or repair actions
o All resolutions or remediation will be documented for future reference
• Establish, monitor/measure and report on key metrics to ensure the Tower is effective in all facets of ITIL & Operational management
• Identify a min of 2 key Service Assurance actions per year and have them closed showing the benefits of the implementation
• Take lead to drive and drill down RCA by working collaborative mindset across other technology towers .
• Review critical infrastructure related change plan for Critical Apps on Unix space and planning of critical infrastructure changes ( cross domain).
• Act as escalation points during Implementation of critical infra changes for Unix Domain and infra change for BC4/5 critical applications.
• Identify and propose industry best practice for any Single Points of Failure or Architecture that is not ‘fit for purpose’
• Participate in planning of Infra best practice and infra stability actions :- patching ,CCM fix, Hygiene reboot, Platform NC remediation and monitoring enhancement.
• Automate and Optimize day to day operational task :- analysing events trend ,recurring Incidents and changes, user request for information and data collection.
• Regular review of Unix Operation technical procedures, process ,work instructions and technical change Implementation plan.
• Suggestions on Industry best practices to improve/expand the efficiency of UNIX operation.
• Work closely with Unix Domain peers to build engagement for engineering roadmap, Unix service offering, alignment of technical initiatives and framework .
• While working on office hours, shall be flexible to work on demand situation (On Call ROTA)
• Ensure that identification of skills is done proactively ensuring any gaps are remediated by training either external, internal, or via personal guidance to other team members
• Ensure that identification of new skill requirements are proactively captured and recommended training course is given to Tower Manager
• Work in close collaboration with other Tower SME’s on any joint initiatives that will benefit internal or external operations
• Ensure that all appropriate documentation is in place, up to date and accurate
• Be focal point for planning of DR exercises and ensure Tower readiness