Incident Management (IM) Operations Lead Job at Infosys, Irvine, CA

ZlY0aDRiZUo3T3ppdDRMUzhXY2tFc0JBK1E9PQ==
  • Infosys
  • Irvine, CA

Job Description

Job Title: Incident Management (IM) Operations Lead

Location: ONSITE – Irvine, CA

Duration: Contract

 

HIRING – Incident Management (IM) Operations Lead with around 10+ years  of relevant experience .

 

ROLE SUMMARY

  • We are seeking an Incident Management (IM) Operations Lead for a 24/7 operations program supporting a key player in the automotive industry's connected car space.
  • This role will lead an onsite–offshore IM function, act as the single point of accountability for all incidents and active outages which occur during onsite business hours and ensure adherence to contractual SLAs.
  • The IM Operations Lead will orchestrate end-to-end incident handling—from anomaly detection to bridge initiation, stakeholder communications, resolution, and post-incident governance—with a strong background in Telematics and Connected Vehicle systems (including Remote Services).

 

KEY RESPONSIBILITIES:

  • Lead and mentor the Incident Management team in an onsite–offshore operating model; establish clear handoffs and coverage to support 24/7 operations.
  • Own and drive the lifecycle of every major incident/outage (not a shift-based support engineer role), ensuring rapid triage, containment, and resolution.
  • Serve as the primary point of contact (POC) for all IM matters—including executive escalations—maintaining authoritative situational awareness across platforms.
  • Accountable for all IM SLAs (MTTD, MTTA, MTTR, communication SLAs); ensure measurement, reporting, and continuous improvement.
  • Should be proficient in handling the incidents ranging from high priority ones (P1, P2) to the low priority incidents (P3, P4).
  • Initiate and run incident bridges/war rooms; coordinate cross-functional responders (application, infrastructure, network, OEM partners, and third parties).
  • Oversee proactive detection through data monitoring tools (Dynatrace, Datadog, Grafana, MaxGauge) and ensure alert quality, runbooks, and signal-to-noise optimization.
  • Establish and enforce SOPs for incident declaration, severity classification, response roles and decision logs.
  • Drive disciplined stakeholder communications: timely updates to product, operations, OEM/customer contacts, leadership, and impacted regions; maintain comms cadence and channels.
  • Ensure post-incident governance: facilitate RCA, document contributing causes, corrective and preventive actions (CAPA), and circulate the RCA across teams for sign-off.
  • Partner with Problem Management to track and eradicate recurring issues; maintain problem backlog, KEDB (Known Error Database), and follow-through on remediation.
  • Define and maintain IM dashboards, KPIs, and executive reports; present weekly/monthly service reviews with trend analysis and action plans.
  • Collaborate with Product/Engineering to influence reliability roadmaps (resiliency patterns, observability, capacity, release safeguards).
  • Ensure compliance with information security, data privacy, and OEM contractual obligations during incident handling and communications.
  • Continuously refine IM playbooks, runbooks, and training; conduct simulations/game days and readiness audits across onsite–offshore teams.

 

Domain Expertise: Telematics & Connected Car

  • Hands-on knowledge of Telematics Control Unit (TCU), eSIM/OTA provisioning, backend telematics platforms, and data flows between vehicle, cloud, and mobile apps.
  • Familiarity with Remote Services (e.g., remote lock/unlock, start/stop, charge control, climate pre-conditioning), geo-services, and safety/assist features.
  • Understanding of service dependencies: identity/auth, messaging, device management, CAN bus signals, firmware/OTA update orchestration, and regional compliance.
  • Experience coordinating incidents across OEM partners, Tier-1 suppliers, cloud providers, and customer support operations.

 

Required Qualifications

  • 8 –12+ years in Operations/Service Management with 4+ years leading Incident Management in large-scale, 24/7 environments.
  • Demonstrated experience running bridges for P1/P0 incidents; proven incident commander skills and decision-making under pressure.
  • Strong background in Telematics/Connected Car domain and vehicle remote services concepts.
  • Proficiency with observability and monitoring tools: Dynatrace, Datadog, Grafana, MaxGauge (dashboards, alerting, traces, logs).
  • Expertise in IM processes: detection → triage → severity assignment → bridge initiation → stakeholder updates → resolution → post-incident RCA.
  • Working knowledge of ITIL practices (Incident, Problem, Change, Service Level Management).
  • Should have good hands on experience in using tools like JIRA, Confluence, Jenkins, XMatters.
  • Excellent communication (written/verbal), executive presence, and stakeholder management across onsite–offshore teams.
  • Ability to analyze telemetry and time-series data to drive root cause hypotheses and corrective actions.
  • Experience defining SLAs/SLOs and building KPI dashboards (MTTD/MTTA/MTTR, incident volume, recurrence, comms SLA, customer impact).

 

Tools & Technologies:

  • Dynatrace, Datadog, Grafana, MaxGauge (APM, logs, metrics, dashboards).
  • Incident & Comms: ticketing (Jira/ServiceNow), chat/bridge tools (Teams/Zoom), status pages.
  • Data: time-series analysis, log aggregation, tracing, alerting policies and noise reduction.

Job Tags

Contract work, Remote work, Shift work

Similar Jobs

Circle Logistics

Class A CDL A Owner Operator Truck Drivers- Dedicated Dry Van Lanes- MI, IN, OH Job Job at Circle Logistics

Class A CDL A Owner Operator Truck Drivers- Dedicated Dry Van Lanes- MI, IN, OH JobCDL A Owner Operators- Regional Dedicated Lanes- IN, OH, MI- West Chester, OH to Zeeland, MI, Zeeland, MI to Mt Vernon, IN, Evansville, IN back to Michigan or broker load back towards... 

Yale New Haven Health

Volunteer Coordinator Job at Yale New Haven Health

 ...Values. These values - integrity, patient-centered, respect, accountability, and compassion - must guide what we do, as individuals and professionals...  .... Under the direct supervision of the Senior Manager of Volunteer Services, coordinates Volunteer Services programs.... 

Trace3

Security Operations Center (SOC) Analyst (Remote) Job at Trace3

 ...struggles and failures not just their success. We appreciate the individuality of the people around us. About the Role: The SOC Analyst is responsible for monitoring detecting analyzing and responding to cybersecurity events and incidents. This role is critical to... 

BlackStar

Chief Financial Officer (CFO) - Community Bank Job at BlackStar

 ...with a well-established, community-focused bank seeking a Chief Financial Officer / Comptroller to join its executive leadership team....  ...a key advisor to the CEO and Board of Directors on financial planning, risk, capital management, and performance optimization.... 

Focused HR Solutions

1.20. QA Tester Job at Focused HR Solutions

 ...the Raleigh area (Triangle Region of NC). Candidate must work EST Business Hours. Our direct client has an opening for a QA Tester position # 729722. This position is for 12+ months, with the option of extension, and will be worked mostly remotely with on-site meetings...