Senior Reliability Manager
The Senior Reliability Manager plays a critical role in ensuring the smooth and efficient functioning of an organization's IT infrastructure, applications, systems, and operations. They must possess strong leadership, technical, and communication skills to effectively manage a team of IT professionals and support the organization's business objectives.
As a Senior Reliability Manager and engineering leader, you will lead a highly technical team consisting of full-stack engineers, DevOps engineers, SREs, and take ownership for keeping critical applications and platforms available, resilient, performant, and secure, while continually enabling our engineering teams to efficiently deliver new and engaging capabilities and adopt new tools and technology. Provide thought leadership and ownership of SRE engineering while delivering best-in-class CI/CD DevSecOps pipelines, cloud, container, and Kubernetes provisioning and management. Continue building a strong culture of high-quality engineering practices, continuous learning, and growing multi-functional collaboration.
A Day in the Life
Responsibilities may include the following and other duties may be assigned.
- Lead a team of site reliability engineers (SREs), provide support for critical applications and services supporting Diabetes ecosystem.
- Defines, develops, and manages a comprehensive and integrated IT Service Management (ITSM) landscape, based on SRE best-practice processes, disciplines, and related toolsets.
- Drive adoption of industry standard methodologies and practices for deployment, observability, and reliability.
- Enable teams to operate and run through automation and tooling.
- Lead day-to-day team activities using the Agile/Scrum methodology.
- Establish and maintain service level objectives (SLOs) for key services and systems, and work to meet or exceed those objectives through engineering efforts.
- Analyze and track operational metrics to identify trends, areas for improvement, and opportunities for optimization.
- Maintain the Application Portfolio Management, ensuring a clear understanding of application landscapes.
- Lead efforts to improve the reliability, availability, and performance of the organization's systems and services through automation, monitoring, and proactive maintenance.
- Lead and maintain IT security measures for business applications, safeguarding data and systems.
- Create and present operational dashboards to senior leadership, providing insights into performance and trends.
- Coordinate and lead strategic core infrastructure and networking projects, ensuring alignment with business objectives.
- Coach, motivate and inspire support team members to achieve and exceed performance results.
- Participate in all critical and high severity issues offering support, attend war room meetings, and provide updates to leadership.
- Compile detailed timelines to include operational impact and technical assessment for high severity issues in support of root cause analysis.
- Oversee ServiceNow tickets to closure and review ticket dashboards to address gaps and trends and develop action plans to address.
- Collaborate with business operations teams to ensure coordination of IT changes
- Manage relationships with IT vendors and service providers, including negotiating contracts, monitoring performance, and resolving disputes.
- Bachelors degree required
- Minimum of 7 years of relevant experience with 5+ years of managerial experience, or advanced degree with a minimum of 5 years of relevant experience with 5+ years of managerial experience
Nice to Have
- Bachelor’s degree in related field (Computer Science, Computer Networking, IT Systems)
- At least 5 years as a manager with prior experience in performance management as well as an emphasis in coaching, mentoring, and managing a team.
- Minimum of 5+ years in SRE/DevOps.
- Proven ability to create, revamp, improve processes to increase efficiency and Service Level Agreement responses of a technical team.
- Familiarity with monitoring tools like Prometheus, Grafana, Nagios, or Splunk for monitoring system performance, availability, and reliability metrics.
- Excellent written and verbal communication skills.
- Ability to present technical information in a clear and concise manner to executives and non-technical leaders.
- Excellent Team player with proven ability to accomplish goals through collaboration.
- Demonstrated proficiency in ServiceNow, Microsoft products
- Knowledge of Linux, UNIX, infrastructure, and application monitoring tools (Prometheus, Datadog, Dynatrace); previous exposure to Kubernetes, EKS, or similar container orchestration systems.
- Experience in deploying, operating, and running services in AWS, Azure or other cloud environments.
- Experience with IaC tools like Terraform, Ansible, or Chef for automating infrastructure provisioning and configuration management.
- Experience with distributed systems / micro service architecture.
Together, we can change healthcare worldwide. At Medtronic, we push the limits of what technology, therapies and services can do to help alleviate pain, restore health and extend life. We challenge ourselves and each other to make tomorrow better than yesterday. It is what makes this an exciting and rewarding place to be.
We want to accelerate and advance our ability to create meaningful innovations - but we will only succeed with the right people on our team. Let’s work together to address universal healthcare needs and improve patients’ lives. Help us shape the future.
Physical Job Requirements
The physical demands described within the Responsibilities section of this job description are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. For Office Roles: While performing the duties of this job, the employee is regularly required to be independently mobile. The employee is also required to interact with a computer, and communicate with peers and co-workers. Contact your manager or local HR to understand the Work Conditions and Physical requirements that may be specific to each role. (ADA-United States of America)
A commitment to our employees lives at the core of our values. We recognize their contributions. They share in the success they help to create. We offer a wide range of benefits, resources, and competitive compensation plans designed to support you at every career and life stage. Learn more about our benefits here.
This position is eligible for a short-term incentive plan. Learn more about Medtronic Incentive Plan (MIP) on page 6 here.
The provided base salary range is used nationally (except in certain CA locations). The rate offered is compliant with federal/local regulations and may vary by experience, certification/education, market conditions, location, etc.