Skip to main content
Laptop

Lead Systems Operations Engineer

About this role: 
We are seeking a highly skilled and forward‑thinking Lead Systems Operations Engineer to join our Technology Operations team. This role is ideal for someone who excels in Kubernetes and OpenShift platform operations, drives operational excellence, and leads initiatives that improve stability, automation, and service reliability. You will play a key role in operating and improving our cloud‑native platforms, reducing operational toil, and ensuring the resilience and compliance of critical infrastructure services. 

In this role, you will: 

  • Lead complex, broad impact initiatives including provision of high-level systems consultation for the technology teams
  • Platform Operations Leadership: Lead day‑to‑day Platform (REDIS, OpenShift) platform operations, including cluster maintenance, upgrades, performance monitoring, and troubleshooting
  • Operations Excellence – Improving operations practices to meet new Incident SLA and improving practices during incident & problem management
  • Incident Response & Problem Management: Serve as an operational lead during incidents, driving rapid diagnosis, resolution, root‑cause analysis, and long‑term corrective actions
  • Operational Automation: Develop or enhance automation (Python, Bash, GitOps workflows, or AI‑assisted tools), build AI Agents, MCP server and tools, add skill in MCP that eliminates manual effort and streamlines run processes
  • Platform Readiness: Lead Platform lifecycle activities, including new cluster builds, configuration, onboarding, upgrades, and cluster decommissioning, ensuring consistency, reliability, and compliance across environments
  • Collaboration & Enablement: Partner with engineering, SRE, security, and development teams to implement repeatable operational patterns, guardrails, and platform readiness standards
  • Security, Compliance & Governance: Ensure platform operations follow organizational policies, security standards, audit controls, and regulatory requirements
  • Continuous Improvement: Identify operational gaps, recurring issues, or inefficiencies and lead initiatives to enhance reliability, resiliency, and operational maturity.

Required Qualifications: 

  • 5+years of Systems Engineering, equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 5+ years of hands-on experience in Python for platform operations automation
  • 5 + years of designing and building complex observability solutions leveraging industry standard toolset and or custom-built solutions
  • Strong proficiency in writing production-quality Python code by using Python libraries and client integrations
  • Ability to develop automation solutions, including remediation procedures & workflows and operational tools using Python
  • 3+years of experience managing complex, enterprise-scale applications in production environments
  • Extensive experience with configuration and monitoring tools such as Grafana, Splunk, and Prometheus
  • Deep platform expertise, including cluster build-outs, CI/CD pipeline integration, troubleshooting, debugging, remediation, patching, upgrades, and root cause analysis (RCA)
  • 2+ years of hands-on Linux system administration experience
  • Deep expertise with Platforms includes building clusters with pipelines, diagnosing, debugging, remediation, upgrades, patching, and RCA.

Desired Qualifications: 

  • Strong experience with Open shift, Kubernetes, Public cloud
  • Experience in AI development, including agents, MCP, tools, and related frameworks
  • Hands-on experience with operational tooling such as Grafana, Splunk, Prometheus, Jira, or GitHub, SDLC
  • Demonstrated ability to influence operational improvements across teams.
  • Strong analytical and operational problem‑solving skills.

Job Expectations: 

  • Participation in on-call rotations 
  • This position is hybrid and must be located at one of the posted locations.
  • This position is not eligible for visa sponsorship.
  • Develop automated remediation scripts in Python to proactively resolve alerts and enhance operational efficiency, reducing Mean Time to Resolution (MTTR).

Posting End Date: 

27 May 2026

*Job posting may come down early due to volume of applicants.

We Value Equal Opportunity

Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.

Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

Applicants with Disabilities

To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.

Drug and Alcohol Policy

 

Wells Fargo maintains a drug free workplace.  Please see our Drug and Alcohol Policy to learn more.

Wells Fargo Recruitment and Hiring Requirements:

a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.


Join our talent community

Learn about upcoming events and career opportunities at Wells Fargo

Talent Community
JK 1212 1236 B 4MP