[Skip To Content]
Laptop

Senior System Operations Engineer

  • Technology & Data
  • Full time
  • R-391307

JD - Senior Systems Operations Engineer.

Engineer should have a mindset to maximize system availability through proactive means. The candidates should build robust automation solutions to eliminate or minimize incidents as well as investigate and resolve issues in response to production incidents. The candidates should perform trend analysis on production issues, perform bug fixes/enhancements in the scripts used for operational tasks and should be comfortable working with Development/scrum teams and support their demanding needs to ensure availability & performance of the applications & platform.

The candidate must have 6+ years of relevant industry experience, proven track record of working in a large scale and global SRE or DevOps implementation projects with application Development experience. The candidate should demonstrate a proactive, hands-on approach and strong system and analytical skills with focus on streamlining the operational tasks using automation.

  • In this role, you will:

  • Lead or participate in managing all installed systems and infrastructure within the Systems Operations functional area
  • Contribute in increasing system efficiencies and lowering the human intervention time on related tasks
  • Required Qualifications:

  • 7+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education.
  • Review and analyze moderately complex operational support systems, application software, and system management tools to ensure the highest levels of systems and infrastructure availability
  • Work with vendors and other technical personnel for problem resolution
  • Lead team to meet technical deliverables while leveraging solid understanding of technical process controls or standards
  • Collaborate with vendors and other technical personnel to resolve technical issues and achieve highest levels of systems and infrastructure availability

Responsibilities:

  • Drive innovation in digital technology & Innovation application portfolios, increase efficiency through automation, SRE and Agile with an emphasis on enhancing end user experience.
  • Leading Team on all technical issues related to APP and WEB tier.
  • Expert in Middleware Administration (WebLogic, Tomcat, Apache, IIS) and Strong working Experience in production support of middleware applications.
  • Drive automation of manual repetitive operational tasks and Engineer solutions to automate production game plans.
  • Perform trend analysis of repetitive production issues and engage relevant operation/development teams to address the failure patterns and incidents.
  • Drive adoption of self-healing and resiliency patterns.
  • Enhance the end-to-end application or system observability by enhancing the alarm setup or developing new dashboards using the monitoring/log analysis/analytic tools such as splunk, AppD, Elastic Search, PowerBI, Tableau etc.
  • Closely work with enterprise SRE team and perform SRE maturity assessment for applications in scope, baseline current state metrics, establish SLI/SLOs, Error budget, Service Levels, monitoring, alerting and recovery objectives and perform periodic resiliency testing for all applications in scope.
  • Manage the Toil Registry created for the group & Reduce toil by fine tuning existing monitoring/alarming setup or by developing tools to automate the routine tasks using ansible, shell scripting etc.
  • Develop a solution for self-healing of alarms thus aiding in production Incident reduction.
  • Enhance or fix the bugs in the existing patching & production release install scripts for improving the success ratio and own/participate in the root cause analysis using 5-Why approach.
  • Recommend infra level solutions by proactively analyzing low level errors in application logs which are undetected to enhance the customer experience.
  • Direct large-scale projects and application implementations from proof of concept through testing and installation.
  • Troubleshoot high severity production incidents in real time, improve system availability & reliability by facilitating blameless postmortems to prevent problem recurrence.
  • Apply analytics on historical monitoring or incident data for predicting issues and take proactive actions.
  • Statistical gathering and analysis to assist architecture engineering and development teams in capacity planning requirements to support projected transaction volumes, response times and system availability targets.
  • Collaboration with enterprise partners on issues and initiatives that impact the infrastructure.
  • Add value to team delivery and work with team to complete tasks with high quality and actively learn new skills/technologies.

Essential Qualifications:

  • Bachelor’s Degree or equivalent experience in any software engineering discipline.
  • 6+ years’ experience in production support & SRE implementation in a large scale environments (preferably in banking domain.
  • Hands on experience in web & middleware platform (apache, tomcat, WebLogic, PCF etc.) in Linux/windows environments.
  • Hands on experience in supporting PCF applications and microservice architecture-based applications.
  • Hands on experience with monitoring/log analysis/dashboard tools such as Appdynamics, splunk, Elastic Search, Netcool, PowerBI, Tableau etc.
  • Proficiency in shell scripting, ansible and one programming language such as python or JavaScript.
  • Good knowledge in DevOps tools - GitHub, Jenkins, UCD and cloud platforms such as GCP.
  • Knowledge in Database and network environments.
  • Good knowledge in Agile and ITIL framework.
  • Strong analytical and problem-solving abilities, with quick adaptation to new technologies, methodologies and systems.
  • Demonstrate strong written, oral communication skills and documentation skills and able to work independently.
  • Self-learner, understand technology environment and deliver faster.
  • Willing to work in shifts (24x7 models) 

Desired Qualifications:

  • Experience in Unix /Linux Server Support domain.
  • Cloud certification
  • Experience with Tableau/ MicroStrategy or similar BI tools
  • Bachelors or Master's degree in Computer Science, Software Engineering or a related field

Posting End Date: 

5 Oct 2024

*Job posting may come down early due to volume of applicants.

We Value Diversity

At Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law.

Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.

Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

Applicants with Disabilities

To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.

Drug and Alcohol Policy

 

Wells Fargo maintains a drug free workplace.  Please see our Drug and Alcohol Policy to learn more.

Wells Fargo Recruitment and Hiring Requirements:

a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.


Rejoignez notre communauté de talents

Renseignez-vous sur les événements à venir et les possibilités de carrière chez Wells Fargo.

Adhérer maintenant
JK 1212 1236 B 4MP