Systems Reliability Engineer

hace 2 días


Lima, Perú Scotiabank A tiempo completo

Hola Felicitamos y valoramos tu interés por seguir creciendo dentro del Grupo Scotiabank, nos encontramos en búsqueda de talento que aporte con sus conocimientos y experiência a la posición y sobre todo con OPTIMISMO.
**Purpose**:
As a member of the Global Systems Reliability team,the Global System Reliability Engineer (SRE) will work in collaboration with a team that will work with Senior management, peers, and business partners to continuously improve the stability, reliability and efficiency of our Global systems through Site Reliability Engineering (SRE) based principles and practices that will include continuous people, process and technology (automating all the things”) enhancements in support of our rapidly changing technology product portfolio.

You will work cross-functionally amongst a variety of teams and be a contributor in all significant engineering service or solution delivered to the Global Systems Reliability Office and stakeholders.You will also have an understanding ‘what could go wrong’, help to solve complex problems and have a flare for communicating and participating in discussions with technical and business partners. You will work directly with our Software Engineering teams to both maintain and operate our existing technology and build our next generation of technologies.

**Key Accountabilities**:

- Work in collaboration with Director, Global System Reliability Engineering as well as with software development, Quality, Product and Data Engineering teams to Champion SRE/ DevOps culture and practices
- Assist management of Service Level Objectives with senior development and business leads
- Assist and participate in initiatives to continuously refine our build, plan and deploy practices for improved stability, reliability, efficiency, repeatability and security. You’ll help to create plans, collaborate with other SROs and DevOps team members - coordinating activity with development and business leads to increase service levels, lower costs, and support delivery velocity objectives
- Working closely with Development and operations teams to assist troubleshooting of our most severe incidents - contributing senior stakeholder communication, driving problem-solving(e.g., log analysis, non-invasive tests) and debugging with best practice techniques
- Assisting and contributing to continuous improvement and execution of quality and timely major incident root cause analysis and blameless post mortem activitiesto ensure we take action to avoid similar problems in the future
- Participate in prioritization of reliability features and contribute to the design, development and delivery of effective tooling, alerts, and automated responses to identify and address reliability risks.
- Contribute to In-depth data analysis to gauge service trends and drive improvements.
- Play a key role in proactive communication of reliability, stability and efficiency results (based on Service Level Objectives), service health (via dashboards) key reliability risks and issues to senior business and technology stakeholders - to prioritize activity (based on trend analysis ) and direct investment and action
- Enable and design/developing reliability solutions - this may include writing code and scripts to automate provisioning of services and to configure services
- Assisting in improving infrastructure automation, efficiency, and cost
- Actively pursue effective and efficient operations of his/her respective areas, while ensuring the adequacy, adherence to and effectiveness of day-to-day business controls to meet obligations with respect to operational risk, regulatory compliance risk, AML/ATF risk and conduct risk, including but not limited to responsibilities under the Operational Risk Management Framework, Regulatory Compliance Risk Management Framework, AML/ATF Global Handbook and the Guidelines for Business Conduct.
**Education and experience**
- Top notch engineer with ability to work globally across the Enterprise.
- Performance and results oriented leadership skills - with a developmental bias (coaching)
- Experience with ITSM tools (ServiceNow, a plus) with strong understanding of SRE and service management principles
- Strong organizational skills and the ability to effectively manage multiple tasks simultaneously
- Capability of working in a complex and fast paced environment
- Ability to represent the team in meetings and presentations that include SeniorBusiness Technology executives
- Ability to maintain calm during stressful situations
- Degree in Computer Science, Engineering, or equivalent experience. ITIL V3 Foundation Cert. in ITSM would be an asset
- 8 + years’ experience in IT
- 2-3 years professional coding experience in one or more of the following: C, C++, Java would be asset.
- Mastery of one or more scripting languages for automating systems, eg. Bash, Python, Ansible would be asset.
- Well-rounded broad knowledge of OS platforms (Linux/UNIX), Networking, Web Systems an



  • Lima, Perú Scotiabank A tiempo completo

    ID de la solicitud: 227737 Gracias por tu interés en ser parte de Scotiabank Perú, apreciamos tu postulación. Estamos en la búsqueda de personas con talento que quieran crecer y lograr los objetivos de nuestra organización. ¡Te deseamos mucho éxito dentro de este proceso! **Senior Systems Reliability Engineer** - Business Line: Operaciones &...


  • Lima, Perú Groupon A tiempo completo

    Groupon is a marketplace where customers discover new experiences and services everyday and local businesses thrive. To date we have worked with over a million merchant partners worldwide, connecting over 16 million customers with deals across various categories. In a world often dominated by e-commerce giants, we stand out as one of the few platforms...


  • Lima, Perú Hunt Consolidated, Inc. A tiempo completo

    **ROLES AND RESPONSIBILITIES**: - Monitoring and calculation of reliability KPI (RAM, MTBF, etc). - Analyze predictive alerts from machine learning software ( for Rotaing and Mechanical assets) - Identify threats and opportunities for Plant production and manage them in MTO (mitigate Threats and Opportunities) process. - Analyze data and perform reliability...


  • Lima Metropolitan Area, Perú OpenLoop A tiempo completo

    OpenLoop is looking for a Senior Site Reliability Engineer to join our team in Lima, Peru.About the RoleCross-Functional CollaborationPartner with engineering teams to improve system reliability and deployment practices.Engage with teams on SRE guidelines and best practices for automation and infrastructure.Work with security teams to implement secure,...


  • Lima, Perú Careers at SunDevs A tiempo completo

    **Descripción del puesto**: Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos. Brindarás asesoramiento y guía a nuestros ingenieros de...

  • Cloud Systems Engineer

    hace 4 días


    Lima, Perú WTW A tiempo completo

    The Cloud System Engineer will participate in configuring and managing the cloud infrastructure services. - Ensure the efficient ongoing operation of the infrastructure. Provisioning virtual machines, configuring load balancers setting up auto-scaling, and establishing connectivity between cloud resources. - Contribute to existing and new IT infrastructure,...

  • IT Systems

    hace 4 días


    Lima, Perú Llamabara Tech A tiempo completo

    About the RoleWe are looking for a proactive Junior IT Systems & Infrastructure Engineer to join our team in Lima.You will be responsible for ensuring the stability, performance, and security of our IT infrastructure, across on-premise environments, cloud platforms, and networking systems.This role combines hands-on technical execution with strategic systems...


  • Lima, Perú Willis Towers Watson A tiempo completo

    **The Role** We are a group of passionate engineers who have built the largest private Medicare marketplace in the United States. We focus on the continuous improvement of our systems and culture. We improve and maintain a platform that provides the best possible experience to shop for insurance plans, and allows our insurance carriers to be be confident...


  • Lima, Perú Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • Lima, Perú Product Perfect, LLC A tiempo completo

    **Job Title**: Senior Database Engineer Consultant (NetSuite Specialist) **Company**: Product Perfect **Location**: Remote (Orange County, California) **Job Type**: Freelance, 1099 Contract **Overview**: Product Perfect is seeking a highly skilled Senior Database Engineer Consultant with expertise in NetSuite and extensive experience in SQL database...