Site Reliability Engineer

hace 3 semanas


Lima, Perú Careers at SunDevs A tiempo completo

**Descripción del puesto**:
Como Site Reliability Engineer en SunDevs, colaborarás con otros ingenieros de software senior y Platform Engineers para diseñar y desarrollar sistemas y plataformas en la nube altamente disponibles, escalables, seguras y mantenibles para resolver grandes desafíos.

Brindarás asesoramiento y guía a nuestros ingenieros de software y SRE para implementar altos estándares y prácticas de seguridad durante el ciclo de vida del desarrollo de software para las nuevas funciones y correcciones de errores en nuestros productos y servicios.

Tendrás que liderar algunas reuniones con clientes y partes interesadas del negocio para explicar tus planes para mejorar la seguridad, escalabilidad, disponibilidad y fiabilidad en nuestros sistemas, tus descubrimientos y las soluciones propuestas.

**Lo que buscamos**:
En SunDevs estamos en una etapa de crecimiento, tratando con clientes basados en Estados Unidos, Canadá y Latinoamérica que exigen un alto nível de compromiso y transparencia sobre el progreso de las nuevas funciones e ideas que les proponemos.

En SunDevs aprendemos y nos movemos rápido, estamos implementando varios cambios en toda la empresa, incluida la adopción de prácticas de SRE, Road Maps basados en resultados y una sólida cultura de Equipos de Producto.

Buscamos un Site Reliability Engineer calificado y experimentado para unirse a nuestro equipo dinámico, alguien con un fuerte sentido de pertenencia con su equipo y la misión del producto o servicio que están construyendo, y un alto sentido de urgencia para entregar resultados que generen un impacto positivo en los objetivos de negocio. Como Ingeniero de Fiabilidad del Sitio, desempeñarás un papel crucial en garantizar la disponibilidad, escalabilidad y fiabilidad de nuestros sistemas. Colaborarás con equipos multifuncionales para diseñar, construir y mantener una infraestructura altamente eficiente y automatizada.

**Responsabilidades clave**:

- Diseñar, implementar y mantener una infraestructura robusta y escalable para respaldar nuestras aplicaciones y servicios.
- Desarrollar y mantener sistemas de monitoreo y alerta para identificar y resolver proactivamente problemas potenciales.
- Colaborar con ingenieros de software para optimizar el rendimiento, la escalabilidad y la disponibilidad de las aplicaciones.
- Automatizar procesos manuales para mejorar la eficiencia y reducir la carga operativa.
- Realizar análisis regulares de rendimiento y capacidad para identificar y abordar cuellos de botella.
- Implementar planes de recuperación ante desastres y continuidad del negocio para garantizar la resiliencia del sistema.
- Solucionar y resolver incidentes de producción y proporcionar una respuesta oportuna a los incidentes.
- Colaborar con equipos multifuncionales para definir y hacer cumplir las mejores prácticas y estándares para la fiabilidad y el rendimiento del sistema.
- Mantenerse actualizado con las tendencias de la industria y las tecnologías emergentes, y evaluar su impacto potencial en nuestros sistemas y procesos.
- Mantener una actitud positiva, empática y profesional hacia los clientes, terceros interesados, gerentes de producto, gerentes de entrega, diseñadores de producto, ingenieros de software y cualquier otro miembro de tu equipo.
- Asegurarse de entregar a tiempo todas las tareas programadas a las que tú y tu equipo se comprometieron.
- Notificar rápidamente y de manera oportuna al cliente, a las partes interesadas, a otros gerentes y a tu equipo sobre cualquier cambio o riesgo que pueda afectar la entrega a tiempo de tus tareas y resultados.
- Hacer que el estado de las tareas del producto/proyecto sea siempre visible para los clientes y cualquier otra parte interesada relevante.
- Participar en una reunión 1:1 con el resto de tu equipo.
- Proporcionar retroalimentación oportuna a tu equipo.
- Participar en la definición de los OKR del producto para tu equipo.
- Participar en la Encuesta de Retroalimentación 360 para los miembros del equipo.

**Requisitos**:

- Inglés B1 como mínimo
- Excelentes habilidades de comunicación con partes interesadas de alto nível y de negocios
- Licenciatura en Ciencias de la Computación, Ingeniería de Software o Sistemas, o experiência práctica equivalente en un campo relacionado con el software.
- Más de 2 años manejando sistemas operativos Linux
- Más de 3 años de experiência escribiendo código seguro en lenguajes como Python, Java, JavaScript, GO y Bash, lo que significa que puedes automatizar tareas y procesos
- Amplia experiência con protocolos de enrutamiento, encriptación, firewalls, Nubes Privadas Virtuales (VPC) y redes privadas virtuales (VPN).
- Familiaridad con herramientas de monitoreo y análisis de rendimiento (por ejemplo, Prometheus, Grafana, CloudWatch).
- Comprensión de los sistemas de bases de datos y experiência en administración de bases de datos (por ejemplo, MySQL, PostgreSQL, MongoDB).
- Conocimie


  • Site Reliability Engineer

    hace 3 semanas


    Lima, Perú Rappi A tiempo completo

    It is time for you to join us to show the world that we are the company that is coming to change paradigms, where we revolutionize hours, minutes and seconds. Because in Rappi WE SEE OPPORTUNITIES where others see problems. WE SEE CLOSENESS where others see distance. WE SEE ADRENALINE where others see pressure. Join a team where we are all capable of...

  • Site Reliability Engineer

    hace 2 semanas


    Lima, Perú Willis Towers Watson A tiempo completo

    We have spent many years growing and fostering a DevOps culture by bridging the divide between our Software and Infrastructure Engineering departments. We want the cross-functional teams that we are building to include Site Reliability Engineers. We operate in a complex, multi-tenant, hybrid cloud and on-premises infrastructure that spans both the Windows...


  • Lima, Perú Willis Towers Watson A tiempo completo

    We have spent many years growing and fostering a DevOps culture by bridging the divide between our Software and Infrastructure Engineering departments. We want the cross-functional teams that we are building to include Site Reliability Engineers. We operate in a complex, multi-tenant, hybrid cloud and on-premises infrastructure that spans both the Windows...

  • Site Reliability Engineer

    hace 3 semanas


    Lima, Perú WTW A tiempo completo

    We have spent many years growing and fostering a DevOps culture by bridging the divide between our Software and Infrastructure Engineering departments. We want the cross-functional teams that we are building to include Site Reliability Engineers. We operate in a complex, multi-tenant, hybrid cloud and on-premises infrastructure that spans both the Windows...


  • Lima, Perú Neara A tiempo completo

    Neara is a high-growth, venture-backed Series B, tech company headquartered in Sydney, Australia. We work with 75% of the utilities in Australia and New Zealand and are growing rapidly across the US and Europe. Our mission is to revolutionise the utilities industry by helping them future-proof their infrastructure and navigate the challenges of the clean...


  • Lima, Perú Neara A tiempo completo

    Neara is a high-growth, venture-backed Series B, tech company headquartered in Sydney, Australia. Recognized as one of the **Times 100 Most Influential Companies of 2024**, our vision is to change how the world operates, build and designs critical infrastructure. We work with 75% of the utilities in Australia and New Zealand and are rapidly expanding across...


  • Lima, Perú Canonical - Jobs A tiempo completo

    **Site Reliability Engineer**: To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers. As a...


  • Lima, Perú Hunt Consolidated, Inc. A tiempo completo

    **ROLES AND RESPONSIBILITIES**: - Monitoring and calculation of reliability KPI (RAM, MTBF, etc). - Analyze predictive alerts from machine learning software ( for Rotaing and Mechanical assets) - Identify threats and opportunities for Plant production and manage them in MTO (mitigate Threats and Opportunities) process. - Analyze data and perform reliability...


  • Lima, Perú Nucleus Health A tiempo completo

    A U.S.based company that is on a mission to develop the largest online marketplace and media platform in the world is looking for a Senior DevOps/SRE Engineer. The engineer will be working with cross-functional teams to raise system performance, reliability, and effectiveness. The company is developing a knowledge-commerce platform that connects clients and...


  • Lima, Perú Hunt Consolidated, Inc. A tiempo completo

    ROLES AND RESPONSIBILITIES: Monitoring and calculation of reliability KPI (RAM, MTBF, etc). Analyze predictive alerts from machine learning software ( for Rotaing and Mechanical assets) Identify threats and opportunities for Plant production and manage them in MTO (mitigate Threats and Opportunities) process. Analyze data and perform reliability analysis for...

  • Site Reliability Engineer

    hace 2 semanas


    Lima, Perú Wikimedia Foundation A tiempo completo

    SummaryThe Wikimedia Foundation is looking for a Site Reliability Engineer (Database) to join our SRE team to build, optimize and support the platform serving the world's favorite encyclopædia to millions of people around the globe. Wikipedia and its sister projects are a globally distributed architecture powered strictly by Free and Open Source software....

  • Site Reliability Engineer

    hace 2 semanas


    Lima, Perú Wikimedia Foundation A tiempo completo

    SummaryThe Wikimedia Foundation is looking for a Site Reliability Engineer (Database) to join our SRE team to build, optimize and support the platform serving the world's favorite encyclopædia to millions of people around the globe. Wikipedia and its sister projects are a globally distributed architecture powered strictly by Free and Open Source software....


  • Lima, Perú Kyndryl Peru SRL A tiempo completo

    **Why Kyndryl** Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...

  • Site Reliability Engineer

    hace 3 semanas


    Lima, Perú Wikimedia Foundation A tiempo completo

    **Summary** The Wikimedia Foundation is looking for a Site Reliability Engineer (Database) to join our SRE team to build, optimize and support the platform serving the world's favorite encyclopædia to millions of people around the globe. Wikipedia and its sister projects are a globally distributed architecture powered strictly by Free and Open Source...


  • Lima, Perú Kyndryl Peru SRL A tiempo completo

    **Why Kyndryl** Kyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...


  • Lima, Perú Kyndryl Peru SRL A tiempo completo

    Why KyndrylKyndryl is a market leader that thinks and acts like a start-up. We design, build, manage, and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl?We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our...


  • Lima, Perú Torre A tiempo completo

    We're helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Proficient in Azure DevOps server, SQL and leadership. Responsibilities and...


  • Lima, Perú Torre A tiempo completo

    We're helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Proficient in Azure DevOps server, SQL and leadership. Responsibilities and...


  • Lima, Perú Torre A tiempo completo

    We're helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Proficient in Azure DevOps server, SQL. Responsibilities and...


  • Lima, Perú Torre A tiempo completo

    We're helping one of our clients, Agile Dream Team, hire a Site Reliability Engineering Leader. “We transform your vision into reality with our on-demand & fully managed Agile software development teams.” Compensation: To be agreed upon. Location: Remote (anywhere). Skills: Proficient in Azure DevOps server, SQL. Responsibilities and...