Monitoring and Observability Analyst
hace 2 días
About Us
Coderio designs and delivers scalable digital solutions for global businesses. With a strong technical foundation and a product mindset, our teams lead complex software projects from architecture to execution. We value autonomy, clear communication, and technical excellence. We work closely with international teams and partners, building technology that makes a difference.
Learn
In this role, as an Monitoring and Observability Analyst , you will design, implement, and maintain proactive monitoring and alerting systems to ensure the availability, performance, and health of IT infrastructure, applications, and services. Your main focus will be on designing end-to-end monitoring solutions using metrics, logs, and traces , configuring business-impact-based alert thresholds (SLIs/SLOs) , and supporting incident resolution by providing detailed monitoring data for Root Cause Analysis (RCA). You will work closely with Operations and Development (DevOps) teams to minimize MTTR (Mean Time to Recovery) and support the continuous improvement of the ecosystem.
The role of Monitoring and Observability Engineer/Analyst is critical to our operation and requires continuous coverage (24/7).
Since we support the infrastructure in the United States, all shifts and holidays are governed by the United States (U.S.) time zone and schedule.
Saturdays, Sundays, and any U.S. Holidays require 24-hour coverage, which is divided into full 12-hour shifts.
It is essential that you have the availability and willingness to work this shift pattern (evening/night and weekends/holidays) to ensure service continuity and compliance with our SLAs.
What to Expect in This Role (Responsibilities)
Contribute to the definition of the company's observability strategy, aligned with industry best practices (SRE/DevOps).
Design and implement end-to-end monitoring solutions.
Configure alert thresholds (SLIs/SLOs) based on business impact and minimize notification noise.
Develop and maintain informative and visually clear dashboards (e.g., Grafana, Kibana) for real-time visibility.
Implement and optimize monitoring automation, from agent deployment to automatic alert response (AIOps basic/intermediate).
Administer and maintain monitoring platforms (updates, patches, cost optimization).
Create and maintain technical documentation (runbooks, monitoring procedures, service maps).
Requirements
Minimum 3 years of experience in Monitoring, IT Operations, or SRE roles.
Advanced experience with one or more monitoring platforms: Prometheus/Grafana, ELK Stack, New Relic, Datadog or similar.
Dominance in monitoring Cloud environments (AWS/Azure/GCP) and containers (Docker, Kubernetes).
Solid understanding of Logs (fluentd, Logstash, Loki) and Distributed Tracing (Jaeger, Zipkin, OpenTelemetry).
Practical experience in scripting languages (e.g., Python, Bash) for task automation and custom checker development.
Deep knowledge of Linux operating systems.
Strong ability to correlate events and data from multiple sources to identify the root cause of complex problems (Analysis Skill).
Ability to anticipate problems instead of just reacting to alerts (Proactivity Orientation).
Excellent oral and written communication skills.
Experience in a collaborative work environment with a DevOps mindset.
Bachelor's degree in Systems Engineering, Computer Science, or a related field.
Nice to Have
Certifications related to Cloud (AWS, Azure).
Certifications related to Observability Platforms (Datadog, Dynatrace).
Certifications related to DevOps/SRE practices.
Understanding of basic networking concepts (TCP/IP, DNS, Load Balancers).
Benefits
100% remote Long-term commitment, with autonomy and impact
Strategic and high-visibility role in a modern engineering culture
Collaborative international team and strong technical leadership
Clear path to growth and leadership within Coderio
Why join Coderio?
At Coderio, we value talent regardless of location. We are a remote-first company, passionate about technology, collaborative work, and fair compensation.We offer an inclusive, challenging environment with real opportunities for growth.If you are motivated to build solutions with impact, we are waiting for you.
Apply now.
-
Monitoring and Observability Analyst
hace 2 días
Lima, Perú Coderio A tiempo completoAbout UsCoderio designs and delivers scalable digital solutions for global businesses. With a strong technical foundation and a product mindset, our teams lead complex software projects from architecture to execution. We value autonomy, clear communication, and technical excellence. We work closely with international teams and partners, building technology...
-
Observability Specialist
hace 1 semana
Lima, Perú Kyndryl A tiempo completo**Who We Are** At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. **The...
-
Technical and Monitoring Manager
hace 1 semana
Lima, Perú Rainforest Alliance A tiempo completoThe Rainforest Alliance is creating a more sustainable world by using social and market forces to protect nature and improve the lives of farmers and forest communities. To achieve our mission, we partner with diverse allies around the world to drive positive change across global supply chains and in many of our most critically important natural landscapes....
-
System Management. Process
hace 5 días
Lima, Perú Kyndryl A tiempo completo**Who We Are** At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. **The...
-
Senior Observability Platform Developer
hace 3 días
Lima, Perú Canonical - Jobs A tiempo completoCanonical seeks an experienced developer with a proven track record in Python and/or Go. As part of the Observability team, you will develop a cloud-native monitoring stack that composes best-in-class open-source monitoring tools. The stack is opinionated, resilient, and scalable, providing deep insights out of the box. The user experience is polished and...
-
Monitoring, Evaluation, and Learning Director
hace 2 semanas
Lima, Perú Corus International A tiempo completoIMA World Health, a member of Corus International, is seeking a** Monitoring, Evaluation, and Learning **Lead **(MEL) **for the anticipated 5-year $19 million USAID-funded Strengthening Systems for Health Security Activity (SSHS) in Peru. The purpose of this Strengthening Systems for Health Security Activity is to help Peru build robust capabilities in...
-
Central Monitoring Specialist
hace 6 días
Lima, Perú Fortrea A tiempo completoJob OverviewThe Central Monitoring Specialist is responsible for the execution of key Central monitoring (Site, subject and study level review as applicable) activities and in assisting the execution of several key activities. The Central Monitoring Specialist collaborates with the study team to execute Central Monitoring and comply with the applicable plans...
-
Finance Planning and Analyst Peru
hace 3 días
Lima, Perú MSD A tiempo completoFinancial Planning and Analyst in Peru responsible of the preparation of all reporting packages for actual results, profit plan schedules, month & year-end closing, budget / forecast and monitoring and results follow up meeting with Management/Marketing and Product Managers. **Major Activities and Responsibilities**: - Prepare profit plan and forecasts for...
-
Monitoring, Evaluation, and Learning Lead
hace 2 semanas
Lima, Perú Culmen International LLC A tiempo completoAbout the Role: Culmen International is seeking a **Monitoring, Evaluation, and Learning Lead (MELL)** to support the Strengthening Systems for Health Security Activity (SSHS) in** Lima, Peru**. The Strengthening Systems for Health Security Activity (SSHS) aims to help the Government of Peru build robust capabilities to better prevent, swiftly detect, and...
-
Data Analyst
hace 2 semanas
Lima, Perú LA Technologies A tiempo completoData Analyst — Role DescriptionThe Data Analyst plays a critical role in transforming raw information into meaningful insights that support strategic planning, operational efficiency, and informed decision-making. This position focuses on gathering, analyzing, and interpreting datasets to uncover trends, evaluate performance, and provide clear, data-driven...