Site Reliability Engineer M/d01
hace 5 meses
**Responsibilities**:
- Help define the future of and contribute directly to Euronet’s infrastructure
- Ensure high uptime (99.98%) of our platform, performance, and scalability by leading the architecture, deployment, automation, maintenance, and management of mission-critical production systems.
- Manage major incidents to mitigation/resolution, perform post-incident reviews of all major incidents and determine action items required to avoid similar issues/minimize downtime for future incidents.
- Build tools and automation that eliminate work and reduce the time it takes to resolve an issue for public cloud and on-premises resources.
- Staying calm under pressure
- Coach team members, provide knowledge transfer to coworkers and encourage acquisition of new skills.
- Provide rotational on-call support where you’ll respond, detect, triage and resolve production incidents
- Meet all Euronet information security best practices to ensure all compliance requirements are met
**Requirements**:
- 8 or more years’ experience in software development and/or systems engineering
- Bachelor’s degree in a related field or equivalent experience required
- Strong knowledge of Linux and Windows operating systems and environment
- Strong knowledge of Networking, Load balancers, DNS, NTP, and TCP/IP
- Strong knowledge of AWS technologies (e.g., EC2, S3, RDS, VPC, EKS, ALB, NLB, CloudFormation)
- Experience with containers (Docker)
- Knowledge of container orchestration (Kubernetes)
- Experience with Infrastructure Automation tools like Terraform, Ansible, Puppet
- Experience with web servers IIS and Apache
- Proficiency in the design principles for monitoring and alerting systems
- Solid scripting skills; experience with Shell, Bash, Ansible, Python, PowerShell, Ruby
- Experience in setting up CI/CD pipelines (GitHub or AWS CodePipeline)
- Excellent organizational, verbal, and written communication skills
- A willingness to learn on the job and take on tasks as needed
**Additional Desired Experience**:
- Experience with one or more of the technologies used for big data: ELK, Beats, Kafka, Redis, Searchguard.
- Experience with Postfix
- Experience with one or more of the following F5 products: LTM, ASM, GTM, AFM, BIGIQ
- Experience with monitoring tools like Nagios, Icinga, SolarWinds, New Relic, Grafana
**Benefits**:
Life insurance 100% covered
50/50 Health insurance (optional)
50/50 Half scholarships (option)
Annual salary appraisal
Internal savings and credits association
Christmas bonus above the law
Opportunities for growth within the company
-
Site Reliability Engineer
hace 7 meses
El Salvador, Perú transact elektronische Zahlungssysteme A tiempo completoThis position is a part of the Infrastructure Engineering organization, and is a full-time, permanent position. The position is ideal for someone who wants to work on Linux systems supporting the observability platforms that the rest of the organization uses and depends on. This position provides a unique opportunity to work on large deployments of the...
-
Site Reliability Engineer W/a01
hace 5 meses
El Salvador, Perú transact elektronische Zahlungssysteme A tiempo completo**Responsibilities** - Help define the future of and contribute directly to Euronet’s infrastructure - Ensure high uptime (99.98%) of our platform, performance, and scalability by leading the architecture, deployment, automation, maintenance, and management of mission-critical production systems. - Manage major incidents to mitigation/resolution, perform...