Facebook Pixel

Ulubione oferty

Aplikuj

Devops SRE

nr ref: 154/8/2024/KB
Konsultant prowadzący: Karolina Bucka
24 października 2024

W Antal zajmujemy się rekrutacją od ponad 20 lat. Dzięki działaniu w 10 wyspecjalizowanych dywizjach, świetnie orientujemy się w aktualnych trendach branżowych. Precyzyjnie określamy specyfikę stanowiska, klasyfikując kluczowe umiejętności i niezbędne kwalifikacje. Naszą misją jest nie tylko znalezienie kandydata, którego kompetencje wpisują się w wymagania danego ogłoszenia, ale przede wszystkim stanowiska, spełniającego oczekiwania kandydata. Numer rejestru agencji zatrudnienia: 496.

SRE DEVOPS

If you’re looking for a career that will help you stand out, join us and fulfill your potential. Whether you aim to reach the top or simply explore an exciting new direction, we offer opportunities, support, and rewards that will take you further.

Our Work Culture

We invest heavily in an Agile culture, adopting DevOps processes, CI/CD pipelines, and cloud technologies. We plan to establish a new development team in Krakow in 2023 as part of a long-term strategy to develop and support our platform in Europe.

This is an exciting opportunity to join a team in its early stages and make a key contribution.

Your Responsibilities

  • Manage application support operations, focusing on resiliency, availability, and monitoring system health and performance.
  • Coordinate resolution of production incidents, conducting post-mortem/RCA to identify root causes and improve processes.
  • Investigate, triage, and resolve production incidents with a focus on technical signals and root cause analysis.
  • Document post-incident recovery steps, contributing to process improvements, identifying deviations, and creating a Knowledge Base.
  • Actively participate in the service management community, engaging in Incident Management, Problem Management, and Service Delivery.
  • Define and deliver tactical and strategic service improvements across the technical and process landscape.
  • Apply SRE principles to continuously improve platform reliability, capacity, and performance, reducing toil and enhancing observability.
  • Develop observability tools and techniques for monitoring, alerting, incident detection, response, capacity management, and release safety.

What You Need to Succeed in This Role

  • 3+ years of experience in developing and supporting distributed systems written in Java.
  • Experience with Disaster Recovery methods and processes.
  • A methodical approach to troubleshooting and problem-solving skills.
  • Experience in application lifecycle management tooling: JIRA/Confluence, Ansible, Vulnerability Remediation, CI/CD automation.
  • Experience implementing and managing Logging, Monitoring, and Alerting frameworks for hybrid cloud using tools such as Geneos, Grafana, InfluxDB, Splunk, Loki, or similar tools.
  • Understanding of RDBMS Database, Cloud Technology, Unix/Linux, Job scheduling e.g., Control-m or autosys.
  • Ability to lead technical conversations with various technical support groups.
  • Excellent communication skills and experience working in Agile methodology.

Join us and grow your career in a dynamic and innovative environment!