What makes the role special
We are seeking a Senior DevOps Engineer to join our Development team. In this role, you will be at the core of our technology stack — building, maintaining, and scaling high-availability systems that power our global services. You'll work with a diverse set of modern technologies, including Linux, Kubernetes, Infrastructure as Code (IaC) tools, monitoring systems, and distributed databases.
This position offers the opportunity to work at the heart of a high-performance global platform, directly influencing its scalability, reliability, and resilience. You will collaborate with experienced engineers, shape best practices, and contribute to the continuous evolution of our infrastructure — ensuring that our systems remain robust, secure, and efficient at scale.
Tasks to solve
Maintain and optimize core server infrastructure, including bare-metal servers, LXC containers, virtual machines, and cloud environments.
Operate and support core infrastructure services such as Nginx, Puppet, GitLab, Artifactory, Nexus, Harbor, Grafana, etc.
Manage and evolve infrastructure following the Infrastructure as Code (IaC) paradigm.
Handle and resolve incidents related to infrastructure operations.
Collaborate closely with cross-functional teams (network engineers, developers, and other technical stakeholders).
Design and implement high-availability, fault-tolerant, and scalable software solutions.
Monitor service performance and availability using modern observability tools, ensuring system reliability and optimal resource utilization.
This role is perfect for you if
4+ years of experience in Linux administration, DevOps, or Site Reliability Engineering (SRE).
Strong proficiency in automating tasks using Bash or similar scripting languages.
Solid understanding of networking fundamentals (TCP/IP stack, routing, DNS, etc.).
Hands-on experience managing bare-metal infrastructure in production environments.
Experience with configuration management systems such as Ansible or Puppet.
Experience with distributed databases (Elasticsearch, Cassandra, MongoDB, MySQL, PostgreSQL, etc.).
Expertise in Kubernetes administration and managing containerized workloads.
Experience with IaC tools such as FluxCD or ArgoCD.
Ability to design and implement high-performance, fault-tolerant, and secure infrastructure solutions.
Experience with monitoring and observability systems — Zabbix, VictoriaMetrics, Loki, Grafana, etc. — including building dashboards and configuring alerting.
What else we appreciate in our team
Programming experience in Python, Go
Experience with cloud providers (AWS, GCP, Alibaba Cloud, or others).
Familiarity with distributed storage systems (Ceph).
Experience with service meshes (e.g., Istio).
Proven track record of working on high-load, large-scale distributed systems.
Experience with cloud providers (AWS, GCP, Alibaba Cloud, or others).
Familiarity with distributed storage systems (Ceph).
Experience with service meshes (e.g., Istio).
