Site Reliability Engineer
We are seeking a highly motivated and skilled Site Reliability Engineer to join our team and play a critical role in maintaining the stability, security, and scalability of our infrastructure, which supports both our backend and frontend applications. In this position, you will have the opportunity to work with cutting-edge technologies and collaborate with a team of talented engineers to build and enhance our infrastructure. You will be instrumental in ensuring optimal performance, high availability, and a seamless user experience.
What You Will Deliver
- Uphold System Reliability: Maintain and improve the reliability, scalability, and security of our infrastructure, guaranteeing system uptime and performance.
- Kubernetes Expertise: Develop and maintain code for our Kubernetes clusters, ensuring efficient resource allocation and orchestration.
- CI/CD Optimization: Build and maintain robust CI/CD pipelines, facilitating rapid and dependable software deployments.
- Cloud Infrastructure Management: Utilize public cloud services (GCP/GKE, AWS) to design and implement a resilient and scalable infrastructure.
- Cross-Functional Collaboration: Partner with development teams to identify and resolve performance bottlenecks, ensuring optimal application
performance. - Proactive Issue Resolution: Proactively identify and address potential issues, minimizing service disruptions and downtime.
- Technological Advancement: Implement new technologies and best practices to continuously enhance our infrastructure.
Who You Are
- Demonstrated Experience: A minimum of 3 years of experience as a Site Reliability/DevOps Engineer or a related role, showcasing your ability to build and maintain high-performance infrastructure.
- Technical Proficiency: A strong understanding of Kubernetes, CI/CD pipelines, and public cloud services (GCP/GKE, AWS, Argo).
- IaC Implementation: Experience with Infrastructure as Code tools and concepts (Terraform, Ansible, etc.).
- Analytical Acumen: Excellent problem-solving and analytical skills, enabling you to identify and resolve complex issues effectively.
- Collaborative Spirit: The ability to work both independently and collaboratively, fostering a positive and supportive team environment.
- Dedication to Learning: A genuine passion for technology and a commitment to continuous learning and professional development.
- SRE Tool Familiarity: Experience with SRE tools and practices (Prometheus,
Grafana, AlertManager, etc.). - Cloud-Native Expertise: Familiarity with cloud-native technologies (Docker,
Helm, Istio, etc.). - Security Focus: Experience with security tools and best practices.
What we offer:
- Competitive Compensation: A competitive salary and benefits package commensurate with your experience and contributions.
- Cutting-Edge Technology: The opportunity to work with the latest technologies and tools in the industry.
- Collaborative Environment: A supportive and collaborative work environment where your ideas are valued and respected.
- Growth Opportunities: Dedicated support for your professional development and career advancement.
- Diverse Team: A dynamic and multicultural team that fosters innovation and collaboration.
If you are a highly motivated and skilled Site Reliability Engineer seeking a challenging and rewarding opportunity to contribute to a world-class infrastructure, we encourage you to submit your application.