Site Reliability Engineer - 1
Bijak
This job is no longer accepting applications
See open jobs at Bijak.See open jobs similar to "Site Reliability Engineer - 1" Omnivore.Software Engineering
Gurugram, Haryana, India
Posted on Tuesday, January 30, 2024
Job Brief:
As a Site Reliability Engineer I at Bijak, you will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure. You will collaborate with cross-functional teams to support and monitor applications in Production. This role offers an exciting opportunity to contribute to a cutting-edge technology environment and drive the reliability of our services.
Key Responsibilities:
- Collaborate with development and operations teams to enhance the reliability, scalability, and performance of our systems
- Identify and troubleshoot issues related to infrastructure, applications, and networks
- Participate in on-call rotations to ensure 24/7 availability and respond to incidents in a timely manner
- Implement and improve monitoring and alerting solutions to proactively identify and address potential issues
- Work on capacity planning, performance analysis, and optimization of our infrastructure
- Contribute to the documentation of system architecture, configurations, and operational procedures
- Stay up-to-date with industry best practices and emerging technologies to continuously improve our systems
- Tracking issues and tasks in a project management tool
- Aid in the deployment process and perform Quality Assurance(Testing) on production environments
Candidate Profile:
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- Proven experience in a Site Reliability Engineer or similar role.
- Strong proficiency in scripting and automation (e.g., Python, Bash, Shell).
- Experience with containerization technologies (e.g., Docker, Kubernetes).
- Solid understanding of cloud platforms (e.g., AWS, Azure, GCP).
- Knowledge of infrastructure as code (e.g., Terraform, Ansible).
- Familiarity with monitoring tools and frameworks (e.g., Prometheus, Grafana).
- Strong problem-solving skills and ability to work collaboratively in a team environment.
- Excellent communication skills and the ability to document complex technical solutions.
- Familiarity with log management and analysis tools
- Good Understanding of Linux operating systems and environment and fluent with Linux terminal commands
- Hands-on operational experience in a high-volume or critical production service environment.
- Good experience in Cloud and Database.
- Worked with project management tools like Jira.
- Should have experience in ticketing systems for logging issues and task creation.
- Good understanding of DevOps tools like (Jenkins, GIT, and Dockers)
- Experience maintaining and deploying systems and software in diverse environments.
- Good debugging skills and Mobile App Testing skill is an added advantage
This job is no longer accepting applications
See open jobs at Bijak.See open jobs similar to "Site Reliability Engineer - 1" Omnivore.