SRE - 1
Bijak
This job is no longer accepting applications
See open jobs at Bijak.See open jobs similar to "SRE - 1" Omnivore.Remote
Posted on Jul 29, 2024
As a Site Reliability Engineer I at Bijak, you will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure. You will collaborate with cross-functional teams to support and monitor applications in Production. This role offers an exciting opportunity to contribute to a cutting-edge technology environment and drive the reliability of our services.
Key Responsibilities
- Collaborate with development and operations teams to enhance the reliability, scalability, and performance of our systems
- Identify and troubleshoot issues related to infrastructure, applications, and networks
- Participate in on-call rotations to ensure 24/7 availability and respond to incidents in a timely manner
- Implement and improve monitoring and alerting solutions to proactively identify and address potential issues
- Work on capacity planning, performance analysis, and optimization of our infrastructure
- Contribute to the documentation of system architecture, configurations, and operational procedures
- Stay up-to-date with industry best practices and emerging technologies to continuously improve our systems
- Tracking issues and tasks in a project management tool
- Aid in the deployment process and perform Quality Assurance(Testing) on production environments
Candidate Profile
Must Have
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- Proven experience in a Site Reliability Engineer or similar role.
- Strong problem-solving skills and ability to work collaboratively in a team environment.
- Excellent communication skills and the ability to document complex technical solutions.
- Familiarity with log management and analysis tools
Good To Have
- Strong proficiency in scripting and automation (e.g., Python, Bash, Shell).
- Experience with containerization technologies (e.g., Docker, Kubernetes).
- Solid understanding of cloud platforms (e.g., AWS, Azure, GCP).
- Knowledge of infrastructure as code (e.g., Terraform, Ansible).
- Familiarity with monitoring tools and frameworks (e.g., Prometheus, Grafana).
- Good Understanding of Linux operating systems and environment and fluent with Linux terminal commands
- Hands-on operational experience in a high-volume or critical production service environment.
- Good experience in Cloud and Database.
- Worked with project management tools like Jira.
- Should have experience in ticketing systems for logging issues and task creation.
- Good understanding of DevOps tools like (Jenkins, GIT, and Dockers)
- Experience maintaining and deploying systems and software in diverse environments.
- Good debugging skills and Mobile App Testing skill is an added advantage
This job is no longer accepting applications
See open jobs at Bijak.See open jobs similar to "SRE - 1" Omnivore.