hero

Build the future of Indian agriculture with us.

34
companies
210
Jobs

SRE - 1

Bijak

Bijak

Remote
Posted on Jul 29, 2024

As a Site Reliability Engineer I at Bijak, you will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure. You will collaborate with cross-functional teams to support and monitor applications in Production. This role offers an exciting opportunity to contribute to a cutting-edge technology environment and drive the reliability of our services.

Key Responsibilities

  • Collaborate with development and operations teams to enhance the reliability, scalability, and performance of our systems
  • Identify and troubleshoot issues related to infrastructure, applications, and networks
  • Participate in on-call rotations to ensure 24/7 availability and respond to incidents in a timely manner
  • Implement and improve monitoring and alerting solutions to proactively identify and address potential issues
  • Work on capacity planning, performance analysis, and optimization of our infrastructure
  • Contribute to the documentation of system architecture, configurations, and operational procedures
  • Stay up-to-date with industry best practices and emerging technologies to continuously improve our systems
  • Tracking issues and tasks in a project management tool
  • Aid in the deployment process and perform Quality Assurance(Testing) on production environments

Candidate Profile

Must Have

  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • Proven experience in a Site Reliability Engineer or similar role.
  • Strong problem-solving skills and ability to work collaboratively in a team environment.
  • Excellent communication skills and the ability to document complex technical solutions.
  • Familiarity with log management and analysis tools

Good To Have

  • Strong proficiency in scripting and automation (e.g., Python, Bash, Shell).
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Solid understanding of cloud platforms (e.g., AWS, Azure, GCP).
  • Knowledge of infrastructure as code (e.g., Terraform, Ansible).
  • Familiarity with monitoring tools and frameworks (e.g., Prometheus, Grafana).
  • Good Understanding of Linux operating systems and environment and fluent with Linux terminal commands
  • Hands-on operational experience in a high-volume or critical production service environment.
  • Good experience in Cloud and Database.
  • Worked with project management tools like Jira.
  • Should have experience in ticketing systems for logging issues and task creation.
  • Good understanding of DevOps tools like (Jenkins, GIT, and Dockers)
  • Experience maintaining and deploying systems and software in diverse environments.
  • Good debugging skills and Mobile App Testing skill is an added advantage