hero

Companies you'll love to work for

39
companies
436
Jobs

Senior Software Engineer (SRE - DevOps)

emids

emids

Software Engineering
Bengaluru, Karnataka, India
Posted on Sep 18, 2024

Job Title : Senior Software Engineer (SRE – DevOps)

Location: Bangalore

Experience: 5-7 yrs

Notice Period: 15-30 Days

Skills to Focus: DevOps, AWS, IaC

Job Description:

Summary:

  • We are building a data infrastructure and family of products to help transform the HealthCare industry.
  • Our data scientists and engineering teams are creating smart products that use clinical patient data to do this and need great Site Reliability Engineers to build, deploy and support the infrastructure that brings their work to our customers.
  • Our Site Reliability Engineering team supports our AWS footprint as infrastructure as code, are super security minded and know how to help the engineering team create a highly reliable system.
  • SRE also automate everything. Our team shares an on-call rotation backed by our engineering teams.


Roles & Responsibilities:

As one of our SREs you will be capable of doing many of the following:

  • Analyze and improve the efficiency, scalability, and reliability of our backend systems
  • Build and mature automation tools for robust continuous integration and deployment pipelines
  • Build scalable, secure, and measurable infrastructure with code
  • Facilitate capacity planning
  • Champion code health, rigorous testing, and maintainability standards
  • Create automation of engineering deployments
  • Create scalable and reliable monitoring and alerting that works
  • Create actionable documentation and playbooks, and when possible automation, to resolve recurring issues and proactively address issues before impact is felt
  • Design, build, and upkeep tools, systems, and self-service options to elevate engineering team productivity and reduce toil
  • Maintain a stable, scalable, and secure development environment while keeping abreast of the latest DevOps innovations
  • Own and maintain client services and data infrastructure in production
  • Support disaster recovery design, implementation, and testing
  • Support engineering teams in implementing system reliability
  • When things go bad, perform advanced troubleshooting of our systems


Required & Desired Skills:

  • Amazon Web Services (AWS) and Application Programming Interface design and best practices
  • Experience with CI/CD tools such as Jenkins, GitLab CI, CircleCI, GitHub Actions) and version control systems such as Git
  • Experience with deployment and config management systems like Salt Stack, Ansible, and HashiCorp
  • Experience with monitoring and logging applications like FluentD, Graylog, and Datadog
  • Familiarity with containerization and orchestration technologies
  • Knowledge of cloud services (e.g., AWS, GCP, Azure) and infrastructure as code (e.g., Terraform, CloudFormation)
  • Proficiency in version control systems like Git and CI/CD tools like Octopus Deploy or ArgoCD
  • Strong communication and collaboration skills to work effectively with cross-functional teams
  • Strong knowledge of best-in-class security practices and testing methods
  • Strong knowledge of internet service architecture (TCP/IP, HTTP, DNS, routing, load balancing)
  • Strong knowledge of the configuration and maintenance of common big infrastructure components such as Cassandra, Redis, FluentD, Apache/Django/Flask, Kafka, Redis, Elasticsearch & Hadoop
  • Strong scripting skills like Python, Ruby, or Bash
  • Strong understanding of Unix and system administration

Thanks.