Senior Software Engineer (SRE - DevOps)
emids
Software Engineering
Bengaluru, Karnataka, India
Posted on Sep 18, 2024
Job Title : Senior Software Engineer (SRE – DevOps)
Location: Bangalore
Experience: 5-7 yrs
Notice Period: 15-30 Days
Skills to Focus: DevOps, AWS, IaC
Job Description:
Summary:
- We are building a data infrastructure and family of products to help transform the HealthCare industry.
- Our data scientists and engineering teams are creating smart products that use clinical patient data to do this and need great Site Reliability Engineers to build, deploy and support the infrastructure that brings their work to our customers.
- Our Site Reliability Engineering team supports our AWS footprint as infrastructure as code, are super security minded and know how to help the engineering team create a highly reliable system.
- SRE also automate everything. Our team shares an on-call rotation backed by our engineering teams.
Roles & Responsibilities:
As one of our SREs you will be capable of doing many of the following:
- Analyze and improve the efficiency, scalability, and reliability of our backend systems
- Build and mature automation tools for robust continuous integration and deployment pipelines
- Build scalable, secure, and measurable infrastructure with code
- Facilitate capacity planning
- Champion code health, rigorous testing, and maintainability standards
- Create automation of engineering deployments
- Create scalable and reliable monitoring and alerting that works
- Create actionable documentation and playbooks, and when possible automation, to resolve recurring issues and proactively address issues before impact is felt
- Design, build, and upkeep tools, systems, and self-service options to elevate engineering team productivity and reduce toil
- Maintain a stable, scalable, and secure development environment while keeping abreast of the latest DevOps innovations
- Own and maintain client services and data infrastructure in production
- Support disaster recovery design, implementation, and testing
- Support engineering teams in implementing system reliability
- When things go bad, perform advanced troubleshooting of our systems
Required & Desired Skills:
- Amazon Web Services (AWS) and Application Programming Interface design and best practices
- Experience with CI/CD tools such as Jenkins, GitLab CI, CircleCI, GitHub Actions) and version control systems such as Git
- Experience with deployment and config management systems like Salt Stack, Ansible, and HashiCorp
- Experience with monitoring and logging applications like FluentD, Graylog, and Datadog
- Familiarity with containerization and orchestration technologies
- Knowledge of cloud services (e.g., AWS, GCP, Azure) and infrastructure as code (e.g., Terraform, CloudFormation)
- Proficiency in version control systems like Git and CI/CD tools like Octopus Deploy or ArgoCD
- Strong communication and collaboration skills to work effectively with cross-functional teams
- Strong knowledge of best-in-class security practices and testing methods
- Strong knowledge of internet service architecture (TCP/IP, HTTP, DNS, routing, load balancing)
- Strong knowledge of the configuration and maintenance of common big infrastructure components such as Cassandra, Redis, FluentD, Apache/Django/Flask, Kafka, Redis, Elasticsearch & Hadoop
- Strong scripting skills like Python, Ruby, or Bash
- Strong understanding of Unix and system administration
Thanks.