Reliability Engineer (Remote)
Inspira Financial
The Reliability Engineer (RE) will report to the Reliability Engineering Manager in the Technology Department. RE will work closely with engineering, security, and infrastructure teams to ensure Inspira’s systems are highly available, scalable, and secure. RE will play a crucial role in deployments, incident response, system reliability, and performance optimization, while also contributing to long reliability-term infrastructure strategies. Working within a team environment, the RE participates directly IT realiabilityin solution creation, providing hands-on support as well as operational support and training. This individual must be creative, client focused, solutions-driven, organized, and have the ability to thrive in a dynamic environment.
- Partner with the Engineering and Security teams to create, implement and apply SRE principles, processes, and controls.
- Build & support Site Reliability function & participate in building tools to monitor and report system KPIs.
- Monitoring of Platform and Environment with tools such as Datadog, Azure Monitor, etc.
- Configure and Support the Disaster Recovery and Business Resumption Plan as it relates to the backup and restoration of the technology infrastructure.
- Ensure run books are updated on a regular basis
- Utilize programming skills to design and develop programs or scripts for various repetitive functions
- Contribute to long-term infrastructure strategies and reliability improvements.
- Performs all duties with a focus on goals of Inspira, which includes risk mitigation
- Support inbound calls/emails, maintaining tickets within the issue tracking application related to Infrastructure Support
- Crosstrain other team members to facilitate coverage
- Other duties as assigned