Senior Site Reliability Engineer

Job summary

Boca Raton

Engineering

Work model

Fully remote

Only United States

3 weeks ago

Job description

Requirements

Must have:

BSc in Engineering, Computer Science, or equivalent practical experience.
5+ years of experience in Site Reliability Engineering.
Strong background in a technical or IT-focused role.
Hands-on experience with configuration management tools such as Ansible, Puppet, Chef, or similar platforms.
Professional experience working in a public cloud environment such as Azure, AWS, or Google Cloud Platform.
Solid troubleshooting and support experience with Linux and Windows servers.
Experience with system and application monitoring tools such as Prometheus, Grafana, Nagios, or CloudWatch.
Familiarity with source control systems such as Git or SVN.
Ability to design cloud architecture and technical solutions that support business priorities.
Broad technical skill set and a proactive, enthusiastic approach to technology.
Excellent verbal and written communication skills.
Ability to serve as a technical point of reference, share best practices, and coach colleagues.

Preferred:

Azure or AWS certifications.
Experience using orchestration tools such as Terraform, Ansible, or CloudFormation.
Experience moving applications from on-premises infrastructure to the public cloud.
Familiarity with blue-green deployment methods.
Experience with continuous integration and delivery tools such as GitLab or Jenkins.
Experience working with containerized environments such as Docker.
Familiarity with log management tools such as Elastic Stack, Graylog, or Splunk.
Experience with enterprise databases such as MySQL or Microsoft SQL Server.
Understanding of change control processes and related procedures.
Experience using secret management services such as HashiCorp Vault.
Familiarity with a high-level programming language.

Responsibilities

Deliver resilient application platforms using Infrastructure as Code and other DevOps practices.
Monitor and support mission-critical, high-revenue business applications on an ongoing basis.
Investigate, diagnose, and resolve complex system and application incidents.
Collaborate closely with development, QA, IT operations, customer operations, and project management teams.
Create and maintain technical documentation for both technical and non-technical audiences.
Participate in an on-call rotation to help maintain 24/7, 365-day system availability.
Work across a diverse range of technologies as a leading member of the team.

Company

We are seeking a remote professional to join a friendly team working across a diverse technology landscape. We invest in our people and offer comprehensive benefits to eligible employees, including medical, dental, and vision insurance, HSA, FSA, 401(k), and life, disability, and ADD insurance. Salaried employees receive paid time off, while hourly employees may receive paid sick leave where required by law. This role does not include bonuses, incentives, or commissions. Compensation is determined by experience, skills, education, certifications, seniority, location, performance, and business needs. We are an equal opportunity employer committed to fair consideration for all qualified applicants.

More Remote jobs in Engineering

Backend / API Engineer (Bethesda (REMOTE), MD, US)

NTT DATA, Inc.

Join NTT DATA as a remote Backend/API Engineer. Design secure, scalable services for federal systems. Requires 3+ years experience and secret clear...

Fully remote· Only US

4 days ago

Sr Manager, Software Development & Engineering Lead (PL)

Charles Schwab

Lead mainframe and distributed development teams in our Mutual Fund trading systems. Join Schwab to drive modernization and innovation in finance.

Join Unqork as a Staff AI Engineer. Build and scale agentic AI products for enterprise applications in a remote-first, innovative environment.

Fully remote· Only US

4 days ago

View all Remote jobs in Engineering