Back to Jobs

Remote Site Reliability Engineer - Production Support Expert for Hybrid Work Environment in Atlanta, GA

Remote, USA Full-time Posted 2025-11-03

Unlock Your Next Career Move with Workwarp

Are you a skilled Site Reliability Engineer looking for a challenging and rewarding role in a dynamic and innovative company? Do you have a passion for ensuring the operational stability of business-critical systems and resolving incidents quickly? If so, we've got an exciting opportunity for you to join our team as a Remote Site Reliability Engineer - Production Support in Atlanta, GA.

At Workwarp, we're committed to fostering a culture of collaboration, innovation, and continuous learning. Our team is dedicated to delivering exceptional results and making a meaningful impact in the industry. As a Remote Site Reliability Engineer - Production Support, you'll be an integral part of our team, working closely with senior engineers and cross-functional teams to drive success.

About the Role

The Production Support Engineer II is responsible for providing day-to-day support for business-critical systems, ensuring operational stability, and quickly resolving incidents. This role focuses on resolving lower to medium-priority incidents, maintaining system health, and supporting the improvement of production environments.

Key Responsibilities

  • Identify, troubleshoot, and resolve lower to medium-priority technical issues with guidance from senior engineers, ensuring minimal disruption to business operations.
  • Support day-to-day monitoring of system performance and use monitoring tools (e.g., Splunk, Dynatrace, CloudWatch) to detect anomalies and take corrective actions.
  • Collaborate with cross-functional teams to resolve technical incidents and escalate higher-complexity issues to senior engineers as needed.
  • Assist in automating routine production support tasks by developing or modifying scripts and tools.
  • Maintain documentation for production issues, troubleshooting steps, and system configurations, contributing to the shared knowledge base.
  • Participate in incident, problem, and change management processes, following ITIL best practices.
  • Perform root cause analysis for recurring issues and assist senior engineers in implementing permanent fixes to improve system stability.
  • Support the implementation of process improvements to enhance system performance and minimize downtime.
  • Assist with mentoring and supporting junior-level engineers, providing guidance as needed.

Essential Qualifications

To be successful in this role, you'll need:

  • A Bachelor's Degree and 4-7 years of experience or equivalent education and software engineering training or experience.
  • Proficiency in using monitoring tools like Splunk, Dynatrace, or CloudWatch to detect and resolve system performance issues.
  • SRE (Site Reliability Engineer) skills.
  • In-depth knowledge in information systems and ability to identify, apply, and implement IT best practices.
  • Understanding of key business processes and competitive strategies related to the IT function.
  • Ability to plan and manage projects and solve complex problems by applying best practices.
  • Ability to provide direction and mentor less experienced teammates.
  • Ability to interpret and convey complex, difficult, or sensitive information.

Preferred Qualifications

While not essential, the following qualifications are highly desirable:

  • 4-8 years of experience in production support, systems administration, or related technical roles.
  • Experience with IT Service Management (ITSM) tools such as ServiceNow with solid understanding of incident, problem, and change management processes.
  • Familiarity with supporting Agile team/processes.
  • Experience in automation tools and scripting for production support tasks.
  • Banking or financial services experience.
  • Experience with cloud technologies such as Configuration Management (ex. Terraform), CICD GitLab, Containerization (ex. Kubernetes), etc.
  • AWS Certified Solutions Architect Associate certification is a plus.

What We Offer

At Workwarp, we're committed to providing a supportive and inclusive work environment that fosters growth and development. As a Remote Site Reliability Engineer - Production Support, you'll enjoy:

  • A competitive salary benchmarked against industry standards.
  • Opportunities for career growth and professional development.
  • A collaborative and dynamic work environment.
  • Flexible working arrangements, including remote work options.
  • A culture of continuous learning and innovation.
  • Access to cutting-edge technologies and tools.

Why Join Us?

If you're a motivated and experienced Site Reliability Engineer looking for a challenging and rewarding role, we want to hear from you. At Workwarp, we're passionate about delivering exceptional results and making a meaningful impact in the industry. As a member of our team, you'll have the opportunity to work on complex and exciting projects, collaborate with talented professionals, and develop your skills and expertise.

So why wait? Apply now and take the next step in your career with Workwarp!

Ready for an Easy Start?

This is a low-stress role with great rewards. If you're reliable and willing to learn, we want you. Apply now and join our team of talented professionals!

Apply for this job  

Similar Jobs