SR. SRE Manager (Remote)

  • Location: Portland , Oregon
  • Type: Direct Hire
  • Job #710

As a Site Reliability Engineer (SRE) Manager in the SaaS Operations team you'll be part of a team who measures and improves production performance reliability through sustainable engineering practices for our suite of applications. Toil will be your number one enemy, observability your closest friend and your mission will be to drive operational burden as close to zero as you can. You will work closely with our Development, Quality Engineering, Security, PaaS and Performance engineering teams to continuously improve our products in production. You are highly empowered and a critical customer interface for how they experience our company and products. You’ll need to guide, direct and teach a teams of SREs towards maturing day to day practices.

Essential Functions

  • Facilitate and lead a cross-functional agile team. Responsible for the productivity, quality, and success of the team.
  • Work closely with Product Management to gather requirements.
  • Organize the team to provide consistent and steady output.
  • Remove blockers and impediments that impact the team.
  • Motivate, guide, and mentor the team to grow in knowledge and skills.
  • Provide technical input as needed, but primarily create an environment where the team can self-organize.
  • Facilitate and enforce all agile ceremonies and meetings. Ensure processes are followed. Adapt as necessary.
  • Provide constructive feedback to team members via regular 1:1 meeting.
  • Listen to feedback from the team and use it to improve the environment and productivity of the team.
  • Other duties as assigned.

Education and Experience

  • BS in Computer Science or equivalent;
  • Minimum of 10+ years of industry experience;
  • Or a combination of both.
  • Experience leading SRE teams in SaaS context
  • Experience working in distributed teams
  • Experience working in cloud native platforms and PaaS
  • Strong mentorship experience taking SRE teams through transformational periods
  • Strong interpersonal skills
  • A can-do attitude and sense of urgency for a high growth/ fast paced environment
  • Proven track record of leading implementations of build and release engineering best practices, both processes and technologies
  • Curious mind, wanting to learn new technologies
  • The ability to think outside of the box to resolve issues and create solutions

Technical Requirements

  • Experience working with containers and container orchestration platforms
  • Experience with declarative IaC frameworks: BOSH, Terraform, Puppet/Chef/Ansible/Salt Experience working inside modern observability platforms like:
    • Centralized logging (ELK, Splunk)
    • APM (New Relic, AppDynamics, Dynatrace)
    • Platform telemetry (DataDog, Nagios)
  • Experience working delivering with CI/CD pipelines (Concourse, Bamboo, Jenkins)
  • Strong linux experience
  • Experience with full stack engineering from Java and/or .Net front end services to backend storage systems in both sql and no-sql contexts
  • Strong experience in public and private cloud contexts (API driven infra)
  • Experience with search platforms like Elastic and Solr at high scale
  • Strong coding experience in any of the following: java, python, ruby, go
  • CloudFoundy experience is a plus
Attach a resume file. Accepted file types are DOC, DOCX, PDF, HTML, and TXT.

We are uploading your application. It may take a few moments to read your resume. Please wait!

About Us

Catapult Recruiting was founded by a group of seasoned IT professionals who are native to the Portland area and love Oregon.

Contact Us

6107 SW Murray Blvd, Unit 269
Beaverton, OR 97008
(503) 970-3111