Site Reliability Engineers (Lead and Sr. Level)

Location: Remote USA
Job Type: Engineering

IQ Workforce is a leading recruiting firm for the engineering, analytics, and data science communities.

Our client is a global restaurant company, which engages in the development and operation of some of the most recognizable brands in the world. They have over 30,000 employees and operate over 50,000 restaurants in 150+ nations and territories.

They are currently looking to grow their Commerce team and one of the immediate needs is a pair of Site Reliability Engineers. These SREs will be part of an embedded software development team focused on building out an enterprise platform to support their franchisees with menu management, POS, online ordering, mobile apps, and much more.

As a Site Reliability Engineer, you will be responsible for ensuring the successful operation of several public-facing, high traffic volume microservices in the cloud. In this role, you will be partnering with software engineers from multiple product delivery teams to ensure that releases are automated, tested, performant, and will not violate the error budgets set by each service.

Your primary job function will be to ensure the highest possible uptime for several cloud-native digital commerce applications. This role will require the ability to troubleshoot complex CI/CD pipelines, backend API software code, and cloud infrastructure components through thoughtful practices such as code reviews, load testing, chaos engineering, monitoring, tracing, documentation, etc.

Qualifications:
Bachelor’s degree in Computer Science or Engineering or equivalent practical experience

6+ years’ experience with one or more programming or scripting languages such as Bash, C#, Java, Node, Python, or C++

Experience with source code control systems, branching and merging, automated unit testing frameworks, automated build tools, and automated deploy frameworks

Strong time management skills with the ability to multitask and deliver quality results.

Preferred:
Experience with container technology such as Docker, Kubernetes, and Helm.

Experience developing CI/CD pipelines for cloud-based infrastructure and applications.

Experience with cloud monitoring and logging technologies such as Prometheus and Elastic Stack.

Experience developing observability dashboards using tools like Grafana or Kibana.

Experience with infrastructure-as-code tools such as Terraform, Chef, Puppet, Ansible, or SaltStack

Experience architecting, deploying, or coding solutions on AWS, GCP, or Azure.