Job openings across our network

26
companies
874
Jobs

Site Reliability Engineer

Sysdig

Sysdig

Software Engineering
Remote
Posted on Wednesday, November 15, 2023

Sysdig is driving the standard for securing the cloud and containers. We created Falco, the open standard for cloud-native threat detection, and consistently contribute to open source software projects. We are passionate, technical problem-solvers, continually innovating and delivering powerful solutions to secure the cloud from source to run.

We value diversity and open dialog to spur ideas, working closely together to achieve goals. We’re an international company that understands how to cultivate a strong culture across a remote team. And we're a great place to work too — we've been named a Bay Area Best Place to Work by the San Francisco Business Times and the Silicon Valley Business Journal for three years now! We were recognized by Deloitte as one of the 500 fastest growing organizations in 2020 and 2021. We are looking for team members who have a passion for container and cloud security and are willing to dig deeper to help our customers. Does this sound like the right place for you?

As a Site Reliability Engineer, you will build solutions to enhance the availability, security, and resilience of the Sysdig services, including backends and data stores. You will collaborate with the Infrastructure, Engineering, and Customer Success teams to provide the best experience for our high-profile customers.


What you will do


  • Deploy, upgrade and migrate large-scale Sysdig services on Kubernetes
  • Enable customers and Sysdig customer-facing teams to solve common issues in productions
  • Enhance the observability and reliability of Sysdig services to meet SLA/SLO
  • Automate manual and repetitive tasks to reduce the toil
  • Work with the Engineering team on security hardening in highly regulated environments

What you will bring with you


  • Working experience in deploying and running workloads on Kubernetes in production is a must
  • Working experience in monitoring production environments using Prometheus is a must
  • Working experience with one of the following data stores is highly preferred: Postgres, Redis, Cassandra, Elasticsearch, Kafka/Zookeeper
  • Ability to write and maintain technical documentation is a must
  • Strong coding skills in a high-level programming language (Python, Golang, etc.)
  • Working experience with Terraform or Helm
  • Experience with well-known CI/CD tool
  • Familiar with common Linux commands
  • Knowledge and experience in public cloud are preferred
  • On call every 6 weeks

Why work at Sysdig?

  • We’re a well-funded startup that already has a large enterprise customer base
  • We have a pragmatic, transparent culture, from the CEO down
  • We have an organizational focus on delivering value to customers
  • Our open source tools (https://sysdig.com/opensource/) are widely used and loved by technologists & developers

When you join Sysdig, you can expect:

  • Competitive compensation including equity opportunities
  • Flexible hours and additional recharge days
  • Mental wellbeing support through Modern Health for you and your family
  • Career growth

Some of our hiring managers are based internationally, an up to date CV in English would be appreciated

#LI-FD1

#LI-Hybrid