Site Reliability Engineer
Sysdig
Software Engineering
Remote
Posted on Sep 18, 2024
In the cloud, every second counts. On the leading edge of security, Sysdig stops attacks in real-time by instantly detecting changes in cloud security risk with runtime insights and open source Falco. We are passionate open source enthusiasts at heart and problem-solvers who are building and delivering powerful solutions to secure cloud-native applications.
We value diverse opinions and open dialogue to spur ideas. We believe in working together to achieve our goals and we pride ourselves on a flexible work culture. We're an international company that understands how to cultivate an inclusive environment across remote teams.
And we're a great place to work too – we've been named a "Best Place to Work" by Inc.,the San Francisco Business Times and the Silicon Valley Business Journal, and we won six workplace awards from Comparably this year. We have been recognized by Deloitte as one of the 500 fastest-growing organizations for the last four years.
We are looking for driven team members who want to join us on our mission to lead cloud security globally. Does this sound like the right place for you?
What you will do
- Reporting to the SRE Manager you will build and manage systems across internal and production Cloud environments with a focus on configuration as code and platform automation
- You will implement reliability improvement projects, including capacity planning, performance tuning, load testing and infrastructure optimization
- You will measure KPI with Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreement (SLAs) and help to define them
- You will help improve our incident response. Perform root cause analysis (RCA), troubleshoot and debug issues across our infrastructure and platform services to identify and fix causes
What you will bring with you
- 2/3 SRE, DevOps or Cloud Infrastructure Engineer experience
- 2/3 experience in containerization (kubernetes, docker and helm charts) - all of them
- 2/3 experience with Linux systems and networking
- Software development skills; Go and Python a big plus
What we look for
- Familiarity with monitoring tools such as Sysdig, Prometheus, Nagios, Icinga, Zabbix
- Tooling and automations development experience
- Experience in CI/CD tools such as Harness or Jenkins
- Experience diagnosing and troubleshooting complex problems in high-throughput applications and network services
Why work at Sysdig?
- We're a well funded startup that already has a large enterprise customer base
- We have an organizational focus on delivering value to customers
- Our open source tools (https://sysdig.com/opensource/) are widely used and loved by technologists & developers
When you join Sysdig, you can expect:
- Great compensation package, including equity opportunities
- Benefits vary based on location
- An international culture with employees in more than 40 countries
- Flexible work arrangement
- Mental well-being support for you and your family and company-wide recharge days
- Development opportunities
We would love for you to join us! Please reach out even if your experience doesn't perfectly match the job description. We can always explore other options after starting the conversation. Your background and passion will set you apart, especially if your career path is different.
Some of our Hiring Managers are globally distributed, an English version of your CV will be appreciated.
Sysdig values a diverse workplace and encourages women, people of color, LGBTQIA+ individuals, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply. Sysdig is an equal-opportunity employer. Sysdig does not discriminate on the basis of race, color, religion, sex, national origin, age, disability, genetic information, sexual orientation, gender identity, or any other legally protected status.
#LI- JG1
#LI-Hybrid