Site Reliability Engineer 100 remote, KÖLN

The Basics


We are looking for a Site Reliability Engineer (m/f/d). You will be a key member of a tight-knit group of talented Engineers who are responsible for keeping our own and our customer’s Kubernetes clusters operational and healthy. You’ll also have a key role in the development of the product itself, working together with our Platform Engineers to deliver the greatest Kubernetes service possible. You will be joining our Cloud Integration Team working with Go and Kubernetes on AWS, Azure and GCP. Giant Swarm is a fast-growing open-source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products. We are a diverse, fully remote (since 2014), and experienced team that is growing and spreading across Europe - with headquarters in Cologne.


Your Job



  1. You maintain, operate, and upgrade our own and our customer’s Kubernetes clusters.

  2. You will design, configure, build, and maintain distributed systems, as part of our managed Kubernetes offering.

  3. You will use a wide variety of open-source technologies and tools from across the open-source community, including Kubernetes, Cluster-API, and Flatcar.

  4. You understand how servers and systems work and you tweak their behavior to your needs, from kernel parameters to the infrastructure provider templates.

  5. You will help resolve incidents on our own and our customer’s clusters.

  6. You participate in the on-call support schedule.

  7. You are a go-to person in case our developers need advice regarding infrastructure.


Requirements



  1. You have deep hands-on knowledge of the inner workings of a Kubernetes cluster.

  2. You must be able to configure all cluster components from the ground up with no automated deployment tools (think Kubernetes the Hard Way).

  3. You have solid practical experience programming in Go. The ideal candidate also has experience working with Cluster API.

  4. You have worked with one (or more) of the major cloud providers.

  5. You’re comfortable debugging systems at all levels, from kernel & networking fundamentals right up to workloads running on Kubernetes.

  6. You’re happy troubleshooting a wide variety of issues and you’re not afraid to parse thousands of lines of logs in pursuit of an answer.

  7. You have experience with maintaining infrastructure with code and you know the pros and cons of various automation tools.

  8. You automate all the things by writing code.


About us


Every new team member changes the team. We love to learn from each other and we are looking for people who know things we don’t. Becoming part of Giant Swarm means that, by extension, you also become part of the Cloud Native community. We actively contribute to upstream projects and our quarterly hackathons will give you space to work on out-of-the-box projects. Occasionally, when we, as a team, want to fully focus on one project, we scratch all meetings and routines for a certain time to better focus during our hive-sprints. Continuous learning is important to us - we foster this through bi-yearly personal development talks, a budget for training/certifications/coaching as well as regular feedback talks and workshops. Our teams are cross-functional and collaboration is key.


Basics



  1. We currently operate on a 32 hour workweek (or 4 day workweek, you decide!).

  2. We don't count holidays but set a minimum number; You choose your own hard- and software.

  3. As a company that has almost, if not more, kids than employees, family-friendliness is crucial to us and paid parental leave is a no-brainer.

  4. We pay monthly perks that cover your costs for working remotely.

  5. We meet twice a year as an entire company and (if possible) see conferences as an important place to catch up with team members.

  6. We aim to be fully transparent (finance, salaries) unless it hurts people and trust you, based on this to make the best decisions.


Important note: We are not hiring job descriptions. We hire humans. :) We welcome applications from everybody, regardless of ethnic or national origin, religion, gender identity, sexual orientation, or age.


#J-18808-Ljbffr
Data publikacji: 2024-05-01
Jetzt bewerben