Site Reliability Engineer (w/m/d) at Douglas GmbH

Posted on: 03/27/2021

Location: Düsseldorf (ON-SITE)

full time

Original Source

Tags: openstack python docker

Douglas is one of the leading premium beauty retailers in the European beauty industry with around 2,400 stores and the No 1 online premium beauty shop in Europe. Our success and future path is defined by the #FORWARDBEAUTY strategy. Douglas is already the first address for beauty in 26 countries and offers a unique consumer experience and a modern portfolio of about 55.000 beauty and health products from more than 750 brands. The fast growing E-Commerce Platform was just upgraded to a curated marketplace. With that, Douglas achieved a turnaround of 3.5 million Euros in 2018/2019. The Douglas team with more than 20.000 Beauty Experts encourages and inspires their customers every day to live their own kind of beauty. #forwardbeauty #liveyourownkindofbeauty #doitforyou **FOR OUR HEADQUARTERS IN DÜSSELDORF WE ARE CURRENTLY LOOKING FOR A** **SITE RELIABILITY ENGINEER (f/m/d)** Douglas is in the middle of a cloud transformation. And, site reliability engineer would help Douglas in this journey by help maintain the Linux systems & give an operating perspective to the team. ****EXPECT EXCITING TASKS:**** * Operation and expansion of our high-availability private cloud, as well as the expansion of the public cloud operating environment * Implementation of operational requirements * Development of solutions within the framework of projects * Close cooperation with the software development teams * Work in an international, open and fast-moving environment * You find it exciting to guarantee the trouble-free operation of our services ****WHAT YOU OFFER:**** * Broad experience with technologies for operating Linux systems, web services and CI/CD tools * Experience in operation and Linux system administration with knowledge of OpenStack, CentOS and fixes of incidents in the network infrastructure * “Infrastructure as code” mindset * Experience in automation. Shell / Python scripting * Strong understanding of: Ethernet, VLAN, IPv4/IPv6, ARP, DHCP, DNS, and TCP * Hands on experience in Docker,Kubernates, LXC, namespaces/cgroups. * Comfortable configuring DNS, DHCP, and LAN/WAN technologies **KEY TASKS AND RESPONSIBILITIES:** * Responsible for software reliability in production. * Works with dev teams to ensure software resilience, scalability and alignment with infrastructure. * Also, can make code changes. * Build and maintain CI/CD pipelines. * Builds and maintains observability and monitoring systems. * On-call duty and initial incident response. * Design and implement load tests and resilience tests. * Measure SLOs and escalate SLO issues. * Facilitate blameless post-mortems and promote a culture of quality **EXPECT FROM US:** * A high degree of responsibility and decision autonomy * an international team of passionate technologists, using state-of-the-art technology stack and collaboration models * A fast-moving environment that values "getting things done" while also steadily increasing process maturity * A wealth of challenging projects, and a team that will support you in mastering them Does this apply to you? Then become a part of our international company and send your application including your earliest possible starting date and your salary expectations. We look forward to hearing from you!