Senior DevOps Engineer - Kaptain at D2iQ

Posted on: 09/01/2021

Location: United States (REMOTE)

Original Source

Tags: jupyter spark helm kubernetes unix python docker

D2iQ is on a mission to provide the best multi-cloud and hybrid-cloud services platform in the industry, built with Kubernetes. We’re looking for highly motivated engineers to join us. This team is responsible for delivering [Kaptain](https://d2iq.com/products/kaptain): a world-class global-scale, end-to-end machine learning platform to some of the most innovative names in tech, cloud, healthcare and financial services. Kaptain is based on Kubeflow and includes an SDK that simplifies the model development lifecycle and workloads orchestration on Kubernetes. This gives Data Scientists tools for maintaining repeatable and versioned workflow. [Kaptain v1.1](https://d2iq.com/blog/kaptain-is-aboard-v-1-0-is-ga) This position will give you the opportunity to collaborate with the brightest engineering minds in cloud infrastructure and distributed systems, as you design and develop a reliable, resilient, and scalable machine learning platform with Kubernetes. As an ideal candidate, you would have empathy for your customer and your team, and welcome customer feedback. You would excel with minimal technical supervision, and design and implement solutions independently, while also supporting your team members. You would embrace time constraints, and work with team members to deliver high-quality products and features, with rapid iteration. #### **Responsibilities** * Develop new and maintain existing KUDO Operators and Helm Charts * Maintain the OSS components versions current in the product * Own and continuously improve CI/CD and image building processes and tooling * Maintain Platform compatibility and Security Patching * Improve documentation and customer-facing tutorials based on personal insights * Manage, tune and troubleshoot issues with large scale distributed systems * Perform code reviews and give constructive, critical, and cordial feedback * Engage with the Kubeflow community on projects that are important to our product #### **Qualifications** * Expert knowledge in Kubernetes, Docker, and Helm * Maintained and Built Tooling for CI/CD infrastructure * Experience with Linux Shell programming * +3 years of work experience (and/or relevant academic experience) * Solid understanding of DevOps best practices * Are self-driven and motivated, with a strong work ethic and a passion for problem-solving * Can debug, troubleshoot and resolve complex technical issues reported by customers * Know Linux or other Unix-like operating systems #### **The following is a plus** * Experience writing and testing software in Go and Python * Experience with Kubernetes Operators * Experience with Machine Learning and distributed model training * Understand cloud platforms architecture, especially networking, security, storage, and resilient application topologies. ### **D2iQ’s Core Values** **We are customer-driven** We are driven by the success of our customers and we deliver a top-notch partnership throughout their journeys. We believe if our customer wins, we win. **We are team always** We are dependable and effective in building positive relationships with our peers. We are relentlessly humble and demonstrate a high-level of respect in our interactions. **We are champions of change** We have a hunger to continuously learn and the courage to take intelligent risks. We’re pioneers, innovators, and change-makers to the core. **We are relentless** We raise the bar and execute flawlessly. We’re serious about ownership and effectively balance quality output with efficient speed. **------** **Our Vision** Enabling organizations to change the world through open source innovation. **Our Mission** We maximize our customer's business value by relentlessly delivering deep expertise and unrivaled technology that utilizes automation to solve the toughest of cloud native challenges including state, scale and resiliency. ------ ### **About D2iQ** D2iQ is the leading provider of enterprise-grade cloud platforms that enable organizations to embrace open source and cloud native innovations while delivering smarter Day 2 operations. With unmatched experience driving some of the world's largest cloud deployments, D2iQ empowers organizations to better navigate and accelerate cloud native journeys with enterprise-grade technologies, training, professional services and support. Whether you are deploying your first Kubernetes workload, optimizing your business analytics with Spark or Jupyter, or looking to educate your developers on the benefits of cloud native, D2iQ has the expertise, services and technology to enable you on the journey. D2iQ is headquartered in San Francisco with additional offices in London and Hamburg, Germany. D2iQ investors include Andreessen Horowitz, Hewlett Packard Enterprise, Khosla Ventures, Microsoft, and T. Rowe Price Associates, Inc. Find us at <https://d2iq.com/> ------ D2iQ is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.