Lead CloudOps Engineer at Deepfactor, Inc

Posted on: 05/06/2022

Location: United States (REMOTE)

Original Source

Tags: consul puppet chef sql grafana prometheus python kubernetes bash circleci aws ansible terraform kafka jenkins

**Responsibilities:** * Build, maintain, and support all cloud-based infrastructure that powers the Deepfactor SaaS platform and back-end systems * Help foster a culture of security, and enforce comprehensive security best practices throughout the Deepfactor infrastructure * Set up and maintain automated monitoring of all systems, to ensure maximum uptime and peak performance * Architect infrastructure solutions for maximum scalability, employing techniques and technologies such as "scale-out", redundancy, automated failover, and microservices * Work closely with software development teams to support their infrastructure needs, and provision and maintain all needed development, testing, and production environments * Employ DevOps automation, "infrastructure as code", and continuous integration/deployment practices as much as possible * Work closely with compliance, engineering, and product teams to implement necessary security certifications such as SOC2 and other security certifications * Position requires the ability to be on-call and respond within defined SLAs **Qualifications:** * Strong background administering scalable infrastructure in a public cloud environment (AWS preferred) * Experience with setting up monitoring tools such as Graphite, Grafana and Prometheus * Experience with HashiCorp technologies such as Consul, Vault, Terraform and Vagrant * In-depth experience with CI/CD tools (Jenkins, CircleCI, etc.) * In-depth experience with GitOps tools (ArgoCD, FluxCD, etc) * Proficiency in Linux system administration, network administration, and security, along with expert knowledge of associated tools and utilities * Proficiency in scripting languages such as Bash and Python, as well as SQL * Experience managing Kubernetes clusters in a production environment required * Experience with managing large scale deployments of message-oriented middleware such as Kafka, NATS, etc. * Must have experience in monitoring and diagnosing issues with high throughput microservices * Experience with infrastructure automation tools, such as Chef, Puppet, Ansible, etc, is also a plus * Strong debugging and problem-solving skills, as well as verbal and written communication skills * Candidates with experience managing multiple technologies or roles is a plus * BS in Computer Science or related field, or equivalent experience