Intuit Senior Software Engineer in Mountain View, California
Interested in building and managing platforms at massive scale that enable billions of dollars of revenue? Does living in Kubernetes world, running 100s of Kubernetes clusters that scale over 1000+ nodes sound like a fun challenge? Intuit is seeking a Platform Operations engineer for its Modern SaaS platform that powers TurboTax, Quickbooks and Mint. Platform Operations Engineers operate right at the intersection of Software Engineering and Infrastructure Engineering to build and operate large scale systems that are secure, fault-tolerant, highly available, affordable, and scalable. Using industry best practices, tools, and principles from software engineering, architecture, and security to build them into our software tools to solve operations problems.
What you'll bring
Bachelor's degree in Computer Science (or related technical field involving programming), or equivalent work experience
Expertise with building, upgrading and managing Kubernetes clusters
Familiarity with logging and metrics framework, such as Fluentd, Prometheus and Jaeger
Proficiency with AWS technologies (NAT Gateway, EC2, ALB/NLB, Cloud watch, IAM, VPC, Route53 etc)
Strong understanding of Unix/Linux operating systems
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
Strong experience in writing high quality Python or GO code
Ability to debug, optimize code, and automate routine tasks.
Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
5+ years of experience with production operations managing large scale systems.
2+ years of experience with software development or working knowledge of software development. Open source development is a plus.
How you will lead
Ownership and accountability for the reliability of the Intuit Kubernetes Service platform, construction of monitoring solutions, improving cluster upgrades, etc.
Leverage development skills to deliver and deploy monitoring as code.
Deliver an always-on (high available, scalable and performant platforms) operational excellence for our services
Manage infrastructure (across AWS, containers and hosted data centers) as code
Continuously automate newer features to maintain and improve SLAs
Troubleshooting complex issues, and managing stakeholders expectations during incidents while troubleshooting.
Contribute to open-source projects (Kubernetes, Keiko, Argo etc.)
Participate in 12/7 oncall rotations along with dev team
Document the important processes and procedures, and perform required KT sessions to disseminate the information across the team.
Supporting and coaching other engineers, pair programming or peer reviewing code, helping to ensure that all engineers are growing and part of a community
Drive and own Root Cause Analysis (RCA) for specific applications.
EOE AA M/F/Vet/Disability. Intuit will consider for employment qualified applicants with criminal histories in a manner consistent with requirements of local law.