Thermo Fisher Scientific Systems Reliability - Sr. Engineer in Minneapolis, Minnesota
How will you make an impact?
A Reliability Engineering - Sr. Engineer is responsible for optimizing the Enterprise Global Technical environment at Thermo Fisher Scientific. Responsibilities would consist of root cause analysis and identifying/resolving gaps within the environment while implementing a sound/reliable solution including automation. Reliability Engineering will build up to do AI/ML within the cloud environment promoting self-healing among other features within that realm. This role will allow thinking outside the box and empower you to make decisions and implement with guardrails not gates.
What will you do?
Play a key role in Thermo Fisher's digital transformation. Drive company's Reliability Engineering initiative and be part of a DevOps culture, working with global platform and development teams.
Tools and automation
Own continuous delivery framework and focus on automation.
Automate different pieces of the environment through Ansible, Chef or Puppet
Ensure stability and integrity of high performance and high availability cloud based systems for the organization.
Design, develop and test automation workflows.
Experience building and managing in a cloud environment, preferably AWS or Azure
Key driver suggesting continuous improvement in systems operations through tools and automation
Report on overall health and optimization of cloud services to management
Participate in the definition of the roadmap for cloud services in collaboration with the Platform Engineering and architecture teams.
Drive root cause analyses, and coach others on doing them, in collaboration with software development teams
Analyze and adjust designs to assist in predicting and improving system stability.
Determine areas initially needing testing and develop a plan to obtain standard data for troubleshooting
Review engineering specifications and drawings, proposing design modifications to improve reliability within cost and other performance requirements
Evaluate environment on and off premise for environmental factors, such as numbers and causes of unit failures
Monitor failure data generated by a customer using product to ascertain potential requirement info product improvement
Responsible/assist in incident and problem management of cloud platform services
Works closely with our external partners and industry leaders to ensure we have a keen understanding of where industry trends and technologies are going and factor those into our strategies and roadmaps
Previous experience with Service Now and proficiency in creating workflows
Ensure the solutions the Reliability Engineering team designs and delivers are meeting defined business requirements and will stand the test of time from an operational excellence perspective
Assist in creating roadmap and work towards DevOps model of service engineering
Work closely with IT Security to ensure the solutions we're designing and delivering meet data security and compliance requirements
Provide regular communication to peers on areas for improvement, progress, milestones, and areas of success
Be available for scheduling for 24/7 oncall rotation to respond to and resolve issues
Ensure documentation and processes are well defined
Experience with Enterprise Systems management tools
Experience maintaining the health of the environment by keeping the systems current with upgrades and patches and by troubleshooting and resolving issues with tools
How will you get here?
- 4-year degree with major in Computer Science/ Engineering (or equivalent) from an accredited university
Must have a minimum of 3-5+ years of Linux experience
Must have a minimum of 1-3 years of experience in Cloud Solutions Delivery and Cloud architecture, especially public cloud platforms such as AWS, Azure, GCP. Strong preference for AWS experience
Tool & Technology Skills
In depth understanding and experience in AWS and a cloud first/cloud only initiative
AWS proficiency (ec2, s3, RDS, Route 53, Lambda, IAM, VPC, Security groups) and other services
Scripting languages: Python and overall Linux shell scripting skills
Ability to identify performance bottlenecks utilizing data in the environment while using data to do so
Experience in the Unix administration/engineering
Working knowledge in Docker
Thermo Fisher Scientific is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, creed, religion, color, national or ethnic origin, citizenship, sex, sexual orientation, gender identity and expression, genetic information, veteran status, age or disability status.