Oracle System Administrator 3-IT in Noida, India
System Administrator 3-IT
Oracle Managed Cloud Service is seeking a Site Reliability Engineer with 8 to 12 years of experience to work with our Oracle Cloud Infrastructure Support Team. The successful candidate will use their hands-on operational experience to identify areas that can be automated and then design, build, implement and support these solutions which will improve operational efficiencies.
The role requires, Linux operational support, Strong backend and frontend development skills preferably, cloud infrastructure automation with Ansible, Terraform, Python. Additional skills with networking and services running on cloud platforms will be beneficial. The role’s focus will combine operational support and automated solutions for infrastructure and services by leveraging their Devops skills and industry standards.
You will deliver the solutions that directly contribute to our esteem customer’s success. You will also be required to perform systems, networking automation running on virtualized platforms in cloud through automation. Other duties include researching, proofing OCI cloud services, their features for improving operations and authoring technical documentation that are beneficial to the company and the team.
What will you do
Act as ultimate escalation point for complex or critical issues at OCI and OCI-C infrastructure.
Utilize a deep understanding of service topology and the dependencies required to troubleshoot issues and define mitigations.
Use your experience and wisdom from building & running systems and infrastructures as a multiplier to drive operational improvements to support our customers infrastructure.
Optimize Oracle Cloud Infrastructure for maximum performance, scalability and reliability.
Serve as part of a 24x7 customer infrastructure support team.
Identify problems and/or opportunities for customer cloud infrastructure support.
Use your excellent written & oral communication skills to communicate our customers effectively.
Collaborate with other team members and stakeholders.
10 years of experience in Linux server administration
5 years of experience in compute, network, storage troubleshooting for improving capacity, reliability, scalability, availability working as a site reliability engineer
Proficient in cloud infrastructure technologies and automation.
System Administration including Linux / windows internals, TCP/IP, DNS, Load balancing technologies. Installing and configuring application / database servers.
OS image build for Linux, Windows and patch automation using Python, PowerShell.
Infrastructure automation through Terraform, Chef, Ansible, Puppet
Experience with Cloud Orchestration frameworks, development and SRE support of these systems
Experience working with fault tolerant, highly available, high throughput, distributed, scalable production systems.
Experience in a 24×7 high-availability production environment
Ability to come with best solution by capturing big picture instead of focusing on minor details and root cause analysis.
Aptitude to be a good team player and the desire to learn and implement new Cloud technologies.
Certifications Preferred if any
Cloud Certifications - OCI Certified Professional
Network Certifications – CCNA
OS Certifications - OEL certified, RHCE certified
Security Certifications - Cloud security certs
Education (Preferred Degree)
- M.S / B.S. in Computer Science, Computer Engineering, Software Engineering, or related areas is preferred
1: Bias for Action
Evaluates acts and communicates in SLA time. Is decisive. Makes timely, practical, effective decisions. Takes initiative without being asked. Plans efficiently while avoiding analysis paralysis. Knows how to take smart risks. Demonstrate strong follow-through and consistently keep commitments to customers and employees. Take ownership and responsibility for priority customer issues where and when required review urgent and critical incidents for quality.
Ability to prioritize the assignments at hand even in loosely structured situations. Effectively handles multiple projects or tasks at the same time and complete them within a set time frame.
3: Self development and teaching
Understands personal strengths and development needs. Initiates self-development actions. Seeks and shares job-relevant learning, developmental experiences, and feedback to enhance performance. Encourages others to take personal responsibility for continual learning and skill growth. Shares knowledge with others.
4: Dealing with ambiguity
Able to function well in loosely structured situations. Works effectively in situations involving uncertainty or lack of information. Effectively handles multiple projects or tasks at the same time. Is open to and responds flexibly to change.
5: Teamwork and willingness to roll up sleeves
Fosters cross-functional and cross business teamwork. Builds and promotes team morale. Works efficiently and effectively on teams to meet customers' needs. Contributes outside the scope of the job. Meets all team commitments. Consistent effort, intense commitment, and willingness to go above and beyond when needed. Willing to do low profile, non-challenging work to get the project done.
Special Requirements: Successful candidates might be required to perform on-call duty on rotational bases.
Detailed Description and Job Requirements
Define, design, and implement network communications and solutions within a fast-paced, leading edge database/applications company.
Perform performance trend analysis and manage the server/network capacity. Propose client configuration and implement technical solutions to enhance and/or troubleshoot the system. Work with others to define, coordinate vendor purchase needs. Responsible for support documentation as well.
Job duties are varied and complex utilizing independent judgment. May have project lead role. 5 years of related experience in a medium to large network distributed and computing environment. BS in Computer Science or related field.
Job: Information Technology
Job Type: Regular Employee Hire