Job description : As a Site Reliability Engineer (SRE), youll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation. You ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you ll be focused on running better production applications and systems. Responsibilities :
Lead designs of major software components, systems, and features to improve the availability, scalability, latency, and efficiency of Visibility services
Provide guidance to other team members on managing end-to-end availability and performance of mission critical services, on building automation to prevent problem recurrence, and on building automated responses for non-exceptional service conditions
Mentor and train other team members on design techniques and coding standards, and to cultivate innovation and collaboration across multiple teams
Manage individual projects priorities, deadlines, and deliverables.
Ready to learn new tools and technology for current and new project requirements.
Your experience:
Minimum 7 years of experience in a production environment
Well-honed problem-solving skills in large scale distributed environments with excellent written and oral communication.
Working Experience on any configuration management tools such as Ansible, chef etc.
Solid understand of DNS, TCP, CDN, VCP and the rest of modern networking and security architectures
Experience designing, building, and managing large scale distributed systems on private or public(AWS/GCP/AZURE) clouds.
Working Experience with Unix/Linux systems internals (networking, file systems, virtualized environments, containers, etc.) and administration.
Experience in any one programming languages
Qualifications Your experience:
Minimum 7 years of experience in a production environment
Well-honed problem-solving skills in large scale distributed environments with excellent written and oral communication.
Working Experience on any configuration management tools such as Ansible, chef etc.
Solid understand of DNS, TCP, CDN, VCP and the rest of modern networking and security architectures
Experience designing, building, and managing large scale distributed systems on private or public(AWS/GCP/AZURE) clouds.
Working Experience with Unix/Linux systems internals (networking, file systems, virtualized environments, containers, etc.) and administration.
Experience in any one programming languages.
Employement Category:
Employement Type: Full timeIndustry: ITFunctional Area: ITRole Category: Software EngineerRole/Responsibilies: Site Reliability Engineer