Staff Database Reliability Engineer - (PostgreSQL + Cloud) @ Rackspace Technology

Home > Devops

Staff Database Reliability Engineer - (PostgreSQL + Cloud)

Rackspace Technology
8 - 11 years
Noida, Gurugram
21 hours ago
Email to a friend
Report this job

Job Description

What Were Looking For

Somone who can work from office ( Hyderabad location )
8-10+ years in DBA / Platform Engineering
Strong multi-cloud experience (Azure / AWS / GCP at least two)
Deep HA/DR & performance tuning expertise
Automation-first mindset (Terraform, scripting, CI/CD)
Experience in SaaS/DBaaS environments preferred

For a Site Reliability Engineer (SRE) in a DBaaS (Database-as-a-Service) support role, the following mandatory skills are typically required:

1. Database Administration (DBA) Skills

Primary Database: PostgreSQL
Secondary Database: MySQL, Oracle, MS SQL Server

Database Backup & Recovery: Tools and strategies for database backups and disaster recovery.
Performance Tuning: Query optimization, indexing strategies, and database performance troubleshooting.
Database Security: User management, roles, access control, and auditing.

2. Cloud Infrastructure Knowledge (DBaaS)

Cloud Platforms: AWS (RDS, Aurora), Azure (Cosmos DB, SQL Database), GCP (Cloud SQL, Firestore).
Infrastructure as Code (IaC): Terraform, CloudFormation, Kubernetes.
Kubernetes & Containers: Running databases in containers (like Kubernetes).
Observability Tools: ELK stack (Elasticsearch, Logstash, Kibana)
Database Migration: Migrating databases across different platforms or cloud environments.
Database Scaling: Vertical and horizontal scaling techniques in cloud environments.

3. SRE Principles (Site Reliability Engineering)

Incident Management: Handling database outages, incident response, and on-call rotations.
Monitoring and Alerting: Tools like Prometheus, Grafana, Datadog, CloudWatch.
Service Level Objectives (SLOs) / Service Level Agreements (SLAs): Ensuring uptime and performance targets.
Disaster Recovery Planning: Ensuring high availability (HA) and disaster recovery (DR) solutions.

4. Scripting and Automation

Scripting Languages: Python, Shell scripting, Bash, PowerShell.
Automation Tools: Ansible, Puppet, Chef.
Infrastructure Automation: Automating database deployment, patching, and scaling.

5. Networking and Infrastructure

Networking Basics: TCP/IP, DNS, Firewall, Load Balancers.
Database Connectivity: Connection pooling, failover strategies, and multi-region deployment.
Storage and Disk Management: Understanding IOPS, latency, and throughput.

6. OS Skills

Expertise in Linux OS ( RHEL, UBunto, Centos)

Understanding of file systems (ext4, XFS, etc.), permissions, and ownership (chmod, chown, ACLs).
Knowledge of process monitoring, management, and troubleshooting (ps, top, htop, kill, pkill, etc.).
Proficiency with tools like top, htop, vmstat, iostat, sar, and dstat to monitor CPU, memory, disk I/O, and network usage.
Ability to analyze system logs (/var/log/, journalctl, dmesg) for troubleshooting.
Understanding of resource limits (CPU, memory, disk, network) and how they impact database performance.
Knowledge of partitioning tools (fdisk, parted) and file system management (mkfs, mount, umount).
Understanding of RAID configurations and Logical Volume Management (LVM) for storage scalability.
Understanding of RAID configurations and Logical Volume Management (LVM) for storage scalability.

7. Troubleshooting and Debugging

Log Analysis: Reading and analysing database and system logs.
Root Cause Analysis (RCA): Performing in-depth analysis after major incidents
Query Performance: Analysing slow queries, deadlocks, and resource contention.

8 . Soft Skills

Communication Skills: Clear communication with stakeholders and engineering teams.
Problem-Solving: Ability to troubleshoot complex database issues under pressure.
Collaboration: Working closely with DevOps, Infrastructure, and Engineering teams.

About Rackspace Technology

We are the multicloud solutions experts. We combine our expertise with the worlds leading technologies across applications, data and security to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future. Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent. Join us on our mission to embrace technology, empower customers and deliver the future.

More on Rackspace Technology

Though were all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission. We bring our whole selves to work every day. And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe. We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic. If you have a disability or special need that requires accommodation, please let us know.

Job Classification

Industry: Oil & Gas
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Rackspace Technology
Location(s): Noida, Gurugram

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: kubernetes iostat networking artificial intelligence dr ansible cloud postgresql gcp linux cloud computing python cyber security performance tuning cpu microsoft azure engineering machine learning database administration o puppet high availability terraform bash aws

Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

DevOps Engineer

Accenture

5 - 10 years

Bengaluru

10 hours ago

₹ Not Disclosed

DevOps Engineer

Accenture

5 - 10 years

Bengaluru

18 hours ago

₹ Not Disclosed

Devops Engineer

Indium Software

6 - 8 years

Hyderabad

4 hours ago

₹ 20-22.5 Lacs P.A.

Site reliability engineer (Java,Unix,Dynatrace and Splunk)

FIS

6 - 10 years

Pune

10 hours ago

₹ Not Disclosed

Rackspace Technology

About Rackspace Technology We are the multicloud solutions experts. We combine our expertise with the world's leading technologies â€” across applications, data and security â€” to deliver end-to-end solutions. We have a proven record of advising customers based on their business challenge...

Staff Database Reliability Engineer - (PostgreSQL + Cloud) @ Rackspace Technology

Home > Devops