DevOps Team Lead

Create a free account to apply in seconds

We are looking for a DevOps Team Lead who can build and scale a highly reliable infrastructure for globally deployed SaaS platforms. This is not a maintenance-only role.

The expectation is to architect systems that improve uptime, deployment speed, scalability, observability, security, and operational efficiency across multiple products and business units.

The role requires someone who can think beyond servers and CI/CD pipelines. We need a systems thinker who understands how infrastructure impacts customer experience, release velocity, support load, product scalability, and business growth.

The candidate will lead the DevOps function across multiple SaaS products handling: Real-time telematics workloads IoT device communication High-ingestion APIs Live tracking systems Video and sensor-based platforms Multi-tenant SaaS deployments ⸻

Key Responsibilities

Infrastructure & Cloud Management

Design, manage, and optimize cloud and on-premise infrastructure.

Ensure high availability, scalability, redundancy, and disaster recovery planning.

Manage Linux-based production environments.

Optimize infrastructure cost without compromising reliability.

Handle scaling strategies for increasing device load and customer growth.

CI/CD & Release Engineering

Build and maintain robust CI/CD pipelines.

Reduce deployment risks and deployment time.

Automate build, deployment, rollback, and environment provisioning processes.

Standardize deployment practices across teams and products.

Monitoring & Reliability

Establish strong monitoring, alerting, and observability systems.

Implement proactive incident detection and root cause analysis.

Reduce downtime and improve platform stability.

Drive SRE-oriented operational maturity.

Security & Compliance

Implement infrastructure security best practices.

Manage access control, secrets management, SSL, firewall policies, backups, and vulnerability handling.

Ensure infrastructure hardening and operational compliance.

Containerization & Orchestration

Manage Dockerized environments and orchestration platforms.

Improve deployment consistency and environment portability.

Support microservices architecture where applicable.

Database & Performance Optimization

Work closely with backend and database teams on: Performance tuning Query optimization support Load balancing Caching strategies Replication and failover systems

Team Leadership

Lead and mentor DevOps engineers.

Create operational SOPs and infrastructure standards.

Build accountability, documentation culture, and ownership within the team.

Coordinate with Development, QA, Support, and Product teams.

Incident Management

Handle production incidents with urgency and ownership.

Build escalation systems and incident response frameworks.

Conduct postmortem analysis and preventive planning.

Required Technical Skills Strong Expertise In

Linux Server Administration AWS / GCP / Azure Docker Kubernetes Jenkins / GitHub Actions / GitLab CI Nginx / Apache Load Balancers & Reverse Proxies Networking & Security Monitoring Tools (Prometheus, Grafana, ELK, Zabbix, etc.) Infrastructure Automation Shell Scripting / Python

Good Understanding Of High-availability architecture Distributed systems Scaling real-time applications Database replication and clustering Message brokers (RabbitMQ, Kafka, Redis Streams, etc.) API infrastructure SSL, DNS, VPN, CDN, WAF Nice to Have Experience in IoT or telematics platforms Experience managing large-scale real-time tracking systems Terraform / Infrastructure as Code SRE practices Cost optimization at scale Multi-region deployment experience

Leadership Expectations

This role is not for someone who only executes tickets.

We expect the person to: Think proactively instead of reactively Build systems before problems become incidents Create operational leverage through automation Reduce dependency on manual intervention Build infrastructure that supports aggressive business growth Create visibility and measurable operational KPIs

KPIs / Success Metrics

The DevOps TL will be evaluated on: Platform uptime Deployment frequency & stability MTTR (Mean Time to Recovery) Infrastructure scalability Security incident reduction Alert quality and monitoring maturity Automation coverage Infrastructure cost efficiency Team efficiency and operational discipline

Experience Required 5+ years in DevOps / Infrastructure Engineering 2+ years leading teams or handling critical production infrastructure Experience managing production SaaS environments at scale

Ideal Candidate Profile

We are not looking for a “server administrator.”

We are looking for someone who: Understands business impact of infrastructure decisions Can scale systems under uncertainty Handles pressure calmly during outages Builds processes, not heroics Has strong ownership mindset Can challenge poor engineering practices Thinks in terms of reliability engineering, not firefighting

Why This Role Matters

For most SaaS companies, DevOps becomes a support function.

For us, it is a growth constraint or growth accelerator.

A weak DevOps team creates: Slow releases Customer dissatisfaction Downtime Engineering bottlenecks Support overload Revenue risk A strong DevOps function compounds the effectiveness of every other department.

That is why this role is strategically important.

Skills

Linux Server AdministrationAWS / GCP / AzureDockerKubernetesCI/CD (Jenkins, GitHub Actions, GitLab CI)Monitoring Tools (Prometheus, Grafana, ELK, Zabbix)Infrastructure AutomationTeam LeadershipIncident ManagementSystems Thinking