Principal Engineer - Open Source Data Platform (ODP)

Bengaluru R&D (Dev)

Create a free account to apply in seconds

ABOUT THE ROLE

We are seeking a Principal Engineer with at least 14 years of experience to serve as a technical visionary and thought leader for Acceldata's Open Data Platform (ODP). In this role, you will define the long-term technical strategy, lead the most complex architectural initiatives, and represent Acceldata in the global open-source and distributed systems community. You will work directly with engineering leadership and executives to shape product direction while driving innovation across the organization.

WHY JOIN US?

As a Principal Engineer at Acceldata, you will be at the forefront of shaping the future of enterprise data platforms, defining the technical vision for systems that power mission-critical workloads across the world's largest organizations. Your decisions will directly influence not only our platform but also the broader data ecosystem through open-source contributions and industry leadership.

You'll work alongside Apache members, committers, and industry veterans who are passionate about solving the hardest problems in distributed computing. This is a rare opportunity to combine deep technical impact with strategic influence, building technology that matters while shaping the direction of a growing data observability company.

RESPONSIBILITIES

• Define and drive the long-term technical strategy and architecture for the Open Data Platform, aligning with business objectives and industry trends.

• Own the design of the most complex, high-impact systems and establish architectural principles and patterns that scale across the organization.

• Identify emerging technologies and industry trends; lead research and development initiatives that position Acceldata at the cutting edge of data platform innovation.

• Serve as a recognized leader in the open-source community; drive Apache project contributions, represent Acceldata at conferences, and influence project roadmaps.

• Collaborate with CTO, VP of Engineering, and Product leadership to translate business strategy into technical execution; provide technical due diligence for strategic initiatives.

• Influence engineering practices, tools, and culture across multiple teams; establish best practices that elevate the entire engineering organization.

• Mentor Staff Engineers and Senior Engineers; develop technical leadership capabilities across the organization.

• Lead resolution of the most challenging technical problems spanning architecture, performance, scalability, and reliability.

• Engage with strategic customers and partners on complex technical discussions; translate customer needs into platform capabilities.

• Drive alignment across engineering, product, and operations on technical decisions with broad organizational impact.

• Work across diverse environments: Bare Metals, VM, Kubernetes, multi-cloud, and hybrid architectures at enterprise scale.

REQUIREMENTS

• 15+ years of hands-on software development experience with at least 8 years focused on distributed systems, big data platforms, or data infrastructure.

• Proven track record of leading large-scale technical initiatives from conception to production across multiple teams.

• Expert-level proficiency in Java or Scala; strong skills in Python and systems languages.

• Deep expertise in distributed computing, including consensus protocols, distributed transactions, data replication, partitioning strategies, and optimization with modern table formats.

• Extensive experience in architecting and scaling systems using Hadoop, Spark, Hive, Trino, Kafka, Flink, and related technologies at production scale (100s to 1000s of nodes).

• Demonstrated ability to design and evolve complex systems that handle petabyte-scale data with high availability and performance requirements.

• Expert knowledge of cloud-native architectures, Kubernetes orchestration, and multi-cloud deployment patterns.

• Track record of diagnosing and resolving complex distributed system issues, including performance optimization, resource management, and failure mode analysis.

• Significant contributions to major open-source projects; experience working with distributed global teams and open-source governance models.

• Exceptional written and verbal communication skills; proven ability to influence technical direction across organizations and with external stakeholders.

• Ability to balance long-term technical vision with near-term delivery requirements; experience making build vs. buy decisions.

DESIRED BACKGROUND

• PMC member or committer status in Apache projects (e.g., Spark, Kafka, Hive, Hadoop, Iceberg, Flink, Trino).

• Speaker at major conferences (ApacheCon, Spark Summit, Kafka Summit, QCon, etc.); published papers or widely-read technical content.

• Experience building or contributing to query engines, optimizers, or execution frameworks.

• Deep experience with modern lakehouse architectures, table formats (Iceberg, Delta, Hudi), and data mesh patterns.

• Experience with ML infrastructure, feature stores, or MLOps platforms.

• Experience scaling engineering organisations in high-growth environments.

• Master's or PhD in Computer Science, with a focus on distributed systems, databases, or related fields.

Skills

Distributed SystemsBig Data PlatformsJavaScalaPythonArchitectural DesignCloud-native ArchitecturesKubernetesTechnical LeadershipCollaboration