Deploy Generative AI 60% Faster!

Data Architect – Big Data/Open Source

www.edgematics.ai
Back

Experience: 12–15 Years
Location: Pune / Hybrid
Company: Edgematics Group

About Edgematics

Edgematics is a global data and AI consulting company empowering Fortune 500 enterprises across the Middle East, Europe, the UK, North America, and India to unlock the true value of their data.

Through our advanced platforms — PurpleCube AI (Data Orchestration & Analytics) and Axoma (Agentic AI) — we help organizations accelerate data modernization, governance, and AI adoption at scale.

Our mission is to transform raw data into actionable intelligence through world-class engineering, innovation, and deep domain expertise.

Job Description

We are seeking an experienced Enterprise Data Architect (Lakehouse & Governance) to lead the end-to-end architecture design and implementation of enterprise-scale Telecom Datalake, Datamart, and Data Governance platforms.

This is a hands-on, governance-heavy leadership role aligned with large-scale delivery expectations. The architect will define scalable, secure, and high-performance Lakehouse architectures while ensuring tight coordination across Platform Engineering, Governance, BI, and DevOps streams.

In alignment with the proposed stack, the architecture focus includes NiFi + dbt + CI/CD orchestration + Iceberg + Spark + Trino, avoiding dependency on non-proposed orchestration tools.

Key Responsibilities

  • Define and design scalable end-to-end Big Data architecture across ingestion, storage, processing, governance, and consumption layers.
  • Architect Iceberg-based Lakehouse solutions integrated with Spark, Trino, NiFi, dbt, and object storage (MinIO/S3).
  • Define orchestration strategy using dbt + CI/CD + platform-native orchestration (where applicable).
  • Design and implement enterprise Data Governance frameworks using Atlas and Ranger.
  • Establish high availability (HA), disaster recovery, workload isolation, and performance optimization strategies.
  • Define best practices for metadata management, fine-grained access control, and data lifecycle management.
  • Lead performance benchmarking ownership, capacity planning, and scalability validation for Telecom-scale workloads.
  • Architect Telecom domain data models (Customer, Usage, Network, Billing domains) aligned with Datamart consumption.
  • Design and govern Data Migration Factory frameworks for legacy-to-Lakehouse transitions.
  • Provide architectural governance, delivery oversight, and mentoring across Platform Engineer, Governance Engineer, BI Engineer, and DevOps Engineer roles.
  • Evaluate emerging technologies and recommend modernization strategies aligned with enterprise and regulatory objectives.

Requirements

  • 12–15 years of experience in Big Data and distributed systems architecture.
  • Strong hands-on expertise with Spark, Iceberg, Trino, NiFi, dbt, and object storage platforms (MinIO/S3).
  • Proven experience designing enterprise-scale Datalake and Datamart architectures.
  • Mandatory OpenShift / Kubernetes production architecture experience.
  • Strong experience in Telecom data models (Customer, Usage, Network, Billing domains).
  • Experience designing and governing Data Migration Factory implementations.
  • Proven ownership of performance benchmarking and platform scalability validation.
  • Deep understanding of distributed systems, concurrency planning, workload isolation, and query optimization.
  • Experience implementing enterprise Data Governance solutions (Atlas, Ranger or similar tools).
  • Strong knowledge of hybrid or cloud-native infrastructure environments.
  • Excellent stakeholder management, executive communication, and leadership skills.
Experience: 12–15 Years
Company: Edgematics Group
Job Location: Pune / Hybrid

Apply for this position

Allowed Type(s): .pdf, .doc, .docx