Data Architect
Virtusa.com
Office
TN
Full Time
Data Architect - (CREQ236053)
Description
- • Architect and implement scalable Azure Databricks solutions, including Data Lakehouse, Delta Lake, Structured Streaming, Unity Catalog, and Medallion Architecture.
- • Design and manage advanced ETL/ELT pipelines utilizing PySpark, Databricks Workflows, Data Factory, and Azure Storage services.
- • Lead ML engineering and MLOps implementation, leveraging MLflow for experiment tracking, model registry, versioning, deployment, and monitoring.
- • Establish and enforce enterprise-grade data governance, security, lineage, and compliance using Unity Catalog, Azure AD, and cloud-native controls.
- • Optimize Databricks clusters for performance, scalability, reliability, and cost-effectiveness in production environments
