Senior Data Solution Architect with 11+ years of experience designing scalable cloud-native data platforms, real-time analytics systems, and AI-ready Lakehouse architectures across AWS, Azure, and GCP. Expertise in Databricks, Snowflake, Spark, Kafka, ETL/ELT pipelines, and data warehousing, with a strong track record of optimizing performance, reducing infrastructure costs, and delivering enterprise-scale data solutions across healthcare, finance, and cloud ecosystems.
Little more about me!
A quick snapshot of my toolkit

Designed and led the development of a real-time healthcare analytics platform integrating EHR and claims data using Apache Kafka, Apache Flink, and AWS Kinesis.
Enabled predictive insights for population health management and reduced data processing latency by 60%.
Deployed HIPAA-compliant data pipelines with Apache NiFi and Airflow on AWS, enhancing care quality and regulatory compliance.

Led the migration of legacy on-premises data infrastructure to a unified cloud-based lakehouse using Databricks and Delta Lake on Azure.
Streamlined ETL workflows using Apache Spark and Talend, improving data refresh rates by 70%.
Integrated machine learning models with MLflow to forecast energy demands, increasing predictive accuracy by 30%.
Developed scalable ETL pipelines with Apache Beam, Python, and Google Cloud Dataflow, processing over 10 million financial records daily.
Designed a cloud-native data lake on GCP, enabling seamless access to structured and unstructured data for cross-team analytics.
Implemented automated data validation and quality checks using Great Expectations, reducing data inconsistencies by 40%.
Designed and deployed a centralized ML Feature Store using Databricks, MLflow, and Feast, enabling 3× faster model iterations. Reduced fraud detection false positives by 18% through real-time feature engineering.