SYSTEM ONLINE

Vardhan Seepala

Lead Data Engineer | Data Systems Architect

Building scalable, secure data platforms across AWS, Azure, and GCP. Focused on measurable impact: lower cost, faster insight, higher reliability.

01. CORE COMPETENCIES

PRIMARY STACK

Python / PySpark 95%
Cloud Architecture (AWS/Azure/GCP) 90%
SQL / Data Modeling 95%

TECHNOLOGY MATRIX

Languages
Python SQL Java Scala
Data Engineering
Spark Airflow Kafka dbt Databricks
Infrastructure
Terraform Docker Kubernetes CI/CD

02. PROFESSIONAL TRACK RECORD

Lead Data Engineer

Jun 2023 – Present
Viral Nation
  • Spearheaded CDC pipelines (Cloud SQL → BigQuery) using Datastream, reducing transformation overhead by 40%.
  • Deployed Terraform templates to configure Datastream on a private VPC, slashing manual setup time by 50%.
  • Engineered DBT models unifying 10+ data sources, reducing data errors by 80%.
  • Architected Dataflow pipelines streaming MongoDB/PostgreSQL data to BigQuery for near real-time decision-making.
  • Led deployment of brand detection models via Kafka for 1M+ daily interactions.
  • Built CI/CD pipelines for AI model deployment, cutting time-to-market by 35%.
  • Optimized PySpark-based ETL pipelines in Azure Synapse, enabling real-time campaign performance insights.

Data Engineer / Analyst Co-Op

Jan 2023 – Apr 2023
BDO
  • Redesigned Azure Data Factory pipelines, reducing processing time from 13 to 9 hours (30% cost savings).
  • Developed Power BI dashboards integrating MQTT server data with digital twin metrics for real-time production tracking.
  • Automated KQL query generation, accelerating data extraction by 25%.

Senior Software Engineer

Aug 2019 – Dec 2021
Oracle Cerner
  • Designed AWS EMR pipelines, reducing processing costs by 30% and runtime by 50%.
  • Streamlined AWS IAM role creation via CloudFormation, reducing manual errors by 90%.
  • Built automated testing scripts (Java/Python), cutting integration testing time by 70%.
  • Standardized healthcare data using Java, Python, and microservices to meet global compliance.
  • Optimized AWS Glue ETL pipelines and managed Redshift clusters for large-scale analytics.

03. EDUCATION

University of Windsor

Windsor, ON, Canada

Master of Applied Computing

Sir MVIT

India

Bachelor of Engineering in Computer Science

04. SELECTED PROJECTS

Multi-Cloud Migration

Orchestrated on-prem to AWS/Azure/GCP migration of 100TB+ data with zero downtime. Achieved 60% OpEx reduction.

Terraform Kubernetes Airflow

Enterprise Data Vault 2.0

Consolidated 15+ systems into an auditable EDW. 10TB+ daily processing with 99.95% uptime.

Snowflake dbt Python

Medallion Data Lake

Designed Bronze/Silver/Gold lake on Azure Databricks processing 50TB+/day. Enabled 200+ self-serve users.

Databricks Spark Azure

Real-time Marketing

Built Kafka → Dataflow → BigQuery pipeline handling 1M+ events/day with sub-30s latency.

Kafka BigQuery Looker