Data Scientist and Architect specializing in scalable data platforms, machine learning, and cloud systems. Driving public health data modernization in Chicago, serving over 2.7M citizens through interoperable, analytics-ready solutions built on Azure and big data technologies.
High-impact Data Scientist and Data Architect with deep expertise in designing scalable, production-grade data models that drive decision-making across public sector and healthcare environments. Specializing in transforming fragmented, high-volume datasets into centralized, interoperable platforms that support analytics, governance, and regulatory compliance.
Currently at the City of Chicago, leading data modernization efforts to consolidate multi-agency sources into unified, longitudinal models that enable real-time insights and strategic planning. Experienced in USCDI standards, HL7/FHIR integration, and building data infrastructure that scales securely across industries.
Core Competencies: Data Architecture • Cloud Solutions • Scalable Data Pipelines • Machine Learning • Metadata Governance • Standards-Driven Data Modeling • Agile Cross-Functional Collaboration
Illinois Institute of Technology
GPA: 3.7/4.0 • 2023
Specialized in Big Data Technologies, Neural Networks, Time Series Analysis, Data Mining, Statistical Learning, and Cloud Computing
Amrita Vishwa Vidyapeetham
Computer Science • GPA: 8.95/10.0 • 2021
Focus on Neural Networks, Machine Learning, Cloud Computing, Database Management, and Software Engineering
Exploring new technologies and building cool stuff using it. Attending Hackathons. Code is my canvas, data is my art.
Finding clarity and inspiration in nature. Every hike teaches patience, perspective, and perseverance.
Running towards goals, both personal and professional. Maintaining balance through active lifestyle.
Driving data modernization and interoperability efforts serving 2.7M+ citizens through scalable Azure-based data systems.
Built predictive analytics and IoT-driven insights to optimize industrial heating systems and minimize downtime.
Mentored 200+ graduate students in Big Data Technologies and scalable data engineering practices using Hadoop, Spark, and Kafka.
Led editorial operations for the college newspaper, overseeing content strategy, publication workflow, and a cross-functional team of writers and editors.
Developed marketing analytics and customer retention models to drive sales and optimize business performance.
Real-time CV system achieving 99% mask detection and 93% pose accuracy, reducing violations by 15%.
CatBoost models increasing impressions by 20%, traffic by 10%, and ROI by 15%.
Python DBMS with B+-tree indexing reducing query times by 30%.
ML model with 92% accuracy enabling retention strategies.
Real-time anomaly detection reducing false positives by 20%.
ARIMA and LSTM models predicting sales with 95% accuracy.