A showcase of practical projects demonstrating my skills in data science, analytics, and visualization.
Tech Stack: Python, R (psych, lavaan), Apache Airflow, Streamlit, Docker, PostgreSQL
Developed a comprehensive clinical analytics platform simulating real-world workflows – from raw EHR and PRO/COA data ingestion to survival analysis, psychometric evaluation, and interactive dashboards.
Tech Stack: Kafka, Spark Structured Streaming, Airflow, Flask, Plotly, Parquet, Tableau
Built a real-time data pipeline simulating e-commerce product clicks using Kafka, Spark, and Airflow. Live dashboards powered by Flask + Plotly and advanced analytics enabled via Tableau.
Tech Stack: Python, SQL, Apache Airflow, Prophet, Scikit-Learn, Power BI, Parquet
Developed an end-to-end procurement analytics pipeline simulating SAP ERP data ingestion, processing, forecasting, and risk assessment. The solution automates ETL workflows and delivers actionable insights to support procurement decision-making.
Developed an interactive dashboard using D3.js to visualize car performance data with coordinated charts and brushing features.
Tools: D3.js, JavaScript, HTML/CSS
Built a CNN-based pipeline to classify and localize brain tumors in MRI scans. Achieved 99.65% accuracy with VGG19.
Tools: Python, TensorFlow, Keras
Built a Natural Language Processing pipeline to detect fake news from online articles using supervised learning techniques. The project combines text cleaning, feature extraction, and classification models to separate real and fake news effectively.
Tools: Python, Scikit-learn, Pandas, NLTK, Matplotlib
Built a regression model to forecast sales for car seat products across different retail stores. The project involved exploratory analysis, feature selection, and predictive modeling using both linear and ensemble techniques.
Tools: R, glmnet, caret, ggplot2, dplyr