Skip to content
View Karant15's full-sized avatar

Block or report Karant15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Karant15/README.md

Hi, I'm Karan Trivedi

MS Data Analytics | Webster University (Dec 2024) Piscataway, NJ | Open to US opportunities (Remote & Relocate) | STEM OPT Active


What I Do

I build data-driven solutions that solve real business problems - not just notebooks that sit on a laptop.

7+ years of combined experience across healthcare, recruitment, and business analytics - including managing data relationships with 30+ NHS hospitals in the UK. Now applying that domain knowledge to data science.

Lean Six Sigma Black Belt - I don't just find problems in data. I frame them as business solutions.


Open To

Track 1 - Analytics Roles: Data Analyst · Business Analyst · Healthcare Analyst · BI Analyst Track 2 - Healthcare Recruitment: Account Manager · Client Success Manager · Workforce Analytics Manager

7+ years managing NHS hospital accounts + MS Data Analytics + LSS Black Belt


Release Schedule - 12 Projects | May-September 2026

One new project published every other week. Building in public.

# Project Type Status Live
1 Healthcare Workforce Analytics Dashboard Python · Streamlit Live Open
2 Supply Chain KPI Dashboard + DMAIC + SQL Python · SQL · Streamlit Live Open
3 Healthcare Readmission ML Pipeline XGBoost · SHAP · Streamlit Live Open
4 Supply Chain Power BI Dashboard Power BI · DAX Building Releasing May 2026
5 SQL Business Analytics Dashboard SQL · SQLite · Python · Streamlit Building Releasing May 2026
6 HR Attrition ML Pipeline + SHAP XGBoost · SHAP · SQL · Streamlit Planned Releasing June 2026
7 Demand Forecasting ML Prophet · ARIMA · XGBoost · SQL Planned Releasing June 2026
8 LLM Chat With Data Tool LangChain · OpenAI · SQL · Streamlit Planned Releasing June 2026
9 Finance Fraud Detection ML XGBoost · SHAP · SQL · Streamlit Planned Releasing July 2026
10 Resume Analyzer AI Tool LangChain · Hugging Face · SQL Planned Releasing July 2026
11 Healthcare RAG Document Q&A LangChain · ChromaDB · RAG Planned Releasing August 2026
12 Cricket Analytics Dashboard Python · Plotly · SQL · Streamlit Planned Releasing August 2026

Featured Projects

Healthcare Workforce Analytics Dashboard - LIVE

Analyzed 9.6M real US Medicare records to identify physician staffing gaps across all 50 states

  • Processed 1.1M unique providers across 104 medical specialties
  • Built interactive 5-tab Streamlit dashboard with US choropleth maps
  • Applied Lean Six Sigma DMAIC framework to structure recruitment gap analysis
  • Identified Wyoming (97.7%), Vermont and Alaska as most critically underserved states
  • Full analysis run locally on 9.6M records - dashboard shows 50k representative sample
  • Live: https://karan-healthcare-analytics.streamlit.app
  • Stack: Python · Pandas · Plotly · Streamlit · CMS Medicare Data

Healthcare Readmission ML Pipeline - LIVE

End-to-end ML pipeline predicting 30-day hospital readmission risk - 101,745 real patient records

  • Trained and compared 4 models: Logistic Regression, Random Forest, Gradient Boosting, XGBoost
  • XGBoost selected with ROC-AUC 0.598 - best balance for imbalanced medical data
  • SMOTE oversampling to handle 11.2% minority class imbalance
  • SHAP explainability showing clinicians exactly why a patient is flagged high risk
  • Live patient risk predictor with gauge chart and clinical recommendations
  • Live: https://karan-healthcare-ml.streamlit.app
  • Stack: Python · XGBoost · SHAP · SMOTE · Streamlit · UCI Diabetes Dataset

Supply Chain KPI Dashboard + DMAIC + SQL - LIVE

Analyzed 180,519 real orders - found that 57% of deliveries are late across 23 global regions

  • Only 42.7% on-time delivery rate - Central Africa worst at 60.7% late rate
  • 15 SQL queries via SQLite covering late rates, revenue, customer segments
  • ABC inventory segmentation identifying Class A products driving 80% of revenue
  • Full DMAIC Six Sigma structured analysis - Define through Control
  • Live: https://karan-supply-chain.streamlit.app
  • Stack: Python · SQL · SQLite · Plotly · Streamlit · DMAIC

Supply Chain Power BI Dashboard - BUILDING

Same 180,519 order dataset rebuilt in Power BI - demonstrating Microsoft stack proficiency

  • 4-page interactive report: Executive Summary, Delivery Performance, Revenue, ABC Inventory
  • DAX measures for KPI calculations
  • Designed for business stakeholders - not just technical audiences
  • Stack: Power BI · DAX · DataCo Supply Chain Dataset

SQL Business Analytics Dashboard - BUILDING

Standalone SQL showcase - 25 advanced queries across healthcare and supply chain datasets

  • Basic through advanced SQL: window functions, CTEs, subqueries, running totals
  • Cross-domain analysis: CMS Medicare + DataCo supply chain in same SQLite database
  • RANK DENSE_RANK ROW_NUMBER LAG LEAD across real business datasets
  • Streamlit dashboard showing queries alongside results
  • Stack: SQL · SQLite · Python · Plotly · Streamlit

Human Capital Analysis

Predicting employee turnover to reduce hiring costs

  • Analyzed 15,000+ employee records using Logistic Regression and Decision Trees
  • Achieved 90% prediction accuracy - job satisfaction identified as top turnover driver
  • Recommended strategies projected to reduce turnover by 20%
  • Stack: R · Logistic Regression · Decision Trees · k-NN · SVM
  • Repo: Human-Capital-Analysis

Bank Loan Risk Model

Loan default prediction reducing misclassification cost by $3M

  • Built Logistic Regression and Decision Tree models on 5,960 loan applicants
  • Improved sensitivity to 80.65%, reducing false negatives
  • Demonstrated $3M cost reduction through optimized approval strategy
  • Stack: R · Logistic Regression · Decision Trees
  • Repo: Bank-Loan-Decision-Making-Analysis

Consumer Segmentation Analysis

Customer segmentation and brand loyalty prediction

  • Segmented 600 consumer profiles using K-Means clustering
  • Applied Random Forest and Logistic Regression for brand loyalty prediction
  • Built for AXANTEUS market research agency
  • Stack: R · K-Means · Random Forest · Logistic Regression
  • Repo: Consumer-Segmentation-Analysis

Currently Learning

Course Platform Section Target
Data Analysis: SQL · Power BI · Tableau · Excel Udemy SQL section - feeds Project 5 directly May 2026
Google Data Analytics Professional Certificate Coursera Week 4 May 2026
Microsoft PL-300 Power BI Associate Microsoft Learn After Power BI section June 2026
Unilever Supply Chain Analytics Coursera Week 7 June 2026

Tech Stack

Languages:        Python · R · SQL (basic through advanced window functions CTEs)
Visualization:    Plotly · Streamlit · Power BI · Tableau · Seaborn
ML/Analytics:     Scikit-learn · XGBoost · SHAP · Logistic Regression · Decision Trees
                  Random Forest · Clustering · Time Series · Predictive Modeling
Imbalance:        SMOTE (imbalanced-learn)
Database:         SQL · SQLite · PostgreSQL · MySQL · Excel (Advanced) · DAX
AI/LLM:           LangChain · OpenAI API · Hugging Face · ChromaDB RAG (coming soon)
Process:          Lean Six Sigma Black Belt · DMAIC · SIPOC · RCA · FMEA
Domain:           Healthcare · Supply Chain · HR Analytics · Finance · Recruitment

Certifications

  • Lean Six Sigma Black Belt - Benchmark Six Sigma (2021)
  • Lean Six Sigma Green Belt - Benchmark Six Sigma (2021)
  • MS Data Analytics - Webster University (Dec 2024) | GPA 3.31
  • Google Data Analytics - Coursera (in progress)
  • Microsoft PL-300 Power BI - Microsoft Learn (in progress)

Experience Highlights

Senior Accounts Manager - ID Medical LLP (Healthcare Staffing, UK) Managed data relationships with 30+ NHS hospitals · Improved forecasting accuracy 25% · 15% YoY revenue growth

Senior Recruitment Consultant - QX KPO Services 453 shifts booked in one month · £25,000 revenue · Led 4-member analytics team

International Peer Mentor & Writing Coach - Webster University CRLA Level 2 Certified · Improved student outcomes 94%


Let's Connect

krntrivedi@gmail.com LinkedIn Healthcare Dashboard Supply Chain Dashboard ML Pipeline GitHub


12 projects in progress. One new deployment every 2 weeks. Check back soon.

Popular repositories Loading

  1. Human-Capital-Analysis Human-Capital-Analysis Public

    Predicting employee turnover using Logistic Regression, Decision Tree, k-NN & SVM on 14,999 employees. Decision Tree achieved 97% accuracy & 0.97 AUC. Built in R. Dataset: 10 attributes.

    R

  2. Bank-Loan-Decision-Making-Analysis Bank-Loan-Decision-Making-Analysis Public

    Predicting home improvement loan defaults using Logistic Regression & Decision Tree in R. 77.47% accuracy, 80.65% sensitivity, $1.165M cost reduction. Dataset: 5,960 applicants | 13 variables.

    R

  3. Consumer-Segmentation-Analysis Consumer-Segmentation-Analysis Public

    Consumer segmentation & brand loyalty prediction for 600 profiles using K-Means clustering, Logistic Regression & Random Forest. Built for AXANTEUS market research agency. Built in R.

    R

  4. Karant15 Karant15 Public

  5. Healthcare-Workforce-Analytics Healthcare-Workforce-Analytics Public

    Analyzing 9.6M US Medicare records to identify physician staffing gaps and recruitment priorities across 50 states

  6. Supply-Chain-Analytics Supply-Chain-Analytics Public

    Analyzed 180,519 supply chain orders to identify delivery failures and inventory gaps using Lean Six Sigma DMAIC framework