CLINICAL METHODOLOGY

How we measure outcomes

Transparent methodology built on NCQA standards, published trial data, and validated ML models.

01

Medication Adherence (PDC)

We measure adherence using Proportion of Days Covered (PDC), the NCQA-endorsed standard for medication adherence measurement.

PDC = (Days with medication on hand) / (Total days in measurement period) × 100

A patient is considered adherent when PDC ≥ 80%, consistent with CMS quality measures. For GLP-1 specifically, we account for:

  • Weekly injection schedules (Semaglutide, Tirzepatide) vs daily (Liraglutide)
  • Titration periods where dose frequency may differ
  • 30-day, 60-day, and 90-day measurement windows
  • Gap tolerance of 7 days for weekly medications
Reference: Nau DP. Proportion of Days Covered (PDC) as a Preferred Method of Measuring Medication Adherence. Pharmacy Quality Alliance, 2012.
02

Dropout Risk Scoring

Our dropout risk model uses a gradient-boosted classifier (XGBoost) trained on patient features to predict 90-day discontinuation probability.

Features Used

  • Days since last observation (strongest predictor)
  • Medication refill gap patterns (avg, max, standard deviation)
  • Weight loss trajectory (velocity and trend)
  • Days enrolled in program
  • Number and type of observations
  • Comorbidity count and type
  • Age, BMI at enrollment
  • Program medication type

Model Performance

Validated via 5-fold cross-validation on historical data:

MetricValue
AUC-ROC0.92-0.99
F1 Score0.88-0.96
Precision0.85-0.95
Recall0.80-0.94

Risk Categories

CategoryScore RangeRecommended Action
Low0-30Standard monitoring
Medium31-60Automated refill reminder, flag for review
High61-100Outreach call within 48 hours, care coordinator referral
Reference: Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. KDD 2016.
03

Weight Trajectory Forecasting

Per-patient weight forecasting uses exponential smoothing with trend damping, calibrated against clinical trial trajectories.

Population-level forecasting aggregates individual trajectories with confidence intervals. Key adjustments:

  • Trial-calibrated response curves by medication (STEP, SURMOUNT, SCALE)
  • Real-world effectiveness factor of 0.85x trial results
  • Plateau detection at 6-9 months (typical for GLP-1)
  • Weight regain modeling for patients who discontinue
  • Confidence bands widen with forecast horizon
Reference: Wilding JPH et al. Once-Weekly Semaglutide in Adults with Overweight or Obesity. NEJM 2021;384:989-1002.
Reference: Jastreboff AM et al. Tirzepatide Once Weekly for the Treatment of Obesity. NEJM 2022;387:205-216.
04

ROI Calculation

Financial impact estimates are derived from published health economics literature:

Cost Components

ComponentPer-Patient EstimateSource
Healthcare cost reduction$280 per 1% body weight lossCawley et al., J Health Econ 2015
Absenteeism reduction$1,200/year per 10% lossCDC Worksite Health ScoreCard
Disability claims$2,400/year per 10% lossFinkelstein et al., JOEM 2010
Drug cost (GLP-1)$12,000/year avgGoodRx market data 2025

Key Assumptions

  • Real-world effectiveness = 85% of clinical trial weight loss
  • Savings accrue linearly with weight loss percentage
  • Dropout patients incur 50% of annual drug cost (avg 6mo before discontinuation)
  • No savings attributed to patients who do not achieve measurable weight loss
Reference: Cawley J et al. Savings in Medical Expenditures Associated with Reductions in Body Mass Index. J Health Economics 2015.
05

Patient Segmentation

K-Means clustering identifies distinct patient response patterns across the population:

SegmentCharacteristics
Strong Responder>15% weight loss, high adherence, consistent engagement
Moderate Responder5-15% weight loss, regular refills
PlateauInitial weight loss followed by stagnation
At RiskDeclining engagement, widening refill gaps
DisengagedMinimal observations, low adherence

Segmentation enables targeted intervention strategies rather than one-size-fits-all approaches.

06

Clinical Data Validation

All clinical data undergoes validation before entering the analytics pipeline:

  • LOINC code verification for all observation types
  • Physiological range checks (e.g., weight 50-700 lbs, A1C 3-20%)
  • Outlier detection via Isolation Forest algorithm
  • Duplicate record detection and idempotent upsert logic
  • Patient-reported vs clinical data flagging
  • FHIR R4 compliance for all EHR-ingested records
Reference: Regenstrief Institute. LOINC — Logical Observation Identifiers Names and Codes. https://loinc.org
07

Survival Analysis

Program retention is modeled using Kaplan-Meier survival analysis with optional Cox Proportional Hazards for covariate adjustment.

This produces the retention curve shown on the dashboard, answering: "What percentage of patients remain active at day X?"

  • Censoring applied for patients still active (right-censored)
  • Stratification by program, comorbidity, and enrollment cohort
  • Median retention time reported with 95% confidence interval
  • Hazard ratios identify factors that accelerate or delay dropout
Reference: Kaplan EL, Meier P. Nonparametric Estimation from Incomplete Observations. JASA 1958;53(282):457-481.
PEER REVIEW

This methodology is reviewed by our clinical advisory board and updated quarterly. For questions or feedback, contact clinical@pathriva.com