Data & Model Monitoring (Evidently)¶
Status: 📋 Planned — architecture designed; Evidently not yet integrated or deployed.
No drift detection is active today. The service is monitored via Prometheus metrics only — see Metrics.
Intended usage¶
Evidently will be used to: - detect feature distribution drift against the training baseline, - monitor prediction distribution changes over time, - generate offline batch reports as a first step.
Planned monitored signals¶
Data drift¶
- Feature distribution shift (PSI / KS test) against training reference.
- Missing or unexpected values.
- Schema changes.
These connect to the data pipeline in src/data/ and the feature engineering layer in src/features/.
Prediction drift¶
- Output probability distribution shift.
- Class balance changes.
Ground truth evaluation¶
- Match results arrive ~90 minutes after kick-off.
- Full accuracy evaluation requires a ground truth feedback loop — planned separately from drift detection.
Offline-first plan¶
- Batch Evidently report on recent predictions vs training baseline (initial phase).
- Export key drift signals as Prometheus metrics (future phase).
- AlertManager rule on confirmed drift signal (future phase).
Related¶
- Data — source of training reference distributions
- ML — model lifecycle and retraining triggers
- Metrics (implemented)
- Monitoring Status