Model Retraining¶

How to trigger, monitor, and validate a model retraining run.

Work in progress

This page is a placeholder. Will be updated after the first automated retraining cycle.

When to Retrain¶

Retraining should be triggered when any of the following conditions are met:

Scheduled: Weekly retraining job via Airflow DAG (retrain_model_dag)
Drift detected: Evidently reports feature drift above threshold (see Monitoring)
Manual: New season data available or model performance drops below baseline

dvc pull data/processed/

dvc repro

This re-runs all changed stages: feature engineering → training → evaluation.

mlflow ui --backend-store-uri ./mlruns

Compare the new run against the current champion alias.

# Via MLflow Python client
client.set_registered_model_alias("soccer-predictor", "champion", version=<new_version>)

After promotion, restart the API pod to load the new model:

kubectl rollout restart deployment/soccer-api -n soccer
kubectl rollout status deployment/soccer-api -n soccer

Automated retraining via Airflow is planned. See Airflow DAGs.