Skip to content

Feature Engineering & Offline/Online Parity

Feature design principles

  • features must be computable before match start,
  • features must be stable across environments,
  • feature definitions are deterministic and versioned.

Feature categories

Examples include: - team historical performance, - recent form statistics, - head-to-head aggregates, - contextual signals (home/away, rest days).


Offline feature pipeline

Offline features are: - computed in DVC pipelines, - stored as versioned feature tables, - validated via data contracts.


Online feature parity

The system enforces train/serve parity: - feature logic is shared between offline and online code paths, - no ad-hoc transformations at inference time.

If parity cannot be guaranteed, the feature is excluded from the model.


Anti-patterns avoided

  • manual feature hacks in notebooks,
  • inference-only features,
  • implicit feature ordering.