Every classification model example I've seen uses the binary target on the home team winning or losing. So that's how I've always built my classification models. Target is "home_win" or some variation of that.

I started thinking today about cutting my feature set in half and only using data related to the home team. Partly as an efficiency exercise. Reduce the number of variables in play, don't throw the kitchen sink at the model.

What prompted this was my collection of "differential" variables. For example, if I have "away average points" and "home average points," I'll turn that into "home average points differential" and use that engineered feature instead of the two individual features. But my differential features are always keyed to the home team, and those features are consistently among the best performers.

I'll backtest my theory of course, but I was curious how many of y'all use models that emphasize home team data vs a mix of both home and away.