Adding Context To Existing Hockey Possession Metrics: An Update On The Advancements In Modeling

Back in February I presented the first post on adding valued context to existing hockey possession metrics. The need for enhancing stats of this nature was to provide additional (missing, but meaningful) context to the existing advanced analytics.

Since February I’ve been working on a series of analytics models that provide enhanced resolution and greater context for today’s standard possession metrics. The focus of the initial effort was to normalize all game metrics and make comparable throughout the season. This will assist in the identification of false peaks and valleys, identify quality of play regardless of game results and assist in identifying strengths and weaknesses in the Capitals as a team.

The purpose of this post is to provide a very brief update on the progress of modeling and goals for the upcoming 2023-24 season.

OPPOSITION MATTERS

But first, a brief recap. Our first step in the model development was to factor in the strength of opposition for each game played. In short an expected goals for percentage (xGF%) against the Anaheim Ducks is not comparable to an xGF% against the Boston Bruins. In other words, how well did the Capitals perform considering the strength of opposition?

The first version of the model simply multiplied the Capitals expected goals for percentage for a game by the oppositions pre-game winning percentage. In addition, it was determined that an average game score was an xGF% of 50.0% against a .500 team, which equates to constant value of 25. Thus, 25 is subtracted from the product of (xGF% x OppWin%) in order to derive a differential.

[xGF%(game) X OppWin%] – 25

Pretty straight forward, but now we can more accurately compare any game against any other game because we have normalized the standard game values.

The following graphic is from the initial post, which summarizes the first 53 games of the Capitals season.

In taking a quick glance at the resultant “game scores”, the Capitals overall performance over time begins to stand out. You can clearly see the slow start to the season but improvement in performance in November, the positive spike in performance in December and the early signs of a wobble and decline in January.

We can also ascertain, to a certain degree, the significance (weight) of the trend. Rather than seeing games as over or under the 50% plateau, we can also see weighted scoring for each game, with the strength of opposition factored in. Game score values over stretches of games can be summed to find a higher level of understanding.

VERSION 2

Like all initial iterations of an analytics model, anomalies surface that require assessment, modification and fine tuning. Towards the end of the 2022-23 season we begin to modify the initial model to provide additional resolution to the quantitative performance measures.

Enhanced Performance Index

There is no question that teams win and lose games they shouldn’t have won or lost. All of the stats, eye-tests and supporting data say the team outplayed the opposition, but because of all sorts of outside factors, including puck luck, penalties, injuries in the game, etc., the final results didn’t agree.

In the first release of the model, it was noticed that there were three or four games games where the performance score did not completely agree with the overall performance of the team. As a result, it was determined that by adding the goals differential and expected goals scored differential, the final game performance score was more accurately represented when compared to the ground truth.

[[xGF%(game) x OppWin%] – 25] + (GF-GA) +(xGF -xGA)

The following screen shot from the second-generation model reflects those changes for all games since the return from the 2023- All-Star break, and provides new enhanced game scores. It also provides a color coding for each game to assist in identifying the trends of the team.

You may recall the Capitals dominating the Boston Bruins in the first game back from the All-Star Break, followed by a dud of a performance against the San Jose Sharks and a solid performance against the Carolina Hurricanes in the first three games back from the break. You can also clearly identify the slow decay to the season that follows.

VERSION 3 AND NEXT STEPS

Up to this point I’ve focused on the performance of the team as a whole, but in order to gain greater insight, we need to also apply advanced performer measures to logical components of the team, including forward lines, defensive pairs and individual performance. Next steps for version 3 of the model will begin to account for those factors, as well as begin to consider:

injuries and other lineup variations against strength of opposition,
line configuration and performances vs. strength of opposition,
odds and betting lines.

Incremental Development

The aforementioned fortification of the expected goals stat is only one additional brush stroke to the overall painting, but as we’ve often stated, the more brush strokes the better. In follow-up posts we will look to build on these qualitative performance measures as well as explore other areas for enhancing the meaning of other existing advanced metrics.

[The statistics used in this post are courtesy of Natural Stat Trick and the NoVa Caps Advanced Analytics Model (NCAAM). If you’d like to learn more about the statistical terms used in this post, please check out our NHL Analytics Glossary]

Much more to come this summer and fall.

By Jon Sorensen