Jump to content

[Discussion] Analytics in Hockey


Recommended Posts

A thread for the discussion of how analytics are developed, implemented, and used to draw meaning about the value of players. I'll build on this intro post later and provide some resources, but I wanted to get started by going on a bit of a rant in my second post, as I have some thoughts about analytics in hockey but haven't really had a place to voice my thoughts.

  • Upvote 1
Link to comment
Share on other sites

Felt like I was in STAT 226 all over again reading that.

 

Good thoughts.

 

I like to think that theoretically, the perfect set of statistical analyses exist out there that can perfectly measure a player's effectiveness. Unfortunately, the complexity of this "perfect analysis" is beyond what is reasonably possible and a drastically more efficient proxy, albeit still imperfect proxy, is simply, the "eye test". I believe this is more or less what proponents of the eye test are claiming when they express their disdain for advanced stats.

 

All that said, I still enjoy dabbling into advanced stats (though I focus more on basic stats) as a complement to my eye test, not a replacement. If my eye test and the statistics mismatch, I've learned over the years to lean more on my eye test than the statistics. It's more likely that there is something flawed with my statistic than my 20 years of eye test experience is flawed.

  • Upvote 3
  • ThereItIs 1
Link to comment
Share on other sites

4 minutes ago, -AJ- said:

Felt like I was in STAT 226 all over again reading that.

Good thoughts.

I like to think that theoretically, the perfect set of statistical analyses exist out there that can perfectly measure a player's effectiveness. Unfortunately, the complexity of this "perfect analysis" is beyond what is reasonably possible and a drastically more efficient proxy, albeit still imperfect proxy, is simply, the "eye test". I believe this is more or less what proponents of the eye test are claiming when they express their disdain for advanced stats.

All that said, I still enjoy dabbling into advanced stats (though I focus more of basic stats) as a complement to my eye test, not a replacement. If my eye test and the statistics mismatch, I've learned over the years to lean more on my eye test than the statistics. It's more likely that there is something flawed with my statistic than my 20-years of eye test experience is flawed.

I don't even think that we need a perfect measure of a player's effectiveness, but to grow the idea of analytics as a valuable tool it is important to stop doing things that we know are wrong. Guys in the Twitter analytics community are being incredibly hypocritical claiming that they see value in advanced stats while doing things that directly undermine the value of advanced stats. 

  • Upvote 2
Link to comment
Share on other sites

7 hours ago, Down By the River said:

I don't even think that we need a perfect measure of a player's effectiveness, but to grow the idea of analytics as a valuable tool it is important to stop doing things that we know are wrong. Guys in the Twitter analytics community are being incredibly hypocritical claiming that they see value in advanced stats while doing things that directly undermine the value of advanced stats. 

 

Yeah, whenever I see some advanced stat I'm unfamiliar with, my first thought always goes to things like the collection methods etc. My brother would talk to me about high danger scoring chances, but before I consider that stat, it's important to define exactly what a "high danger scoring chance" is and assess the level of objectivity with which these chances can be recorded. How much wiggle room is there in determining whether a chance is high danger or not? Does our definition of a "high danger" chance even actually realistically predict for goal scoring?

 

Feels like there's a progression when it comes to peoples' perceptions of stats in sports from distrust when they first learn about them, to full trust when they begin digging into them, and then back to a bit of skepticism again when you really get into it.

  • Cheers 2
Link to comment
Share on other sites

25 minutes ago, -AJ- said:

Yeah, whenever I see some advanced stat I'm unfamiliar with, my first thought always goes to things like the collection methods etc. My brother would talk to me about high danger scoring chances, but before I consider that stat, it's important to define exactly what a "high danger scoring chance" is and assess the level of objectivity with which these chances can be recorded. How much wiggle room is there in determining whether a chance is high danger or not? Does our definition of a "high danger" chance even actually realistically predict for goal scoring?

Feels like there's a progression when it comes to peoples' perceptions of stats in sports from distrust when they first learn about them, to full trust when they begin digging into them, and then back to a bit of skepticism again when you really get into it.

My excitement to see analytics brought into hockey was quickly replaced with resentment at how it seemed that people just wanted to dupe NHL teams into giving them jobs as heads of analytic departments using smoke and mirror techniques. The people who are hiring heads of analytic departments probably don't have the background knowledge to determine whether they can trust the metrics being peddled. 

  • Cheers 1
Link to comment
Share on other sites

You have explained the number one problem in so called advanced stats in hockey.  

 

They saw the nice, neat, one number WAR in baseball and tried to replicate it in hockey. It absolutely sucks balls as a measure. 

 

On the topic of shot quality measurements, I seem to recall Calgary Flames being the analytic darlings last year who were deemed to be ready to break out and "regress back to mean" throughout the whole season, but they never did. Watching them play, you could tell that they dominated games in terms of quantity of shots from everywhere, but rarely had east to west plays to set up those shots.  

 

Kevin Woodley (the goalie analyst) does dip into some private analytics that look at chances more in depth then what's available publicly. 

 

I used to get annoyed when I heard Thomas Drance and Rachel Doerrie talking about advanced stats as they are both just talking out of their ass, but I've learned to just let it go and look at it all with a grain of salt. Expecting someone who studied English Literature and Sports Management to understand what they know or don't know with statistics is foolish. 

  • Like 1
Link to comment
Share on other sites

1 hour ago, AatuD2 said:

You have explained the number one problem in so called advanced stats in hockey.  

 

They saw the nice, neat, one number WAR in baseball and tried to replicate it in hockey. It absolutely sucks balls as a measure. 

 

On the topic of shot quality measurements, I seem to recall Calgary Flames being the analytic darlings last year who were deemed to be ready to break out and "regress back to mean" throughout the whole season, but they never did. Watching them play, you could tell that they dominated games in terms of quantity of shots from everywhere, but rarely had east to west plays to set up those shots.  

 

Kevin Woodley (the goalie analyst) does dip into some private analytics that look at chances more in depth then what's available publicly. 

 

I used to get annoyed when I heard Thomas Drance and Rachel Doerrie talking about advanced stats as they are both just talking out of their ass, but I've learned to just let it go and look at it all with a grain of salt. Expecting someone who studied English Literature and Sports Management to understand what they know or don't know with statistics is foolish. 

 

Everyone should be immediately skeptical whenever Drance say something along the lines of "Team X out-chanced Team Y" because he basically relies on data that are basically just pulling shot location information from NHL.com to determine quality scoring chances. It is insanely frustrating to have him tout analytics while simultaneously misusing the term bell curve on a daily basis. 

 

But you are 100% right that it is far healthier to just let it go, lol. 

  • Cheers 1
  • Upvote 1
Link to comment
Share on other sites

Great post! And spot on.

 

It always bothered me observing people stormimg down from the proverbial mountain top with a book of 'advanced' stats in their hand acting like they're the word of freaking God. Like... where do you even start, you know?

 

Here is a really good blog post/article I I found awhile back discussing some of the deeper methodological issues on NHL analytics maybe you'd like to read:

 

https://www.neutralzone.net/mens/2023/03/15/a-journey-in-hockey-analytics-and-the-shortcomings-of-the-discipline/

 

 

Link to comment
Share on other sites

Excellent points about independence and lack of standardization in classifying countable events.  On top of that there are misleading proxy stats, like shots as a measure of possession.   Contemporary 3 on 3 overtime and the old soviet teams show how wrong that is.  No doubt some proprietary analytics are more reliable than the lazy slop that dominates public discussion, but as of now, analytics seems more like a good idea than a reality. 

Link to comment
Share on other sites

  • 2 months later...

PDO, lol. 

 

Why is PDO a bad indicator of team performance?

  1. Does not capture quality of scoring chances
  2. Score effects. Related to the above point, teams that lead tend to take fewer shots, teams trailing tend to get desperate and take lots of shots from poor angles. 
  3. Fails to account for shot attempts
  4. If PDO does include 5-4 play, it penalizes teams who (a) have above-average PPs and/or (b) generate a lot of PP opportunities
  5. Does not account for better defensive structure that clogs up middle of ice (hallmark of Tocchet's system)
  6. Does not reward teams for simply having elite goaltenders/shooters
  7. Shooting percentage inflation is not actually that impactful on overall results. The difference between an 11% shooting percentage (Vancouver) and 9% shooting percentage  (league-worst Chicago) amounts to about 2 additional goals every 3 games. When your goal differential is insanely high (Vancouver), an elevated shooting percentage is not the reason why you're winning games. 

Guys like Thomas Drance speak out of both sides of their mouth. They like to preface PDO discussions by saying it is just a 'gut check' measure to indicate whether what they are seeing is in line with performance. However, he uses it to suggest that Canuck performance is not sustainable. It is sustainable if you actually look under the hood of what PDO is measuring and understand why (a) it might poorly measure team performance and (b) mathematically, a decline in PDO for the Canucks would not have impacted their current record that much because they are outscoring opponents by such a wide margin.

 

For the record, I do not think this team is a cup contender. However, there are a bunch of reasons to point to why this is the case, none of which are related to PDO. 

Link to comment
Share on other sites

7 hours ago, Down By the River said:

PDO, lol. 

 

Why is PDO a bad indicator of team performance?

  1. Does not capture quality of scoring chances
  2. Score effects. Related to the above point, teams that lead tend to take fewer shots, teams trailing tend to get desperate and take lots of shots from poor angles. 
  3. Fails to account for shot attempts
  4. If PDO does include 5-4 play, it penalizes teams who (a) have above-average PPs and/or (b) generate a lot of PP opportunities
  5. Does not account for better defensive structure that clogs up middle of ice (hallmark of Tocchet's system)
  6. Does not reward teams for simply having elite goaltenders/shooters
  7. Shooting percentage inflation is not actually that impactful on overall results. The difference between an 11% shooting percentage (Vancouver) and 9% shooting percentage  (league-worst Chicago) amounts to about 2 additional goals every 3 games. When your goal differential is insanely high (Vancouver), an elevated shooting percentage is not the reason why you're winning games. 

Guys like Thomas Drance speak out of both sides of their mouth. They like to preface PDO discussions by saying it is just a 'gut check' measure to indicate whether what they are seeing is in line with performance. However, he uses it to suggest that Canuck performance is not sustainable. It is sustainable if you actually look under the hood of what PDO is measuring and understand why (a) it might poorly measure team performance and (b) mathematically, a decline in PDO for the Canucks would not have impacted their current record that much because they are outscoring opponents by such a wide margin.

 

For the record, I do not think this team is a cup contender. However, there are a bunch of reasons to point to why this is the case, none of which are related to PDO. 

Honestly I never even heard about this PDO stuff before this season. Canucks start playing well and all of a sudden it's all people talk about now.

  • Vintage 1
Link to comment
Share on other sites

5 minutes ago, Mando27 said:

Honestly I never even heard about this PDO stuff before this season. Canucks start playing well and all of a sudden it's all people talk about now.

These fancy stats are for geeks, who couldn’t play, to feel like they know the game. One glaring glance from Zadorov and these fancy stats guys would be running to hide behind skirts. 
Hockey is a tough game that includes intimidating the opponent. The fancy stat guys seem to never accept (because they don’t understand) that nuance. Isn’t it kind of ironic how the players actually value plus/minus but the fancy stat guys don’t? Ask Quinn Hughes about plus/minus. The players value that stat more than those PDo and pretty charts. Plus stats are winners. Minus stats are losers. Very easy to understand and it actually reflects if a player is helping his club win. Fancy stats are for pick suckers. 
Yes, Alf is drunk. Of course! 

  • Like 1
  • Cheers 1
Link to comment
Share on other sites

Love hockey and stats are for sure part of it.   Advanced stats have been around for a couple decades now,  two clubs even hired guys who were in love with it to be their GMs (Chakya for sure and Dubas to a lesser degree).    ARI sucked then and well still kind of sucks today, with all those guys now well into their primes, either on that team, traded or retired/didn't work out. 

 

WAR.   Have brought this up before, when the media was making a divide, traditionalist's, folks that watch a ton of hockey, and can take a scouts card and fill it out (shot, skating, compete etc etc etc etc) by watching a game, and on the other side of the spectrum, math geeks who are figuring out stuff using new metrics, and rarely watch a game (there was a real rivalry going on about a player like Bolland, and how this WAR/advanced stat stuff, back then it was mostly about puck could suck so badly with some advanced lens stuff, but be a hero..as in eye test versus math test), .    CHI was in the midst of winning their second cup and a few guys, using Corsi's idea on how to and when to give a number one a break (counting all shot attempts not just shots because, well butterfly and that sort of thing require a goalie to cover the bottom of the net as soon as anyone comes in the zone), and WAR was gaining some traction.  

 

Thing is, WAR came out and said Alex Steen was the most complete, and best player in the NHL.    Alex Steen.  lol.    Better then Crosby, the Sedins, Thornton, Ovi, Kane, Toews, Kopitar,  and a dozen or so guys like Mike Richard's, Carter, Kesler etc etc etc.   

 

I'm not a math genuis, but am pretty good at arithmetic.    Maybe they've come a long way since then, but when I read that and some of their other claims, it just didn't match up the eye test, or the basic stat lines.  

 

Things like zone entries and exits, are kind of neat.    Green was recording this with his staff using pencils way before  it became a thing.   

 

But when it comes time to evaluate players, the basic stats still are king.   Especially TOI.  It shows how much the coach trusted that player, and quality of competition.   To a degree where a coach puts you as well.   It's well known the Sedins got to start with the puck 70% plus of the time in their own zone, after a stoppage.   Guys like Sutter 25%.   Their job is to get the puck back down the ice etc.     You always need to consider the quality of competition.  TOI is a quick way to understand, is he middle six, or top six.   Does he PK.  Is he on the first PP unit etc.    What is his EV production like?   And what is his plus minus, compared to his peer group? 

 

A good example of that, the best line we've had since EP came in, was Hogs EP Podz for pure goals going in or not going in.   They plays 50-60 minutes together, EP was in a funk and demoted to the third line, played 3rd line 5 x 5 against other teams bottom six.   Guess what happened?   They didn't get scored on.    And scored a few goals.   Doesn't make them the best line though...but from that lens, it was our best line Green/Bruce season.  

 

Im with the OP.   Advanced stats, don't always add value.   And some advanced stat darlings, aren't nearly as good on the WAR cards, as they should be on the ice as a result.    Funny how WAR cards all go up once playing with better players right?   Wonder how Myers is doing...So far, 3 - that's 3 different coaches have played him in the top four,  sometimes as the top pairing.   That's because he's better than Stetcher or Burroughs.   Doesn't take a genuis to figure that out.  Kuzmenko makes almost what he does, yet is benched.   I'm sure, Myers also would have been benched, if the coaches had 3 top four RHD's to work with at times too. 

Edited by IBatch
  • Like 1
  • Vintage 1
Link to comment
Share on other sites

1 hour ago, IBatch said:

Love hockey and stats are for sure part of it.   Advanced stats have been around for a couple decades now,  two clubs even hired guys who were in love with it to be their GMs (Chakya for sure and Dubas to a lesser degree).    ARI sucked then and well still kind of sucks today, with all those guys now well into their primes, either on that team, traded or retired/didn't work out. 

 

WAR.   Have brought this up before, when the media was making a divide, traditionalist's, folks that watch a ton of hockey, and can take a scouts card and fill it out (shot, skating, compete etc etc etc etc) by watching a game, and on the other side of the spectrum, math geeks who are figuring out stuff using new metrics, and rarely watch a game (there was a real rivalry going on about a player like Bolland, and how this WAR/advanced stat stuff, back then it was mostly about puck could suck so badly with some advanced lens stuff, but be a hero..as in eye test versus math test), .    CHI was in the midst of winning their second cup and a few guys, using Corsi's idea on how to and when to give a number one a break (counting all shot attempts not just shots because, well butterfly and that sort of thing require a goalie to cover the bottom of the net as soon as anyone comes in the zone), and WAR was gaining some traction.  

 

Thing is, WAR came out and said Alex Steen was the most complete, and best player in the NHL.    Alex Steen.  lol.    Better then Crosby, the Sedins, Thornton, Ovi, Kane, Toews, Kopitar,  and a dozen or so guys like Mike Richard's, Carter, Kesler etc etc etc.   

 

I'm not a math genuis, but am pretty good at arithmetic.    Maybe they've come a long way since then, but when I read that and some of their other claims, it just didn't match up the eye test, or the basic stat lines.  

 

Things like zone entries and exits, are kind of neat.    Green was recording this with his staff using pencils way before  it became a thing.   

 

But when it comes time to evaluate players, the basic stats still are king.   Especially TOI.  It shows how much the coach trusted that player, and quality of competition.   To a degree where a coach puts you as well.   It's well known the Sedins got to start with the puck 70% plus of the time in their own zone, after a stoppage.   Guys like Sutter 25%.   Their job is to get the puck back down the ice etc.     You always need to consider the quality of competition.  TOI is a quick way to understand, is he middle six, or top six.   Does he PK.  Is he on the first PP unit etc.    What is his EV production like?   And what is his plus minus, compared to his peer group? 

 

A good example of that, the best line we've had since EP came in, was Hogs EP Podz for pure goals going in or not going in.   They plays 50-60 minutes together, EP was in a funk and demoted to the third line, played 3rd line 5 x 5 against other teams bottom six.   Guess what happened?   They didn't get scored on.    And scored a few goals.   Doesn't make them the best line though...but from that lens, it was our best line Green/Bruce season.  

 

Im with the OP.   Advanced stats, don't always add value.   And some advanced stat darlings, aren't nearly as good on the WAR cards, as they should be on the ice as a result.    Funny how WAR cards all go up once playing with better players right?   Wonder how Myers is doing...So far, 3 - that's 3 different coaches have played him in the top four,  sometimes as the top pairing.   That's because he's better than Stetcher or Burroughs.   Doesn't take a genuis to figure that out.  Kuzmenko makes almost what he does, yet is benched.   I'm sure, Myers also would have been benched, if the coaches had 3 top four RHD's to work with at times too. 

 

 

Yeah but how hot was Alex Steen's girlfriend?

 

 

Link to comment
Share on other sites

13 hours ago, Mando27 said:

Honestly I never even heard about this PDO stuff before this season. Canucks start playing well and all of a sudden it's all people talk about now.

PDO (puck luck as they basically want to call it...and well in real life, just like hockey, you make your own luck!) has been around for quite awhile.    It's not anything to be too concerned about.    Funny how Wayne Gretzky's shooting percentage gets de-bunked by folks who never watched him play.   All he did his entire youth was pick corners, and fake our goalies with his shoulders and fake shots - then opened them up and made them look stupid.     Kuzmenko's however, was way too high given what he accomplished in the KHL.    That's the difference between using stats properly.    A lot of folks expected a regression.    Some fans on the CDC a wanted to lock Kuzmennko up for 6-8 years.   Good thing Allvin is the GM.     To get a baseline you need a sample size.     The "people" talking about PDO, are doing their best to invalidate the teams accomplishments.    The Vancouver media including the goofs that are podcasting or whatever, need to start validating a little more.

 

At least we have a coach that keeps it real.   

  • Like 1
  • Cheers 1
Link to comment
Share on other sites

Congratulations, you guys managed to bring back the stats headaches I used to get when I had to take those classes in school.

 

I always found stats interesting but never enough to really understand why some people loved those classes. I was reading the original post and trying to think of a meaningful way to contribute to the discussion but ended up with, better to keep one’s mouth shut and be thought a fool than open it and remove all doubt…

 

I do wonder though if after a significant amount of time wearing the on ice trackers the players are now wearing, say 10 years or more, if the amount of raw data will be available to really analyze some of the things people are discussing. Like will the tracking data that will be available remove some of the observation bias they’ve had to rely on over the years? 
 

And that’s enough stats for me for another 5 years. Back to the eye test…

  • Like 1
Link to comment
Share on other sites

11 minutes ago, Flipper said:

Congratulations, you guys managed to bring back the stats headaches I used to get when I had to take those classes in school.

 

I always found stats interesting but never enough to really understand why some people loved those classes. I was reading the original post and trying to think of a meaningful way to contribute to the discussion but ended up with, better to keep one’s mouth shut and be thought a fool than open it and remove all doubt…

 

I do wonder though if after a significant amount of time wearing the on ice trackers the players are now wearing, say 10 years or more, if the amount of raw data will be available to really analyze some of the things people are discussing. Like will the tracking data that will be available remove some of the observation bias they’ve had to rely on over the years? 
 

And that’s enough stats for me for another 5 years. Back to the eye test…

Observation bias? We a fans, so of course there will be bias.

And stats are fine but need to be connected to the actual game by those who played and understand the game. Otherwise stats are just numbers and lack meaningful context. 
 

Link to comment
Share on other sites

7 minutes ago, Alflives said:

Observation bias? We a fans, so of course there will be bias.

And stats are fine but need to be connected to the actual game by those who played and understand the game. Otherwise stats are just numbers and lack meaningful context. 
 

 

Anything beyond the most basic stats also have bias, its called assumptions. 

  • ThereItIs 1
Link to comment
Share on other sites

21 minutes ago, Flipper said:

Congratulations, you guys managed to bring back the stats headaches I used to get when I had to take those classes in school.

 

I always found stats interesting but never enough to really understand why some people loved those classes. I was reading the original post and trying to think of a meaningful way to contribute to the discussion but ended up with, better to keep one’s mouth shut and be thought a fool than open it and remove all doubt…

 

I do wonder though if after a significant amount of time wearing the on ice trackers the players are now wearing, say 10 years or more, if the amount of raw data will be available to really analyze some of the things people are discussing. Like will the tracking data that will be available remove some of the observation bias they’ve had to rely on over the years? 
 

And that’s enough stats for me for another 5 years. Back to the eye test…

 

In other fields, we've known for nearly a century that unstructured judgement leads to poor decision-making. People can talk about the eye test as a way to debunk stats, which is fine/fair, but the eye test alone can also be pretty deceiving, especially if there are no checks/balances. To me, not talking about hockey here, but if you want to be able to get closer to the "truth", quantitative research methods are absolutely necessary. 

  • Cheers 1
Link to comment
Share on other sites

2 minutes ago, Bob Long said:

 

Anything beyond the most basic stats also have bias, its called assumptions. 

Statistical assumptions are not the same as bias. Quite the opposite. Assumptions of ordinary least squares regression are there to ensure that bias does not occur, or, at the very least, does not impact your analysis in ways that create bias. 

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...