Jump to content
Twins Daily
  • Create Account
  • Twins News & Analysis

    Scouting the Mental Game: What Are Current Analytics Missing?


    Hans Birkeland

    We should be past the point of treating analytics in baseball like a boogeyman. At the same time, the curiosity that drives this pursuit should recognize its current limitations.

    Image courtesy of Peter Aiken–USA TODAY Sports

    Twins Video

    "Scouting the stat line" is a pejorative nowadays, meaning you evaluate a player strictly on their baseball-reference page. But not too long ago, it was a pretty smart thing for a front office to do.

    In 2002, all you had to do was look at David Justice’s on-base percentage to determine that he might have something left in the tank late in his career. A decade or so later, we started caring less about raw rate stats and focusing more on who was the least lucky player, using expected stats. 

    xFIP and xSLG came into fashion as ways to measure what a player should have done given a luck-neutral environment. Then came batted ball data, and everyone rushed to figure out who hit the ball the hardest and whose slider had the most spin. If a player’s performance was subpar but the ball data told a different story, you may have a bargain on your hands. 

    We now know that someone with a good OBP can be pretty useless if they lack power or defensive value. We also, arguably, are starting to realize that expected rate stats, while useful in predicting who might be good next year, don't offer much for tomorrow’s game. 

     

    The Limitations of Current Baseball Analytics
    Batted ball data, while useful, offers even less in player evaluation, because it doesn’t truly factor in, for instance, a feel for hitting. Luis Arraez has looked terrible under the lens of batted ball data, but would the Marlins prefer anyone else at the plate when all you need is a base hit to swing the game in the late innings? 

    His mindset as a hitter is elite, and evaluating that could usher in a new wave of analysis.

    I always go back to the debate over clutch stats. It’s true that over a big enough sample, players will hit and pitch at the same level in clutch situations as they do normally. But I like to imagine myself, in eighth-grade Babe Ruth League, up at the plate with the game on the line. 

    I hit scared. Not of the ball, but of striking out, especially when the team needed me to perform. I would flail at everything and could usually put a ball in play, weakly, given that I had three shots at it. I was also kind of fast, the opposing defense was usually poor and I probably went one-for-four in close and late situations. 

    My approach was terrible. My thought process was terrible. The moment felt too big for me. I made bad swing decisions. My season average? .250. The stats didn’t say I was a bad clutch hitter, but in my mind, I knew that I was.

    How do you think David Ortiz in his Red Sox heyday felt with the game on the line? He knew the opposing pitcher’s tendencies and their plan of attack. He knew what pitches he did well against. He’d done it before. A mixture of confidence and savvy is a great combination. 

    I don’t even doubt that we can find a way to measure both of those variables. We’ll start park-adjusting it and call it SAVCON+. It’ll weigh chase percentage, exit velocity in close and late situations, number of pitchers/hitters faced, number of teammates weddings he attended on a rate basis, and mold it all into a two-digit percentile in which 100 is average. 

    But tongue in cheek or not, the deeper we go into which stats can predict with higher probability what a player will be going forward, and in certain situations, the more players will succumb to their own analysis. Because once guys start thinking about why their expected performance is so good, the confidence game resets. 

    Why a Results-Driven Mindset Matters
    Once a player hears about a stat and which extreme he falls on, the possibility arises where he says to himself, “I’ve been a groomsman for six different guys on this team and was an usher in four,” or in more realistic terms, “My exit velocity is good, so I don’t need to make any adjustments.” He’s broken the glass.

    Showing somebody an impression you do of them has the same effect. The joke is less funny, more mean, and you probably won’t do it as much. Then trying to perform it under pressure is a recipe for disaster.

    This ties into the cat-and-mouse game between the pitcher and hitter. If either side is in his own head thinking about how he needs to make swing or pitch decisions based on attaining better expected or batted ball stats, the opponent has a huge advantage. 

    Every year, we hear about how Max Kepler is going to adjust and start posting a higher average on balls in play, but the self-described tinkerer hasn’t been valuable as a hitter outside of 2019, when Rawlings was producing superballs. 

    Sometimes Kepler tries to lift the ball more, sometimes he tries to go the other way more. He’s a pretty smart, introspective guy, but he lets pitchers not named Trevor Bauer carve him up on a routine basis. For all we know, the pitchers are Lisa Simpson playing rock paper scissors with Bart, thinking to themselves, “Poor predictable Max, always picks rock.” To which Max thinks, “Good ol’ rock, nothing beats that!”

    Measuring the Mental Game
    Despite our efforts, we can’t know the truth of what goes on in players’ heads, but it's clear some guys are better at the mental game than others. You can call this sought-after ability character, intangibles, baseball IQ, BALLPAYER™, or Derek Jeter-like. An overemphasis on it by front offices leads to the type of teams that got lapped in the mid-2000s by teams with a basic analytical understanding. An under-emphasis can lead to performance that falls well below a team’s talent level, like the 2022 White Sox (although injuries and Tony La Russa played a big part, as well).

    Do the Twins have these kinds of guys?

    I have no idea. I rely on beat reporters and league insiders to provide me the occasional morsel of information on a bi-annual basis, such as the revelation that Tyler Mahle didn’t do shoulder-strengthening exercises last year after his injury and then went straight to Driveline after the season concluded to work on his slider. Otherwise, all I have is the eye test. 

    Joe Ryan looked confident last year, but he definitely seemed to lose a lot of battles, especially against tougher opponents. Did he rely on his fastball too much deep in counts? Will he trust his new offspeed pitches enough to go to them in more high-leverage situations? 

    Jose Miranda looked pretty excitable at the plate and in the field in his rookie year but also had a lot of big hits. Can he be more selective and let the game come to him in his follow-up campaign? (The opening series was a promising indicator, with Miranda drawing three walks in three games.)

     

    Is Gilberto Celestino a mistake-prone head case or was he a young player pushed into action before he was ready and trying too hard to keep his roster spot?

    It’s the front office’s job to figure out these answers, and if the conclusion is a clear negative, it behooves them to try and trade the player to a team that overvalues his statistical output. That could be the new market inefficiency for teams to capitalize on. 

    In Moneyball, old-timey scouts were mocked by Billy Beane for saying guys “looked like a ballplayer,” or that they might not be confident players based on the women they dated. That kind of analysis is still pretty gauche, but in a world where everything a player does has a metric assigned to it, and every team has access to all of that data beyond even what the public does, perhaps effectively evaluating a player’s mind is a way to win Moneyball 3.0.

    Follow Twins Daily For Minnesota Twins News & Analysis

    Recent Twins Articles

    Recent Twins Videos


    User Feedback

    Recommended Comments

    Featured Comments

    This ties in nicely with the Correa leadership thread.  A ton of this stuff is very hard, if not impossible to quantify in a data driven way.  What analytics would predict that Kepler would own prime Bauer?

    I think the analytics, as a term, is badly misused in baseball.  Its talked about like moneyball was the first time analytics were used.  In reality, analytics have been part of the game from the beginning, only the data used and available has changed.  Noted analytical manager Earl Weaver said something like, pitching, defense and the 3 run homer, which is very oddly mostly correct still to this day.  He didn't have a computer running the data, he was probably mostly doing it in his head but was performing the same processes with different data.

    Now that all of us slappys have access spin rates and tendencies and tons of other data we don't need the key to modern analytics comes back to knowing which data to use at the correct time.  Does L/R override batter v hitter historical or vs versa?  It was an interesting call yesterday when Rocco left Gallo in to face a tough lefty in Garrett, who he had never faced before.  Probably thought it was too early in the game and leading off the inning but no one would have raised a single question if Solano hit there.  We all saw how that worked out.  By leaving him in at that point he got another at bat against a righty later with runners on.  The computer can tell you the moves to make but it can't always tell you when to make them or not depending on several of the factors mentioned in the article.

    Now that the extra inning runner rule is permanent, I'm hopeful they will spend some analytical capital on this as a situation.  My hunch is that there will be a few more bunts or situational hitting for WPA moment of 1.00.

    I also misspelled anylitics every time I typed it in this post.  Use that data point as you see fit. 

    I think OBP is still a very important stat. Give a few guys with high OBP and a couple with power to drive them in. The more they get on base, the better chance you have of scoring. After all, its mighty hard to win if you don't score!

    I'm all for correlating any information that makes a team better.  "The good face"?  "Looks like a young Barry Bonds to me"? An analytics team should at least look at what goes into a statement like that from a veteran scout, and see if either they can break it down to something with predictive power, or can be validated as meaningful and not simply something an old guy says about every hitter of a certain build who happens to hit the ball over the fence the day he's watching.

    Inside the player's head?  I'm optimistic that sound conclusions can be drawn.  But those will come from trained professionals who know what they are looking for and then can specify it in a way others can recognize - if it were easier than that then it would already be done.

    I take a very broad view of analytics, and a table that has a column containing simply "yes" or "no" for a given player, or "low", "medium" "high", and so forth, can be of very high value to a front office making decisions.  It's all about breaking down intractable problems into smaller ones that can be more easily tackled.  A never ending process, and I disagree strongly with Manfred that it's in any way harmful.  It's what problem solvers (managers, team executives, whoever) do.

    43 minutes ago, Jocko87 said:

    I think the analytics, as a term, is badly misused in baseball.  Its talked about like moneyball was the first time analytics were used.  In reality, analytics have been part of the game from the beginning, only the data used and available has changed. 

    Bingo.  They used to give the batting title to the player who had the most hits in a season.  Then someone stopped to wonder about frequency, and batting average was born, and I imagine some diehards took it hard.

    6 minutes ago, ashbury said:

    A never ending process, and I disagree strongly with Manfred that it's in any way harmful.  It's what problem solvers (managers, team executives, whoever) do.

    Not necessarily harmful but the rule changes with the clock, batter minimum etc were largely implemented to counter the effects of hyper analytics. Paralysis by analysis was slowing down the game.  As you say, they are solving that problem and evolution continues. 

    24 minutes ago, ashbury said:

    A never ending process, and I disagree strongly with Manfred that it's in any way harmful.  

    I think the modern sabermetrics push has resulted in things like heavy shifts, multiple pitching changes dragging out games, early hooks and prioritizing pitcher velocity over health/longevity resulting in the game's best pitchers throwing less, reduction in strategic plays like stolen bases and bunting, and the trend toward 3 outcome at bets.  All of which has been dramatically worse for the entertainment value of the game in my opinion.  And it's not just me - I read an article that MLB lost HALF of its World Series viewership in '22 vs '16.  Attendance is also down almost across the board. So yeah I think these things matter. 

    Cannot freakin believe I just sided with Manfred on anything, I'm going to go shower now :)

    I think the main reason fans don't like analytics as a tool for building rosters is that they're generally used for bargain hunting and fans don't like bargain hunting. You don't need analytics to tell you that Aaron Judge is great, and fans would much rather see their team just be the highest bidder for Aaron Judge than get a bargain on a Joey Gallo bounceback that someone's computer said might happen.

    In my (former) world, as a band director, we were forced to find ways for data to drive everything we did.  That's a fine idea when the question is "How many math problems can Johnny do correctly on this test?" but not always useful when describing someone's consistency in producing a quality product in music or art.  So, as fine arts teachers, we are faced with using anecdotal data, meaning what and at what level do we observe something and and do we observe it frequently. As the teacher/manager/director/coach you develop a "feel" for how someone can perform something.  I don't think that's better (or worse) than a good healthy statistical analysis.  There are things that anecdotal data describes very well and things that it does not.  In fact, I think that the best knowledge comes from having both statistical data and anecdotal data, but that's not always easy or even possible.   In the case of the mental part of the game, anecdotal data may be the best (if imprecise and inconsistent) way of measuring it. 

    1 hour ago, Woof Bronzer said:

    I think the modern sabermetrics push has resulted in things like heavy shifts, multiple pitching changes dragging out games, early hooks and prioritizing pitcher velocity over health/longevity resulting in the game's best pitchers throwing less, reduction in strategic plays like stolen bases and bunting, and the trend toward 3 outcome at bets.  All of which has been dramatically worse for the entertainment value of the game in my opinion.  And it's not just me - I read an article that MLB lost HALF of its World Series viewership in '22 vs '16.  Attendance is also down almost across the board. So yeah I think these things matter. 

    Cannot freakin believe I just sided with Manfred on anything, I'm going to go shower now :)

    There's no question that today's analytics* has figured out some new things.  But just as infielders long ago figured out how to get cheap double plays on popups, leading to the Infield Fly Rule, the solution is always to adjust the rules and/or playing conditions if it's ever felt to have gotten out of balance.  If batters and pitchers and fielders figure out methods to increase the chance of winning that also makes the game boring, change the conditions they're operating under.  And it doesn't have to be drastic in most cases.  If an absolute newcomer watches a game today and asks why the shortstop is positioned where he is rather than a few feet even farther over, you can tell them the rule and they'll go "oh, okay."  Soften the baseball, lower the pitcher's mound or move it backward to 62 feet, change the strike zone slightly - things like this have happened through the years and can be done again, and years later you don't even remember that it was ever any different.

     

    * Longtime SABR member and yet I more or less hate the term sabrmetrics.  SABR's about the study of baseball, the majority of its efforts being in research to establish the historical record or correct it, and Bill James's co-option of the organization's name was short-sighted.  In business they call the same thing analytics, so I go with that.

    1 hour ago, Jocko87 said:

    Not necessarily harmful but the rule changes with the clock, batter minimum etc were largely implemented to counter the effects of hyper analytics. Paralysis by analysis was slowing down the game.  As you say, they are solving that problem and evolution continues. 

    I started reading your post thinking we were in disagreement but with your closing sentence I see us as aligned.  I don't want to see the game changed to have 3 or 5 bases, or make it 2 strikes and yer out, and my fantasy that out of the park home runs (usually borrrrring!) should be foul balls had its last real chance of happening before the turn of the 20th century*.  No need for something as annoying as the Offside rule in soccer which takes away the thrill just as something interesting starts to happen.**  But less radically than that, I think the problems plaguing the game of baseball (Three True Outcomes being IMO the worst) can be dealt with gently and subtly.

    I'll quibble that I don't see the direct connection between the new pitch clock and analytics.  Batters and pitchers alike recognized (and still do) that their best performance comes when they take time to collect themselves.  No need for spin rates and exit velocities to accept that, and to then take steps to limit both competitors equally.

    * USAFChief is probably the last one among us who could have lobbied for it, so I blame him for short-sightedness

    ** In both futbol and American football, when an attacker is midway through an exciting long play, you are looking with one eye at the player and with the other eye for indications from the ref that it's all coming back.

    1 hour ago, ashbury said:

    I started reading your post thinking we were in disagreement but with your closing sentence I see us as aligned.  I don't want to see the game changed to have 3 or 5 bases, or make it 2 strikes and yer out, and my fantasy that out of the park home runs (usually borrrrring!) should be foul balls had its last real chance of happening before the turn of the 20th century*.  No need for something as annoying as the Offside rule in soccer which takes away the thrill just as something interesting starts to happen.**  But less radically than that, I think the problems plaguing the game of baseball (Three True Outcomes being IMO the worst) can be dealt with gently and subtly.

    I'll quibble that I don't see the direct connection between the new pitch clock and analytics.  Batters and pitchers alike recognized (and still do) that their best performance comes when they take time to collect themselves.  No need for spin rates and exit velocities to accept that, and to then take steps to limit both competitors equally.

    * USAFChief is probably the last one among us who could have lobbied for it, so I blame him for short-sightedness

    ** In both futbol and American football, when an attacker is midway through an exciting long play, you are looking with one eye at the player and with the other eye for indications from the ref that it's all coming back.

    I think the analytics leading to increases in three true outcome results played a large role in the pitch clock. Yes, all the walking around the mound, or adjusting batting gloves, or whatever have grown as well, but the games were starting to last longer and longer as teams began building teams based around hitters working counts (also a strategy to get starters out earlier), and pitchers striking people out. It all kind of piled on and created a game that was lengthening. To your point about changing the number of bases or balls and strikes, it wasn't going to be easy to stop hitters from working counts or convince guys striking out wasn't that big of a deal so they had to go with the clock. So I think there was some analytics driven game length stuff on top of the just super slow pace of some guys.

    Analytics was a big deal when only a few teams could do it.  Now everyone has a computer full of statheads and thus it is an almost even ballgame which then comes down to the mental aspects of the game in each players hear, attitude (I too would choke up in the crucial moments) and natural aspects like speed and coordination.  

    I still like old fashioned ball players taking advantage of new information and not managers who follow the data only. 

    14 hours ago, Woof Bronzer said:

    All of which has been dramatically worse for the entertainment value of the game in my opinion.  And it's not just me - I read an article that MLB lost HALF of its World Series viewership in '22 vs '16.  Attendance is also down almost across the board. So yeah I think these things matter.

    Blame the blackouts, inability to view the game and complete and utter failure to market the game on that. Plus the league is freaking draconian when it comes to highlights being posted on alternate sites like Twitter/Facebook/even Tik Tok I'd bet.

    New thing is tried and it works. Advantage acquired. 

    Eventually one by one competitors copy new thing until all are doing it. 

    Advantage negated.

    New thing is tried and it works. Advantage acquired

    Eventually one by one competitors copy new thing until all are doing it. 

    Advantage negated. 

    New thing is tried and it works. Advantage acquired

    Eventually one by one competitors copy new thing until all are doing it. 

    Advantage negated. 

     

     

    I like the article.  I have long argued that relying 100% on analytics is a poor way to predict prospects.  Yes, relying on things like he looks funny so I do not like him, is poor way too.  However, I feel many are relying on analytics.  For some, it will tell some stories, but the eye test still should be used.  The writer points out how for years people have talked about how Kepler is just unlucky and one day it will balance out, but eventually, you just need to decide it is not just luck.  The write points out how Arraez was always attacked for his lack of power and eventually that will catch up to him.  However, he still keeps on getting base hits.  Sure, he is not going to hit HR and drive in a ton of runs, but he still provides good value.  

    One thing analytics never will do is truly predict future outcomes.  It will not predict if a player makes adjustments to fix issues.  All it does is tries to see if there is expected change in the outcomes that has happened, assuming everything being the same down the road.  You assume the player will continue to everything the same, or the pitcher will and no adjustments will be made.  Even then, it assumes the player is like everyone else.  Take Arraez for instance.  The assumption of his regression is that he would not keep his high BABIP because his hard hit rate was not high enough to track with history.  However, that assumes he is not good at trying to direct the ball in the area where there is no defense.  I do not care how hard you hit the ball, if you hit it where they are not, you will get a hit. If you can put the ball where you want, because you are that great, you will get hits even on weak hits. 

    Thanks Hans, you're right on. I missed your article & finally got to read it. I often get hit by fellow fans arguing stats, stats, stats. Most stats are history & are often spun. What matters more is what are the underlying conditions (positive or negative) which might or might not exists now, that drives those stats.

    For example I don't put a lot of stock in '20 & '19 into evaluations which a lot of people do. There are a lot of underlying conditions that some have been expressed needs to be figured in to the equation. Like intangibles, a player putting it together or player on a bad or good team when  leaves his production changes.



    Create an account or sign in to comment

    You need to be a member in order to leave a comment

    Create an account

    Sign up for a new account in our community. It's easy!

    Register a new account

    Sign in

    Already have an account? Sign in here.

    Sign In Now

×
×
  • Create New...