There is a magic to the number. It is nice and round, a multiple of 100, which itself may have a claim as the roundest of the round. It is a perfect square. And it is sufficiently high to seem unattainable. No player has had a batting average of 400 points—that’s 400 tenths of a percent, or, if you speak baseball, simply .400 with a silent decimal point—over the course of an entire season since Ted Williams did it at age 23 in 1941. Every year or two, someone carries an average in the .390s into July, and we spend the next couple of months discussing whether the magic number is in reach. They never make it.
In 1986, biologist Stephen Jay Gould published a seminal article about .400 hitting that explains its disappearance as the inevitable result of the maturation of baseball play. As the average performances for batter, pitcher, and fielder move closer to their natural upper limits, he argues, the highs and lows tighten around the mean. Standard deviation declines over time. In his words:
Variation in batting averages must decrease as improving play eliminates the rough edges that great players could exploit, and as average performance moves toward the limits of human possibility. … Declining variation arises as a general property of systems that stabilize and improve while maintaining constant rules of performance through time. The extinction of .400 hitting is, paradoxically, a mark of increasingly better play. (Stephen Jay Gould, “Entropic Homogeneity Isn’t Why No One Hits .400 Any More,” Discover, August 1986.)
It is now a quarter century later, nobody has hit .400, and, presumably, the extremes have tightened still further around the average. That average has not changed, with the AL and NL hitting a combined .262 in 2009, almost exactly the historical average. We still have .400 chases—this past season Joe Mauer had a .392 average on July 1, and Chipper Jones’s average on July 1 was .393 the previous year. But those chases inevitably end with averages slowly sagging to more common levels, with Mauer finishing 2009 at .365 and Jones wrapping up 2008 with a .364 average. Can it be true that we will never again see a .400 hitter? Is it time to stop wasting ink every summer on whether the latest hot-handed player can make it to the mark?
To answer that question, and the question of who might do it, I will look at the anatomy of the theoretical modern .400 hitter. Joe Posnanski took this on early last season. His inspiration came while watching Jack Cust, a man who strikes out at an astonishing rate and who, among above average hitters, is possibly the least likely to hit .400. Posnanski compares past .400 hitters with great hitters of the present. The big difference, he finds, is that players struck out far less in the past. If current players could keep everything else equal and simply strike out less, he argues, we would see a .400 hitter again.
The key to this observation lies in Batting Average on Balls in Play, or BABIP. A player’s BABIP is his average on balls hit into the field of play, which includes every at bat that does not end in a home run or a strikeout. We calculate BABIP by subtracting home runs from hits, and dividing by at bats minus home runs and strikeouts. Some calculations, such as those at Wikipedia and The Hardball Times, add sacrifice flies to the denominator. It is also possible to add bases reached on error to the numerator. However, the simplest formula omits both of these factors. We will use this simple definition, which makes sense here, because sacrifice flies are omitted from batting average, and errors already count against it—if we assume official scorers are competent, errors would be outs if repeated.
| BABIP = | Hits – Home Runs |
| At Bats – (Home Runs + Strikeouts) |
Overall, balls in play turn into hits somewhere around 30 percent of the time. Batters seem to have some control over this, with good batters often able to hit .330 or higher on their balls in play. There is an enormous amount of variation. A player’s best and worst BABIP seasons may be as many as 50 to 60 points away from his career average.
Over the past 50 years, in seasons of at least 502 plate appearances (the number needed to qualify for the batting title) the highest BABIP marks have been .411, by Rod Carew in 1977; .408, by Manny Ramirez in 2000; .406, by Jose Hernandez in 2002; and .405, by Andres Galarraga in 1993 and Roberto Clemente in 1967. Ichiro Suzuki, Derek Jeter, and David Wright are the only others to have reached a .400 BABIP. Thus, in 50 years, a batter has had at least a .400 BABIP while qualifying for the batting title only eight times. I looked at players with high career BABIPs and at least 1,800 plate appearances, or about 3 seasons worth of playing time—players who might be within striking distance of .400 on their balls in play—and noticed one common characteristic.
| Table 1: Top Career BABIP (Min. 1,800 PA) | ||
| Player | Career BABIP | Career K% |
|---|---|---|
| Matt Kemp | .366 | 24.8 % |
| Derek Jeter | .362 | 16.9 % |
| Ichiro Suzuki | .359 | 9.8 % |
| Matt Holliday | .354 | 18.7 % |
| Hanley Ramirez | .354 | 18.4 % |
| Bobby Abreu | .351 | 21.5 % |
| David Wright | .350 | 20.1 % |
| Miguel Cabrera | .350 | 21.0 % |
| Joe Mauer | .349 | 11.5 % |
| B.J. Upton | .345 | 27.7 % |
With a couple of exceptions, these high BABIP guys strike out a lot. As Posnanski writes, if they could strike out less, there is a good chance someone might get lucky on all of those balls in play and reach a .400 overall batting average. But saying “if he just struck out less he would hit .400″ is kind of like saying “if Jason Kendall just hit more home runs he would be a good batter.” A batter’s strikeout rate is an integral part of his hitting skill set; a player may have no more control over his strikeout rate than he does over his capacity for power. When players do consciously try to cut back their strikeouts, the results are not encouraging. Jack Cust, a natural strikeout artist who provides value with his power and on-base skills, tried to reduce his strikeouts in 2009. Both his power and his walks suffered.
Ichiro and Mauer are the two exceptions to the strikeout-happy norm of the career BABIP leaders. It makes sense that they would have a very good chance at .400, because when trying to hit for a high average, strikeouts are bad. It helps to have less of them. A strikeout counts as an at bat, thus factoring into batting average, but it is an automatic out. However, a strikeout is not the only type of at bat that does not end up in play. Home runs also do not factor into BABIP, and home runs are always hits. Just as a player cannot be lucky and get a hit on a strikeout, he also cannot be unlucky and not get a hit on a home run. Luck is for balls in play only.
So strikeouts are bad, but how bad are they? How do we quantify that? We cannot know until we look at the other rate that factors into batting average, a rate I will call BABnIP, for Batting Average on Balls not in Play. Again, two types of at bats do not factor into BABIP: home runs and strikeouts. To determine a player’s BABnIP, we will simply divide home runs, the hits, by home runs plus strikeouts, all balls not in play.
| BABnIP = | Home Runs |
| Home Runs + Strikeouts |
To find a player who has what it takes to hit .400, we need to look for someone who has a sufficiently high BABnIP that he could get there with reasonable luck on balls in play. That is, a player who not only does not strike out much, but who also hits for power. Here is a list of the highest single-season BABnIPs by currently active players from the past 10 seasons, with the BABIP the player would have needed to reach a .400 overall batting average.
| Table 2: Top BABnIP Seasons, Last 10 Years (Active Players) | |||||
| Player (Year) | AB | HR | K | BABnIP | BABIP for .400 |
|---|---|---|---|---|---|
| Albert Pujols (2006) | 535 | 49 | 50 | .495 | .378 |
| Albert Pujols (2004) | 592 | 46 | 52 | .469 | .386 |
| Albert Pujols (2009) | 568 | 47 | 64 | .423 | .394 |
| Gary Sheffield (2003) | 576 | 39 | 55 | .415 | .397 |
| Todd Helton (2000) | 580 | 42 | 61 | .408 | .398 |
| Albert Pujols (2008) | 524 | 37 | 54 | .407 | .399 |
| Vlad Guerrero (2005) | 520 | 32 | 48 | .400 | .400 |
| Albert Pujols (2003) | 591 | 43 | 65 | .398 | .401 |
| Albert Pujols (2005) | 591 | 41 | 65 | .387 | .403 |
| Gary Sheffield (2000) | 501 | 43 | 71 | .378 | .407 |
Joe Mauer and Ichiro Suzuki do not make the list. This past season, Mauer would have needed a .419 BABIP to reach .400. If he keeps the power from his recent surge, he could get closer in the coming years. The closest Ichiro has ever been was in 2001, when he would have needed a .426 BABIP for .400. His career BABIP is a little higher than Mauer’s; if he got very lucky over a short season and beat that career mark by 60 points, he might have a chance. Sheffield, Helton, and Guerrero all have had good BABnIP seasons, but they all came in the first half of the decade, and all three are aging quickly.
Two other sluggers, Aramis Ramirez and Carlos Lee, just missed the cut for Table 2. Both have had strong BABnIP seasons. However, both suffer from .295 career BABIP marks, which effectively puts a .400 BABIP season out of reach. Bonds, whom I left out because it seems unlikely he will play again, had great BABnIP seasons toward the end of his career. In 2004, he hit 45 home runs against only 41 strikeouts. But he also did not have a good career BABIP mark, hitting only .288 on his balls in play.
Albert Pujols is the player who consistently puts himself in strong position with his many home runs and few strikeouts. His career BABIP is .321, not nearly as good as the players in Table 1, but within striking distance given the BABnIP numbers he routinely puts up. Will Pujols ever hit .400? Probably not. But if he maintains his outstanding power and contact skills, one of these years he will get lucky on his balls in play and will make a serious run at the mark.




Decent theory here, but you left out a huge factor in hitting…walks. Ted Williams walked 147 times in ‘41 and had an on base percentage of .551. The reason Ichiro doesn’t get close to hitting .400 is cuz he doesn’t walk much, not cuz he doesn’t hit for much power. The most walks he’s had in a season is 68 and that is about 20 higher than his average during his career. The more times you walk to fewer hits you need. He hit .372 one year. If he walked 100 times that year (51 more than he actually did and still 47 times less than Williams in ‘41) he would have hit just over .400 (assuming all the walks replaced outs). If he walked 147 times like Williams he would have hit way over .400 (even if all the walks didn’t replace outs, but instead replaced hits). So while you theory is pretty solid it is incomplete. Strikeouts are the reason guys like Cust and Dunn don’t hit for high averages, but walks are why Ichiro doesn’t.
Thank you very much for your thoughts.
I agree, walks are an important factor. However, I don’t think we can just replace outs with walks, because then we’re really getting into fantasyland. If we want to do the thought experiment, “What would happen if Ichiro walked more?” then I’d prefer to keep his HR, K, and BABIP rates constant and simply adjust his total at bats down, which would reflect more of his plate appearances ending in walks. I actually do something similar in part 2 of this series, which is here:
http://www.ballyourbase.com/posts/2010/03/how-to-hit-400-part-2
Also, no matter how much Ichiro walks, his batting average is still going to be fueled only by his BABIP and BABnIP (weighted appropriately). So, yes, power does matter. The best HR/K ratio he’s ever had was 15/66 in 2005. That’s a .185 average over 81 at bats, and that weighs him down big time. It means he has to hit for a very high average on his balls in play — well over .400 — to make up the difference. And while he has been amazing at getting hits on balls in play, he’d have to do something really unprecedented in the modern era, like hit for a .420 or .430 BABIP.
Ted Williams in 1941? Yes, he did walk a lot, but that served to limit his at bats to about 450 (Ichiro is usually near 700), rather than actually affecting his batting average. He hit over .400 with just a .378 BABIP mark, and he was able to do that because he hit 37 home runs and struck out only 27 times, giving him a .578 average over those 64 at bats. That’s a really great head start, and something Ichiro will never come close to.
The main point here was that while Pujols would still have to get incredibly lucky to hit for an overall .400 average, he wouldn’t have to do anything unprecedented. And that’s because he does so well on his HR/K ratio, approaching 1/1 in some seasons. So while I agree with you that walks do serve a purpose in limiting at bats, they’re not directly related to batting average.