DIPS

moonstone2
Veteran

Posts: 1,241

DIPS Jul 1, 2014 16:43:46 GMT -5

Quote

Post by moonstone2 on Jul 1, 2014 16:43:46 GMT -5

I'm saying that major-league-caliber pitchers, when facing major-league-caliber hitters, pitching in front of major-league-caliber defenses, and playing in major-league stadiums, generally have very little control over the rate at which their balls in play fall for hits. There is a voluminous amount of research on the subject, most of which strongly supports this conclusion.

If you actually read the links that you post you would realize that this is NOT true. Pitchers do indeed have significant controll over balls in play.

Tippett drew three main conclusions:

Pitchers do influence BABIP.

A pitcher shows statistically significant BABIP consistency across the length of his career.

Small influence over BABIP (much smaller than influence over strikeouts and walks) is still significant because such a large percentage of balls are put into play.

jmei
Global Moderator

Posts: 15,224

DIPS Jul 1, 2014 17:11:00 GMT -5

Quote

Post by jmei on Jul 1, 2014 17:11:00 GMT -5

If you actually read the research, you would realize that subsequent analysis has demonstrated that virtually all of that individual impact on BABIP is explained by some combination of the ability to generate popups, the ability to generate weak ground balls that extreme ground ball pitchers possess (think of this as the GB analogue of a popup), and the fact that pitchers with lower contact rates (and thus higher strikeout rates) can induce slightly lower BABIPs and HR/FBs.

Outside of a pitcher's batted ball profile (which definitely can affect BABIP in a significant way) and his strikeout rate (which has a small, independent effect on a pitcher's BABIP; note, however, the high strikeout pitchers generally also induce high popup rates, so the simple correlations in that link are slightly misleading), pitchers have virtually no impact on the rate at which their batted balls fall for hits.

I have mentioned and alluded to these caveats multiple times in this thread.

Last Edit: Jul 1, 2014 17:33:19 GMT -5 by jmei

jimed14 Veteran Posts: 25,830	DIPS Jul 1, 2014 17:16:27 GMT -5 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by jimed14 on Jul 1, 2014 17:16:27 GMT -5 At the very minimum, we should be looking BABIP-IFFB instead of straight BABIP.
	“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

Oregon Norm
Global Moderator

Senior Moderator

Scouting the Cactus League

Posts: 6,617
Member is Online

DIPS Jul 1, 2014 18:06:18 GMT -5

Quote

Post by Oregon Norm on Jul 1, 2014 18:06:18 GMT -5

BABIP is a random variable. For a given pitcher it has an expected mean, and variation around it. The mean can be different from pitcher to pitcher given certain tendencies, but it will still vary year to year.

All it takes to move a ball twenty feet to the left or right of an outfielder is a change of less than 6 degrees in the angle off the bat. For a line drive, that can be the difference between a nice catch and a ball hit in the gap for extra bases. Now ML batters are a talented bunch, but that sort of consistent at-bat after at-bat fine-grained control is asking a bit much. So a batter's BABIP will vary largely depending how their luck is running, and because of that a pitcher's will also, though probably to a lesser extent. These aren't hypotheses either. Studies have shown that variability clearly.

jimed14
Veteran

Posts: 25,830

DIPS Jul 1, 2014 18:43:57 GMT -5

Quote

Post by jimed14 on Jul 1, 2014 18:43:57 GMT -5

I don't think anyone can argue with that. But the variability of BABIP can either be luck (ball getting hit right at players) or pitching better (weak contact) than normal.

Simply dismissing great starts because of FIP, SIERRA and luck is not valid IMO. In the minors, that's scouting the box score.

Last Edit: Jul 1, 2014 18:48:32 GMT -5 by jimed14

“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

chavopepe2
Veteran

Posts: 1,608

DIPS Jul 1, 2014 18:55:13 GMT -5 via Tapatalk dewey1972 likes this

Quote

Post by chavopepe2 on Jul 1, 2014 18:55:13 GMT -5

I think "dismissing" is too strong of a term. The mediocre peripherals lead me (and others) to be skeptical about the sustainability of the success. They were great starts. From a results perspective, he was terrific. But when you're looking at a 5-10 game sample with the peripherals he has had, it is perfectly reasonable to question whether we can really count on those great results to continue.

And thats all DIPS really is...

jmei
Global Moderator

Posts: 15,224

DIPS Jul 1, 2014 19:00:06 GMT -5

Quote

Post by jmei on Jul 1, 2014 19:00:06 GMT -5

Jul 1, 2014 18:43:57 GMT -5 jimed14 said:

Simply dismissing great starts because of FIP, SIERRA and luck is not valid IMO. In the minors, that's scouting the box score.

Maybe this is just semantic (what exactly constitutes "simply dismissing?"), but I wholeheartedly disagree. You should absolutely use peripherals to differentiate between performances, and if you're just looking at surface stats like hits, runs, ERA, OPS, etc., you're going to make a lot of poor evaluations. A guy who gives up three runs but has a 10:0 K/BB absolutely had a better start than a guy who have up two hits and no runs but has a 2:2 K/BB. Especially in the high minors, you can draw very valuable conclusions based on stats (especially peripherals), including conclusions that scouts may miss. This is particularly true for a forum like this one, because most of us do not have the opportunity or training to scout minor league games, but all of us can learn to analyze stat lines and make accurate predictions based on that analysis.

(Also, it's SIERA. One R.)

jimed14
Veteran

Posts: 25,830

DIPS Jul 1, 2014 19:29:08 GMT -5

Quote

Post by jimed14 on Jul 1, 2014 19:29:08 GMT -5

You were dismissing his last 5 starts as luck using FIP and SIERA even with peripherals such as 6/24 BB/K. I'm having trouble finding LD%, IFFB%, GB%, FB% on a game log basis, but he only had 2 FO last night vs. 5 GB. Sometimes, when a pitcher is pitching better than usual, it's not luck. Especially with prospects who are inconsistent. Sometimes, they're hitting their potential. Again, I don't know it's the case, but I'm way more uncertain of it than you seem to be thinking it's just luck and therefore unsustainable.

What it comes down to, way more than it being about Ranaudo is to be so quick to look at FIP every time a pitcher goes on a run or seems to be outperforming it. You did it in the Owens thread too when Eric was doing his #3 pitcher analysis. I'll argue that Owens' ERA will be continually be lower than his FIP while he's in the minors so there's no reason to look at it. Inducing weak contact is a skill.

Also, a question. Do you know if FIP or SIERA gives credit for IFFB? This is a mistake if it doesn't, IMO.

Last Edit: Jul 1, 2014 19:30:38 GMT -5 by jimed14

“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

moonstone2
Veteran

Posts: 1,241

DIPS Jul 1, 2014 21:26:23 GMT -5

Quote

Post by moonstone2 on Jul 1, 2014 21:26:23 GMT -5

A couple of notes.

Since Voros McCracken published his original article DIPS theory is probably the most misused theory on the internet. Not only do several so called "sabermatricians", misapply the research, but McCracken's research wouldn't have passed the strict scruitiny required in other fields. McCracken used a low Rsquared and only used two seasons of research.

First off the idea of DIPS theory makes absolutely no sense. A batted ball in play is by far the most common end to a plate appearance by far. I would estimate that roughly 2/3 of plate appearances end with a ball in play. To suggest that the speed, location, and movement of the pitch thrown has little or no influence on the ball landing safely is somewhat silly. In the article that jimei posted it was estimated that 30% of a the variance of BABIP could explained by factors influenced by the pitcher. Considering that it's around 65% of outcomes that's a lot. For those math geeks out there, because n is large, the standard deviations are going to be smaller, much smaller.

Second there are significant limitations on the research.

All the research suggested was that established major league starting pitchers had little influence on BABIP over an extended period of time. Any other analysis extending this research is a complete misuse of the ideas presented. You cannot use the theory to analyze a minor league prospect or to dismiss a single start.

Players who are much better than their competition are clearly able to influence BABIP. Even in AAA the competition is far more diffuse than it is in the majors. A AAA pitcher who is really doesn't belong there might be able to get minor league hitters to make consistently bad contact. Can you imagine a high school team making consistently bad contact against Felix Hernadez and some guy in the stands declaring that the weak contact was just bad luck? It's not.

A pitcher's influence on BABIP over long periods says nothing about his influence over short periods. Over one start the pitcher may just be facing a team swinging away early in the count. Or a pitcher may have exceptionally good movement that start. I wouldn't be surprised if the real reason that pitcher BABIP shows less constancy than K's or BB's is that BABIP is influenced by movement, which there is some evidence for, while K's are influenced more by velocity and BB's by control. It would make sense that a pitcher would have more ability to be consistent with velocity and control than movement. You would need to, analyze how hard the ball was hit, location etc. as opposed to scouting a box score.

Finally it should be mentioned that most statistical studies are influenced by survival bias. I remember reading a post once that suggested that Zach Stewart was actually a strong pitcher because he had a much lower XFIP than ERA. Simply put a pitcher who has a substantially higher HR/FB ratio or BABIP than average won't be on the mound very long and won't be part of the study. It's likely that Stewart simply wasn't good enough for the majors and that was the reason for the variance.

I really don't know why I posted this as it's not going to stop people from polluting the internet by misusing research that they do not understand. Some people love to scout the box score and pretend that they are smart. I am sure no one cares.

jimed14
Veteran

Posts: 25,830

DIPS Jul 1, 2014 21:34:29 GMT -5

Quote

Post by jimed14 on Jul 1, 2014 21:34:29 GMT -5

I appreciate the post moonstone. That was said with way more conviction than I could have said. I'm relatively new with advanced stats but I think I'm on the right track. There's just no way I'd ever use FIP or BABIP for a single game or a stretch of five games to try to explain anything. I actually brought up the high school stats point in the Ranaudo thread.

“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

James Dunne
Staff

Managing Editor

Posts: 12,364

DIPS Jul 2, 2014 0:25:01 GMT -5

Quote

Post by James Dunne on Jul 2, 2014 0:25:01 GMT -5

Inducing weak contact can absolutely be a skill. But before we go around accusing everyone who disagrees with us of "scouting the box score," can we at least provide evidence that Ranaudo is actually doing that? Otherwise everyone is just looking at the same box scores with a different set of biases.

Over 5-10 starts, peripherals are certainly a more reliable indicator of anything than runs and hits allowed. If you've been doing it as long as Matt Cain or Mark Buehrle or Tom Glavine, sure. But ten starts of Ranaudo outpitching his peripherals doesn't mean much. The more important thing is that he's producing a lot of grounders, which is going to keep the ball in the park, which will keep his ERA down whenever his BABIP regresses to whatever his mean is. Because it's not .202.

joshv02
Veteran

Posts: 600

DIPS Jul 2, 2014 7:31:25 GMT -5

Quote

Post by joshv02 on Jul 2, 2014 7:31:25 GMT -5

Again: Voros didn't say that players do not have control over BABIP. The contention was just that the players that survive the minor leagues and make it to the majors demonstrate a small spread of skill in controlling their BABIP such that you cannot tell which players (by looking at their stats) do and do not have a significantly out-of-the-major-league-norm skill in limiting BABIP. People continue to read it wrong.

And, of course Voros's "study" wouldn't pass muster in a peer reviewed journal. It was done for a usenet group in the late 1990s so that he could figure out a better way to win a roto league. This was a different time, when people would waste their days ~~when they should be studying for the bar~~ reading on the newly democratized web; BPro didn't really make money at the time, and the best place to read Tango and Dolphin and MGL was in the corner of AOL's fanhome.

But otherwise, lots of good (if not repeated) points that I don't disagree with.

jmei
Global Moderator

Posts: 15,224

DIPS Jul 2, 2014 7:32:32 GMT -5

Quote

Post by jmei on Jul 2, 2014 7:32:32 GMT -5

Jul 1, 2014 19:29:08 GMT -5 jimed14 said:

You were dismissing his last 5 starts as luck using FIP and SIERA even with peripherals such as 6/24 BB/K. I'm having trouble finding LD%, IFFB%, GB%, FB% on a game log basis, but he only had 2 FO last night vs. 5 GB. Sometimes, when a pitcher is pitching better than usual, it's not luck. Especially with prospects who are inconsistent. Sometimes, they're hitting their potential. Again, I don't know it's the case, but I'm way more uncertain of it than you seem to be thinking it's just luck and therefore unsustainable.

What it comes down to, way more than it being about Ranaudo is to be so quick to look at FIP every time a pitcher goes on a run or seems to be outperforming it. You did it in the Owens thread too when Eric was doing his #3 pitcher analysis. I'll argue that Owens' ERA will be continually be lower than his FIP while he's in the minors so there's no reason to look at it. Inducing weak contact is a skill.

I noted repeatedly that he had pitched really well over those five starts. My point was just (a) that he pitched worse than his ERA would suggest (i.e., advanced metrics regarded him as having pitched more good than spectacular), (b) he still had things to work on in Pawtucket and was not major league ready, and (c) that I still liked Webster and Barnes more than him.

Part of the problem is that we use the word "luck" in different ways. I don't mean to suggest that he's actually a bad pitcher who just fluked into some good results. I mean that he pitched really well in that stretch but we shouldn't expect him to continue to pitch that well going forward. There are no value judgments there; rather, just the suggestion that that stretch probably doesn't represent his true talent level.

I'm generally suspicious anytime someone points to a small sample of great results and suggests that that prospect has turned a corner and is blossoming into the guy we always thought he was. Having followed the system for eight or nine years now, I can tell you that (a) folks do this all the time and (b) the vast, vast majority of the time, it's just a little blip in the radar rather than a breakout. You're right to suggest that it's certainly a possibility that this is Ranaudo reaching his potential, but the odds are very much against it, and I tend to think that we should avoid getting too worked up about that possibility until he sustains it over a longer period of time.

Re: Owens-- I think we agree more than we disagree. I think Owens' deception is a legitimate skill, as is his ability to get a good number of popups, and he will likely be able to outperform his peripherals. But the magnitude of that skill is very large. The best BABIP-outperformers can sustain something like a .280-.290 BABIP over the long haul, which means that they'll put up a 3.10 ERA as compared to their 3.30 FIP. Even if outperforming your BABIP is a skill in the minor leagues, it's one that translates less well to the majors and has a pretty narrow spread there. As such, I don't think we can just ignore his minor league FIP, as he's not going to be able to outperform it as much in Boston as he has in Portland and Salem.

Also, a question. Do you know if FIP or SIERA gives credit for IFFB? This is a mistake if it doesn't, IMO.

SIERA does, FIP doesn't. If you have questions about these metrics, I would just google around and read more about them. You're going to encounter them a lot, and it's worth knowing how exactly they work.

jmei
Global Moderator

Posts: 15,224

DIPS Jul 2, 2014 8:34:35 GMT -5

Quote

Post by jmei on Jul 2, 2014 8:34:35 GMT -5

Jul 1, 2014 21:26:23 GMT -5 moonstone2 said:

A couple of notes.

Since Voros McCracken published his original article DIPS theory is probably the most misused theory on the internet. Not only do several so called "sabermatricians", misapply the research, but McCracken's research wouldn't have passed the strict scruitiny required in other fields. McCracken used a low Rsquared and only used two seasons of research.

First off the idea of DIPS theory makes absolutely no sense. A batted ball in play is by far the most common end to a plate appearance by far. I would estimate that roughly 2/3 of plate appearances end with a ball in play. To suggest that the speed, location, and movement of the pitch thrown has little or no influence on the ball landing safely is somewhat silly. In the article that jimei posted it was estimated that 30% of a the variance of BABIP could explained by factors influenced by the pitcher. Considering that it's around 65% of outcomes that's a lot. For those math geeks out there, because n is large, the standard deviations are going to be smaller, much smaller.

It's not like Voros' analysis was the only study which found that pitchers have very little control over their BABIP. Other researchers have conducted a multitude of follow-up studies using more rigorous methods that have almost unanimously reaffirmed his conclusion that pitchers have very little control over their BABIPs. Of course, most of the nuance is in what constitutes "very little," but the general conclusion that the vast majority of BABIP variation is explained by factors other than the pitcher has been found again and again. These analyses have been completed with much smaller minimum IP thresholds and relief pitchers, and the theory still held.

Plus, as I've mentioned before, we have isolated pretty much all the ways that a pitcher influences his BABIP. If he is a knuckleballer, generates lots of popups, is an extreme ground ball pitcher, or is a high strikeout pitcher, he can probably sustain a below-average BABIP. If he does not do any of those things, he probably can't. Moreover, BABIP skill has a narrow spread in the major leagues. Even the best pitchers are only a little better than the league-average. Even if a guy meets some or all of that criteria, if he's rocking a .230 BABIP, it's going to regress, even if it regresses to .280 rather than .300.

Jul 1, 2014 21:26:23 GMT -5 moonstone2 said:

Second there are significant limitations on the research.

All the research suggested was that established major league starting pitchers had little influence on BABIP over an extended period of time. Any other analysis extending this research is a complete misuse of the ideas presented. You cannot use the theory to analyze a minor league prospect or to dismiss a single start.

Players who are much better than their competition are clearly able to influence BABIP. Even in AAA the competition is far more diffuse than it is in the majors. A AAA pitcher who is really doesn't belong there might be able to get minor league hitters to make consistently bad contact. Can you imagine a high school team making consistently bad contact against Felix Hernadez and some guy in the stands declaring that the weak contact was just bad luck? It's not.

A pitcher's influence on BABIP over long periods says nothing about his influence over short periods. Over one start the pitcher may just be facing a team swinging away early in the count. Or a pitcher may have exceptionally good movement that start. I wouldn't be surprised if the real reason that pitcher BABIP shows less constancy than K's or BB's is that BABIP is influenced by movement, which there is some evidence for, while K's are influenced more by velocity and BB's by control. It would make sense that a pitcher would have more ability to be consistent with velocity and control than movement. You would need to, analyze how hard the ball was hit, location etc. as opposed to scouting a box score.

Finally it should be mentioned that most statistical studies are influenced by survival bias. I remember reading a post once that suggested that Zach Stewart was actually a strong pitcher because he had a much lower XFIP than ERA. Simply put a pitcher who has a substantially higher HR/FB ratio or BABIP than average won't be on the mound very long and won't be part of the study. It's likely that Stewart simply wasn't good enough for the majors and that was the reason for the variance.

The bolded is just not true. If we're trying to determine how well a pitcher is going to perform going forward, looking at his walk, strikeout, and batted-ball numbers over a single start or over a small number of starts will much, much better predict his future success than looking at his ERA or his hits given up. Because we're dealing with small samples, there are significant error bands around any stat that you want to look at, but the ones around BB/K/batted ball numbers are much lower than those around ERA or H/9. Indeed, the smaller your initial sample size, the more that advanced stats like FIP or SIERA outperform ERA in terms of predicting a guy's future ERA. The smaller the sample we're looking at, the more we should be using advanced stats rather than ERA or WHIP to project a given pitcher.

Moreover, there is no reason to think that DIPS just falls apart in the minor leagues. You're right to suggest that it holds less and less as we go down the minor league ladder due to the spread in competition and survivor bias, but evaluating players by their peripherals is still absolutely more predictive than evaluating them based on their ERA/WHIP. This is especially true the closer you get to the major leagues, and we're talking about a guy in Ranaudo who pitches in AAA. Teams absolutely use DIPS principles to evaluate their prospects, which is a pretty strong indicator that they matter.

Last Edit: Jul 2, 2014 8:36:05 GMT -5 by jmei: typos

jmei
Global Moderator

Posts: 15,224

DIPS Jul 2, 2014 10:45:34 GMT -5

Quote

Post by jmei on Jul 2, 2014 10:45:34 GMT -5

Just wanted to briefly elaborate how on I evaluate small sample data for pitchers. I generally hold a rebuttable presumption that the high variance stats (think BABIP, LOB%, etc.) are driven by chance rather than skill, especially when we're talking small samples (if it's a really small sample, I add HR/9 to the list). As such, I think that ERA estimators (FIP, SIERA, etc.) are a much better estimate of a player's future performance in small samples than their ERA is.

If the player possesses skills that suggest that they have a true-talent ability to outperform a league-average BABIP (think deception, a high IFFB%, etc.), those ERA estimators might slightly underrate them, but the effect tends to be pretty small. Therefore, I still think their ERA estimators are a better estimate of their future performance than their ERA is, though their true-talent levels are likely a little better than those ERA estimators suggest.

(I generally follow the same process with hitter BABIP, although because hitter BABIP is much, much more of a skill than pitcher BABIP, I mentally regress less with hitters than I would with pitchers. This is especially true if the player has either the skills (elite batspeed, speed, etc.) to suggest the ability to maintain above-average BABIPs or has shown the ability to maintain above-average BABIPs over the course of a large sample in the minors.)

I do the above because it's just playing the odds. Study after study indicates that BABIP is highly variable and more often driven by chance than skill, especially in small samples. When it is driven by skill, we can roughly identify most of the factors that drive it, so we can identify those guys who might be able to sustainably outperform a league-average BABIP. As such, when those factors aren't present, we should regress BABIP (and HR numbers in small samples) to league-average marks when evaluating a pitcher. When those factors are present, we should still regress a clearly-out-of-line BABIP to what we estimate that pitcher's true-talent BABIP might be.

Here's an illustration. It's like knowing that nine out of ten coins are 50/50 odds, but one out of ten is 60/40 odds. Someone presents us with a coin of unknown providence and flips it four times, getting four heads in a row. Our reaction should be that it's probably a fair coin that just happened to get four heads in a row due to chance. Our reaction should not be that maybe it's a coin that flips heads every single time.

ericmvan
Veteran

Supposed to be working on something more important

Posts: 9,027

DIPS Jul 2, 2014 14:25:51 GMT -5

Quote

Post by ericmvan on Jul 2, 2014 14:25:51 GMT -5

Jul 1, 2014 17:34:29 GMT -5 jmei said:

I don't want to derail this thread more than I already have, so I've moved the most recent DIPS-related posts to its own thread in the Off-Topic Forum. Feel free to continue that discussion there. Let's keep this thread to Ranaudo. Thanks.

Can I give a few sentences on the actual reality, which is not what was described earlier in the thread?

Not true, and obviously not true when you think about it: with a few exceptions, MLB pitchers all have the same innate league-average BABIP skill relative to their groundball vs. ball-in-air rates, which means you should ignore ERA in favor of FIP or SIERA.

Actual truth: MLB pitchers have been selected to have a narrow range of BABIP skills relative to their groundball rates, and there is so much noise in the data, even after controlling for defense and park as best we can, that it's often hard to say whether a guy is somewhat better than average or somewhat worse--it's too easy to have numbers opposite to your skill. We can only really be certain about the outliers of the distribution.

What the reality means is that when a AAA pitcher has had a low BABIP like Ranaudo (especially when that low BABIP is accompanied by a known cause, like excellent command or a funky delivery), there's good reason to believe that as an MLB pitcher, his likeliest outcome will be a lower than expected BABIP and an ERA lower than FIP and SIERA, even though it's even more likely that those differences will not be large enough to identify him as having that skill for certain.

So, no, Ranaudo can not be expected to perform like this in MLB and have an ERA way lower than FIP and SIERA, but he certainly can be expected to have a somewhat or slightly lower ERA.

Another way to look at this is that the MLB distribution of (ERA-FIP) or (ERA-SIERA) is wider than you'd expect it to by chance.

I wish I knew how much to regress (ERA-FIP) to zero for minor leaguers. In the advanced minor league pitching stats I used to do, I'd adjust BABIP for team defense and then regress it halfway to league average, and then use that in a BaseRuns-based component ERA. That seemed to work OK.

"You either need some medication or you're an a******." -- David Ortiz correctly diagnosing Bobby Valentine

jmei
Global Moderator

Posts: 15,224

DIPS Jul 2, 2014 14:46:24 GMT -5

Quote

Post by jmei on Jul 2, 2014 14:46:24 GMT -5

Jul 2, 2014 14:25:51 GMT -5 ericmvan said:

Not true, and obviously not true when you think about it: with a few exceptions, MLB pitchers all have the same innate league-average BABIP skill relative to their groundball vs. ball-in-air rates, which means you should ignore ERA in favor of FIP or SIERA.

Actual truth: MLB pitchers have been selected to have a narrow range of BABIP skills relative to their groundball rates, and there is so much noise in the data, even after controlling for defense and park as best we can, that it's often hard to say whether a guy is somewhat better than average or somewhat worse--it's too easy to have numbers opposite to your skill. We can only really be certain about the outliers of the distribution.

I understand these these statements are descriptively different, but practically speaking, because the BABIP spread amongst major leaguers is so narrow and determining true-talent BABIP skill is so difficult, I tend to think that we can regard the first statement as essentially accurate and thus can and should heavy-handily apply DIPS principles in evaluating pitchers.

It's like how classical physics is, strictly speaking, not entirely accurate (see quantum mechanics and relativity), but for for the purposes of teaching 11th graders basic science, it's accurate enough. Yes, in reality, it's more complicated than F=ma, but for our back-of-the-envelope internet discussion, it's close enough.

jimed14
Veteran

Posts: 25,830

DIPS Jul 2, 2014 14:59:41 GMT -5

Quote

Post by jimed14 on Jul 2, 2014 14:59:41 GMT -5

With a multi-season sample size, that's much more true than in a handful of games or one game. A good results pitching start can be because the ball is hit hard right at players or great defensive plays, or it can be because no one hit the ball harder than 70 mph all game because of great command/control (which can easily explain a 0% HR/FB ratio or low BABIP). This is why I dismiss single game FIP/SIERA discussions without seeing the game(s). Pitchers can pitch the games of their lives or they can get lucky with BABIP or they are underrated and/or improving.

“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

jmei
Global Moderator

Posts: 15,224

DIPS Jul 2, 2014 15:06:59 GMT -5

Quote

Post by jmei on Jul 2, 2014 15:06:59 GMT -5

Jul 2, 2014 14:59:41 GMT -5 jimed14 said:

With a multi-season sample size, that's much more true than in a handful of games or one game. A good results pitching start can be because the ball is hit hard right at players or great defensive plays, or it can be because no one hit the ball harder than 70 mph all game because of great command/control (which can easily explain a 0% HR/FB ratio or low BABIP). This is why I dismiss single game FIP/SIERA discussions without seeing the game(s). Pitchers can pitch the games of their lives or they can get lucky with BABIP or they are underrated and/or improving.

Did you read this link? The smaller the sample size, the more that you want to look at a guy's peripherals as opposed to his ERA. Remember, this isn't a dichotomy between having a scout at the game giving us a detailed report versus just looking at his FIP. There's that somewhat famous quote about how, in small samples, a good scout is always better than stats, which I absolutely agree with. In general, a good scouting report will be way more valuable for a minor league prospect than any statistical analysis, for many of the reasons discussed above.

But here, we're talking about the difference between looking at a guy's box score stats or looking at his peripherals. In general, few of us have the ability to see many minor league games, and the TV feeds are generally of poor quality (blurry, from a bad angle, don't show batted ball trajectory, etc.). Even fewer of us has any scouting background. In light of that, we're forced to predominantly look at stats and secondhand scouting reports to evaluate prospects. And if we're going to cite numbers to evaluate a player, there are certain ones that are more predictive than others, and we should be using the more predictive ones.

Last Edit: Jul 2, 2014 15:16:43 GMT -5 by jmei

jimed14
Veteran

Posts: 25,830

DIPS Jul 2, 2014 15:53:04 GMT -5

Quote

Post by jimed14 on Jul 2, 2014 15:53:04 GMT -5

I'm not talking about predicting anything. I'm talking about attributing results to luck vs being better than usual for 1-5 games. I think in small samples, pitchers probably regularly exceed their median ability and just pitch better. I don't agree with looking at a single game where a pitcher gave up no home runs and saying that he really should have given up 0.6 home runs even if no one ever hit the ball hard on any of the 4 fly balls he gave up. Maybe it's true, but it is not certain.

And FIP seems to be way low when pitchers have really bad games as well.

“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

jmei
Global Moderator

Posts: 15,224

DIPS Jul 2, 2014 16:17:51 GMT -5

Quote

Post by jmei on Jul 2, 2014 16:17:51 GMT -5

Jul 2, 2014 15:53:04 GMT -5 jimed14 said:

I'm not talking about predicting anything. I'm talking about attributing results to luck vs being better than usual for 1-5 games. I think in small samples, pitchers probably regularly exceed their median ability and just pitch better. I don't agree with looking at a single game where a pitcher gave up no home runs and saying that he really should have given up 0.6 home runs even if no one ever hit the ball hard on any of the 4 fly balls he gave up. Maybe it's true, but it is not certain.

I mentioned this earlier-- we're using the word "luck" to mean different things. I'm not trying to take anything away from someone who had an great performance-- no-hitters and perfect games (and, to a lesser extent, shutouts or big wins, etc.) are awesome and worth celebrating. But they're also unsustainable in the sense that that guy is probably not going to continue to pitch that well, and that's the question I'm almost always intent on answering. I'm usually looking forwards rather than backwards, and for those purposes, looking at ERA estimators rather than ERA generally makes more sense.

But, more broadly, just because a guy gave up a bunch of shallow fly balls does not necessarily mean that he pitched well. The difference between a shallow fly ball and a home run is maybe a quarter of an inch and a fraction of a second. There just aren't many pitchers who are good enough at tricking major league hitters into consistently mis-hitting their pitches (unless they're getting them to swing-and-miss).

jmei Global Moderator Posts: 15,224	DIPS Jul 18, 2014 13:19:31 GMT -5 Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by jmei on Jul 18, 2014 13:19:31 GMT -5 On the subject of DIPS, here's an old article that I had trouble finding earlier but which I just stumbled onto, explaining why using SIERA to evaluate pitchers is generally a good idea: www.baseballprospectus.com/article.php?articleid=10281
	Last Edit: Jul 18, 2014 13:19:43 GMT -5 by jmei

mgoetze
Veteran

Posts: 5,057

DIPS Jul 18, 2014 14:45:29 GMT -5

Quote

Post by mgoetze on Jul 18, 2014 14:45:29 GMT -5

Jul 18, 2014 13:19:31 GMT -5 jmei said:

On the subject of DIPS, here's an old article that I had trouble finding earlier but which I just stumbled onto, explaining why using SIERA to evaluate pitchers is generally a good idea: www.baseballprospectus.com/article.php?articleid=10281

But "everyone still at BP" has since come to the conclusion that SIERA is actually completely awful at anything anyone could possibly want, right?

Ceterum censeo John Farrell esse dismissiendam.

ericmvan
Veteran

Supposed to be working on something more important

Posts: 9,027

DIPS Aug 8, 2014 6:34:34 GMT -5

Quote

Post by ericmvan on Aug 8, 2014 6:34:34 GMT -5

Jul 2, 2014 14:46:24 GMT -5 jmei said:

Jul 2, 2014 14:25:51 GMT -5 ericmvan said:

Not true, and obviously not true when you think about it: with a few exceptions, MLB pitchers all have the same innate league-average BABIP skill relative to their groundball vs. ball-in-air rates, which means you should ignore ERA in favor of FIP or SIERA.

Actual truth: MLB pitchers have been selected to have a narrow range of BABIP skills relative to their groundball rates, and there is so much noise in the data, even after controlling for defense and park as best we can, that it's often hard to say whether a guy is somewhat better than average or somewhat worse--it's too easy to have numbers opposite to your skill. We can only really be certain about the outliers of the distribution.

I understand these these statements are descriptively different, but practically speaking, because the BABIP spread amongst major leaguers is so narrow and determining true-talent BABIP skill is so difficult, I tend to think that we can regard the first statement as essentially accurate and thus can and should heavy-handily apply DIPS principles in evaluating pitchers.

It's like how classical physics is, strictly speaking, not entirely accurate (see quantum mechanics and relativity), but for for the purposes of teaching 11th graders basic science, it's accurate enough. Yes, in reality, it's more complicated than F=ma, but for our back-of-the-envelope internet discussion, it's close enough.

Just noticed this reply ...

Following on your analogy, the point I was trying to make is that the assumptions of DIPS theory represent not the truth but a very useful simplification of it. But just as in physics, done at certain scales, one should always be aware that you might be looking at a situation where quantum effects come into play, one should always be open to the possibility that a real BABIP skill is in play. Folks who take the simplified version of the truth as the whole truth tend to dismiss that possibility a priori.

It is possible to ferret out real BABIP skills with intelligent analysis, and (obviously) even more so with pitch/fx data. I've never tackled the latter, but I think I've used regular stats to demonstrate that 2010 Buchholz had it, and I just presented a very interesting case that Kelly may well have it. Almost everyone else has dismissed the latter possibility out of hand, but it didn't take me long at all to find evidence that supported it. I think that supports my argument that when we see BABIP outliers, we should more often be looking at them closely rather than assuming them to be subject to complete regression.

"You either need some medication or you're an a******." -- David Ortiz correctly diagnosing Bobby Valentine

jimed14
Veteran

Posts: 25,830

DIPS Aug 8, 2014 10:52:42 GMT -5

Quote

Post by jimed14 on Aug 8, 2014 10:52:42 GMT -5

Am I crazy in thinking that over an entire career, BABIP should be pretty similar amongst all pitchers if skill were not involved? Because when you look at career BABIP, a lot of the lowest ones are right in line with the most dominant pitchers. I mean the spread is over 100 points for careers. I think to believe that BABIP isn't a skill is to believe that to be dominant, you need to be lucky. Or that every no-hitter is purely luck and pretty much every pitcher has an equal chance of getting one (not counting for strikeouts of course).

If you put a high school pitcher in the majors, his BABIP is going to be way higher, obviously. I don't know that anyone would dispute that. And if you put a major league pitcher in high school, his BABIP would be way lower, obviously.

This is an extreme example by design. But it's clear that extreme differences in talent result in extreme differences in BABIP so it's easy for me to expect that smaller differences in talent between major league pitchers could also result in differences in BABIP.

Last Edit: Aug 8, 2014 10:53:54 GMT -5 by jimed14

“We just lost a World Series game in 18 innings. But after that [meeting], it didn’t feel like we lost. It felt like we won.”

Post by moonstone2 on Jul 1, 2014 16:43:46 GMT -5

Post by jmei on Jul 1, 2014 17:11:00 GMT -5

Post by jimed14 on Jul 1, 2014 17:16:27 GMT -5

Post by Oregon Norm on Jul 1, 2014 18:06:18 GMT -5

Post by jimed14 on Jul 1, 2014 18:43:57 GMT -5

Post by chavopepe2 on Jul 1, 2014 18:55:13 GMT -5

Post by jmei on Jul 1, 2014 19:00:06 GMT -5

Post by jimed14 on Jul 1, 2014 19:29:08 GMT -5

Post by moonstone2 on Jul 1, 2014 21:26:23 GMT -5

Post by jimed14 on Jul 1, 2014 21:34:29 GMT -5

Post by James Dunne on Jul 2, 2014 0:25:01 GMT -5

Post by joshv02 on Jul 2, 2014 7:31:25 GMT -5

Post by jmei on Jul 2, 2014 7:32:32 GMT -5

Post by jmei on Jul 2, 2014 8:34:35 GMT -5

Post by jmei on Jul 2, 2014 10:45:34 GMT -5

Post by ericmvan on Jul 2, 2014 14:25:51 GMT -5

Post by jmei on Jul 2, 2014 14:46:24 GMT -5

Post by jimed14 on Jul 2, 2014 14:59:41 GMT -5

Post by jmei on Jul 2, 2014 15:06:59 GMT -5

Post by jimed14 on Jul 2, 2014 15:53:04 GMT -5

Post by jmei on Jul 2, 2014 16:17:51 GMT -5

Post by jmei on Jul 18, 2014 13:19:31 GMT -5

Post by mgoetze on Jul 18, 2014 14:45:29 GMT -5

Post by ericmvan on Aug 8, 2014 6:34:34 GMT -5

Post by jimed14 on Aug 8, 2014 10:52:42 GMT -5