December 23rd, 2007
How to lie with statistics, academia edition: Here’s what your $40,000 a year is paying for
In yesterday’s New York Times, a pair of academics — Columbia professor of sociology Jonathan Cole and University of Chicago professor of statistics Stephen Stigler — published an article titled “More Juice, Less Punch,” which aimed to ask the question: “Do [PEDs] make a difference sufficient to be detected in the players’ performance records?” Their answer, not surprisingly, is no (otherwise there wouldn’t have been any point in publishing their story in the first place): “An examination of the data on the players featured in the Mitchell report suggests that in most cases the drugs had either little or a negative effect.”
I feel sorry for the students that are forced to sit through these boobs’ courses.
Cole and Stigler try to prove their point by comparing stats from before and after a given player is accused of using roids (or HGH, or whatever). They explain their methodology thusly: “For pitchers identified by the report, we looked at the annual earned run average for their major league careers. For hitters we examined batting averages, home runs and slugging percentages. We then compared each player’s yearly performance before and after he is accused of having started using performance-enhancing drugs. After excluding those with insufficient information for a comparison, we were left with 48 batters and 23 pitchers.” The results, they say, show no net gain in performance.
This in itself would seem to intuitively demonstrate that PEDs do, in fact, work - baseball players, like mathematicians and physicists - show a dramatic tail-off at a very young age (for the geeks, their best work is usually done in their 20s; for ballplayers, the peak years usually come between 28 and 32) and if players with extended careers don’t show any decline in performance, that would indicate an unusual pattern.
Anyone who had any slight degree of sophistication would also realize that it’s next to meaningless to compare raw data - you need to make sure you understand what the data you’re looking at actually means. In this case, that means realizing that comparing stats like ERA or home runs or OPS or anything else tells you much less about a player’s relative performance than ERA+ or OPS+. (OPS+ normalizes OPS for the park and the league the player played in; ERA+ shows the player’s ERA in relation to the league’s ERA. This explains why Pedro’s 1.74 ERA in 2000, when the league ERA was 5.07, earned him an ERA+ of 291, while Sandy Koufax’s 1.74 ERA in 1964, when the league ERA was 3.25, only garnered him an ERA+ of 187. It also helps show why Pedro’s 2000 season was arguably the best ever. It resulted in the highest ERA+ since 1880, and the second best ever. Koufax’s top season ranks as 56th.)
Let’s drill down a little more. Cole and Stigler write, “The Roger Clemens is a case in point: a great pitcher before 1998, a great (if increasingly fragile) pitcher after he is supposed to have received treatment. But when we compared Clemens’s E.R.A. through 1997 with his E.R.A. from 1998 on, it was worse by 0.32 in the later period.” As I pointed out last year, the salient point here is how Clemens performed in his late 30s compared to his mid 20s. In the 12 years from Clemens’ breakout year in 1986, when he was 23, he had an ERA+ above 180 twice; in the 10 years from age 35 to 44, he had two more. Compare that to other Hall of Fame pitchers from this era like Greg Maddux, who had four years with an ERA+ of 180 or higher before age 35 and none afterwards, or Tom Glavine, whose five best years all came before age 35. Heck, compare it to Tom Seaver, the guy who was voted into the Hall with the highest percentage ever: his six best years all came before age 34.
Cole and Stigler are just as ignorant when it comes to hitters. “What should not be overlooked,” they write, “is that Bonds’s profile is strikingly like Babe Ruth’s high performance level until near the end of his career, with one standout home run year — a year in which other players on other teams also exceeded their previous levels.”
Actually, what should not be overlooked is the fact that Bonds has put up an OPS+ of greater than 200 in three out of the last six years, compared with comperable numbers in three of his first 14 years in the bigs. Ruth also had an OPS+ higher than 200 in three of his final six years…and another eight in the previous 14. (Another thing that should not be overlooked: Bonds has played the majority of his career in a home ballpark that has a spacious right field, unlike Ruth, who got to hit in Yankee Stadium.)
I know it’s not a shocker than a pair of academics don’t really understand baseball; it has taken autodidacts like Bill James to help illuminate the game. What is shocking is how little Cole and Stigler — professors who not only deal with numbers but teach at elite institutions — seem to understand about analyzing data.
December 17th, 2007
The Sox and the Bruins, dragging down the locals
Boston’s four pro sports teams–the Sox (11-3), Pats (11-0), C’s (20-2), and Bruins (18-11-3)–are a combined 60-19-3 since October 1, which is good for a .732 winning percentage. Take the B’s out of the equation and the three first place teams are a combined 42-5, which is good for approximately 9 wins for every 10 games. Back in 1986, I remember thinking that Boston had had a pretty decent year–the Sox and Pats both played for the championship and the Celts won their 2nd title in 3 years. Let’s see how this year ends up…
December 16th, 2007
Meet the new boss, redux: I say yes, I say no
“This is not a bluff; it’s just reality.”
– Hank Steinbrenner, December 2, 2007, when asked about his “firm deadline” of midnight, December 3, for the Yankees and Twins to complete a deal for Johan Santana.
“No. It’s over. When it’s over, it’s over.”
– Hank Steinbrenner, December 4, 2007, when asked if the Yankees were willing to continue negotiations if the Twins became willing to accept the Yankees final offer.
“We’re still thinking about it. We haven’t ruled it out completely. We’re still considering it. I haven’t closed the door completely on Santana.”
– Hank Steinbrenner, December 14, 2007
December 16th, 2007
Day Three: The situation’s looking worse all the time for Roger
Roger Clemens, through his lawyer, has been sticking with his Casablanca-evoking outrage that he was fingered as a ‘roids user. He shouldn’t be surprised, and neither should anyone else. (Compare this picture of a middle-aged Clemens to this one when he was in Boston. It certainly looks like his body went through a Bonds-like transformation.) I’ve been curious as to why more people weren’t asking questions about Clemens since last January, when Boston was in the hunt for his services.
In the last two days, the situation for Clemens has, remarkably, gotten even worse. There ex-big leaguers like C.J. Nitkowski defending Brian McNamee after he was called “troubled” by Clemens’ lawyer–a remarkable breach of the unspoken code of omerta among current and former ballplayers. There’s Curt Schilling, who looks up to Clemens as an idol, saying “I believe it” when asked about the contents of the Mitchell report. There’s the results of ESPN’s Jerry Krasnick’s informal poll of Hall voters–a full two-thirds of whom say they either wouldn’t vote for Clemens or are undecided.
And now there’s Andy Pettitte’s classy confirmation of McNamee’s revelations about his use of HGH. (Classy so long as his statement that there were only two times he used are, in fact, true.) Not only did Pettitte not say that McNamee was troubled, he confirmed exactly what McNamee had told investigators.
The steroid mess isn’t going to be one of those Watergate/Monica situations where the cover-up is worse than the crime…but it may be a case where the public, and the press, is a lot quicker to grant absolution to guys who come clean on their own. I’m willing to be dollars to doughnuts that Pettitte gets the biggest ovation of any player when the Yankees are announced on opening day at the Stadium.
December 16th, 2007
In defense of the Yankees dynasty
The presence of Clemens and Pettitte–and, to a lesser extent, of the likes of Chuck Knoblauch and David Justice–has predictably caused some people to question the Yankees ‘96-’00 dominance. I’ll add myself to one of the voices for the defense. No one will ever know what the Mitchell report would have looked like had there been a strength and conditioning coach in every clubhouse that talked, on the record, about what happened while they were with their respective teams…but it’s a safe bet that more teams would look like the Yankees, with more than a dozen players named, than like the Sox, who don’t have a single major player cited for actions during his time in Boston.
December 13th, 2007
What - you want more on the Mitchell Report?
Lots and lots and lots and lots of actual and virtual ink will be spilled on the Mitchell Report, which is going to make life hell for a whole mess of people. I’ll resist added too much of my drivel and will instead limit myself to some few quick points on issues such as…
Roger Clemens. Why, you might ask, would a sure-fire Hall of Famer risk his reputation and legacy over these last five or so years by taking PEDs? People asked me that question again and again during the pre-season frenzies of last season and 2006. I have no way of knowing; for some reason, Clemens won’t talk to me. But I do have an idea: because he has never, in his entire life, had to deal with the consequences of his actions. He can act like a teenage mutant ninja freak and throw broken bats across the field and it’s chalked up to competitive fire. He can demand ludicrous contract clauses like Hummers and private transportation and he’s indulged. Why, after years and years of this, would he suddenly think that the rules applied to him? (Clemens is far from alone in this regard; this is something that crops up again and again in ballplayers, who are constantly reminded that the normal rules of society–stay faithful to your spouse, clean up after yourself, don’t eat McDonald’s for breakfast–don’t apply to them.
I Love (the fact that I’m not playing in) New York. Plenty of teams’ fans are going to be crowing/letting out a huge sigh of relief…so long as those fans aren’t rooting for the Mets and the Yankees. A quick scan of what is destined to become known as the list shows current and former New Yorkers including Kevin Brown, Paul Lo Duca, Mo Vaughn, Todd Pratt, Ron Villone, David Justice, Chuck Knoblauch, Clemens, Andy Pettitte, and Lenny Dykstra. Does that mean that other teams–like, say, the Sox–are (or were) any cleaner? Hell no. It just means no-one else had a clubhouse attended that got popped.
The non-inclusion of any of the Idiots: Earlier today, what turned out to be a fake list was leaked; that one included names like Nomar, Johnny Damon, and Trot Nixon, along with other usual suspects like Pudge, Pujols, and Milton Bradley. (Later in the day, well-circulated rumor had Varitek also on the list.) Back in 2005, a member of the Sox’s front office physically shuddered at the thought of what would happen in Boston if news ever broke about someone on the ‘04 team roiding up. It looks like that won’t happen…for now, anyway. That brings us to…
Eric Gagne. Gagne, as everyone now knows, was on the list, which can’t be a surprise to anyone. (Also included in the report is news that the Sox inquired about Gagne’s supposed doping before acquiring him at the deadline.) It turns out that the biggest favor Gagne may have done Boston is sucking ass for the second half of the season–now, at least, no one can point to him as one of the reason’s for the team’s success.
That’s all for now. I’ve written plenty about steroids in the past, including last August, when I wondered why no one was wondering about Roger, and way back in October ‘06, when I mocked the press’s surprise that Clemens had been fingered in he Grimsley affidavit. I also tagged Jason Giambi a gutless punk, ripped into the Players Union for defending the players’ right to destroy their livers, lamented the fact that Jose Canseco seemed to be the only honest guy around, and talked about how Bill James compared steroids to going through a divorce. (Sort of, anyway.)
More later, I’m sure.
December 12th, 2007
Can you hear me now? The Daly fiasco echo chamber, day two.
After getting roundly hammered for his asinine post yesterday, BU journalism professor Chris Daly apparently decided that he hadn’t sufficiently proved his lack of insight and threw up several hundred more words of absolute drivel. It’s hard to tease out the biggest laughlines, but here are a couple of start with:
“As a professor of journalism, I work with dozens of talented young people every year, and I know just how capable they are. I also know that they often need guidance, backgrounding, and careful editing. I regret leaving the impression that people in their 20s are somehow inherently unqualified to cover presidential politics or anything else.”
and
“Like many blogs, mine is a venue for criticism, analysis and commentary. It is not an outlet for reporting or research. I googled Mr. Bacon to begin to address the question, Could experience have been a factor?”
So, to summarize: Daly defends his own ignorance by writing that the young’uns out there need “backgrounding and careful editing”…and then goes on to say that he didn’t have any responsibility to provide any kind of background, context, or careful analysis because he declined to do any reporting and research before publishing online. (This last point is particularly ironic in light of a piece Daly has posted on his site titled “Are Bloggers Journalists?” in which he invokes the two Thomases: Paine and Jefferson.) As a j-school professor, Daly sure raises some interesting points, such as: Does a self-proclaimed professional journalist and educator have any responsibility to maintain any standards when writing for his blog, which heralds his profession (www.journalismprofessor.com) and his professional affiliation? What standards should blogs be held to if they want to be taken seriously? Etc, etc. Unfortunately (for his students, anyway), Daly raises these issues implicitly, and only by his own negative example.
This little bout of industry indignation also raises another interesting issue: the Romenesko echo chamber effect. Since the first link to his original post yesterday, Daly’s musing have been the subject of four more posts on Jim Romenesko’s “daily fix of media industry news,” including this one detailing initial reactions (including my own), an unintentionally ironic post from Washington Post executive editor Len Downie chastising Romenesko for linking to Daly’s piece in the first piece, a link to a letter from Time’s David Von Drehle, and this morning’s post detailing Daly’s semi-apologia. Numerous other people weighed in on Romenesko’s letters page, including the Times’ Adam Nagourney, the Boston Globe’s Erica Noonan, and Eric Alterman’s Eric Alterman. In an era of continual griping about newsroom cutbacks, why are so many highly-respected (and relatively high paid) journalists spending their precious time engaging a man who, according to his own resume, last did time as a working journalist in 1997? (I, for one, have a good excuse: I don’t have a job.)
I’ll venture one answer: journalists are self-obsessed, and, in a time when our public opinion ranking is somewhere below that of politicians, garbage collectors, and lawyers, Romenesko–a site that’s been labeled the industry’s water cooler so many times it’s practically part of the site’s name–is one area where we can remind each other we still matter. The lack of a volume control on Romenesko’s site, where a long Times feature about the future of the Wall Street Journal under Rupert Murdoch gets less attention that Chris Daly, makes it easy for us all to indulge in these self-important feeding frenzies. As a result, we give the Chris Daly’s of the world some weight, but that’s really secondary to our main, albeit unconscious, objective: reminding ourselves how much we matter.
Which isn’t to say that the first thing I’ll do when I post this is send a humble email to Romenesko himself. Because if there’s no link–and no reaction from my peers–how will I know that my voice on this burning issue is being heard?

