Announcement

Collapse
No announcement yet.

History of Baseball Statistics & Sabermetrics

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • History of Baseball Statistics & Sabermetrics

    An radio interview I heard yesterday inspired this thread. We can use this thread document the history of baseball statistics and sabermetrics. I was listening to Brian Kenny on the Tom Tolbert/Eric Byrnes Show (KNBR 680 AM-San Francisco). There were discussing Sabermetrics and boxing. During the interview Kenny was talking about how for a long time certain plays on the field don't give any statistical credit to a batter. He specifically mentioned when a batter, with a man on first, doubles to advance the lead runner to third. The next batter could hit a dribbler to the first baseman and get an RBI while the batter who hit the double gets nothing. Kenny then mentions that back "around 1910" they attempted to give credit to a batter for advancing runners. He said it was called it "base runners advanced". Kenny further stated that it didn't catch on because they had trouble tracking the base runners. Has anyone heard about this before? I'll do some research as well.
    Last edited by Honus Wagner Rules; 09-13-2013, 01:01 PM.
    Strikeouts are boring! Besides that, they're fascist. Throw some ground balls - it's more democratic.-Crash Davis

  • #2
    Originally posted by Honus Wagner Rules View Post
    An radio interview I heard yesterday inspired this thread. We can use this thread document the history of baseball statistics and sabermetrics. I was listening to Brian Kenny on the Tom Tolbert/Eric Byrnes Show (KNBR 680 AM-San Francisco). There were discussing Sabermetrics and boxing. During the interview Kenny was talking about how for a long time certain plays on the field don't give any statistical credit to a batter. He specifically mentioned when a batter, with a man on first, doubles to advance the lead runner to third. The next batter could hit a dribbler to the first baseman and get an RBI while the batter who hit the double gets nothing. Kenny then mentions that back "around 1910" they attempted to give credit to a batter for advancing runners. He said it was called it "base runners advanced". Kenny further stated that it didn't catch on because they had trouble tracking the base runners. Has anyone heard about this before? I'll do some research as well.
    Yes!, and I'm very much sold on the entire concept and don't know why it's been so overlooked. It's based on attributing bases achieved, rather than model-based attribution of runs achieved. [Some refer to it as a "bases produced" approach]. It's a cleaner and more direct approach to run estimation, and to attribution of runs to individuals, than the various existing run estimators are. It's also an excellent predictor of win percentage. I've read the same thing about how the ideas originated early on--around that 1910 date, maybe even earlier, discussed in Baseball Magazine in 1913 I think it was. Unfortunately there aren't play by play accounts for a lot of early games, at least in available form.

    This describes some of the basic ideas: http://people.umass.edu/gmhwww/pdf/H...ce-Average.pdf

    Comment


    • #3
      Originally posted by Jim Bouldin View Post
      Yes!, and I'm very much sold on the entire concept and don't know why it's been so overlooked. It's based on attributing bases achieved, rather than model-based attribution of runs achieved. [Some refer to it as a "bases produced" approach]. It's a cleaner and more direct approach to run estimation, and to attribution of runs to individuals, than the various existing run estimators are. It's also an excellent predictor of win percentage. I've read the same thing about how the ideas originated early on--around that 1910 date, maybe even earlier, discussed in Baseball Magazine in 1913 I think it was. Unfortunately there aren't play by play accounts for a lot of early games, at least in available form.

      This describes some of the basic ideas: http://people.umass.edu/gmhwww/pdf/H...ce-Average.pdf
      The concepts of Base Advance Average and Win Tracking Rate look very interesting.
      Strikeouts are boring! Besides that, they're fascist. Throw some ground balls - it's more democratic.-Crash Davis

      Comment


      • #4
        An Alan Schwarz 2004 article with a time line of baseball statistics.

        ************************************************** ***************************************
        A numbers revolution
        By Alan Schwarz, Thursday, July 8, 2004


        So you think the timeline of baseball's obsession with statistics begins in the 1980s, with computers and Bill James? Think again.

        Baseball and its numbers fix have existed for more than 150 years -- from the primordial ooze from which statistics developed in the 1860s, to the earliest sabermetricians from 1910-1930, to post-war military scientists writing scholarly articles on them. Assembling a full history of this fascination would take an entire book. As it turns out I did just write that book: "The Numbers Game," which came out a few weeks ago and, thanks to the generous folks at ESPN.com, is being featured on the site right now.

        The following, for those who want a tease of the amazingly potent history baseball has with statistics, is a timeline of some of that history's most important moments:


        1837 -- Constitution of the Olympic Ball Club of Philadelphia, which played an early ancestor of baseball called "town ball," mandates that a scorebook be kept to record runs scored by all players.

        1845 -- First box score appears in New York Morning News. Batters' columns include only runs and outs.

        1858 -- Box scores continue expansion by including nine more columns per player, including foul outs and times catching a ball on one bounce (which at the time counted as an out).

        1867 -- To reward batters who hit their way on base but do not score, New York writer Henry Chadwick begins awarding such batters a "base hit" in those situations. Other Chadwick innovations
        around this time include "total bases" and "unearned runs."

        1872 -- With "hits per game" and "total bases per game" the favored method to rate batters, a fan from Washington, H.A. Dobson, writes to Chadwick and maintains this unfairly favors leadoff
        hitters, who bat more often each game. He proposes a new system: hits per times at bat. This proved so popular, so fast, it became known simply as "batting average."

        1872 -- A Mr. Reed, scorer for the Philadelphia Athletics, rates fielders not by errors, but by plays successfully made per game. Metric never catches on until Bill James resurrects it more than 100 years later as "range factor."

        1879 -- National League keeps "reached first base" as official statistic. Discards it after one year and tries "bases touched." Scraps that quickly, too.

        1883 -- And you thought today's stats were weird? American Association awards pitching championship to Tim Keefe of the New York Metropolitans because of his league-low .0362 earned-run-per-at-bat ratio.

        1887 -- In spooky harkening of today's on-base revolution, NL and American Association decide that batting average should count walks as full-fledged hits. After St. Louis' Tip O'Neill bats .492, the one-year experiment ends.

        1893 -- Pitching rules change, sending moving hurlers back approximately 5-6 feet to their present 60 feet, 6 inches away. Taking advantage of pitchers' uneasiness with the new distance, Boston's Hugh Duffy bats .438.

        1905 -- Sensing a growing interest in relief pitching, annual Reach guide counts "times taken out," the opposite of today's "complete games."

        1907 -- New York Press sports editor Ernie Lanigan begins his own logs of "runs batted in"; statistic doesn't become official until 1920. (RBI had actually been kept as early as 1879 by a Buffalo newspaper.)

        1910 -- St. Louis Browns infield plays deep to allow popular Nap Lajoie of Cleveland to go 8-for-9 in doubleheader on last day of the season, in hopes he'll beat out Ty Cobb for the prestigious batting average championship. (No one knew who was actually leading at the time, because the official statistics were not released during the season, only after.) After weeks of controversy and machinations, Cobb still wins the title, .385 to .384. Maybe. (See 1981.)

        1912 -- Watching fewer starting pitchers complete games, National League president John Heydler scraps "earned runs per game" and replaces it with a new measure, earned runs per nine innings pitched. You know it as "earned run average."

        1912 -- "Who's Who in Baseball" debuts, with first-ever seasonal register of active players' batting and fielding averages.

        1914 -- Baseball's first attempt at a comprehensive record book, "Balldom," is published by Pittsburgh stat freak George Moreland. Includes vital list, "Eight Games in Which First Basemen Made No Putouts."

        1916 -- Baseball Magazine's F.C. Lane, perhaps baseball's first true sabermetrician, begins all-out assault on worthlessness of batting average. Assigns higher weights to doubles, triples and home runs, while also proposing new respect for walks, which he calls "the orphan child of the dope sheets."

        1918 -- Stat-crazed brothers Al Munro and Walter Elias, who had started a business selling baseball statistics to newspapers and New York billiard parlors, are hired by the National League to keep the loop's official numbers. Outfit evolves into what we know today as the Elias Sports Bureau.

        1919-1921 -- Babe Ruth shatters Ned Williamson's record of 27 home runs with 29, then 54, then 59. Fans start focusing on individuals' statistics, rather than the scores of the games, more than ever before. Beyond Ruth's example of the promise of power, Ray Chapman's fatal beaning causes leagues to use cleaner balls, helping offensive statistics skyrocket.

        1922 -- In very fishy episode after the season, AL president Ban Johnson overrules an official scoring decision to give Ty Cobb an extra hit and change his average from .399 to .401.

        1941 -- Former major leaguer Ethan Allen invents All-Star Baseball, a tabletop game that allows kids to stage major league-type contests with a spinner atop circular discs, whose circumference is sectioned off according to players' real-life statistics. (If Joe DiMaggio was a .361 hitter who hit home runs ever 17 at-bats, his disc would over time mimic this performance.) All-Star Baseball becomes instant sensation, sells thousands of sets.

        1947 -- Brooklyn Dodgers boss Branch Rickey hires Allan Roth as team statistician. Roth proceeds to keep all sorts of new statistics to rate players, including an early form of on-base percentage, batting average with runners in scoring position, performance in different ball-strike counts, and more. (Even Roth was not unique. Rickey had retained a statistician named Travis Hoke while running the Browns in the 1930s.)

        1948 -- Harold Richman, an 11-year-old from Long Island, invents a new tabletop statistics game using dice. He later publishes it commercially in 1961 under the name Strat-O-Matic.

        1951 -- Hy Turkin, a sportswriter for the New York Daily News, and Cy Thompson, a Broadway musician and statistics fan, publish the "Official Encyclopedia of Baseball," the game's first historical register. Because data was so hard to find at that time, the only statistics included are games played, batting average for hitters and won-lost records for pitchers.

        1952 -- Topps adds full statistics lines on the back of their annual baseball cards.

        1953-63 -- George Lindsey, a Canadian military officer, spends 10 years blowing off his command and family life to apply sophisticated statistical analysis to baseball's numbers. Publishes two seminal articles in military journal Operations Research that examine the benefits and costs of various strategies (steals, sacrifices, intentional walks, etc.) through his unique base-out matrix.

        1960 -- Harvard University professor William Gamson begins a new baseball pool called Baseball Seminar, a forerunner of modern fantasy baseball.

        1964 -- Earnshaw Cook, a retired metallurgist and consultant in the development of the atom bomb, publishes the first full-length sabermetric book, "Percentage Baseball."

        1969 -- "The Baseball Encyclopedia," baseball's first comprehensive historical register, debuts on Aug. 28. The 2,338-page behemoth includes at least 17 statistics for each player each year -- dating all the way back to 1876. The New York Times raves, "It's the book I would take with me to prison." A massive technological undertaking, it is the first trade book in the United States entirely typeset by computers.

        1970 -- Harlan Mills, a software engineer, and his brother Eldon invent a new statistic called "player win average," based on how everything a hitter, pitcher or fielder does affects the probability of his team eventually winning a game. The Mills brothers release their method in an obscure book, "Player Win Averages."

        1971 -- Society for American Baseball Research forms in Cooperstown, N.Y.

        1977 -- Unknown Kansan Bill James self-publishes his first "Baseball Abstract."

        1979 -- Astros president Tal Smith, a statistics buff from childhood, hires Steve Mann as baseball's first modern stat analyst for a major league club.

        1980 -- New York writer Dan Okrent devises the rules for The Rotisserie League Baseball Association. Feature on it in Inside Sports magazine in May 1981 kick-starts fantasy craze.

        1981 -- STATS Inc. begins operation by developing "Edge 1.000" computer system to help clubs keep their own specialized statistics.

        1981 -- The Sporting News reports that SABR member Pete Palmer has discovered that Ty Cobb was awarded two too many hits in 1910, giving him just 4,189 for his career (and meaning he should have lost the AL batting race to Nap Lajoie). With Pete Rose in the midst of his Cobb pursuit, MLB ignores overwhelming proof that a mistake was made.

        1981 -- Dan Okrent's profile of Bill James in Sports Illustrated introduces sabermetrics to the masses. Ballantine wins bidding war to nationally publish James' "Baseball Abstract," which soon becomes an annual bestseller.

        1982 -- San Francisco NPR radio voice Eric Walker publishes "The Sinister First Baseman," a collection of baseball essays, many of them building a new statistical philosophy based on power and on-base percentage. This catches the eye of young Oakland executive Sandy Alderson, who ultimately hires Walker as a consultant. The A's philosophy is born.

        1982 -- USA Today begins publication in September with express purpose of bringing more statistics to sports fans.

        1983 -- Oakland's Steve Boros becomes the first manager to openly praise computer data for helping him make strategic decisions.

        1984 -- Prompted by the Pete Palmer's and John Thorn's "The Hidden Game of Baseball," a comprehensive analysis of new statistics, The New York Times starts publishing a weekly box with a new statistic: On-base plus slugging percentage (OPS).

        1985 -- Facing bankruptcy, STATS Inc. shifts focus from developing software for teams to keeping and distributing statistics for the public.

        1989 -- "Total Baseball" debuts as rival to "The Baseball Encyclopedia."

        1989 -- Retrosheet begins massive compilation and online publishing of old box scores and play-by-plays, allowing droves of historical research never before possible. Current holdings have swooned to almost every game from the mid-'60s onward, and thousands more before that.

        1990 -- USA Today, with STATS Inc., overhauls box score to include all batters' walks, strikeouts, men left on base, and updated batting average. Also included: pitch counts and a new statistic called "holds."

        1990 -- Journeyman Billy Beane hired as advance scout by A's, where he soon reads Eric Walker's on-base manifesto, "Winning Baseball." Never looks at baseball the same way again.

        1994 -- STATS Inc. revolutionizes statistics delivery by updating its AOL box scores and statistics during games, pitch-by-pitch. Real-time statistics delivery ultimately draws lawsuit from NBA and other sports leagues, claiming they violate broadcast licenses.

        1996 -- Baseball Prospectus begins publication of annual book and Web site; soon introduces statistics community to Value Over Replacement Level, PECOTA, Pitcher Abuse points and more.

        1997 -- After losing decision of U.S. District Court, STATS wins appeal of NBA lawsuit and secures right to disseminate statistics in real time.

        1999 -- After 22 years of fighting among SABR members, MLB and the Elias Sports Bureau, Bud Selig announces that Hack Wilson's record RBI total from 1930 is being officially changed from 190 to 191.

        2000 -- John Dewan sells STATS Inc. to News Corp. for $45 million.

        2001 -- Voros McCracken publishes his "Defense Independent Pitching Stats" system, which suggests that pitchers have little influence over whether batted balls fall in for hits or are turned into outs by their defense. Hired following year by Red Sox as statistical analyst.

        2001 -- Bill James unveils Win Shares system. Hired following year by Red Sox as baseball-operations advisor.

        2003 -- MLB.com decides to begin outfitting stadiums with sophisticated camera systems to capture pitch and throw velocities, runner speeds, batted-ball trajectories and more, in part to build an entire new set of fielding statistics, ETA 2006.

        2004 -- Alan Schwarz writes "The Numbers Game," the first full-length history of baseball statistics. Hope you like it.
        Alan Schwarz is the senior writer of Baseball America and a regular contributor to ESPN.com. His new book, "The Numbers Game: Baseball's Lifelong Fascination With Statistics," is published by St. Martin's Press and can be ordered on Alan's Web site.
        Strikeouts are boring! Besides that, they're fascist. Throw some ground balls - it's more democratic.-Crash Davis

        Comment


        • #5
          Thanks for that info. Ohhh, the hours I played All Star Baseball, c. 1941.

          Comment


          • #6
            Originally posted by leewileyfan View Post
            Thanks for that info. Ohhh, the hours I played All Star Baseball, c. 1941.
            Is this what you are referring to, leewileyfan? It looks like loads of fun.

            http://baseballgames.dreamhosters.com/CadacoASB.htm
            Last edited by Honus Wagner Rules; 09-13-2013, 06:35 PM.
            Strikeouts are boring! Besides that, they're fascist. Throw some ground balls - it's more democratic.-Crash Davis

            Comment


            • #7
              Originally posted by Honus Wagner Rules View Post
              Is this what you are referring to, leewileyfan? It looks like loads of fun.

              http://baseballgames.dreamhosters.com/CadacoASB.htm
              YES!!! YES!!! YES!!! That is the game I wasted years of my teens playing!!! The 1989 version with the pictures on them! It came with useless pitcher discs too, but they were only figure heads and couldn't factor into the outcome at all. Anyway, I loved that game and seriously played it hours at a time, keeping the stats of my players I would mix up and divide into various teams. Oh wow. I want it again. I can't, I have to finish my degree and I am so close. If I had that, I would never see my wife and son again and have to drop out of school. I'd be living with nothing but my motorcycle, baseball gear, and that game under a bridge somewhere in Austin. Doesn't actually sound all that bad now that I say it...
              "It ain't braggin' if you can do it." Dizzy Dean

              Comment


              • #8
                Originally posted by Honus Wagner Rules View Post
                Is this what you are referring to, leewileyfan? It looks like loads of fun.

                http://baseballgames.dreamhosters.com/CadacoASB.htm
                Yep. But I seem to recall a squarish version, with a spin arrow and color-coded enties around the perimeter. Each card was a player; and the perimeter areas were consistent with player performance.

                In fact, we were inspired by these games to create our own home versions. I recall assembling my own as follows:

                1. Get Dad's dress shirt [lightly starched] from the laundry.

                2. Get that cardboard insert that preserved the length and width of the laundered shirt. That was to be cut-to-suit a MLB playing surface. You cut an arc to describe the outfield, having already drawn the infield area.

                3. Now get one [two, if you're into grand stadium design], of the curved cardboard inserts in Dad's laundered shirt collar. Once you have completed entering variegated drawn forms to cover the playing center for batted ball events of all sorts, including sacrifice flies and bunts, errors and stolen bases [if the situation allowed], you were now ready to glue that vertical arc to make your outfield wall. If we got ambitious, that would be green [Crayola] with yellow [Crayola] distance markers.

                4. Play was reallt evolved and sophisticated. An Eberhart No.2 pencil [or your index finger] would "bat" the ball [a bit of eraser; I always found a modest round paper spitball was "truer"] - landing in a game event. If it landed "on-the-line, it was a strike. If it kept landing on the line after two strikes, well, it was like Luke Appling spoiling the ones he didn't like. We had separate slots marked K.

                We got very well practised at this. Our leagues and games and scores and stats were every bit like MLB results as the published games.

                [I guess we were 8-13 doing this, when not playing sandlot ball for real].

                Thanks for the memories.

                Comment


                • #9
                  Originally posted by Honus Wagner Rules View Post
                  An Alan Schwarz 2004 article with a time line of baseball statistics.
                  Great--thanks for this! I've got to get my hands on Schwarz's book. Has anybody here read it?

                  Comment


                  • #10
                    Originally posted by Jim Bouldin View Post
                    Great--thanks for this! I've got to get my hands on Schwarz's book. Has anybody here read it?
                    Yes, I've read the book. It has a treasure trove of historical info starting with Henry Chadwick known as The Father of Baseball.
                    Strikeouts are boring! Besides that, they're fascist. Throw some ground balls - it's more democratic.-Crash Davis

                    Comment


                    • #11
                      Just went on ebay and saw they have a good deal of these All Star Game sets from Cadaco. Lots of the 1960s sets, that appear to have most if not all of the discs, one from 1970, and my old one from 1989! I just may buy that '89 one, just to have to play with my son when he gets a few years older. Of course, my dumb arse may end up buying a bunch of these from over the years and have games of stars from different eras! Oh no. Not going to finish my degree now. I knew it. Never should have looked at that link HWR provided. Thanks man. I'll have a blast living under that overpass by downtown Austin with my bike, ball gear, and these games!
                      "It ain't braggin' if you can do it." Dizzy Dean

                      Comment


                      • #12
                        Originally posted by Honus Wagner Rules View Post
                        1953-63 -- George Lindsey, a Canadian military officer, spends 10 years blowing off his command and family life to apply sophisticated statistical analysis to baseball's numbers.
                        Whammo!! Looks like it's not opinion-free...

                        Comment


                        • #13
                          Originally posted by Jim Bouldin View Post
                          Whammo!! Looks like it's not opinion-free...
                          I spent over a decade on active duty here, and wonder how he was able to pull off the "blowing off his command" part of that to mess around with statistical analysis! Awesome!
                          "It ain't braggin' if you can do it." Dizzy Dean

                          Comment


                          • #14
                            The origins of the boxscore. (Baseball Magazine 1925)

                            http://www.npr.org/templates/story/s...ryId=106891539

                            Chadwick boxscore.jpg
                            Last edited by Honus Wagner Rules; 09-18-2013, 07:52 PM.
                            Strikeouts are boring! Besides that, they're fascist. Throw some ground balls - it's more democratic.-Crash Davis

                            Comment


                            • #15
                              Originally posted by Honus Wagner Rules View Post
                              An radio interview During the interview Kenny was talking about how for a long time certain plays on the field don't give any statistical credit to a batter. He specifically mentioned when a batter, with a man on first, doubles to advance the lead runner to third. The next batter could hit a dribbler to the first baseman and get an RBI while the batter who hit the double gets nothing. Kenny then mentions that back "around 1910" they attempted to give credit to a batter for advancing runners. He said it was called it "base runners advanced". Kenny further stated that it didn't catch on because they had trouble tracking the base runners. Has anyone heard about this before? I'll do some research as well.
                              This is the entire purpose of Linear Weights, covered by Palmer-Thorn in The Hidden Game of Baseball. On pages 64-67, the runner advancement values of LWTS are spelled out as resulting from Palmer's 1978 simulation of all gamed played between 1901 and 1978.

                              In recent years, some have argued for Base Runs [BsR]; while with today's powerful computers, the actual application of BEFORE and AFTER each plate appearance via the 24 Grid Base-Out situations should be a piece-of-cake for a capable programer.

                              Comment

                              Ad Widget

                              Collapse
                              Working...
                              X