Statistics play an important role in summarizing baseball performance and evaluating players in the sport.
Since the flow of a baseball game has natural breaks to it, and normally players act individually rather than performing in clusters, the sport lends itself to easy record-keeping and statistics. Statistics have been kept for professional baseball since the creation of the National League and American League, now part of Major League Baseball.
The practice of keeping records of player achievements was started in the 19th century by Henry Chadwick.^{[1]} Based on his experience with the sport called cricket, Chadwick devised the predecessors to modern day statistics including batting average, runs scored, and runs allowed.
Traditionally, statistics such as batting average (the number of hits divided by the number of at bats) and earned run average (the average number of earned runs allowed by a pitcher per nine innings) have dominated attention in the statistical world of baseball. However, the recent advent of sabermetrics has created statistics drawing from a greater breadth of player performance measures and playing field variables. Sabermetrics and comparative statistics attempt to provide an improved measure of a player's performance and contributions to his team from year to year, frequently against a statistical performance average.
Comprehensive, historical baseball statistics were difficult for the average fan to access until 1951, when researcher Hy Turkin published The Complete Encyclopedia of Baseball. In 1969, Macmillan Publishing printed its first Baseball Encyclopedia, using a computer to compile statistics for the first time. Known as "Big Mac", the encyclopedia became the standard baseball reference until 1988, when Total Baseball was released by Warner Books using more sophisticated technology. The publication of Total Baseball led to the discovery of several "phantom ballplayers", such as Lou Proctor, who did not belong in official record books and were removed.^{[2]}
Use of statistics[edit]
Throughout modern baseball, a few core statistics have been traditionally referenced – batting average, RBI, and home runs. To this day, a player who leads the league in all of these three statistics earns the "Triple Crown." For pitchers, wins, ERA, and strikeouts are the most often-cited statistics, and a pitcher leading his league in these statistics may also be referred to as a "Triple Crown" winner. General managers and baseball scouts have long used the major statistics, among other factors and opinions, to understand player value. Managers, catchers and pitchers use the statistics of batters of opposing teams to develop pitching strategies and set defensive positioning on the field. Managers and batters study opposing pitcher performance and motions in attempting to improve hitting.
Some sabermetric statistics have entered the mainstream baseball world that measure a batter's overall performance including on-base plus slugging, commonly referred to as OPS. OPS adds the hitter's on-base percentage (number of times reached base by any means divided by total plate appearances) to his slugging percentage (total bases divided by at-bats). Some argue that the OPS formula is flawed and that more weight should be shifted towards OBP (on-base percentage).^{[2]} The statistic wOBA (weighted on-base average) attempts to correct for this.
OPS is also useful when determining a pitcher's level of success. "Opponent On-base Plus Slugging" (OOPS) is becoming a popular tool to evaluate a pitcher's actual performance. When analyzing a pitcher's statistics, some useful categories to consider include K/9IP (strikeouts per nine innings), K/BB (strikeouts per walk), HR/9 (Home runs per nine innings), WHIP (walks plus hits per inning pitched) and OOPS (opponent on-base plus slugging).
However, since 2001, more emphasis has been placed on Defense-Independent Pitching Statistics, including Defense-Independent ERA (dERA), in an attempt to evaluate a pitcher's performance regardless of the strength of the defensive players behind him.
All of the above statistics may be used in certain game situations. For example, a certain hitter's ability to hit left-handed pitchers might incline a manager to increase his opportunities to face left-handed pitchers. Other hitters may have a history of success against a given pitcher (or vice versa), and the manager may use this information to create a favorable match-up. Broadcast commentators often refer to this as "playing the percentages".
Commonly used statistics[edit]
Most of these terms also apply to softball. Commonly used statistics with their abbreviations are explained here. The explanations below are for quick reference and do not fully or completely define the statistic; for the strict definition, see the linked article for each statistic.
Batting statistics[edit]
1B – Single: hits on which the batter reaches first base safely without the contribution of a fielding error.
2B – Double: hits on which the batter reaches second base safely without the contribution of a fielding error.
3B – Triple: hits on which the batter reaches third base safely without the contribution of a fielding error.
AB – At bat: Plate appearances, not including bases on balls, being hit by pitch, sacrifices, interference, or obstruction.
BA – Batting average (also abbreviated AVG): hits divided by at bats (H/AB)
BB – Base on balls (also called a "walk"): hitter not swinging at four pitches called out of the strike zone and awarded first base.
BABIP – Batting average on balls in play: frequency at which a batter reaches a base after putting the ball in the field of play. Also a pitching category.
GPA – Gross Production Average: 1.8 times on-base percentage plus slugging percentage, divided by four
GS – Grand Slam: a home run with the bases loaded, resulting in four runs scoring, and four RBI credited to the batter.
H – Hits: times reached base because of a batted, fair ball without error by the defense
HBP – Hit by pitch: times touched by a pitch and awarded first base as a result
HR – Home runs: hits on which the batter successfully touched all four bases, without the contribution of a fielding error.
ITPHR – Inside-the-park home run:hits on which the batter successfully touched all four bases, without the contribution of a fielding error or the ball going outside the ball park.
IBB – Intentional base on balls: times awarded first base on balls (see BB above) deliberately thrown by the pitcher. Also known as IW (intentional walk).
ISO – Isolated power: a hitter's ability to hit for extra bases, calculated by subtracting batting average from slugging percentage
K – Strike out (also abbreviated SO): number of times that a third strike is taken or swung at and missed, or bunted foul. Catcher must catch the third strike or batter may attempt to run to first base.
LOB – Left on base: number of runners neither out nor scored at the end of an inning.
OBP – On-base percentage: times reached base (H + BB + HBP) divided by at bats plus walks plus hit by pitch plus sacrifice flies (AB + BB + HBP + SF).
SB% – Stolen base percentage: the percentage of bases stolen successfully. (SB) divided by (SBA)(stolen bases attempted).
SF – Sacrifice fly: Fly balls hit to the outfield which although caught for an out, allow a baserunner to advance
SH – Sacrifice hit: number of sacrifice bunts which allow runners to advance on the basepaths
SLG – Slugging average: total bases achieved on hits divided by at-bats (TB/AB)
TA – Total average: total bases, plus walks, plus hit by pitch, plus steals, minus caught stealing divided by at bats, minus hits, plus caught stealing, plus grounded into double plays [(TB + BB + HBP + SB - CS)/(AB - H + CS + GIDP)]
TB – Total bases: one for each single, two for each double, three for each triple, and four for each home run [H + 2B + (2 * 3B) + (3 * HR)] or [1B + (2 * 2B) + (3 * 3B) + (4 * HR)]
TOB – Times on base: times reaching base as a result of hits, walks, and hit-by-pitches (H + BB + HBP)
XBH – Extra base hits: total hits greater than singles (2B + 3B + HR)
SB – Stolen base: number of bases advanced by the runner while the ball is in the possession of the defense.
CS – Caught stealing: times tagged out while attempting to steal a base
SBA/ATT – Stolen base attempts: total number of times the player has attempted to steal a base (SB+CS)
DI – Defensive Indifference: if the catcher does not attempt to throw out a runner (usually because the base would be insignificant), the runner is not awarded a steal. Scored as a fielder's choice.
R – Runs scored: times reached home plate legally and safely
UBR – Ultimate base running: a metric that assigns linear weights to every individual baserunning event in order to measure the impact of a player's baserunning skill
Pitching statistics[edit]
BB – Base on balls (also called a "walk"): times pitching four balls, allowing the batter to take first base
BK – Balk: number of times pitcher commits an illegal pitching action while in contact with the pitching rubber as judged by umpire, resulting in baserunners advancing one base
BS – Blown save: number of times entering the game in a save situation, and being charged the run (earned or not) which eliminates his team's lead
CERA – Component ERA: an estimate of a pitcher's ERA based upon the individual components of his statistical line (K, H, 2B, 3B, HR, BB, HBP)
CG – Complete game: number of games where player was the only pitcher for his team
DICE – Defense-Independent Component ERA: an estimate of a pitcher's ERA based upon the defense-independent components of his statistical line (K, HR, BB, HBP)
ER – Earned run: number of runs that did not occur as a result of errors or passed balls
ERA – Earned run average: total number of earned runs (see "ER" above), multiplied by 9, divided by innings pitched
ERA+ – Adjusted ERA+: earned run average adjusted for the ballpark and the league average
FIP – Fielding independent pitching: a metric, scaled to resemble an ERA, that focuses on events within the pitcher's control – home runs, walks, and strikeouts
xFIP: This variant substitutes a pitcher's own home run percentage with the league average
FPOM – First Pitch Outs Made: Number of outs earned where the batter ground or flies out on the first pitch.
G – Games (AKA "appearances"): number of times a pitcher pitches in a season
GF – Games finished: number of games pitched where player was the final pitcher for his team as a relief pitcher
L – Loss: number of games where pitcher was pitching while the opposing team took the lead, never lost the lead, and went on to win
LOB% – Left-on-base percentage: LOB% represents the percentage of baserunners a pitcher does not allow to score. LOB% tends to regress toward 70–72% over time, so unusually high or low percentages could indicate that pitcher's ERA could be expected to rise or lower in the future. An occasional exception to this logic is a pitcher with a very high strikeout rate.^{[3]}
SV – Save: number of games where the pitcher enters a game led by the pitcher's team, finishes the game without surrendering the lead, is not the winning pitcher, and either (a) the lead was three runs or fewer when the pitcher entered the game; (b) the potential tying run was on base, at bat, or on deck; or (c) the pitcher pitched three or more innings
SVO – Save opportunity: When a pitcher 1) enters the game with a lead of three or fewer runs and pitches at least one inning, 2) enters the game with the potential tying run on base, at bat, or on deck, or 3) pitches three or more innings with a lead and is credited with a save by the official scorer
W – Win: number of games where pitcher was pitching while his team took the lead and went on to win, also the starter needs to pitch at least 5 innings of work (also related: winning percentage)
whiff rate: a term, usually used in reference to pitchers, that divides the number of pitches swung at and missed by the total number of swings in a given sample. If a pitcher throws 100 pitches at which batters swing, and the batters fail to make contact on 26 of them, the pitcher's whiff rate is 26%.
WP – Wild pitches: charged when a pitch is too high, low, or wide of home plate for the catcher to field, thereby allowing one or more runners to advance or score
Fielding statistics[edit]
A – Assists: number of outs recorded on a play where a fielder touched the ball, except if such touching is the putout
CI – Catcher's Interference (e.g., catcher makes contact with bat)
DP – Double plays: one for each double play during which the fielder recorded a putout or an assist.
E – Errors: number of times a fielder fails to make a play he should have made with common effort, and the offense benefits as a result
FP – Fielding percentage: total plays (chances minus errors) divided by the number of total chances
INN – Innings: number of innings that a player is at one certain position
PB – Passed ball: charged to the catcher when the ball is dropped and one or more runners advance
PO – Putout: number of times the fielder tags, forces, or appeals a runner and he is called out as a result
RF – Range factor: 9*(putouts + assists)/innings played. Used to determine the amount of field that the player can cover
TC – Total chances: assists plus putouts plus errors
TP – Triple play: one for each triple play during which the fielder recorded a putout or an assist
UZR – Ultimate zone rating: the ability of a player to defend an assigned "zone" of the field compared to an average defensive player at his position
Overall player value[edit]
VORP – Value over replacement player: a statistic that calculates a player's overall value in comparison to a "replacement-level" player. There are separate formulas for players and pitchers
Win shares: a complex metric that gauges a player's overall contribution to his team's wins
WAR – Wins above replacement: a non-standard formula to calculate the number of wins a player contributes to his team over a "replacement-level player"
General statistics[edit]
G – Games played: number of games where the player played, in whole or in part
It is difficult to determine quantitatively what is considered to be a "good" value in a certain statistical category, and qualitative assessments may lead to arguments. Using full-season statistics available at the Official Site of Major League Baseball^{[4]} for the 2004 through 2011 seasons, the following tables show top ranges in various statistics, in alphabetical order. For each statistic, two values are given:
Top5: the top five players bettered this value in all of the reported seasons
Best: this is the best of all of the players for all of the reported seasons
^Palmer, Pete; Paul Adomites; David Nemec; Matthew D. Greenberger; Dan Schlossberg; Dick Johnson; Mike Tully (2001). "Birth of the Game". Cooperstown: Hall of Fame Players. Lincolnwood, Illinois: Publications International. p. 21. ISBN0-7853-4530-2.
^ ^{a}^{b}Pete Palmer and Gary Gillette, ed. (2005). "Introduction". The 2005 ESPN Baseball Encyclopedia (1st Edition ed.). New York: Sterling. ISBN1-4027-2568-X.
Albert, Jim, and Jay M. Bennett. Curve Ball: Baseball, Statistics, and the Role of Chance in the Game. New York: Copernicus Books, 2001. ISBN 0-387-98816-5. A book on new statistics for baseball.
Alan Schwarz, The Numbers Game: Baseball's Lifelong Fascination with Statistics (New York: St. Martin's, 2005). ISBN 0-312-32223-2.