{"id":497,"date":"2021-05-01T12:00:25","date_gmt":"2021-05-01T12:00:25","guid":{"rendered":"https:\/\/falk.syr.edu\/sport-analytics\/?p=497"},"modified":"2021-06-22T01:22:07","modified_gmt":"2021-06-22T01:22:07","slug":"analyzing-trades-in-major-league-baseball-using-war","status":"publish","type":"post","link":"https:\/\/falk.syr.edu\/sport-analytics\/2021\/05\/01\/analyzing-trades-in-major-league-baseball-using-war\/","title":{"rendered":"Analyzing Trades in Major League Baseball Using WAR"},"content":{"rendered":"<h2 class=\"wp-block-heading\" id=\"s:by-cameron-mitchell-syracuse-university\">By Cameron Mitchell &#8211; Syracuse University &#8217;21<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:by-cameron-mitchell-syracuse-university\" aria-label=\"Link to section 'By Cameron Mitchell &#8211; Syracuse University &#8217;21'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"s:abstract\">Abstract<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:abstract\" aria-label=\"Link to section 'Abstract'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<p>Every year in Major League Baseball, dozens upon dozens of players are traded from team to team. Teams contending for a championship may add a veteran player on an expiring contract, while teams who are struggling may try to acquire young talent to build for the future.<\/p>\n\n\n\n<p>In this research, I analyzed all trades involving major league players from 2009-2020. In total, close to 300 trades were analyzed. WAR models were created for batters, starting pitchers, and pitchers who are relievers. These models were then used to create predicted WAR values for each player involved in a trade, and trades were then assessed based on both WAR and predicted WAR obtained by each team involved.<\/p>\n\n\n\n<p>All of the data obtained in this study came from the leaderboards on FanGraphs. A few metrics were also created based on existing stats obtained from FanGraphs.com.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"s:introduction\">Introduction<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:introduction\" aria-label=\"Link to section 'Introduction'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<p>The overall goal of my research was to determine which teams have been the best in baseball over the last decade+ in terms of \u201cwinning\u201d trades. To do this, I created a predicted WAR metric for every player in my dataset. WAR, which stands for Wins Above Replacement,&nbsp;&nbsp;is universally seen as one of the best metrics for determining&nbsp;&nbsp;the overall value of a baseball player. Its value gives an estimate for the amount of wins a player adds to his team when compared to league average or replacement level player.&nbsp;<\/p>\n\n\n\n<p>In order to adequately assess the different types of baseball players, three different models were created. A model for all batters, a model for starting pitchers, and a model for all relievers were generated. Each model used WAR from Fangraphs as the dependent variable. Each model also had a specific minimum requirement that had to be met in order for a player to be included in the model analysis. This was done in order to eliminate low outliers whose stats could have skewed the results of the model. In particular, pitchers batting, and position players pitching were two of the key groups that needed to be eliminated.&nbsp;&nbsp;The specifics of each model are listed below.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"s:models\">Models<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:models\" aria-label=\"Link to section 'Models'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"s:batter-model-can-be-seen-below\">Batter Model (can be seen below as an example)<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:batter-model-can-be-seen-below\" aria-label=\"Link to section 'Batter Model (can be seen below as an example)'\" tabindex=\"-1\"> Link<\/a><\/span><\/h3>\n\n\n\n<p><strong>Minimum requirement:<\/strong>&nbsp;162 plate appearances (about 1 per game)<\/p>\n\n\n\n<p><strong>Win Probability Added (WPA):<\/strong>&nbsp;The accumulation of total Fangraphs win probability added for a batter after each of their plate appearances throughout the season. It was included in order to have an assessment of how clutch a batter was throughout the season.<\/p>\n\n\n\n<p><strong>Ground Ball\/Fly Ball Ratio(GB_FB):&nbsp;<\/strong>A batter\u2019s total ground balls divided by his total fly balls throughout the season. This was included in order to assess a batter\u2019s hit contact quality.<\/p>\n\n\n\n<p><strong>Speed (Spd):&nbsp;<\/strong>Fangraphs&#8217;s speed rating for a player. This was included in order to assess a player\u2019s speed.<\/p>\n\n\n\n<p><strong>Walk\/Strikeout Ratio (BB_K):&nbsp;<\/strong>A batter\u2019s total walks divided by his total strikeouts throughout the season. This was included in order to assess a batter\u2019s discipline.<\/p>\n\n\n\n<p><strong>Fielding (FLD):<\/strong>&nbsp;Fangraphs&#8217;s fielding rating for a player. This was included in order to assess a player\u2019s fielding ability. Note that sense the FLD rating can be negative, a player who did not play in the field throughout the season was given a 0 for FLD.<\/p>\n\n\n\n<p><strong>Previous Season WAR (prev_WAR):<\/strong>&nbsp;The WAR for the previous season for that player. This was included in order to control for how good a player was the previous season.<\/p>\n\n\n\n<p><strong>Age:&nbsp;<\/strong>Player\u2019s Age, included to control for a player\u2019s age<\/p>\n\n\n\n<p><strong>Age Squared (Age_sq):&nbsp;<\/strong>Player\u2019s age squared, included to control for steep decline in player ability as they age<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"s:starter-model\">Starter Model<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:starter-model\" aria-label=\"Link to section 'Starter Model'\" tabindex=\"-1\"> Link<\/a><\/span><\/h3>\n\n\n\n<p>Minimum requirement: 15 Innings Pitched (about 3 starts)<\/p>\n\n\n\n<p><strong>Leverage Index (pLi):<\/strong>&nbsp;A starter\u2019s average leverage index throughout a given start. This was included to assess a starter\u2019s overall ability; a low leverage index indicates the pitcher did not face a lot of stressful situations throughout his outing, which means he did not allow a lot of baserunners<\/p>\n\n\n\n<p><strong>Walks\/9 (BB_9):<\/strong>&nbsp;A starters walks\/9 innings pitched. This was included to assess a starter\u2019s control.<\/p>\n\n\n\n<p><strong>Strikeouts\/9 (K_9):&nbsp;<\/strong>A starters strikeouts\/9 innings pitched. This was included to assess a starter\u2019s ability to strike batters out and eliminate the luck that comes into play when a ball is put into play.<\/p>\n\n\n\n<p><strong>Home runs\/9 (HR_9):&nbsp;<\/strong>A starters home runs allowed\/9 innings pitched. This was included to evaluate a starters ability to limit home runs allowed.<\/p>\n\n\n\n<p><strong>Complete Games (CG):&nbsp;<\/strong>A starter\u2019s total amount of complete games throughout the season. This was included to assess a starter\u2019s ability to pitch deep into games<\/p>\n\n\n\n<p><strong>Innings Pitched (Start_IP):&nbsp;<\/strong>A starters total innings pitched as a starter throughout the season. This was included to assess a pitcher\u2019s ability to stay healthy throughout the season.<\/p>\n\n\n\n<p><strong>Previous Season WAR (prev_WAR):<\/strong>&nbsp;The WAR for the previous season for that player. This was included in order to control for how good a player was the previous season.<\/p>\n\n\n\n<p><strong>Age:&nbsp;<\/strong>Player\u2019s Age, included to control for a player\u2019s age<\/p>\n\n\n\n<p><strong>Age Squared (Age_sq):&nbsp;<\/strong>Player\u2019s age squared, included to control for steep decline in player ability as they age<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"s:reliever-model\">Reliever Model<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:reliever-model\" aria-label=\"Link to section 'Reliever Model'\" tabindex=\"-1\"> Link<\/a><\/span><\/h3>\n\n\n\n<p><strong>Minimum Requirement<\/strong>: 10 Innings Pitched (about 10 appearances)<\/p>\n\n\n\n<p><strong>Difference in Leverage Index (diff_LI):<\/strong>&nbsp;The difference in leverage index for a reliever when he enters the game minus when he exits the game. This was used in order to evaluate a reliever&#8217;s ability to enter a game and calm down situations, as&nbsp; a large difference in leverage index indicates that&nbsp; a reliever would have taken a stressful situation and gotten his team out of it.<\/p>\n\n\n\n<p><strong>Strikeout\/Walk Ratio (K_BB):&nbsp;<\/strong>A reliever&#8217;s total strikeouts divided by his total walks throughout the season. This was included to assess a reliever\u2019s control.<\/p>\n\n\n\n<p><strong>Runners Stranded (RS):&nbsp;<\/strong>The total number of runners a reliever left on base throughout the season. This was included to assess a reliever&#8217;s ability to limit damage when he comes into a game by leaving inherited runners and his own runners on base.<\/p>\n\n\n\n<p><strong>Saves (SV):<\/strong>&nbsp;A player\u2019s total saves throughout the season. This was done to assess a player\u2019s ability to close out close games.<\/p>\n\n\n\n<p><strong>Holds (HLD):<\/strong>&nbsp;A player\u2019s total holds throughout the season. This was done to assess a player\u2019s ability to pitch in the late innings of close games.<\/p>\n\n\n\n<p><strong>Home runs\/9 (HR_9)<\/strong>: A reliever&#8217;s home runs allowed\/9 innings pitched. This was included to evaluate a relievers ability to limit home runs allowed.<\/p>\n\n\n\n<p><strong>Previous Season WAR (prev_WAR):<\/strong>&nbsp;The WAR for the previous season for that player. This was included in order to control for how good a player was the previous season.<\/p>\n\n\n\n<p><strong>Age:&nbsp;<\/strong>Player\u2019s Age, included to control for a player\u2019s age<\/p>\n\n\n\n<p><strong>Age Squared (Age_sq):&nbsp;<\/strong>Player\u2019s age squared, included to control for steep decline in player ability as they age<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"s:results\">Results<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:results\" aria-label=\"Link to section 'Results'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td>Variable<\/td><td>Estimate<\/td><td>P-Value<\/td><\/tr><tr><td>WPA<\/td><td>0.697<\/td><td>&lt;2e-16***<\/td><\/tr><tr><td>GB_FB<\/td><td>-0.179<\/td><td>5.61e-10***<\/td><\/tr><tr><td>SPD<\/td><td>0.116<\/td><td>&lt;2e-16***<\/td><\/tr><tr><td>BB_K<\/td><td>0.560<\/td><td>1.41e-12***<\/td><\/tr><tr><td>FLD<\/td><td>0.100<\/td><td>&lt;2e-16***<\/td><\/tr><tr><td>Prev_WAR<\/td><td>0.198<\/td><td>&lt;2e-16***<\/td><\/tr><tr><td>Age<\/td><td>0.027<\/td><td>0.534<\/td><\/tr><tr><td>Age_sq<\/td><td>-0.001<\/td><td>0.097*<\/td><\/tr><\/tbody><\/table><figcaption>Table 1: Batter Model<\/figcaption><\/figure>\n\n\n\n<p>After generating the predicted WAR for each player, each trade was then assessed by calculating a total predicted WAR obtained by each team in each trade. Team totals as well as averages were then calculated and analyzed for the duration of the dataset. The amount that each team received in total and gave away in total, both in actuality and according to my prediction, can be seen below Teams are color coded based on their primary team color.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"817\" src=\"https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.11-PM-1024x817.png\" alt=\"\" class=\"wp-image-498\" srcset=\"https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.11-PM-1024x817.png 1024w, https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.11-PM-300x239.png 300w, https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.11-PM-768x613.png 768w, https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.11-PM.png 1298w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure><\/div>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"s:discussion\">Discussion<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:discussion\" aria-label=\"Link to section 'Discussion'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1008\" height=\"696\" src=\"https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.24-PM.png\" alt=\"\" class=\"wp-image-499\" srcset=\"https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.24-PM.png 1008w, https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.24-PM-300x207.png 300w, https:\/\/falk.syr.edu\/sport-analytics\/wp-content\/uploads\/sites\/12\/2021\/06\/Screen-Shot-2021-06-09-at-4.02.24-PM-768x530.png 768w\" sizes=\"(max-width: 1008px) 100vw, 1008px\" \/><\/figure><\/div>\n\n\n\n<p>One of the more interesting things about the results of my research is the relationship between a team\u2019s winning percentage and the difference between&nbsp;&nbsp;the WAR they were predicted to receive and the WAR they were predicted to trade away. As can be seen in the graph above, as a team\u2019s predicted WAR difference increased, their winning&nbsp;&nbsp;percentage also increased. This validates for me that my models did a great job in assessing player\u2019s trade values. As for which teams did the best overall, both in reality and according to my models, the Los Angeles Dodgers did the best with a predicted difference of 44.89 WAER and an actual difference of 31.1 WAR. As for the worst team, again both in reality and according to my models, it was the Miami Marlins with an actual difference of -31.1 and a predicted difference of -42.86.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"s:conclusions\">Conclusions<span class=\"anchor\" aria-hidden=\"true\"><a href=\"#s:conclusions\" aria-label=\"Link to section 'Conclusions'\" tabindex=\"-1\"> Link<\/a><\/span><\/h2>\n\n\n\n<p>I believe my research&nbsp;shows that&nbsp;teams that do well in trades generally do well on the field, as displayed by the graph above. Additionally,&nbsp;I feel that my models&nbsp;did&nbsp;an outstanding job at modeling teams\u2019 success in trades. Based on important variables for each of three different player categories.&nbsp;The different categories for each position group represent an important characteristic for that particular group that MLB teams should always be considering. Finally,&nbsp;&nbsp;an extension of this project that I am definitely interested in is expanded my trade dataset to include prospects. Unfortunately, the metrics that I included were challenging to find for prospects, so I could only&nbsp;&nbsp;evaluate&nbsp;&nbsp;trades that involved major league players. Prospect evaluation is an extremely important aspect in baseball for teams, as prospects represent the future&nbsp;of the sport and teams are often just as concerned about their future as they are about their present.&nbsp;Therefore, prospects would be an interesting extension to this project.&nbsp;Overall, however, my research&nbsp;still&nbsp;answers my research questions and determines which teams are the best \u201ctraders\u201d in baseball.<\/p>","protected":false},"excerpt":{"rendered":"<p>By Cameron Mitchell &#8211; Syracuse University &#8217;21 Link Abstract Link Every year in Major League Baseball, dozens upon dozens of players are traded from team to team. Teams contending for a championship may add a veteran player on an expiring contract, while teams who are struggling may try to acquire young talent to build for&hellip;<\/p>\n","protected":false},"author":52,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[11],"tags":[],"_links":{"self":[{"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/posts\/497"}],"collection":[{"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/users\/52"}],"replies":[{"embeddable":true,"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/comments?post=497"}],"version-history":[{"count":2,"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/posts\/497\/revisions"}],"predecessor-version":[{"id":501,"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/posts\/497\/revisions\/501"}],"wp:attachment":[{"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/media?parent=497"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/categories?post=497"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/falk.syr.edu\/sport-analytics\/wp-json\/wp\/v2\/tags?post=497"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}