<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Beatpaths &#187; Beatpath Tech</title>
	<atom:link href="http://beatpaths.com/category/beatpath-tech/feed/" rel="self" type="application/rss+xml" />
	<link>http://beatpaths.com</link>
	<description>The Winning Ways of Winners</description>
	<lastBuildDate>Sun, 27 Nov 2011 04:35:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Backtesting, Revised</title>
		<link>http://beatpaths.com/2009/09/24/backtesting-revised/</link>
		<comments>http://beatpaths.com/2009/09/24/backtesting-revised/#comments</comments>
		<pubDate>Fri, 25 Sep 2009 06:54:24 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=371</guid>
		<description><![CDATA[I messed up a bit on the previous bit of backtesting, although the way I messed up was informative. The previous round of backtesting tested the beatflukes variant (not vanilla), and each year had all the game results of the previous seasons in my dataset. So, 2006&#8242;s results was using a beatpath graph based off [...]]]></description>
			<content:encoded><![CDATA[<p>I messed up a bit on the previous bit of backtesting, although the way I messed up was informative. The previous round of backtesting tested the beatflukes variant (not vanilla), and each year had all the game results of the previous seasons in my dataset. So, 2006&#8242;s results was using a beatpath graph based off of game outcomes from 2003 &#8211; 2006.</p>
<p>I&#8217;ve managed to configure the backtesting script a bit more correctly now, and I&#8217;ve also included a first stab of the tiebreaker method that uses paths in and paths out, which is what Moose uses for his non-weighted rankings. Below is the grid for the vanilla variant.</p>
<p></p>
<table border="1" cellpadding="1" cellspacing="1">
<tbody>
<tr>
<td>Year</td>
<td>Random</td>
<td>uNet</td>
<td>Prev. Week</td>
<td>Beatloop Str.</td>
<td>Fractional</td>
<td>BPower/Win/Loop</td>
<td>Str. of Beatwins</td>
<td>Bucklin</td>
<td>UPower</td>
<td>UPower/Loop</td>
<td>UNet-Lookahead</td>
<td>Paths In/Out</td>
</tr>
<tr>
<td>2004 (50-33)</td>
<td>149-118</td>
<td>160-107</td>
<td>153-114</td>
<td>158-109</td>
<td>153-114</td>
<td>162-105</td>
<td>158-109</td>
<td>156-111</td>
<td>153-114</td>
<td>161-106</td>
<td>153-114</td>
<td>157-110</td>
</tr>
<tr>
<td>2005 (71-30)</td>
<td>160-107</td>
<td>167-100</td>
<td>167-100</td>
<td>164-103</td>
<td>165-102</td>
<td>162-105</td>
<td>169-98</td>
<td>168-99</td>
<td>165-102</td>
<td>162-105</td>
<td>167-100</td>
<td>167-100</td>
</tr>
<tr>
<td>2006 (56-46)</td>
<td>147-120</td>
<td>154-113</td>
<td>151-116</td>
<td>150-117</td>
<td>154-113</td>
<td>156-111</td>
<td>155-112</td>
<td>154-113</td>
<td>154-113</td>
<td>153-114</td>
<td>151-116</td>
<td>150-117</td>
</tr>
<tr>
<td>2007 (89-35)</td>
<td>166-101</td>
<td>170-97</td>
<td>160-107</td>
<td>162-105</td>
<td>168-99</td>
<td>170-97</td>
<td>166-101</td>
<td>170-97</td>
<td>168-99</td>
<td>169-98</td>
<td>160-107</td>
<td>173-94</td>
</tr>
<tr>
<td>2008 (73-44)</td>
<td>156-110</td>
<td>152-114</td>
<td>157-109</td>
<td>158-108</td>
<td>152-114</td>
<td>147-119</td>
<td>154-112</td>
<td>159-107</td>
<td>152-114</td>
<td>148-118</td>
<td>157-109</td>
<td>152-114</td>
</tr>
<tr>
<td>TOTAL: 64.33%</td>
<td>58.32%</td>
<td>60.19%</td>
<td>59.07%</td>
<td>59.37%</td>
<td>59.37%</td>
<td>59.74%</td>
<td>60.12%</td>
<td>60.49%</td>
<td>59.37%</td>
<td>59.45%</td>
<td>59.07%</td>
<td>59.90%</td>
</tr>
</tbody>
</table>
<p>And here are all the tiebreaker variants for single-year data sets using beatflukes graphs:</p>
<table border="1" cellpadding="1" cellspacing="1">
<tbody>
<tr>
<td>Year</td>
<td>Random</td>
<td>uNet</td>
<td>Prev. Week</td>
<td>Beatloop Str.</td>
<td>Fractional</td>
<td>BPower/Win/Loop</td>
<td>Str. of Beatwins</td>
<td>Bucklin</td>
<td>UPower</td>
<td>UPower/Loop</td>
<td>UNet-Lookahead</td>
<td>Paths In/Out</td>
</tr>
<tr>
<td>2004 (54-34)</td>
<td>151-116</td>
<td>157-110</td>
<td>152-115</td>
<td>161-106</td>
<td>156-111</td>
<td>161-106</td>
<td>159-108</td>
<td>156-111</td>
<td>156-111</td>
<td>158-109</td>
<td>152-115</td>
<td>158-109</td>
</tr>
<tr>
<td>2005 (76-35)</td>
<td>152-115</td>
<td>169-98</td>
<td>168-99</td>
<td>168-99</td>
<td>168-99</td>
<td>165-102</td>
<td>172-95</td>
<td>166-101</td>
<td>168-99</td>
<td>162-105</td>
<td>168-99</td>
<td>165-102</td>
</tr>
<tr>
<td>2006 (67-53)</td>
<td>146-121</td>
<td>145-122</td>
<td>151-116</td>
<td>148-119</td>
<td>148-119</td>
<td>153-114</td>
<td>146-121</td>
<td>148-119</td>
<td>148-119</td>
<td>150-117</td>
<td>151-116</td>
<td>147-120</td>
</tr>
<tr>
<td>2007 (89-37)</td>
<td>161-106</td>
<td>172-95</td>
<td>166-101</td>
<td>164-103</td>
<td>167-100</td>
<td>175-92</td>
<td>166-101</td>
<td>167-100</td>
<td>167-100</td>
<td>171-96</td>
<td>166-101</td>
<td>173-94</td>
</tr>
<tr>
<td>2008 (80-49)</td>
<td>154-112</td>
<td>156-110</td>
<td>159-107</td>
<td>161-105</td>
<td>156-110</td>
<td>153-113</td>
<td>159-107</td>
<td>165-101</td>
<td>156-110</td>
<td>152-114</td>
<td>159-107</td>
<td>155-111</td>
</tr>
<tr>
<td>TOTAL: 63.76%</td>
<td>57.27%</td>
<td>59.90%</td>
<td>59.67%</td>
<td>60.12%</td>
<td>59.60%</td>
<td>60.49%</td>
<td>60.12%</td>
<td>60.12%</td>
<td>59.60%</td>
<td>59.45%</td>
<td>59.67%</td>
<td>59.82%</td>
</tr>
</tbody>
</table>
<p>So, conclusions?  First it&#8217;s odd and a bit discouraging that the accuracy for a season is pretty much indistinguishable from accuracy based off of multiple seasons of games all mixed together.  As for the accuracy levels, I found one resource saying that Isaacson-Tarbell was 158-97-1 in 2008 &#8211; in the regular season.  That&#8217;s 61.9%.  None of the variants beat that, but the actual beatpath picks (a subset of all the games) do beat that.  But, who&#8217;s to say that Isaacson-Tarbell wouldn&#8217;t pick that same subset even more accurately?  At any rate, further investigation is required to see if some combination of beatpaths picks, better-record, and home team could beat Isaacson-Tarbell.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2009/09/24/backtesting-revised/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Backtesting</title>
		<link>http://beatpaths.com/2009/09/22/backtesting/</link>
		<comments>http://beatpaths.com/2009/09/22/backtesting/#comments</comments>
		<pubDate>Wed, 23 Sep 2009 02:23:52 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=366</guid>
		<description><![CDATA[I&#8217;ve managed to fire up my old wonky backtesting system. It doesn&#8217;t have a lot of ability yet, but it gives some interesting information. Here&#8217;s what I&#8217;ve found. Year Random uNet Prev. Week Beatloop Str. Fractional BPower/Win/Loop Str. of Beatwins Bucklin UPower UPower/Loop UNet-Lookahead 2004 (84-57) 150-117 166-101 157-110 160-107 164-103 167-100 166-101 161-106 164-103 [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve managed to fire up my old wonky backtesting system. It doesn&#8217;t have a lot of ability yet, but it gives some interesting information. Here&#8217;s what I&#8217;ve found.</p>
<table border="1" cellpadding="1" cellspacing="1">
<tbody>
<tr>
<td>Year</td>
<td>Random</td>
<td>uNet</td>
<td>Prev. Week</td>
<td>Beatloop Str.</td>
<td>Fractional</td>
<td>BPower/Win/Loop</td>
<td>Str. of Beatwins</td>
<td>Bucklin</td>
<td>UPower</td>
<td>UPower/Loop</td>
<td>UNet-Lookahead</td>
</tr>
<tr>
<td>2004 (84-57)</td>
<td>150-117</td>
<td>166-101</td>
<td>157-110</td>
<td>160-107</td>
<td>164-103</td>
<td>167-100</td>
<td>166-101</td>
<td>161-106</td>
<td>164-103</td>
<td>161-106</td>
<td>157-110</td>
</tr>
<tr>
<td>2005 (91-45)</td>
<td>162-105</td>
<td>152-115</td>
<td>152-115</td>
<td>162-105</td>
<td>156-111</td>
<td>160-107</td>
<td>151-116</td>
<td>156-111</td>
<td>156-111</td>
<td>157-110</td>
<td>152-115</td>
</tr>
<tr>
<td>2006 (94-52)</td>
<td>156-111</td>
<td>163-104</td>
<td>159-108</td>
<td>161-106</td>
<td>163-104</td>
<td>167-100</td>
<td>164-103</td>
<td>164-103</td>
<td>163-104</td>
<td>161-106</td>
<td>159-108</td>
</tr>
<tr>
<td>2007 (89-43)</td>
<td>162-105</td>
<td>164-103</td>
<td>169-98</td>
<td>162-105</td>
<td>165-102</td>
<td>167-100</td>
<td>158-109</td>
<td>166-101</td>
<td>165-102</td>
<td>171-96</td>
<td>169-98</td>
</tr>
<tr>
<td>2008 (80-48)</td>
<td>158-108</td>
<td>159-107</td>
<td>160-106</td>
<td>156-110</td>
<td>163-103</td>
<td>159-107</td>
<td>160-106</td>
<td>160-106</td>
<td>163-103</td>
<td>161-105</td>
<td>160-106</td>
</tr>
<tr>
<td>TOTAL: 64.13%</td>
<td>59.03%</td>
<td>60.22%</td>
<td>59.70%</td>
<td>60.00%</td>
<td>60.75%</td>
<td>61.42%</td>
<td>59.85%</td>
<td>60.45%</td>
<td>60.75%</td>
<td>60.75%</td>
<td>59.70%</td>
</tr>
</tbody>
</table>
<p>
So, there&#8217;s some food for thought. Here&#8217;s how to read it &#8211; the records in parentheses are the actual beatpaths records, meaning, the games that have actual beatpath relationships. All the other methods in the table are tiebreaker methods; meaning ways to rank teams that don&#8217;t have beatpath relationships. All are based off of the same beatpath graph, and so all start from the records in the left column. We wouldn&#8217;t expect those methods to have a higher win percentage than the beatpath win percentage. These are various tiebreaker methods that I have tried out in the past. I haven&#8217;t tried out Weighted or Moose&#8217;s other ranking system &#8211; I&#8217;d expect Weighted to not perform very well, and the other ranking system to perform very well. This is also the vanilla method of finding beatpath relationships &#8211; not beatflukes or iterative.</p>
<p>I am curious what the historical win percentage of Isaacson-Tarbell is, at least over the same five-year period. <a href="http://fspi.blogspot.com/2008/03/historical-performance-of-isaacson.html">This page</a> indiciates Isaacson-Tarbell&#8217;s win percentage long-term is 62.29%, which would indicate that perhaps the best strategy to follow is 1) pick the team with a beatpath relationship to another team, 2) If there isn&#8217;t one, pick the team with the winning record, 3) if they&#8217;re tied, pick either the home team or the one selected by the beatpaths tiebreaker.</p>
<p>Hopefully in the future I&#8217;ll be able to look at how the beatpaths percentage (64.13%) compares to beatflukes, iterative, or whatever other kind of beatloop resolution scheme we come up with.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2009/09/22/backtesting/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Isaacson-Tarbell</title>
		<link>http://beatpaths.com/2009/09/16/isaacson-tarbell/</link>
		<comments>http://beatpaths.com/2009/09/16/isaacson-tarbell/#comments</comments>
		<pubDate>Wed, 16 Sep 2009 21:48:58 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=354</guid>
		<description><![CDATA[Here&#8217;s a review of Isaacson-Tarbell, which we should compare our picking schemes against this season. Isaacson-Tarbell is &#8220;picking for idiots&#8221;. And it&#8217;s pretty bulletproof &#8211; it seems most pick experts can&#8217;t outperform it. The picking rules are: Pick the team with the better win-loss record. If they&#8217;re tied, pick the home team. Isaacson-Tarbell was 9-7 [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a review of Isaacson-Tarbell, which we should compare our picking schemes against this season.</p>
<p>Isaacson-Tarbell is &#8220;picking for idiots&#8221;. And it&#8217;s pretty bulletproof &#8211; it seems most pick experts can&#8217;t outperform it. The picking rules are: Pick the team with the better win-loss record. If they&#8217;re tied, pick the home team.</p>
<p>Isaacson-Tarbell was 9-7 for week one. I am thinking of using a variant of Isaacson-Tarbell, perhaps called Isaacson-Tarbell-Beatpaths (ITB) for this season. This is &#8211; pick the team with the better win-loss record. If they&#8217;re tied, pick the team that is ranked higher according to beatpaths. Last season, beatpaths outperformed the home-team-wins approach, so I&#8217;m hoping I&#8217;ll be able to beat Isaacson-Tarbell this season.</p>
<p>For week 1 (where no one had a better win-loss record), I relied on beatpaths rankings, so we&#8217;re consistent with this approach. Beatpaths had an 11-5 record, or two better than Isaacson-Tarbell.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2009/09/16/isaacson-tarbell/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>In Review&#8230;</title>
		<link>http://beatpaths.com/2009/09/16/in-review/</link>
		<comments>http://beatpaths.com/2009/09/16/in-review/#comments</comments>
		<pubDate>Wed, 16 Sep 2009 21:43:58 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=352</guid>
		<description><![CDATA[I thought I&#8217;d take the opportunity to trace through the general strategy here, both to get my brain clearer for the season, but also to review for whoever is reading. We start out with all the wins and losses for the season so far. The first design choice I made was to ignore points, and [...]]]></description>
			<content:encoded><![CDATA[<p>I thought I&#8217;d take the opportunity to trace through the general strategy here, both to get my brain clearer for the season, but also to review for whoever is reading.</p>
<p>We start out with all the wins and losses for the season so far. The first design choice I made was to ignore points, and any other stats other than wins and losses. This is the conceit of the site and the idea. We focus only on wins, losses, and who beat who &#8211; and then, usually around Week 4, we start seeing beatloops, like A-&gt;B-&gt;C-&gt;A.</p>
<p>It may be that the initial beatloops are very long, or very short.</p>
<p>The next design choice that I made long ago was to remove the smallest beatloops first. I did this for both philosophical and expedient reasons. First, the smallest beatloop is a season split &#8211; where two teams beat each other. What this says to me is that we are unable to determine which team is better by looking at only those two game outcomes, so we throw them out. And so after all team splits are removed, I remove remaining 3-team beatloops, and then the remaining 4-team beatloops, etc.</p>
<p>This is arguable, as you may have A-&gt;B-&gt;C-&gt;A, and A-&gt;B-&gt;D-&gt;E-&gt;A, and C-&gt;A might be spectacularly flukey. Removing the smallest first would yield B-&gt;D-&gt;E-&gt;A. Removing the flukey game and the longer beatloop would retain B-&gt;C. If C-&gt;A is flukey, I think you can make a stronger case that B-&gt;C than B&#8211;&gt;A. But even then, the question is how do you determine that C-&gt;A is flukey?</p>
<p>The above paragraph is a good example of the pretzels we enjoy tying our brains up in during the NFL season.</p>
<p>The expedient reason was that removing smaller beatloops first tends to straighten out the longer beatloops. If you don&#8217;t remove the smaller ones first, things get so circular with so many different routes, that any other scheme to prioritize loop removal becomes about even more ambiguous tradeoffs.</p>
<p>The other main tradeoff with this approach is that one game can be in several shared beatloops, thereby eliminating several other game outcomes. This has been the source of much of our commentary in previous seasons &#8211; whether we see these games as flukey games or key games. I have never been convinced that a game being in more shared beatloops means that is is flukey. But we have developed a few theories (check the comments <a href="http://beatpaths.com/2009/01/07/retroactive-pick-record/">here</a> for instance) on how to identify actual links (game outcomes) to remove, thereby de-emphasizing loop removal in favor of breaking them into segments that we keep.</p>
<p>At any rate, all this discussion is to get down to one data structure, a directed acyclic graph (DAG).</p>
<p>From there, it&#8217;s about finding a power ranking. The general approach is to examine all teams that don&#8217;t have a beatloss, apply a tie-breaker, pick one, remove it from the graph, and then examine all remaining teams that don&#8217;t have a beatloss (which may include new teams if its only beatloss was the team that got removed in the previous step). We&#8217;ve also discussed many tiebreaker methods, from referring to the previous week&#8217;s rankings, to other more complex methods.</p>
<p>My principles for the system have always been to keep things simple enough that the method can be explained to a sports layperson. There are articles every year on how Podunk University should be ranked ahead of University of Miami, because of a chain that is long beatpath, so the concept itself is fun and interesting. So far the approach has been to not try and identify single games as flukey and remove them, because what seems like a fluke could instead be a massive clue or harbinger. Instead, the approach has been to instead merely remove ambiguities, and rely on the more clarified parts of the season to attempt to fill in the blanks.</p>
<p>The final part of the system has been to have fun using the system in various ways to come up with weekly picks. I&#8217;d love the commenters to come up with their own picking systems that use beatpaths either in full or in part. I&#8217;m thinking of a variant of the Isaacson-Tarbell predictor that relies on beatpaths just a tad.</p>
<p>One statistic I&#8217;m curious about &#8211; the historical predictive pick record for matchups that actually have beatpaths, versus ones that don&#8217;t.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2009/09/16/in-review/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>2008 NFL Week 17 Inter/Intra Beatpaths Graph</title>
		<link>http://beatpaths.com/2008/12/29/2008-nfl-week-17-interintra-beatpaths-graph/</link>
		<comments>http://beatpaths.com/2008/12/29/2008-nfl-week-17-interintra-beatpaths-graph/#comments</comments>
		<pubDate>Mon, 29 Dec 2008 19:00:44 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=324</guid>
		<description><![CDATA[Here&#8217;s a different beatpath graph for end of the 2008 season. I&#8217;m surprised at how clear this looks. This is based off of my brainstorming post last week and the ensuing discussion. What I did for this graph is I simply excluded all matches this season that were intra-conference or inter-conference seeding matches. Every team [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a different beatpath graph for end of the 2008 season.  </p>
<p>I&#8217;m surprised at how clear this looks.  This is based off of my brainstorming post last week and the ensuing discussion.  What I did for this graph is I simply excluded all matches this season that were intra-conference or inter-conference seeding matches.  Every team had two games excluded; a home game and an away game.</p>
<p><img src="http://beatpaths.com/img/2008-17-nfl-ic-clean.png" height="523" width="480" border="1" hspace="4" vspace="4" alt="2008-17-Nfl-Ic-Clean" /></p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2008/12/29/2008-nfl-week-17-interintra-beatpaths-graph/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Beatloop Removal Approaches</title>
		<link>http://beatpaths.com/2008/12/23/beatloop-removal-approaches/</link>
		<comments>http://beatpaths.com/2008/12/23/beatloop-removal-approaches/#comments</comments>
		<pubDate>Wed, 24 Dec 2008 03:23:39 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=317</guid>
		<description><![CDATA[Zooming out as I&#8217;m apt to do, another quick/reminder exploration of the various algorithms. This is restricted entirely to the graph methods. The goal here has always been to create a pecking order of the teams, based off of wins and losses and nothing else. This means a DAG &#8211; a directed acyclic graph. Acyclic [...]]]></description>
			<content:encoded><![CDATA[<p>Zooming out as I&#8217;m apt to do, another quick/reminder exploration of the various algorithms.  This is restricted entirely to the graph methods.</p>
<p>The goal here has always been to create a pecking order of the teams, based off of wins and losses and nothing else.  This means a DAG &#8211; a directed acyclic graph.  Acyclic means no beatloops.</p>
<p>Beatloops exist and there&#8217;s no way around that.  The goal is to remove them.  The question was what a beatloop means.  Does it mean the teams in a beatloop are tied?  The thought here was to reject that &#8211; a beatloop doesn&#8217;t mean that the teams are tied; it just means that the relationship between the teams is ambiguous.</p>
<p>This frees us up a bit &#8211; all we have to do is remove ambiguity and just rely on the rest of the graph to sort itself out into a rough pecking order.</p>
<p>We want to retain clear win/loss relationships and reject ambiguous data.  Which means rejecting the most &#8220;clearly ambiguous&#8221; data.  To me, this has always meant, the smallest set of data.  I&#8217;ve had it linked in my head that the smallest set of ambiguous data by definition meant the most tightly linked set of ambiguity.  It&#8217;s clear that in the NFL, this is not necessarily true.  I think it&#8217;s true that a four-team beatloop can be more tightly ambiguous than a three-team beatloop.  But more on that later.</p>
<p>But I think the general principle is to remove the smallest amount of ambiguity, in order to retain other clear relationships.</p>
<p>So one approach was just always to resolve the smaller beatloops first.  At first, this makes sense &#8211; two-team-beatloops (home-and-home season series splits) mean that it&#8217;s hard to say which team is better than the other just based off of the wins between the two teams.  That&#8217;s intuitive.  That&#8217;s a clear example of &#8220;tight ambiguity&#8221;.  It&#8217;s an easy decision to remove season splits.  Those links in there can lead to an absolute ton of beatloops, and it was gratifying to see how much simpler the graph got after removing even only the season splits.</p>
<p>Next was again the question of what is the best way to fairly remove the smallest set of ambiguous data?  My approach here was &#8211; when there&#8217;s a question, punt &#8211; as a way to avoid as much subjectivity as possible.  (Leaving aside the semantic discussion of how even choosing to create this website was an exercise in subjectivity.  <img src='http://beatpaths.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />   )</p>
<p>Punting meant &#8211; rather than trying to determine how to split apart the collection of three-team beatloops into several families, to just remove all of them at once, no matter how much they overlapped.</p>
<p>This has been the base system from the beginning.  I like it because it&#8217;s simple enough that I believe anyone can grasp the approach without forgetting the details.  Map all the victories.  Remove the season splits.  Remove the beatloops, smallest first.  Done.</p>
<p>So what are the subjective choices I drew here?  First, there was the desire to actually end up with a DAG (direct acyclic graph).  We can even challenge that one and just choose to determine rankings based off of a graph with loops in it, but I want to avoid that direction for now.  It&#8217;s not visual enough and doesn&#8217;t strike me as intuitive to a casual graph viewer.</p>
<p>So the first subjective choice was to remove smallest loops first.  We already have one open challenge there &#8211; Boga sketched out a quick algorithm that I&#8217;d like to explore (if I can just find the comment again).  That would be to work on resolving all loops at once, I think (or it might have just been how to rank teams before the loops are taken out).  </p>
<p>And I&#8217;m also pretty convinced at this point that resolving smallest loops first is more sound only if the teams all play each other, as they do in the NBA and MLB.  In the NFL, a four-team beatloop can exist and it doesn&#8217;t necessarily mean it shouldn&#8217;t be removed at the same time as some other three-team beatloops, and this is specifically because of the rarity of intra-conference games, which is not a team&#8217;s fault.</p>
<p>The other subjective choice was to just remove <b>all</b> n-sized beatloops at once, no matter how much they overlapped.  The thought here was to avoid the subjectivity of choosing to remove 3-team beatloops before removing other 3-team beatloops.  But in a sense I am already introducing a similar subjectivity by removing 3-team beatloops before removing 4-team beatloops.</p>
<p>So while one direction is resolving all beatloops at once, the other direction is to choose to split out (e.g.) 3-team beatloops into multiple families, and resolve some of them first, and then the others.</p>
<p>There are some easy ways to chip away at this that would be noncontroversial.  The problem is not a team being in several beatloops, it&#8217;s of a single segment (TeamA=>TeamB) being in several beatloops.  So you could first remove every beatloop where none of its single segments are in other beatloops.  If I were to do this and then resolve the remaining beatloops with current rules, I&#8217;m pretty sure I&#8217;d end up with the exact same result.  But it could introduce some greater clarity in terms of examining each step of the process.  There is room to explore in terms of choosing how to remove some beatloops with overlapping segments, before removing other ones.</p>
<p>Iterative is an example of an approach that seeks to remove segments, not loops &#8211; it identifies actual single-match outcomes that should be ignored in the initial data set, as opposed to entire loops.  Doktarr, please correct me if I&#8217;m wrong on this.  We&#8217;ve had one other similar approach of ignoring actual segments; the beatfluke approach, which obliterates beatloop segments that are directly contradicted by the resultant DAG, thereby restoring the rest of the beatloop to the graph.  There is one other obvious segment-removal approach we haven&#8217;t explored yet, and that is the one that merely seeks to find the fewest number of matches in a season that can be excluded in order to create a DAG.  (For ties, for instance, five ways to remove only seven game outcomes, we&#8217;d isolate down by removing the game outcomes that appear in all scenarios, and then rank the rest of the candidates through some set of objective-as-possible standards.)  Doktarr has been very patient with my reticence to get back into segment-removal.  <img src='http://beatpaths.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p>So I&#8217;ve got a couple of questions here.</p>
<p>1) Doktarr, is it possible to restate iterative logically, in a way that doesn&#8217;t use ratios or fractional numbers?  Is it as simple as just removing NYJ->MIA first because that link appears &#8220;most&#8221; (5) in all the 3-team beatloops?  I think someone actually did restate it this way once, but I might be confusing it with Boga and MOOSE&#8217;s discussion.</p>
<p>2) Also, is it possible to restate iterative in terms of full beatloops being removed?</p>
<p>3) Finally, more thoughts from anyone appreciated on how to judge removing beatloops if multiple sizes of beatloops are considered at once.  There&#8217;s obviously a problem with this, in that by the time every team has at least one win and one loss, the entire league is one big beatloop.  The only thing that we can REALLY say at this point is that Detroit is definitely worse than every team they have directly played.  But every other team can draw a circuitous beatpath to any team that has beaten them.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2008/12/23/beatloop-removal-approaches/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Call For Volunteers</title>
		<link>http://beatpaths.com/2008/11/12/call-for-volunteers/</link>
		<comments>http://beatpaths.com/2008/11/12/call-for-volunteers/#comments</comments>
		<pubDate>Wed, 12 Nov 2008 22:45:49 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[News and Notices]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=284</guid>
		<description><![CDATA[As part of the upcoming redesign for Beatpaths, I&#8217;m looking for people who might be interested in posting commentary on the actual front page of the site, as opposed to simply leaving comments. We&#8217;ve had a fair amount of interest here from people suggesting new methods, ways to rank the confidence of picks, etc. New [...]]]></description>
			<content:encoded><![CDATA[<p>As part of the upcoming redesign for Beatpaths, I&#8217;m looking for people who might be interested in posting commentary on the actual front page of the site, as opposed to simply leaving comments.  We&#8217;ve had a fair amount of interest here from people suggesting new methods, ways to rank the confidence of picks, etc.  New content can be for other sports, as well.  The new design will allow a more collaborative feel and the new tool will allow dynamic graph generation.</p>
<p>If you&#8217;re interested in representing a slice of the content &#8211; whether it&#8217;s a posting once a week or something more irregular, drop me a note and make a proposal.  I can set up a mailing list of us to do some coordination.</p>
<p>For those who don&#8217;t know, the backend system is written in perl &#8211; I may be interested in allowing others to collaborate on the programming if there&#8217;s enough interest.</p>
<p>If you&#8217;re interested, contact me through <a href="http://curtsiffert.com/feedback">this webform</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2008/11/12/call-for-volunteers/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Beatpaths Nomenclature</title>
		<link>http://beatpaths.com/2008/11/04/beatpaths-nomenclature/</link>
		<comments>http://beatpaths.com/2008/11/04/beatpaths-nomenclature/#comments</comments>
		<pubDate>Tue, 04 Nov 2008 20:47:08 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=281</guid>
		<description><![CDATA[Discussion on different methods are ramping up &#8211; I&#8217;d like to make one suggestion on nomenclature. Graph Method: Methods that have to do with determining what teams have beatpaths and beatwins to other teams. Examples are the current Standard graph method on this site, the Iterative graph method, and the Beatflukes graph method. Power Method: [...]]]></description>
			<content:encoded><![CDATA[<p>Discussion on different methods are ramping up &#8211; I&#8217;d like to make one suggestion on nomenclature.</p>
<p><b>Graph Method:</b> Methods that have to do with determining what teams have beatpaths and beatwins to other teams.  Examples are the current Standard graph method on this site, the Iterative graph method, and the Beatflukes graph method.</p>
<p><b>Power Method:</b> Methods that take the Graph methods and then determine sequential rankings from them.  Examples are the current strength-of-beatpower power method on this site, the Weighted power method, and MOOSE&#8217;s (what&#8217;s the name?) Total Paths power method.</p>
<p>I might have already misstated something, because I think I&#8217;m still unclear what power method the Iterative graph method uses.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2008/11/04/beatpaths-nomenclature/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>2008 NFL Week 3 Discussion</title>
		<link>http://beatpaths.com/2008/09/21/2008-nfl-week-3-discussion/</link>
		<comments>http://beatpaths.com/2008/09/21/2008-nfl-week-3-discussion/#comments</comments>
		<pubDate>Mon, 22 Sep 2008 06:33:35 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=257</guid>
		<description><![CDATA[Miscellaneous observations: Last week&#8217;s power rankings are 10-5 9-6 so far this week. The only two teams with beatpaths to every other team in their division: Denver and Arizona. #31 beat #1, with some awesome direct snap misdirection. And it&#8217;s time to start thinking of tiebreaker and rank strategies. There are two stages to determining [...]]]></description>
			<content:encoded><![CDATA[<p>Miscellaneous observations:</p>
<p>Last week&#8217;s power rankings are <strike>10-5</strike> 9-6 so far this week.</p>
<p>The only two teams with beatpaths to every other team in their division: Denver and Arizona.</p>
<p>#31 beat #1, with some awesome direct snap misdirection.</p>
<p>And it&#8217;s time to start thinking of tiebreaker and rank strategies.</p>
<p>There are two stages to determining rankings.  First is the beatloop resolution strategy.  That is pretty stable, although doktarr and moose have written about possible ways to enhance it.  The general principle of beatloops is not to imply that the teams in a beatloop are tied &#8211; it&#8217;s more just that it is the smallest set of data that can be seen as ambiguous/confusing, and thus should be removed.   That way we rely on the rest of the graph to imply rankings.  I think that trying to divine too much data from a beatloop just introduces too many judgment calls into a graph.  We always remove smallest beatloops first, starting with splits, and then recalculate.</p>
<p>We&#8217;ve tried some methods to bust beatloops here in the past.  One that I was fond of was called the beatfluke method, defined as:  If Team A&#8217;s loss to Team B was beatlooped away, and Team A also has an entirely different remaining alternate beatpath to Team B, then it contradicts Team A&#8217;s loss, and thus the A-beats-C-beats-B part of the beatloop can be restored to the graph.</p>
<p>I found this made the graph more vertical, and also slightly more accurate, but I didn&#8217;t like how it would lead to more dramatic shifts in the power rankings each week.  It made the graphs vary more from week to week.  Perhaps if it were combined with a more stabilizing tiebreaker, it could be used again.</p>
<p>The other two approaches of busting beatloops were doktarr&#8217;s &#8220;iterative&#8221; method, and Moose&#8217;s score method.  The &#8220;iterative&#8221; method breaks shared beatloops at their shared link, like if one game is responsible for the existence of several beatloops.  It is another effort to try to identify one link of a beatloop (a game outcome) as flukey.  I do have trouble justifying that one intuitively, though &#8211; I feel like I need another reason to believe that link actually is flukey, other than it just being part of several beatloops.  The other is a weighted system having to do with score differentials.  I believe this ended up accurate and perhaps superior, although I&#8217;m trying to keep the main system here free of extra data like points (as opposed to just wins and losses).</p>
<p>After that, there&#8217;s how to determine rankings from the resultant beatpath graph.  So far this season, I&#8217;ve been breaking ties based off of the rankings of the previous week.  But the usual tiebreaker for later in the season is to compare the strength of the teams&#8217; direct beatwins &#8211; for instance, if every team in a tied set has at least three beatwins, it averages the strength of the top three beatwins of each of those teams, and picks the top team.  Finally, I think Moose came up with a tiebreaker having to do with counting all the links in a resultant beatpath graph.  This is somewhat similar to what I used in the first and second year here, which counted number of teams above and below each team, but it yields more information in that it counts every link of every possible path, thereby giving extra weight to stronger paths.  I probably have this explanation wrong but Moose will correct me in the comments.  This is a good candidate to apply as a tiebreaker to the official rankings this season.</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2008/09/21/2008-nfl-week-3-discussion/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Beatpaths!  Beatpaths!  Beatpaths!</title>
		<link>http://beatpaths.com/2008/09/09/beatpaths-beatpaths-beatpaths/</link>
		<comments>http://beatpaths.com/2008/09/09/beatpaths-beatpaths-beatpaths/#comments</comments>
		<pubDate>Tue, 09 Sep 2008 18:23:16 +0000</pubDate>
		<dc:creator>ThunderThumbs</dc:creator>
				<category><![CDATA[Beatpath Tech]]></category>
		<category><![CDATA[NFL]]></category>
		<category><![CDATA[News and Notices]]></category>

		<guid isPermaLink="false">http://beatpaths.com/?p=248</guid>
		<description><![CDATA[For those that are new visitors this year, here&#8217;s a quick description of what we&#8217;re all about. A few years back, I was reading one of those websites that purports to power-rank sports teams based off of every single in-game stat in the book. And, being a Broncos homer, I was peeved that the Broncos [...]]]></description>
			<content:encoded><![CDATA[<p>For those that are new visitors this year, here&#8217;s a quick description of what we&#8217;re all about.</p>
<p>A few years back, I was reading one of those websites that purports to power-rank sports teams based off of every single in-game stat in the book.  And, being a Broncos homer, I was peeved that the Broncos were ranked so low.</p>
<p>My gut-based thought &#8211; completely lacking in any evidence and data &#8211; was that wins aren&#8217;t always linked to stats.  Sometimes it&#8217;s heart!  And guts!  Or the intelligence of a team&#8217;s scheme &#8211; sometimes they just play the matchup game better.  But really, I was just thinking about how it all comes down to wins and losses, and damn the rest of the data.</p>
<p>The advanced ranking schemes pay attention to a lot of data &#8211; all the data except for wins and losses.  So my perverse idea was to do the inverse.  Focus only on wins and losses.</p>
<p>Turns out there are other schemes that also only pay attention to wins and losses &#8211; they use big words and probability and math and statistical theory.</p>
<p>I decided to use graphs.  Here&#8217;s how the idea came about.</p>
<p>I started graphing the NFL season based off of one simple rule &#8211; if a team beats another, you draw a line from the winner to the loser.  The End.</p>
<p>Soon, you start seeing long sequences of wins and losses &#8211; The Broncos beat the Steelers, who beat the Patriots, who beat the Giants &#8212; oh wait, they didn&#8217;t beat the Giants.  Anyway, those sequences of wins become what I call a <b>beatpath</b>.</p>
<p>Now, early in the season, you see sportswriters try in vain to structure their own power rankings by always having teams ranked ahead of other teams they&#8217;ve beaten.</p>
<p>But soon, you notice that contradictions start appearing.  Every year, there&#8217;s some silly article over in college football land about how Captain Doohickey Tech is better than Ohio State, since they beat Team A, who beat Team B, who beat Team C&#8230;. who beat Team X, who upset Ohio State back in week 2.  When you know just as well that Ohio State has demolished someone who has in turn demolished the Raiders, I mean, Captain Doohickey Tech.</p>
<p>That contradiction is called a <b>beatloop</b>.  </p>
<p>And so we, that is the Royal We, not to be confused with Eddie Royal, worked out a system to resolve these beatloops and come up with automatic power rankings.  And surprisingly, it&#8217;s ended up weirdly accurate.  Well, sometimes just weird.  But a hell of a lot of fun.</p>
<p>Every week I&#8217;ll post graphs that are a snapshot of the NFL season so far.  And the automatic power rankings that come from the graph.  And we&#8217;ll be continuing to discuss ways to tweak the algorithm &#8211; commenters often come up with variants to the algorithm that might even be more accurate.  And we&#8217;ll have arguments about whether the beatpath algorithm is actually predictive (when it&#8217;s doing well), or merely descriptive of the season so far (when there are lots of upsets).  And there will probably be some subtle Broncos homerism littered throughout, until someone calls me out on it.</p>
<p>Welcome to the site (redesign coming soon!), and enjoy your stay!</p>
]]></content:encoded>
			<wfw:commentRss>http://beatpaths.com/2008/09/09/beatpaths-beatpaths-beatpaths/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
	</channel>
</rss>

