![]() |
|
|
#1 | ||
|
n00b
Join Date: Aug 2008
|
Testing a Game v. a Season
Question for the statistics people:
Has anyone thought about the implications of testing a single game as opposed to a season? It is a lot of work, but I thought it may help solve the defensive puzzle. Let's say you want to test what bumping the SLB blitz by 10% does. If you ran that for the same season 10 times you would have 160 samples. The problem is though that the computer changes the offence for each team. If however, you ran the same game 100 times you just might have a good sample of the 10% increase in the SLB blitz against the "type" of team you are playing against. Any thoughts would be greatly appreciated. |
||
|
|
|
|
|
#2 |
|
n00b
Join Date: Mar 2008
Location: Alberta
|
I think that is a great idea, as I noticed it in the other thread on Def GPs and it got me to thinking. The chore of it is taking down the stats for each game, since I know nothing about parsing programs, but may be someone has a good idea on how to do this easily.
I think that running the single game a 100 times would give you a very good idea of your performance in a specific situation, and comparing that to other games run multiple times against different types of teams might lead to some kind of revelation about defensive gameplanning.
__________________
Owner/GM/Head Coach and Towel Boy for the St. Louis Rams.of the FOWL Owner of the Green Bay Packers of the PFL. First Response Coordinator of Public Relations Disasters for the Balzac Ticklers of the FOOLX. (retired) Owner/GM of the soon to be awesome Fort Worth Fury of the IHOF |
|
|
|
|
|
#3 |
|
"Dutch"
Join Date: Oct 2000
Location: Tampa, FL
|
Sounds like a promising idea.
|
|
|
|
|
|
#4 | |
|
n00b
Join Date: May 2007
|
Quote:
My thoughts, having re-read the other Defensive Gameplan string and this one: A. What are you going to measure? Changes in the number/success of SLB blitzes? Changes in the number of sacks? Changes in points allowed? Changes in yards allowed? Given the amount of work required to gather any useful data, this is a crucial question to answer up-front. B. The study, as proposed, has limitations. In addition to being limited by the type of offense the team runs, it is also limited by the strengths and weaknesses of the personnel on the opposition. C. If you wish to pursue this (i.e. repeatedly running the same game), would recommend the following:
I suspect that within Quicksands multiple runs of a single season, there are valuable pieces of information that could be extracted with the proper statistical analysis techniques. |
|
|
|
|
|
|
#5 |
|
Pro Starter
Join Date: Nov 2000
Location: Cary, NC, USA
|
Running a single game - saving out the info each time - 100x would be excruciatingly tedious.
![]() What may be more easily doable is setting up a MP league, giving all 32 teams the same offensive and defensive gameplans. Then running that season 10x... so you're looking at 16 games x 32 teams x 10 seasons = a lot of data with the same offense v. defense (obviously different personnel). |
|
|
|
|
|
#6 |
|
n00b
Join Date: May 2007
|
I like that plan. Personnel differences should wash out over the course of a season, and rather than repeating the season, would recommend running consecutive seasons to further reduce any personnel effects.
|
|
|
|
|
|
#7 | |
|
n00b
Join Date: Aug 2008
|
Quote:
Well, I guess I got my answer, yikes! Wish I read that before running a test. FYI I had a team with good Bump and Run numbers. So I altered my pass defence by adding 10% to BnR and taking away 10% from MfM -- however I never let MfM drop below 3% so that the opposing QB would have a hard time locking on. To keep it simple I looked at Wins, Points For, and Points againts. I did 50 samples. Most of the data was inconclusive, however there was a 33% increase in games that my opponent was held to less that 21 points. So, if I limit it to saying that against the "type" of talent I was playing against, does it mean anything? |
|
|
|
|
|
|
#8 | |
|
n00b
Join Date: May 2007
|
Quote:
It's hard to say what, if anything you've found. What is significant about 21 points? Is that an average number of points scored in this test? The average number of points scored in general in FOF? What happened in the other 67% of games? These are pieces of information that can help determine the value of what you've found. |
|
|
|
|
|
|
#9 | |
|
lolzcat
Join Date: Oct 2000
Location: Annapolis, Md
|
Quote:
My honest answer is that I fear this would still be too small a sample size to tell much, especially for something as minor as the 10% tweak you're talking about. There might be some value in testing an entirely different defensive strategy this way, but I just don't know what you're going to get out of all this labor if all you test is a small thing like a modest bump in blitzing percentage. I confess that I'm somewhat interested in the results or something like this, but I fear that what you'd see would end up being something that looks an awful lot like white noise. One team posts a PR% of 21.4, and the other a 21.8, and you end up saying "who knows?" or (possibly more harmful) you end up being drawn to a conclusion that B is better than A when in fact the difference was simply a function of the still-present randomness at work. |
|
|
|
|
|
|
#10 |
|
lolzcat
Join Date: Oct 2000
Location: Annapolis, Md
|
|
|
|
|
|
|
#11 | |
|
n00b
Join Date: Aug 2008
|
Quote:
I think I found.... nothing thus returning to my former conclustion that I wish I read your post before conducting the test. |
|
|
|
|
|
|
#12 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
n00b
Join Date: Aug 2008
|
Here are the points against numbers FYI
Last edited by damelloman888 : 08-26-2008 at 09:50 AM. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
|
|
|
#13 |
|
High School Varsity
Join Date: Mar 2006
Location: Woodstock, GA
|
If I was going to do this kind of test, I would set up the player file with all 50's/mid level/what have you. So that all players for every position are rated the same (except for the values we cannot control and the hidden values).
Then I would set up each defensive team a little different and have every offense running the same offense. Then again, I don't have that kind of time. ![]() |
|
|
|
|
|
#14 | |
|
n00b
Join Date: Aug 2008
|
Quote:
this is the last week that I will have this kind of time for years (please don't ask) |
|
|
|
|
|
|
#15 | |
|
n00b
Join Date: May 2007
|
Quote:
This is where statistical analysis tools becomes crucial. A difference between 21.4 and 21.8 could be statistically significant - even w/ a small sample size if the effect is particularly robust. Of course statistical significance can still have little to no real world relevance, but at least it becomes possible to assign a confidence interval regarding the impact of randomness (i.e. error). After thinking about this issue a little more, I'm wondering if the cart isn't being placed before the carrot. In another thread, it was noted that there was a lack of clarity regarding the impact/importance/value of double coverage. This has led me to think that before tweaking the defense (i.e. double coverage, blitz %, etc.) it would be useful to know to what degree these variables result in positive plays for the defense. That is, what % of sacks involve a LB or DB blitzing or what is the completion % for double coverage v. single v. zone, etc. I haven't spent much (any) time looking at game logs, so I'm unfamilar w/ the data available and therefore don't know if this type of analysis would be possible. |
|
|
|
|
|
|
#16 | |
|
n00b
Join Date: May 2007
|
Quote:
If I'm understanding the data you posted: Independent Variable (IV) = pass defence increased 10% to BnR and decreased 10% from MfM - never let MfM drop below 3% Dependent Variable (DV) = points allowed (each number represents the points allowed in a single game correct?) Comp = points allowed before applying IV Change = points allowed after applying IV Based on these assumptions, I ran a quick analysis of the data and found the following: Comp AVG = 24.6 (SD = 9.31) Change AVG = 23.22 (SD = 9.39) Difference = 1.38 I next conducted a t-test, which yielded a p = 0.231, which is generally considered to be non-significant. What all of this means (if I correctly understand the data you provided) is that: "adding 10% to BnR and taking away 10% from MfM" and "never let[ting] MfM drop below 3%" resulted in an average 1.38 reduction in points allowed. The probablility of this difference being due to chance (random or other factors) is less than 23% (i.e. we can say with only 77% degree of confidence the changes in defense were responsible for the 1.38 point reduction). Possible causes for these findings:
In the end, I have to agree (unfortunately) that you've probably found nothing; however, without running the above sort of analysis, there is no way to really know with any degree of confidence, and perhaps with a different DV and a few more runs (e.g. 100 as originally proposed) something of interest might pop-up. |
|
|
|
|
|
|
#17 | |
|
n00b
Join Date: Aug 2008
|
Another idea....
Quote:
Wow, thank you for the info -- the whole thing lead to another idea. Once again with 50 samples. IV = human coaching (in this case me using everything I know so far) DV = Win percentage Comp = WP before applying IV Change = WP after applying IV Comp = 54% Change = 66% also FYI DV = Points Comp = 26.32 Change = 23.8 DV = Points allowed Comp = 24.6 Change = 18.54 It is important to note that I did not fix the Red Zone defense bug in either case. How much difference would be significant? Could I have stumbled on a good way for a rookie coach to "practice"? Thank you for working through this with me, it sure is interesting. |
|
|
|
|
|
|
#18 | |
|
n00b
Join Date: May 2007
|
Quote:
No problem, this is interesting to me. I frequently don't have the time to do the data collection and the analysis I would like and I enjoy thinking this through. RE: Win%, I've got to think a little on how to test this, but eyeballing it, you went from winning around 1/2 of the time (a coin toss = not much better than pure chance) to about 2/3 of the time. Without testing, that looks significant. If you could provide me the raw data (like in the previous post) either in the string or PM, I could work it up pretty quick and see what's going on. Based on the earlier results, I think it will be significant, but would have to see. Did you make any offensive adjustments? Interesting to note that Comp pts allowed = 24.6 - did you run it again, or use the old data? |
|
|
|
|
|
|
#19 |
|
n00b
Join Date: Aug 2008
|
Raw Data
Me coaching make all kinds of offensive and defensive adjustments.
Us Them Win 34 14 1 24 21 1 30 14 1 20 17 1 12 17 0 30 14 1 29 17 1 31 9 1 27 30 0 24 19 1 37 7 1 12 10 1 41 26 1 24 34 0 10 17 0 27 7 1 19 20 0 38 28 0 24 27 0 34 21 1 17 34 0 30 23 1 23 20 1 29 3 1 10 28 0 23 15 1 28 20 1 23 15 1 20 16 1 20 27 0 20 10 1 26 17 1 30 17 1 26 3 1 17 20 0 17 20 0 13 20 0 13 17 0 26 17 1 16 6 1 27 24 1 19 16 1 24 21 1 21 6 1 34 10 1 25 35 0 23 20 1 17 30 0 24 27 0 22 21 1 Last edited by damelloman888 : 08-27-2008 at 11:55 AM. |
|
|
|
|
|
#20 |
|
n00b
Join Date: Aug 2008
|
Raw Data
Computer Coaching (same as before, I'm hopeing that 100 samples are not nessisary)
Us Them Win 24 38 0 24 27 0 28 21 1 14 27 0 28 23 1 29 32 0 17 20 0 17 13 1 20 21 0 34 16 1 31 28 1 21 24 0 33 9 1 38 36 1 26 23 1 41 24 1 45 51 0 34 35 0 27 21 1 41 14 1 34 24 1 10 31 0 38 24 1 26 24 1 35 37 0 31 21 1 24 31 0 34 17 1 9 35 0 31 32 0 20 42 0 24 13 1 21 24 0 31 30 1 30 7 1 16 34 0 34 20 1 37 17 1 9 24 0 7 24 0 26 6 1 37 31 1 13 7 1 26 35 0 24 20 1 20 34 0 17 24 0 37 16 1 16 20 0 27 23 1 Last edited by damelloman888 : 08-27-2008 at 11:54 AM. |
|
|
|
|
|
#21 |
|
n00b
Join Date: May 2007
|
Looking at the data:
CONTROL (Computer Running the Show) AVG PF=26.32 SD=9.22 AVG PA=24.6 SD=9.31 AVG Point Differential=1.72 SD=12.8 TEST (User Changes) AVG PF=23.8 SD=7.23 AVG PA=18.54 SD=7.88 AVG Point Differential=5.26 SD=11.15 These numbers suggest that while your scoring went down with your changes, so did your opponent's. Additionally, there was less variability in scoring by both teams. That is there was greater consistency. I assume, that none of the TEST games used the same offensive or defensive gameplan. T-Test (Comparing CONTROL to TEST) PF p=.1121 PA p=.0005* AVG Point Differential p=.0785 *Significant <.05 Conclusions: The difference in Points For (PF) and AVG Point Differential between the two tests are not statistically significant (generally the "p" value should be <0.05). Said another way, there is an 11% probability that the change in PF occurred by chance and there's a 7% chance that the change in point differential was due to chance. The change in PA is statistically significant with a less than 1% (0.05%) chance that the change was due to chance alone (i.e. there is a 99.95% probability that the difference in PA is due to gameplan changes). Problematic is the fact that changes were made to both offensive and defensive gameplans, That is as QuickSand has pointed out, how an offense performs often dictates what the defense has to face. Therefore, the overall changes in game plan appear to be effective at reducing PA in this particualy situation (i.e. against this one team's offensive talent and style). At this point, a description of the oppositon would be useful (the offense/defense they seemed to use,if known, their personnel, etc.) and the game plan changes made would be informative. Additionally, it might be useful to take this as preliminary data, look at changes made, pick some, and run repeated tests. If defensive gameplanning is the test you wish to look at (vice overall gameplanning) then I'd recommend returning to doing nothing on offense and tinkering w/ the defense. Also, 50 runs seems to be enough to detect changes when they're robust. |
|
|
|
|
|
#22 |
|
Resident Alien
Join Date: Jun 2001
|
Anyone else think that it is ridiculous that one would have to go to this level of effort to try to figure out defense in a game that is supposed to be played for fun? This is not a slam on the researchers -- kudos to them. This is a slam on the unintuitiveness of the game as far as defense goes. Sure, put in stuff that people can dig around to gain a little bit of an advantage. But don't make the game so that NOBODY knows what they are doing on defense.
__________________
Indiana Hoosiers Football - 2025-26 National Champs The FOFC Ladder History thread |
|
|
|
|
|
#23 | |
|
n00b
Join Date: Aug 2008
|
I agree
Quote:
Kodos I agree, I have a few days left with this kind of time and hoped to be able to help everyone out. It is silly the amount of time it takes though, however if I find anything I will be happy to share. |
|
|
|
|
|
|
#24 |
|
n00b
Join Date: Aug 2008
|
Back To The Whole Season
What about in a season:
Comp = computer game plan Change = human game plan Sample a whole season twice and find a team that has an average of 8 wins (.500 record) Then try the same seaon with your game plan. I think 13 wins are significant. Over two season I think you need 23 wins. Is that correct? |
|
|
|
|
|
#25 |
|
lolzcat
Join Date: Oct 2000
Location: Annapolis, Md
|
Seriously, if you're trying to gather information from testing, at least gather enough information that you have something to work with. That means more than wins.
Here's a tool that you might find useful: Excel-Based QuikTest Template - Front Office Football Central I know it got edited a couple of times with various upgrades, but I think you may find it useful -- at least this way you'll have data on things like points allowed per game, yards per carry, yards per attempt, and the like. |
|
|
|
|
|
#26 | |
|
n00b
Join Date: May 2007
|
Quote:
And even more seriously, once you have the data, analyze it correctly. |
|
|
|
|
|
|
#27 |
|
Morgado's Favorite Forum Fascist
Join Date: Oct 2000
Location: Greensboro, NC
|
And even more seriously, it would be wise to take into account the advice of those who have played this and other iterations of FOF for 10ish years.
GUIDE TO DEFENSIVE SUCCESS IN FOF2007 1. Acquire players on defense with lots of big, red bars. 2. Don't do anything extreme with your game plan. 3. Don't rex, specifically because of the overuse of goal line, and the switched columns bug. 4. Do well on offense. 5. Don't waste your time testing one defensive game plan versus another. I can put this guide in PDF form for printing if anyone needs it that way. ![]()
__________________
The media don't understand the kinds of problems and pressures 54 million come wit'! Last edited by Ben E Lou : 09-02-2008 at 07:33 AM. |
|
|
|
|
|
#28 | |
|
n00b
Join Date: May 2007
|
Quote:
This is the best advice we get from 10ish years of experience? 1. What if you can't acquire the necessary number of players w/ big red bars? 2. Extreme means what? 3. So basically just follow #2, that helps how? 4. What if you have a team that won't/can't do well on offense? (Do I have to have both 1 & 4 or will one or the other suffice?) 5. Why, because after 10ish years nobody else has succeeded? The fact that in an earlier post, the Point differential was statistically significant at around the 0.07 and the Points Against Differential was statistically significant suggests that whatever monkeying around damelloman888 was doing, he may have been having some pretty good success tweaking his defense.. All of the above suggestions regarding defense suggest that it is essentially random and/or out of the player's control - is this what the game designer had in mind when he designed the defensive aspect of the game? |
|
|
|
|
|
|
#29 | ||||||
|
Morgado's Favorite Forum Fascist
Join Date: Oct 2000
Location: Greensboro, NC
|
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
Look, I don't mean to throw a wet blanket over testing. I am *THRILLED* to see some new faces doing detailed tests. I appreciate the work you and others have put into it, and I hope you'll stay involved in unraveling some of the mysteries of FOF. I'd just hate for y'all to burn out testing something that I'm fairly certain is going to yield little/no return. I've spent a good bit of time on this (and usually when I spend a good bit of time testing out game plans, a patch or new version ends up coming out to nerf the stuff I discover ,) and I'm convinced that:1. You can improve your defensive performance only *very* marginally by game planning "well." 2. You can screw up your defensive performance significantly by doing too much of any one thing. (Again I'd refer you to the defensive familiarity Q&A for guidelines.) 3. A "good" game plan is most useful against a team with an offensive game plan that's somewhat imbalanced. Against a balanced offense, it's virtually useless.
__________________
The media don't understand the kinds of problems and pressures 54 million come wit'! Last edited by Ben E Lou : 09-02-2008 at 10:06 AM. |
||||||
|
|
|
|
|
#30 | |
|
Grizzled Veteran
Join Date: Oct 2000
Location: Syracuse, NY
|
Quote:
Fixed. ![]() |
|
|
|
|
|
|
#31 |
|
Morgado's Favorite Forum Fascist
Join Date: Oct 2000
Location: Greensboro, NC
|
So saith yet another person whose game plan testing has caused code changes.
![]()
__________________
The media don't understand the kinds of problems and pressures 54 million come wit'! |
|
|
|
|
|
#32 | |
|
Coordinator
Join Date: May 2003
Location: Utah
|
Quote:
This to me and what everyone has been saying about it...really honestly is frustrating. However, I am able to create a defense without the big red bars and do well against other teams (except Ben's) and I feel that the right players and skill sets at the right position can make or break a defense. However, if that is not the case, so be it...this game is flawed to me more than ever then and if Gary's DDS: PF is better at all, even a shred...I will move that way immediately.
__________________
"forgetting what is in the past, I strive for the future" Last edited by MacroGuru : 09-02-2008 at 02:04 PM. |
|
|
|
|
|
|
#33 |
|
Morgado's Favorite Forum Fascist
Join Date: Oct 2000
Location: Greensboro, NC
|
Is it flawed, or does the NFL work that way? (i.e. Is it all about talent and mixing/disguising schemes in the NFL, rather than defensive schemes?)
__________________
The media don't understand the kinds of problems and pressures 54 million come wit'! |
|
|
|
|
|
#34 | |
|
Coordinator
Join Date: May 2003
Location: Utah
|
Quote:
Depends on the route you take. But everyone is saying if I don't have a great offense and great talent on defense I can't do a thing on defense. Talent wise a decent defense with the proper scheme can be devastating, even in the NFL. I should be able to game plan, scheme and do more defensively in order to succeed than what I am being told here. Get a lot of players with red bars have an offense that can score don't rex (which I agree with) don't do anything extreme to me, that's a flawed system, obviously if I have the best talent on defense, I should do well. It's about being able to win with what you have, and I am being told regardless of what I have, if they do not have the bars it's not going to work.
__________________
"forgetting what is in the past, I strive for the future" |
|
|
|
|
|
|
#35 |
|
Pro Starter
Join Date: Oct 2000
Location: Cary, NC
|
FWIW, you can Rex on defense with success (and I do all the time), you just need to manually correct its egregious abuses, mostly fixing the swapped columns bug, and toning down its tendency to call 4-deep zone.
But nothing beats the big red bars...
__________________
-- Greg -- Author of various FOF utilities |
|
|
|
|
|
#36 | |
|
Pro Starter
Join Date: Oct 2000
Location: Cary, NC
|
Quote:
But you're actually describing having the correct bars in the correct places. If you don't have talent, how do you expect to succeed? You don't need all-around players, but you do need overall talent in key places. For example, I could care less if my DTs can pass rush, and I could care less if my CBs can defend the run. If they do, it's a bonus, but they don't need to to have success. But if my DTs suck at run defense and my CBs can't cover man-to-man, no amount of scheming is going to cover that up, either in the NFL or in FOF.
__________________
-- Greg -- Author of various FOF utilities |
|
|
|
|
|
|
#37 | |
|
Mascot
Join Date: Jun 2007
|
Quote:
Leans to the flawed side. Example: Tampa/Cover 2 teams play Tampa 2 on most downs in the NFL. The Monte Kiffins of the world believe the simplicity of their scheme gives them an advantage. If you tried to run Cover 2 80% of the time in FOF then you would fail miserably. It would be a nice improvement if you could replicate that or any other type of defensive philosophy within FOF and target talent designed to fit the defensive system whether it be Cover 2, Zone blitzing (LeBeau), Bellichik, Johnson, etc... |
|
|
|
|
|
|
#38 | |
|
n00b
Join Date: May 2007
|
Quote:
Do big bars let you "cheat" more in this direction? That is, if I want to go with a more specialized defense, will having the big bars allow me get away with over-using certain strategies...even a little? |
|
|
|
|
|
|
#39 | |
|
n00b
Join Date: Apr 2006
|
Quote:
In my opinion as Ben and others have stated never go to the extreme. However i have seen success when a defense has big red bars in many areas can go to a more specialized defense more successfully. I recall playing a team recently where my offense on numerous occasions had very favorable matchups where i would be running the ball against an aggressive pass defense yet still struggled to run the ball and my team was a successful running team, the opposing defense had a lot of talent especially the front seven at stopping the run. |
|
|
|
|
|
|
#40 |
|
n00b
Join Date: May 2007
|
Well that certainly speaks to the notion of having the right people in the right schemes. Begins to make me wonder if what has been perceived as a lack of effective control in defensive gameplanning can't partially be explained as a mismatch between scheming/planning and personnel? Personally, as a fan, I know less in general about defense and this ignorance is further fueled by a lack of robust stats to explain what happens on defense.
|
|
|
|
![]() |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
|
|