Part 2 - I suck on wednesday - Toying with SC2 replay stats

In my previous (and first) blog post a week ago, I discussed some interesting statistics that I had compiled from my personal set of 1208 1v1 games played in the last 18 months. While my overall win-rate of 50.7% was very close to the 50% that the Blizzard matchmaking contraption aims for, things looked different when I broke the results down per weekday.

On wednesday, my win-rate did not exceed 39.4%, which is very low. I then went on to discuss the statistical concept of "standard deviation" (expressed by the sigma variable), which is a measure of variance there is in a data-set. In the case of my wednesday data, my win-rate deviated from the expected outcome by 2.5 sigma. This means that there is only a 1% probability to get such an outcome by pure chance.

So the conclusion I ended with was that it seemed unlikely that pure chance was behind the sucky wednesdays and that there had to be a deeper cause. In this post I will take the analysis a bit further and see if my original conclusion lives up to further testing.

The definition of a day
I use SC2Gears to mass-analyze my replays and it does a lovely job at it. Immediately after seeing my wednesday win-rate in the SC2Gears multi-replay-analysis screen, I started to think about what could cause it. One of the first things that popped up in my mind is immediately a very simple idea.

SC2Gears considers a day to start and end at midnight. This is a reasonable thing to do, since it's pretty much what everyone does. However, in the context of this analysis it may not be the right way to go about it. If there is some mental condition that is affecting my play on wednesday, it does not magically disappear at midnight. Nor does it magically appear at midnight on tuesday. All games played past midnight count towards the next day. Makes sense when you look at the formal concept of day, running from midnight to midnight, but not so much when you look at my 24-hour cycle where the sleeping period is the divider between two days.

Going back to the statistics, I noticed that my latest game was logged somewhere at hour 3 and my earliest at hour 8 (SC2Gears breaks the results down by the hour as well, so this info was easy to find). A natural way to go about it would be to place the day-divider somewhere between 3 and 8. I posted in the SC2Gears thread with a feature request for a custom day-divider but this request was not fulfilled unfortunately.

So I let it rest for a while until the numbers started to gnaw at me again. I wrote a short script to run over my replay database and compile a list of win-rates for each day of the week, but now with a "day" having the right beginning and end point.

The results are as follows:

Monday - 58.2% (4.8%) (1.7)
Tuesday - 44.7% (8.1%) (0.65)
Wednesday - 43.3% (3.7%) (2.1)
Thursday - 49.1% (4.9%) (0.18)
Friday - 52.5% (3.9%) (0.64)
Saturday - 50.6% (2.8%) (0.21)
Sunday - 54.3% (2.9%) (1.5)

The first number on each line is the win-rate for that "day". The second percentage, between brackets, is the expected standard deviation if we assume that the chance to win a game is 50%. This value will become smaller the more games that are played. The third number is how many times the standard deviation the observed value deviates from the expected value of 50%.

The first observation is that the wednesday anomaly has been reduced significantly by applying this shift in the data. The odds oup[pf a 2.1 sigma deviation happening by chance are 3.6%, against the 1% for the 2.6 sigma deviation we had before. But there's more to it than this.

The look-elsewhere-effect
Suppose you test whether a coin is fair by flipping it 100 times and counting the number of heads and tails. If the result is not too far (say less than 2 sigma) from the expected value, you call the coin fair. This is a very reasonable way of determining whether a coin is fair or not. The problem is that once you start testing multiple coins, you're bound to run into one that is fair, but does not pass the test. In fact, about one in twenty fair coins will have a 2 sigma (or more) deviation from a 50/50 distribution.

Even though the analysis of a single data-set may be solid, the more data-sets you analyze, the larger the probability of finding an anomalous result that is purely due to chance. This phenomenon is known as the Look-Elsewhere-Effect (among other things). The term gained some popularity after it was used in some reports on the search for the Higgs boson in the LHC particle accelerator.

It applies to this case too. If we assume that there's nothing special about wednesday, then the anomaly we see there may as well have happened on any other day. This means that the odds of having a deviation of this magnitude are much higher than we've previously assumed. Without bothering you with the equation (it's not difficult), the chance of at least one day having at least a 2.1 sigma deviation is 22.5%.

In conclusion
There we have it. The final number to measure the weirdness of this result. 22.5% is the chance that a deviation this big occurs due to pure chance. It's not a big probability, but it's not so small that pure chance can be excluded as the cause of it. There may not be something wrong with me on wednesdays. The only way to know for sure is to get more statistics by playing more.

On a related note, the analysis in this blogpost and the previous one demonstrate how careful one must be when using statistics. The naive result, using just the raw data from SC2Gears suggested a very large deviation, which could (with 99% confidence) not be due to chance. However, after more careful evaluation of the data, the outcome changed dramatically.

RaiderRob

Netherlands377 Posts

March 15 2012 11:19 GMT

Not statistical analysis but for me it's common (anecdotal?) knowledge that players in any game I've played on average are better in the middle of the week than during the weekend. During the week and late at night it's mostly the more dedicated everyday players while during the weekend and especially Sunday afternoons you get all the weekend warriors playing their 5-10 games of SC2 for the week. So maybe the cause isn't that you're worse but you are running into better than average opponents on Wednesday and on Sunday you get weaker than average, on the whole it balances out.

Cyber_Cheese

Australia3615 Posts

March 15 2012 11:19 GMT

Tuesday seems completely random :O

Alejandrisha

United States6565 Posts

March 15 2012 12:07 GMT

i did some of my own calculations
n = 2151

sigmas and z scores:
Tues 2.74%, .6569
Mon 2.74%, .6204
Sat 3.03%, .0275
Wed 2.66%, -.3205
Fri 2.77%, -.3876
Thur 2.90%, -.4606
Sun 2.92%, -.4947

nothing significant but still pretty cool to look at

Rannasha

Netherlands2398 Posts

March 15 2012 12:32 GMT

On March 15 2012 20:19 RaiderRob wrote:
Not statistical analysis but for me it's common (anecdotal?) knowledge that players in any game I've played on average are better in the middle of the week than during the weekend. During the week and late at night it's mostly the more dedicated everyday players while during the weekend and especially Sunday afternoons you get all the weekend warriors playing their 5-10 games of SC2 for the week. So maybe the cause isn't that you're worse but you are running into better than average opponents on Wednesday and on Sunday you get weaker than average, on the whole it balances out.

The match-making system should remove most of this effect though. If in the middle of the week the average player is better, than the distribution of players within the ranks/leagues should be shifted towards the higher end (more master than bronze online) while in weekends there'll be more lower league players online. The matchmaking system will always try to match you against an equal opponent though, so unless the number of players is so low that this isn't possible, I don't expect to see a significant difference in the skill of my opponents. Of course, this is impossible to measure since Blizzard doesn't release MMR.

@Cyber_cheese: I don't have many games played on tuesday, as I'm often doing other things on that day (WoW raiding mostly). Consequently, the standard deviation is really high. Saturday and sunday are my most active days on the other hand.

Alejandrisha

United States6565 Posts

March 15 2012 12:41 GMT

i used my own average instead of 50% for the expected average. that's fine, right?

Dakota_Fanning

Hungary2347 Posts

March 15 2012 12:54 GMT

Wednesday is not just like any other day.

Blizzard Patches and Battle.net maintenance is Tuesday-Wednesday (depends on your location)...

More lag and game changes might effect you and everyone else differently. This also explains the lower number of games.

Rannasha

Netherlands2398 Posts

March 15 2012 14:07 GMT

On March 15 2012 21:54 Dakota_Fanning wrote:
Wednesday is not just like any other day.

Blizzard Patches and Battle.net maintenance is Tuesday-Wednesday (depends on your location)...

More lag and game changes might effect you and everyone else differently. This also explains the lower number of games.

I'm on the EU servers, so my maintenance is wednesday morning. This is a time I rarely play though, as I usually don't play between getting up and going to work. Never really noticed any after-effects of maintenance on wednesday evening. Game-changes happen so rarely (and often don't affect me at the plat/diamond level that much) that they shouldn't play a role.

My game-count on wednesday is quite average for a weekday. It's tuesday that's very low, but that's easily explained by the planning of my non-SC2 activities.

Right now, after having redone the counting with the hours past midnight being added to the previous day as well as having compensated for the look-elsewhere-effect, I think that my wednesday-anomaly is due to chance rather than any external factors. I'll keep playing games to see where the numbers go

htn2481

Vietnam117 Posts

March 15 2012 15:02 GMT

Just read the "My roommate + Men" thread then jumped to next thread "I suck on Wednesdays" hahaha

Kurr

Canada2338 Posts

March 15 2012 15:15 GMT

#10

Well, it was Steak and blowjob day yesterday and it was a Wednesday. Makes sense I guess!

Yeah, I read it the same way as htn2481 lol.

29 fps

United States5724 Posts

March 15 2012 15:28 GMT

#11

On March 16 2012 00:02 htn2481 wrote:
Just read the "My roommate + Men" thread then jumped to next thread "I suck on Wednesdays" hahaha

same here...

Rannasha

Netherlands2398 Posts

March 15 2012 15:56 GMT

#12

Unfortunately the subtitle "Toying with SC2 replay stats" didn't fit anymore if I wanted to also include the "part 2". Once I had posted it and saw the listing I realized it wasn't the best title ever.

Oh well, not the first time I lure audience in with a slightly deceptive title.