ZvP is imbalanced - Page 34

Post a Reply

Prev 1 32 33 34 35 36 47 Next All

Holgerius

Sweden16951 Posts

October 20 2009 16:46 GMT

#661

How did this turn into a math discussion?

zulu_nation8

China26351 Posts

October 20 2009 16:47 GMT

#662

yes but it shouldnt be 50% since we are only looking at progaming zvps, so we need to use the historical mean of zvp as the null and not 50%.

zulu_nation8

China26351 Posts

October 20 2009 16:55 GMT

#663

and im pretty sure 59% over 800 games falls pretty inside the standard deviation and will not be significant at all once the real SD is figured out.

Muirhead

United States556 Posts

October 20 2009 16:57 GMT

#664

Figure out the mean for progaming ZvPs, and when I get home I will calculate my weighted coin scenario with that mean instead of 47% for you! I will tell the exact chance that given your mean protoss will have won as many games as they did or less, not even a statistical approximation.

Black Gun

Germany4482 Posts

October 20 2009 17:00 GMT

#665

On October 21 2009 01:43 zulu_nation8 wrote:
can you explain to me why 1-P0 is the SD

short answer: because the number of zerg wins (lets call it X) follows a binomial distribution and the estimator for the variance of the winning percentage (X/n, where X = number of wins) is p*(1-p)/n.

(veeery) long answer: in statistical hypothesis testing we compute how likely or unlikely the observed outcome would be if the null-hypothesis was true. obviously, to do that we need to know the cumulative distribution function (cdf) of the null-distribution.

to provide a unifying framework and for historical reasons (times when ppl had no support of powerful computers and had to do things by hand), a normal approximation is used so that instead of the "true null distribition" we can use the well-known cdf of a standard normal distribution to compute the p-value. to do that, we need to know the variance/standard deviation under the null-hypothesis.

the decisive thing about all that: we only use the normal distribution for the p-value, but the standard deviation still is that of the null-distribution. and the sd of the null distribution (which is scaled binomial here) is sqrt(p0*(1-p0)/n).

edit: read my previous post guys, i already conducted the test with the correct sd for a null-hypothesis of a historical zvp percentage of 55% and still it is highly significant!

during the last 7 months the zvp winning percentage is highly significant above 55%!

zulu_nation8

China26351 Posts

October 20 2009 17:00 GMT

#666

I'm working on getting access to TLPD, also remember there are 30k games on TLPD and only recently did it record offline and prelim games, so the overall # of games played in progaming is much larger. I don't think 800 games would be significant in this case, unless we remove the offline prelim games and only count tv games, actually even then the sample size would probably still be too small.

motbob

United States12546 Posts

October 20 2009 17:03 GMT

#667

On October 21 2009 01:14 zulu_nation8 wrote:

Show nested quote +

motbob i think its pretty obvious a standard deviation of 50% is wrong, the sooner you realize this and drop the im an econ major i know stats attitude, the faster we can move on.

A win is not 100%, and a loss is not 0%, that would be the standard deviation if brood war had like 80% half wins or something, even then that would not make sense since there would be no statistical significance since EVERYTHING would fall under the range of 0 and 1, thats why your numbers are so messed up.

You are making a fool out of yourself. Stop trying to pretend you know more about stats than me. You're tripped over your terminology and your methodology ever since we started having this discussion. You continually make basic mistakes like confusing standard deviation with standard error. I'm going to say it again: either do a statistical test of your own or stop trying to criticize my methods.

Muirhead

United States556 Posts

October 20 2009 17:03 GMT

#668

Black gun I believe you by the way. I'm just trying to explain in a way that doesn't involve as much knowledge.

JWD

United States12607 Posts

October 20 2009 17:03 GMT

#669

motbob I'm going to try to explain exactly why your standard deviation is wrong, since zulu won't do it. I think what you did is to calculate the standard deviation of the variable zerg win, which is a binary variable you defined (I picked the name for exposition's sake) that equals 1 if Zerg wins a ZvP and 0 if Protoss wins. You correctly calculated the standard deviation of this variable — we'd expect it to be near .5 because the mean of zerg win is about .5, and so each instance of zerg win is about .5 from that mean.

However, this standard deviation is not the standard deviation relevant to your test for determining whether the recent Z>P trend is significantly anomalous. The variable you are examining in that test is not zerg win, but ZvP balance over a several-month period, another variable which I'll call balance. Therefore the standard deviation you must use in your test is the standard deviation of balance—that is, the error of several-month ZvP balance from the mean several-month ZvP balance. You can NOT use the standard deviation of zerg win, which has no place in your calculation.

Black Gun

Germany4482 Posts

October 20 2009 17:11 GMT

#670

On October 21 2009 02:03 JWD wrote:
motbob I'm going to try to explain exactly why your standard deviation (technically it's a standard error, since standard deviation is a "true", unascertainable value and we are just estimating it) is wrong, since zulu won't do it. I think what you did is to calculate the standard error of the variable zerg win, which is a binary variable you defined (I picked the name for exposition's sake) that equals 1 if Zerg wins a ZvP and 0 if Protoss wins. You correctly calculated the standard error of this variable — we'd expect it to be near .5 because the mean of zerg win is about .5, and so each instance of zerg win is about .5 from that mean.

However, this standard error is not the standard error relevant to your test for determining whether the recent Z>P trend is significantly anomalous. The variable you are examining in that test is not zerg win, but ZvP balance over a several-month period, another variable which I'll call balance. Therefore the standard error you must use in your test is the standard error of balance—that is, the error of several-month ZvP balance from the mean several-month ZvP balance. You can NOT use the standard error of zerg win, which has no place in your calculation.

u should read my last 2 posts. in the first one i conducted the correct tests. in the second one i explained in detail why the standard error i was using is the correct one. and no, it is not hard to compute the sd of "balance". once we get a certain zvp winning percentage as the "historical balance", the sd needed in our test is simply sqrt[p*(1-p)/n]. i already tried it with 55%, so if the historical zvp stats are not higher than 55%, then the outcome of the last 7 months differs significantly.

im a statistics major close to graduating, so u can believe me

zulu_nation8

China26351 Posts

October 20 2009 17:12 GMT

#671

On October 21 2009 02:03 motbob wrote:

Show nested quote +

first of all motbob, i think you should calm down. I learned this stuff in a psych 101 class 3 years ago so I assume it's pretty basic stuff and nothing to get worked up about. You plugged in .49 for standard deviation, not standard deviation, SE = SD/root(885). I had no idea how you got the number but just by common sense you should see that a SD of 50% for a null of 50% is wrong. I plan to do the test when I get access to TLPD. But you really should calm down or else this will go nowhere.

JWD

United States12607 Posts

October 20 2009 17:15 GMT

#672

On October 21 2009 02:11 Black Gun wrote:

Show nested quote +

Yep, your calculations look good. Bottom line is yes, the recent ZvP trend is statistically significant. This shouldn't be very surprising.

motbob

United States12546 Posts

October 20 2009 17:16 GMT

#673

On October 21 2009 02:12 zulu_nation8 wrote:

Show nested quote +

dude

it's binary data, the data points are either a 1 or 0. Of course the stdev is going to be ~.5! Punch a bunch of binary data into excel and use excel to get the stdev of that data. It will give you a value of about .5

JWD

United States12607 Posts

October 20 2009 17:18 GMT

#674

On October 21 2009 02:16 motbob wrote:

Show nested quote +

On October 21 2009 02:12 zulu_nation8 wrote:

On October 21 2009 02:03 motbob wrote:

On October 21 2009 01:14 zulu_nation8 wrote:

On October 20 2009 15:33 motbob wrote:

On October 20 2009 15:22 zulu_nation8 wrote:
Standard deviation means how far the mean % from other samples of 800 games in the history of progaming can deviate from the null hypothesis. Which should be something like .05 or .1. What your test proved was that basically your numbers are wrong.

Go into excel and use the command stdev on a bunch of numbers. That's the standard deviation I'm talking about. You plug that into this equation (for omega):

$[image loading]$

Please don't criticize my methods again until you do a statistical test of your own. After all, you said you would.

motbob read my post, you calculated the figure correctly, but that's not the proper figure for your test.

citi.zen

2509 Posts

October 20 2009 17:18 GMT

#675

I would just note the number of "zvp is imba" posts in the past 3 months. Over the same period I haven't seen any posts claiming the opposite. I am not saying that having many posts on a topic makes it a valid argument, but at the very least in terms of perception something is certainly going on.

motbob

United States12546 Posts

October 20 2009 17:18 GMT

#676

...I don't have a choice as to which SE I use in my test. SE is SD (of my data) divided by sqrt(n). I can't change it.

I can change my null hypothesis though... are you saying my null hypothesis should be the historical winrate instead of 50%?

JWD

United States12607 Posts

October 20 2009 17:19 GMT

#677

On October 21 2009 02:18 motbob wrote:

Show nested quote +

yeah I confused standard error and stdev, and just edited to fix that…no that's not what I'm saying.

I'm too rusty on stats to make any further useful contributions to this thread, but I'm pretty sure I explained your problem right motbob

zulu_nation8

China26351 Posts

October 20 2009 17:21 GMT

#678

On October 21 2009 02:11 Black Gun wrote:

Show nested quote +

I'm having trouble believing that the trend is significant not because I don't trust your math but just by what I remember. I'm very confident there has been similar trends in the past over similar samples, and if we were to look at the stats of other matchups, something like 59% over 7 months really shouldnt be very surprising.

Also, can you explain what 885 means in the equation? Like if the overall games are 30k+, is there a way to include the size of the sample?

Muirhead

United States556 Posts

October 20 2009 17:26 GMT

#679

It really doesn't matter how many games were played in the past Zulu_nation.

Like if you roll a dice 3 billion times you can get a good idea of how its weighted.

If you change something and then roll it a million times you can still get a good idea of whether the weighting significantly changed, even though a million is a tiny fraction of 3 billion.

zulu_nation8

China26351 Posts

October 20 2009 17:27 GMT

#680

On October 21 2009 02:19 JWD wrote:

Show nested quote +

motbob just think of it like this, how can the MEAN of zerg win% ever be 100% over similar samples? Surely theres never been a period in progaming when zerg has won every game vs toss over 800 games?

Prev 1 32 33 34 35 36 47 Next All

Please or register to reply.

ZvP is imbalanced - Page 34

Completed

Ongoing

Upcoming