ZvP is imbalanced - Page 35

Post a Reply

Prev 1 33 34 35 36 37 47 Next All

motbob

United States12546 Posts

October 20 2009 17:27 GMT

#681

On October 21 2009 02:21 zulu_nation8 wrote:

Show nested quote +

I'm having trouble believing that the trend is significant not because I don't trust your math but just by what I remember. I'm very confident there has been similar trends in the past over similar samples, and if we were to look at the stats of other matchups, something like 59% over 7 months really shouldnt be very surprising.

Also, can you explain what 885 means in the equation? Like if the overall games are 30k+, is there a way to include the size of the sample?

885 is the number of ZvPs played since March 1st. We plug it into the z test equation for n.

motbob

United States12546 Posts

October 20 2009 17:28 GMT

#682

On October 21 2009 02:27 zulu_nation8 wrote:

Show nested quote +

motbob just think of it like this, how can the MEAN of zerg win% ever be 100% over similar samples? Surely theres never been a period in progaming when zerg has won every game vs toss over 800 games?

Uh yeah this is true but I dunno why it's relevant. Again, I think you're confusing the standard error involved in a statistical test with the standard deviation of a population.

zulu_nation8

China26351 Posts

October 20 2009 17:32 GMT

#683

On October 21 2009 02:26 Muirhead wrote:
It really doesn't matter how many games were played in the past Zulu_nation.

Like if you roll a dice 3 billion times you can get a good idea of how its weighted.

If you change something and then roll it a million times you can still get a good idea of whether the weighting significantly changed, even though a million is a tiny fraction of 3 billion.

ok let me relearn this stuff and I'll get back to you.

Black Gun

Germany4482 Posts

October 20 2009 17:36 GMT

#684

On October 21 2009 02:21 zulu_nation8 wrote:

Show nested quote +

i was refering to the figures from the last page, which are the figures for the last 7 months, progaming only. there we had 524 zerg wins to 361 toss wins. thats 885 games played. thats the sample from which we estimate the winning percentage of the last 7 months. in the tests conducted so far, we assume that the historical balance with which we compare the last 7 months is given. if u want to include the uncertainty involved because we estimate this historical balance zvp winning ratio from data, u would have to conduct a two-sample-test. but as u said, there are like several thousands of games from which we would estimate the historical percentage, thus the uncertainty involved in this estimation is negligible.

but u are right in one point: the metagame has always been shifting, it goes up and down. comparing with the average of these up and down movements can be misleading. the decisive question is if the current shift in metagame in favour of zerg is more severe than previous shifts. in other words: is it even worse than the previous bad times for protoss, or is it comparable to them?

zulu_nation8

China26351 Posts

October 20 2009 17:36 GMT

#685

On October 21 2009 02:28 motbob wrote:

Show nested quote +

On October 21 2009 02:27 zulu_nation8 wrote:

On October 21 2009 02:19 JWD wrote:

On October 21 2009 02:18 motbob wrote:

On October 21 2009 02:03 JWD wrote:
motbob I'm going to try to explain exactly why your standard deviation (technically it's a standard error, since standard deviation is a "true", unascertainable value and we are just estimating it) is wrong, since zulu won't do it. I think what you did is to calculate the standard error of the variable zerg win, which is a binary variable you defined (I picked the name for exposition's sake) that equals 1 if Zerg wins a ZvP and 0 if Protoss wins. You correctly calculated the standard error of this variable — we'd expect it to be near .5 because the mean of zerg win is about .5, and so each instance of zerg win is about .5 from that mean.

However, this standard error is not the standard error relevant to your test for determining whether the recent Z>P trend is significantly anomalous. The variable you are examining in that test is not zerg win, but ZvP balance over a several-month period, another variable which I'll call balance. Therefore the standard error you must use in your test is the standard error of balance—that is, the error of several-month ZvP balance from the mean several-month ZvP balance. You can NOT use the standard error of zerg win, which has no place in your calculation.

...I don't have a choice as to which SE I use in my test. SE is SD (of my data) divided by sqrt(n). I can't change it.

I can change my null hypothesis though... are you saying my null hypothesis should be the historical winrate instead of 50%?

yeah I confused standard error and stdev, and just edited to fix that…no that's not what I'm saying.

I'm too rusty on stats to make any further useful contributions to this thread, but I'm pretty sure I explained your problem right motbob

motbob just think of it like this, how can the MEAN of zerg win% ever be 100% over similar samples? Surely theres never been a period in progaming when zerg has won every game vs toss over 800 games?

Uh yeah this is true but I dunno why it's relevant. Again, I think you're confusing the standard error involved in a statistical test with the standard deviation of a population.

no im not, like JWD says, your standard deviation is the mean of zerg winning, so of course zerg can only win or not win, and not 80% win. What we're actually calculating is the mean of the zerg win RATIO, in that case the SD would be something like how far on average does the zvp % deviate from the overall historical mean of say 55% over similar 855 game samples.

motbob

United States12546 Posts

October 20 2009 17:39 GMT

#686

On October 21 2009 02:36 zulu_nation8 wrote:

Show nested quote +

On October 21 2009 02:28 motbob wrote:

On October 21 2009 02:27 zulu_nation8 wrote:

On October 21 2009 02:19 JWD wrote:

On October 21 2009 02:18 motbob wrote:

motbob just think of it like this, how can the MEAN of zerg win% ever be 100% over similar samples? Surely theres never been a period in progaming when zerg has won every game vs toss over 800 games?

Uh yeah this is true but I dunno why it's relevant. Again, I think you're confusing the standard error involved in a statistical test with the standard deviation of a population.

no im not, like JWD says, your standard deviation is the mean of zerg winning

no it isn't lol

zulu_nation8

China26351 Posts

October 20 2009 17:44 GMT

#687

let me explain that, your SD is how far your data deviates from the possible outcomes, out of win or not win, for a zerg progamer when he enters a 1v1 starcraft game, whereas the rest of your equation is calculating the win ratio of a certain number of games, and basically not binary data.

zulu_nation8

China26351 Posts

October 20 2009 17:49 GMT

#688

motbob honestly i probably havent used a calculator in three years and Im also not an econ major so I understand why you wouldnt believe me but please, just use common sense, how can the standard deviation be 50% in this case? How is that possible? The SD is a mean.

EtherealDeath

United States8366 Posts

October 20 2009 17:50 GMT

#689

Should just get all the TLPD data and calcaute the probability of a P's win chance being 48-52% (or some other arbitrary range) at each time period, using data from x months to the left and right of said time period given the win% in that time period, but I guess zulu is working on getting the data for that.

EtherealDeath

United States8366 Posts

October 20 2009 17:53 GMT

#690

On October 21 2009 02:49 zulu_nation8 wrote:
motbob honestly i probably havent used a calculator in three years and Im also not an econ major so I understand why you wouldnt believe me but please, just use common sense, how can the standard deviation be 50% in this case? How is that possible? The SD is a mean.

On October 21 2009 01:29 Black Gun wrote:

Show nested quote +

the variable we are discussing here is binary, hence the estimator of the mean is the proportion p = 524/(524+361) = 0.592. the sample size is large enough to use a normal approximation.

if we assume a null-hypothesis of a balanced winrate of p0 = 50%, then in the corresponding test we need to use this p0 and not p in the formula for the standard deviation!

the test statistic then is:

Z = sqrt(n)*(p - p0)/sqrt[p0*(1-p0)] = sqrt(885)*(0.5921 - 0.5)/sqrt(0.5*(1-0.5)) = 5.479 -> highly significant.

if we assume a null-hypothesis of p0 = 0.55, then we obtain a Z of 2.517 -> p-value of 0.0059, ie significant even on a confidence level of 99%.

so the ZvP-winrate during that timeframe significantly exceeds 55%.

zulu_nation8

China26351 Posts

October 20 2009 17:54 GMT

#691

On October 21 2009 02:50 EtherealDeath wrote:
Should just get all the TLPD data and calcaute the probability of a P's win chance being 48-52% (or some other arbitrary range) at each time period, using data from x months to the left and right of said time period given the win% in that time period, but I guess zulu is working on getting the data for that.

Im gonna use # of games instead of time period, and the # of games would be a number roughly equivalent to the number of games in a season.

Black Gun

Germany4482 Posts

October 20 2009 17:54 GMT

#692

if u get that data, could u plz send it to me aswell? id like to draw some graphs to see the development over time^^

motbob

United States12546 Posts

October 20 2009 17:55 GMT

#693

On October 21 2009 02:44 zulu_nation8 wrote:
let me explain that, your SD is how far your data deviates from the possible outcomes, out of win or not win, for a zerg progamer when he enters a 1v1 starcraft game, whereas the rest of your equation is calculating the win ratio of a certain number of games, and basically not binary data.

No it isn't, that's the standard error. The standard error is the SD divided by sqrt(n). That's what I've been trying to tell you.

EDIT: and the SD is not a mean, wtf.

EtherealDeath

United States8366 Posts

October 20 2009 17:59 GMT

#694

On October 21 2009 02:54 Black Gun wrote:

Show nested quote +

if u get that data, could u plz send it to me aswell? id like to draw some graphs to see the development over time^^

Well my web programming is nonexistent, but I'd imagine that you would direct your code to http://www.teamliquid.net/tlpd/games/ and grab data from winner/loser, and then since each the data in each field happens to have race encoded, that would work. But yea, someone do it plz =)

Day[9]

United States7366 Posts

October 20 2009 18:08 GMT

#695

wait what happened?

Day[9]

United States7366 Posts

October 20 2009 18:08 GMT

#696

wait zvp is what?

zulu_nation8

China26351 Posts

October 20 2009 18:09 GMT

#697

i really wish i was better at math so i can explain what everyone already knows by common sense.

motbob

United States12546 Posts

October 20 2009 18:11 GMT

#698

On October 21 2009 03:09 zulu_nation8 wrote:
i really wish i was better at math so i can explain what everyone already knows by common sense.

I wish you were better at statistics so that you could realize that your efforts are misguided.

zulu_nation8

China26351 Posts

October 20 2009 18:14 GMT

#699

ok motbob you win, the standard deviation is clearly 50% from a mean of 55%.

infinity2k9

United Kingdom2397 Posts

October 20 2009 18:17 GMT

#700

This statistics talk is really over the top and silly. You only need to WATCH THE GAMES to get an idea of this topic. The point shouldn't be trying to prove things with pointless statistics but actual game analysis, we all know Zerg is doing better lately its plainly obvious but just point out the reasons why and what can change it.

Prev 1 33 34 35 36 37 47 Next All

Please or register to reply.

ZvP is imbalanced - Page 35

Completed

Ongoing

Upcoming