• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 09:47
CEST 15:47
KST 22:47
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Classic Games #3: Rogue vs Serral at BlizzCon0[ASL20] Ro16 Preview Pt1: Ascent9Maestros of the Game: Week 1/Play-in Preview12[ASL20] Ro24 Preview Pt2: Take-Off7[ASL20] Ro24 Preview Pt1: Runway13
Community News
Weekly Cups (Sept 1-7): MaxPax rebounds & Clem saga continues22LiuLi Cup - September 2025 Tournaments3Weekly Cups (August 25-31): Clem's Last Straw?39Weekly Cups (Aug 18-24): herO dethrones MaxPax6Maestros of The Game—$20k event w/ live finals in Paris76
StarCraft 2
General
Team Liquid Map Contest #21 - Presented by Monster Energy #1: Maru - Greatest Players of All Time Weekly Cups (Sept 1-7): MaxPax rebounds & Clem saga continues Classic Games #3: Rogue vs Serral at BlizzCon What happened to Singapore/Brazil servers?
Tourneys
LiuLi Cup - September 2025 Tournaments Maestros of The Game—$20k event w/ live finals in Paris WardiTV Mondays Sparkling Tuna Cup - Weekly Open Tournament RSL: Revival, a new crowdfunded tournament series
Strategy
Custom Maps
External Content
Mutation # 490 Masters of Midnight Mutation # 489 Bannable Offense Mutation # 488 What Goes Around Mutation # 487 Think Fast
Brood War
General
The Korean Terminology Thread Recommended FPV games (post-KeSPA) [ASL20] Ro16 Preview Pt1: Ascent FlaSh on ACS Winners being in ASL ASL20 General Discussion
Tourneys
[ASL20] Ro16 Group B [ASL20] Ro16 Group A Is there English video for group selection for ASL BWCL Season 63 Announcement
Strategy
Simple Questions, Simple Answers Muta micro map competition Fighting Spirit mining rates [G] Mineral Boosting
Other Games
General Games
Stormgate/Frost Giant Megathread Iron Harvest: 1920+ General RTS Discussion Thread Diablo IV S10 Infernal Tides Guide Nintendo Switch Thread
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread
Community
General
US Politics Mega-thread Russo-Ukrainian War Thread Things Aren’t Peaceful in Palestine The Games Industry And ATVI UK Politics Mega-thread
Fan Clubs
The Happy Fan Club!
Media & Entertainment
[Manga] One Piece Anime Discussion Thread Movie Discussion! [\m/] Heavy Metal Thread
Sports
2024 - 2026 Football Thread Formula 1 Discussion MLB/Baseball 2023 TeamLiquid Health and Fitness Initiative For 2023
World Cup 2022
Tech Support
Linksys AE2500 USB WIFI keeps disconnecting Computer Build, Upgrade & Buying Resource Thread High temperatures on bridge(s)
TL Community
The Automated Ban List
Blogs
Collective Intelligence: Tea…
TrAiDoS
A very expensive lesson on ma…
Garnet
hello world
radishsoup
Lemme tell you a thing o…
JoinTheRain
RTS Design in Hypercoven
a11
Evil Gacha Games and the…
ffswowsucks
INDEPENDIENTE LA CTM
XenOsky
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1231 users

ZvP is imbalanced - Page 37

Forum Index > BW General
Post a Reply
Prev 1 35 36 37 38 39 47 Next All
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
October 20 2009 19:01 GMT
#721
On October 21 2009 03:55 Black Gun wrote:
Show nested quote +
On October 21 2009 03:46 EtherealDeath wrote:
Would be cool though to get the race matchup elo of each player in each game and then use that to determine the probability of the zerg winning each game, and then use that to determine the probability of zerg winning at least as many games as they did. If only there were a simple to use automated process for this...



still, there would remain the question how to scale elo differences. same elo means winning percentage of 50%, obviously. but how strong is the effect of lets say 50 points difference in elo? and does the effect of elo differences change for different elo regions?

to tackle this, u would need to perform a logistic regression

that would surely be interesting, but i dont think many guys would understand it, especially if the effect of elo differences is not constant over time, ie would have to be modelled nonparametrically.


ELO Difference Calculation

Well, about the thought concerning graphing player elos instead. Presumably the charts should look pretty similar if ZvP were balanced. Now, since we don't have enough pro games to do this well, what if we took every A level and above ICCUP game as well? Don't know how closely that skill level compares to the average progamer though.
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
October 20 2009 19:05 GMT
#722
On October 21 2009 03:55 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 19:10 GMT
#723
On October 21 2009 04:05 EtherealDeath wrote:
Show nested quote +
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.


ok but im still having trouble understanding how, if zerg is expected to win 55% of games, and they win 59% over 800 chances, theres a 0.6% chance of that happening. It would seem that it happens all the time in BW. Of course if you were to have a two sided coin and say heads is expected to come 55% of the time, it would see very improbable to get 59% over 800 times. So what would this mean to BW? That each game can not be counted as a separate event?
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 19:11 GMT
#724
I don't think using ZvP ELOs to do this would work Intuitively, it seems that correcting for the ELO difference would "cancel out" any difference from the ZvP winrate from 50%... I dunno.
ModeratorGood content always wins.
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 19:15 GMT
#725
On October 21 2009 04:10 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means
ModeratorGood content always wins.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 19:18 GMT
#726
On October 21 2009 04:15 motbob wrote:
Show nested quote +
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
October 20 2009 19:21 GMT
#727
On October 21 2009 04:10 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.


ok but im still having trouble understanding how, if zerg is expected to win 55% of games, and they win 59% over 800 chances, theres a 0.6% chance of that happening. It would seem that it happens all the time in BW. Of course if you were to have a two sided coin and say heads is expected to come 55% of the time, it would see very improbable to get 59% over 800 times. So what would this mean to BW? That each game can not be counted as a separate event?



Well, one problem I see with using all the data in TLPD and then constructing a probability distribution out of that are the map changes, and the changes in the metagame. It's not quite the same coin that we keep flipping, it changes over time. Though, if we consider that the metagame is mostly influenced by the maps (eh may or may not be true, but simplifies analysis), we could use that data to get the effect that maps have on the game. Of course, if the data were pretty consistent over the entire TLPD history, then ZvP is really balanced regardless of the maps, but I think we all know that's not true.

It's one of the reasons I prefer using just this 7 month period, with more or less consistent maps. Now we are flipping the same coin, and it seems to be zerg tilted.

One thing you can use though, using all the games, is to calculate the probability of zerg's win % being under 50% at any point in time, and compare that to toss's. Off the top of my head, they should be relatively close if it the chance/magnitude of map imbalance were not tilted toward zerg, but I'd have to be more careful about that before saying anything conclusive. Too many possible pitfalls.
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
October 20 2009 19:22 GMT
#728
On October 21 2009 04:18 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.


We could just do say... HBR only, or Desti only and see. I don't have any stats software with me though ;/
Pretty sure the stats will come out heavily zerg favored though.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 19:25 GMT
#729
On October 21 2009 04:21 EtherealDeath wrote:
Show nested quote +
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.


ok but im still having trouble understanding how, if zerg is expected to win 55% of games, and they win 59% over 800 chances, theres a 0.6% chance of that happening. It would seem that it happens all the time in BW. Of course if you were to have a two sided coin and say heads is expected to come 55% of the time, it would see very improbable to get 59% over 800 times. So what would this mean to BW? That each game can not be counted as a separate event?



Well, one problem I see with using all the data in TLPD and then constructing a probability distribution out of that are the map changes, and the changes in the metagame. It's not quite the same coin that we keep flipping, it changes over time. Though, if we consider that the metagame is mostly influenced by the maps (eh may or may not be true, but simplifies analysis), we could use that data to get the effect that maps have on the game. Of course, if the data were pretty consistent over the entire TLPD history, then ZvP is really balanced regardless of the maps, but I think we all know that's not true.

It's one of the reasons I prefer using just this 7 month period, with more or less consistent maps. Now we are flipping the same coin, and it seems to be zerg tilted.

One thing you can use though, using all the games, is to calculate the probability of zerg's win % being under 50% at any point in time, and compare that to toss's. Off the top of my head, they should be relatively close if it the chance/magnitude of map imbalance were not tilted toward zerg, but I'd have to be more careful about that before saying anything conclusive. Too many possible pitfalls.


Lots of factors would have to be limited I agree. A test would have to be done by time, and another by maps I guess. Also we have to set the standard for tv/non tv games, minor league/non minor league games, prelim/no prelim, etc.
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 19:25 GMT
#730
On October 21 2009 04:18 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.

I don't see any evidence that playing on TV/ not playing on TV would have any impact to skew the results towards zerg or towards protoss. In the absence of any such evidence, there's no reason to redo the test.
ModeratorGood content always wins.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 19:29 GMT
#731
On October 21 2009 04:22 EtherealDeath wrote:
Show nested quote +
On October 21 2009 04:18 zulu_nation8 wrote:
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.


We could just do say... HBR only, or Desti only and see. I don't have any stats software with me though ;/
Pretty sure the stats will come out heavily zerg favored though.


It would come out zerg favored but I highly doubt Desti would be called an "imbalanced map." Since the standard deviation for all maps with a minimum of like 30 games played would be much higher than 5% or whatever Desti is from 55%.
ix
Profile Joined July 2003
United Kingdom184 Posts
October 20 2009 19:29 GMT
#732
zulu_nation8 you're not adding anything to the discussion with your random comments. Read up on some basic statistics if you want to try to disprove what people have done so far.
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
October 20 2009 19:30 GMT
#733
On October 21 2009 04:29 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:22 EtherealDeath wrote:
On October 21 2009 04:18 zulu_nation8 wrote:
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
[quote]


they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.


We could just do say... HBR only, or Desti only and see. I don't have any stats software with me though ;/
Pretty sure the stats will come out heavily zerg favored though.


It would come out zerg favored but I highly doubt Desti would be called an "imbalanced map." Since the standard deviation for all maps with a minimum of like 30 games played would be much higher than 5% or whatever Desti is from 55%.


Eh yeah I suppose the p-value might not be low enough to reject the 55% hypothesis using any reasonable alpha.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 19:30 GMT
#734
On October 21 2009 04:25 motbob wrote:
Show nested quote +
On October 21 2009 04:18 zulu_nation8 wrote:
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
On October 21 2009 03:34 Day[9] wrote:
i'm reading so much about standard deviation

what happened to null hypothesis tests? : [



they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.

I don't see any evidence that playing on TV/ not playing on TV would have any impact to skew the results towards zerg or towards protoss. In the absence of any such evidence, there's no reason to redo the test.


do it with only official games and you'll see that the Z score will be much lower, theres your evidence that its a factor to be considered, actually i'll do it when I have time since youre still not getting why your SD is wrong.
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 19:32 GMT
#735
On October 21 2009 04:29 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:22 EtherealDeath wrote:
On October 21 2009 04:18 zulu_nation8 wrote:
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
[quote]


they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.


We could just do say... HBR only, or Desti only and see. I don't have any stats software with me though ;/
Pretty sure the stats will come out heavily zerg favored though.


It would come out zerg favored but I highly doubt Desti would be called an "imbalanced map." Since the standard deviation for all maps with a minimum of like 30 games played would be much higher than 5% or whatever Desti is from 55%.

z-test on desti since March 1st! BRB
ModeratorGood content always wins.
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 19:34 GMT
#736
On October 21 2009 04:30 zulu_nation8 wrote:
Show nested quote +
On October 21 2009 04:25 motbob wrote:
On October 21 2009 04:18 zulu_nation8 wrote:
On October 21 2009 04:15 motbob wrote:
On October 21 2009 04:10 zulu_nation8 wrote:
On October 21 2009 04:05 EtherealDeath wrote:
On October 21 2009 03:55 zulu_nation8 wrote:
On October 21 2009 03:53 EtherealDeath wrote:
On October 21 2009 03:44 zulu_nation8 wrote:
On October 21 2009 03:40 Black Gun wrote:
[quote]


they were performed

the result: the zvp winning percentage of the last 7 months significantly exceeds 55%. so even if the historical race imbalance would be as high as 55% zerg wins, the recent trend would still be much higher than that, so that it cant be explained as a fluke.


if i was to plot the data of zvp win % over every 800 games in the history of bw, and find the standard deviation. And then plug that into a z test for the current 800 game period and have the null be 55%, would that be a better test to explain if the current trend is significant?



I'd imagine that taking games in a 400 game radius around each game, and plotting the win % in that range continuously would be better. That way, we have ~30k data points.


right so if the current sample comes out as insignificant what would that mean? And what does the test black gun did mean over a sample of 800 games?



If it were insignificant, then almost certainly the historical chance of a zerg beating a toss is pretty high, more so than you would expect if the matchup were balanced. The test black gun did was to determine the probability that zerg wins at least as much as they do, assuming they have an expected win % of 55%. That probability turned out to be just under 0.6%, which means for the usual significance levels, the null hypothesis of 55% must be rejected, and replaced by something higher.

So what would this mean to BW? That each game can not be counted as a separate event?

Sorry... not sure what this means


it means youre ignoring that not every game is played under the same conditions like a coin flip, if the test is to become more accurate, factors such as map, time, tv/non tv games would have to be taken under consideration, for example delete all the non official games from the 855 games and do another z-test and see what comes out.

I don't see any evidence that playing on TV/ not playing on TV would have any impact to skew the results towards zerg or towards protoss. In the absence of any such evidence, there's no reason to redo the test.


do it with only official games and you'll see that the Z score will be much lower, theres your evidence that its a factor to be considered, actually i'll do it when I have time since youre still not getting why your SD is wrong.

I'm not getting it cause you're not doing your own test and showing me. Until you do that I'll never understand.
ModeratorGood content always wins.
Black Gun
Profile Blog Joined July 2009
Germany4482 Posts
Last Edited: 2009-10-20 19:37:19
October 20 2009 19:36 GMT
#737
with a complete dataset from tlpd, one could perform a logistic regression with elo, maps and time as covariates. thus we could distinguish the effects of these factors. time would account for the metagame shifts.

edit: with the corresponding dataset i could easily carry out this stuff, i got access to stats software and know what im doing and so on
"What am I supposed to do against this?" - "Lose!" :-]
ix
Profile Joined July 2003
United Kingdom184 Posts
October 20 2009 19:43 GMT
#738
How would you use ELO exactly? Wouldn't you need to compare PvZ ELO with the other matchup ELOs to get some idea of the historic differences, otherwise a shift in the PvZ meta game will be hidden by the apparently higher ELOs of the Zerg players.
motbob
Profile Blog Joined July 2008
United States12546 Posts
Last Edited: 2009-10-20 19:55:50
October 20 2009 19:54 GMT
#739
I am going to do this step by step so that there's no question that I'm doing it right. Here we go! Remember, this is a test to see whether the zerg winrate on Destination since March 1st is significantly different from a) the historical zerg win rate against P or b) a hypothetical 50% winrate.

Wikipedia has a great screenshot of the formula for a z-test:
[image loading]


In this equation, we're looking for a z value that has an absolute value greater than 2. In that case, the p-value would be less than 0.05, which is what most statisticians take to be statistically significant.

The x with the bar over it is the mean of our data. Our data is a bunch of zeros and ones corresponding to the wins and losses of zergs against protoss. Therefore, the mean of our data will be equal to the winrate of ZvP. Let's head over to excel.

In Excel, I punch in our data points. We have one data point for each game: a 1 if the zerg wins, and a 0 if the zerg loses. That means that we're going to have 110+72=182 rows in the column we're using. 110 of those rows will contain a 1, and 72 of the rows will contain a 0. It doesn't matter what order the numbers are in.

Now I'm going to type in the formula "=average(A1:A182)" to get the average of this data. The average is 0.604395604. I have put it in cell B182.

μ0 is the hypothesized population mean. This is where we plug in the null hypothesis. Do I want to use 50% or the historical zerg winrate? I'll do a test with both, and I'll assume the historical zerg winrate to be 53%. I have put 0.5 in cell C182 and 0.53 in cell D182.

σ is the population standard deviation. Getting this is very simple now that we have our data plugged into excel. I'm going to type in the formula "=stdev(A1:A182)" to get the population standard deviation. It's 0.490329033, and it's in cell E182.

n is the number of data points, or 182. I put 182 in cell F182.

Now that we have all the varibles, we can do the test! I'm going to type in the formula at the very top of this post into excel: "=(B182-C182)/(E182/SQRT(F182))"

Excel spits out a result of 2.872303303, which gives a p-value of less than 0.05.

If we use the historical estimate of the zerg winrate, excel spits out a z-value of 2.046894038, which is still statistically significant.

If we were to use a historical zerg winrate of 55%, excel would NOT give us a statistically significant result.
ModeratorGood content always wins.
Heyoka
Profile Blog Joined March 2008
Katowice25012 Posts
October 20 2009 20:46 GMT
#740
YOUR STANDARD DEVIATION CANNOT BE 49%

THAT IS NOT YOUR VARIATION

YOU ARE LOOKING AT ONE DATA POINT

THERE IS NO OTHER WAY I CAN SAY THIS SIMPLY, UNLESS I DO IT IN CAPS PLUS BOLD
@RealHeyoka | ESL / DreamHack StarCraft Lead
Prev 1 35 36 37 38 39 47 Next All
Please log in or register to reply.
Live Events Refresh
Kung Fu Cup
12:00
SC:EVO Monthly
MaxPax vs CreatorLIVE!
ByuN vs Classic
SteadfastSC233
IndyStarCraft 128
Liquipedia
Replay Cast
10:05
Maestros of the Game Ro24 B
Zoun vs ByuNLIVE!
CranKy Ducklings195
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
mouzHeroMarine 279
SteadfastSC 233
Harstem 182
IndyStarCraft 128
StarCraft: Brood War
Britney 62139
Rain 3475
Shuttle 3095
Bisu 2874
Flash 1760
BeSt 1461
Light 1060
Larva 751
firebathero 617
Mini 455
[ Show more ]
Stork 427
EffOrt 412
hero 363
actioN 352
Barracks 327
Soma 215
Last 197
sSak 178
Rush 125
Mong 117
Snow 103
Mind 100
PianO 90
Backho 60
sorry 51
Nal_rA 31
Movie 28
Sharp 27
Yoon 26
Sacsri 26
scan(afreeca) 16
TY 14
yabsab 13
Terrorterran 13
soO 12
Noble 11
HiyA 10
Icarus 9
Shine 3
Dota 2
Gorgc5155
qojqva2757
Dendi1209
BananaSlamJamma208
League of Legends
Reynor47
Counter-Strike
zeus514
byalli341
edward40
Super Smash Bros
Chillindude2
Other Games
singsing1718
B2W.Neo1229
olofmeister769
hiko769
crisheroes427
DeMusliM404
Hui .208
ArmadaUGS193
Liquid`VortiX86
QueenE59
FunKaTv 32
SortOf25
Organizations
Other Games
gamesdonequick825
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 14 non-featured ]
StarCraft 2
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• FirePhoenix2
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
Dota 2
• WagamamaTV434
League of Legends
• Nemesis4706
• Jankos1249
Upcoming Events
OSC
2h 13m
Moja vs Babymarine
Solar vs TBD
sOs vs goblin
Nice vs INexorable
sebesdes vs Iba
Nicoract vs TBD
NightMare vs TBD
OSC
10h 13m
ReBellioN vs PAPI
Spirit vs TBD
Percival vs TBD
TriGGeR vs TBD
Shameless vs UedSoldier
Cham vs TBD
Harstem vs TBD
RSL Revival
20h 13m
Cure vs SHIN
Reynor vs Zoun
Kung Fu Cup
22h 13m
The PondCast
23h 13m
RSL Revival
1d 20h
Classic vs TriGGeR
ByuN vs Maru
Online Event
1d 22h
Kung Fu Cup
1d 22h
BSL Team Wars
2 days
Team Bonyth vs Team Dewalt
BSL Team Wars
2 days
[ Show More ]
RSL Revival
2 days
Maestros of the Game
3 days
ShoWTimE vs Classic
Clem vs herO
Serral vs Bunny
Reynor vs Zoun
Cosmonarchy
3 days
Bonyth vs Dewalt
[BSL 2025] Weekly
3 days
RSL Revival
3 days
Maestros of the Game
4 days
BSL Team Wars
4 days
Afreeca Starleague
4 days
Snow vs Sharp
Jaedong vs Mini
Sparkling Tuna Cup
5 days
Afreeca Starleague
5 days
Light vs Speed
Larva vs Soma
LiuLi Cup
6 days
Liquipedia Results

Completed

Copa Latinoamericana 4
SEL Season 2 Championship
HCC Europe

Ongoing

BSL 20 Team Wars
KCM Race Survival 2025 Season 3
BSL 21 Points
ASL Season 20
CSL 2025 AUTUMN (S18)
LASL Season 20
RSL Revival: Season 2
Maestros of the Game
Chzzk MurlocKing SC1 vs SC2 Cup #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
FISSURE Playground #1

Upcoming

2025 Chongqing Offline CUP
BSL Polish World Championship 2025
BSL Season 21
BSL 21 Team A
EC S1
BLAST Rivals Fall 2025
IEM Chengdu 2025
PGL Masters Bucharest 2025
MESA Nomadic Masters Fall
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.