• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 18:38
CET 23:38
KST 07:38
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
[ASL20] Finals Preview: Arrival12TL.net Map Contest #21: Voting10[ASL20] Ro4 Preview: Descent11Team TLMC #5: Winners Announced!3[ASL20] Ro8 Preview Pt2: Holding On9
Community News
2025 RSL Offline Finals Dates + Ticket Sales!8BSL21 Open Qualifiers Week & CONFIRM PARTICIPATION1Crank Gathers Season 2: SC II Pro Teams7Merivale 8 Open - LAN - Stellar Fest3Chinese SC2 server to reopen; live all-star event in Hangzhou22
StarCraft 2
General
RotterdaM "Serral is the GOAT, and it's not close" Could we add "Avoid Matchup" Feature for rankgame Smart servos says it affects liberators as well Chinese SC2 server to reopen; live all-star event in Hangzhou The New Patch Killed Mech!
Tourneys
Crank Gathers Season 2: SC II Pro Teams 2025 RSL Offline Finals Dates + Ticket Sales! Merivale 8 Open - LAN - Stellar Fest $5,000+ WardiTV 2025 Championship $3,500 WardiTV Korean Royale S4
Strategy
Custom Maps
Map Editor closed ?
External Content
Mutation # 497 Battle Haredened Mutation # 496 Endless Infection Mutation # 495 Rest In Peace Mutation # 494 Unstable Environment
Brood War
General
BW General Discussion [ASL20] Finals Preview: Arrival BSL Season 21 BSL Team A vs Koreans - Sat-Sun 16:00 CET ASL20 Pre-season Tier List ranking!
Tourneys
[ASL20] Grand Finals BSL21 Open Qualifiers Week & CONFIRM PARTICIPATION ASL final tickets help [ASL20] Semifinal A
Strategy
Current Meta Soma's 9 hatch build from ASL Game 2 Simple Questions, Simple Answers Roaring Currents ASL final
Other Games
General Games
Stormgate/Frost Giant Megathread Path of Exile General RTS Discussion Thread Nintendo Switch Thread Dawn of War IV
Dota 2
Official 'what is Dota anymore' discussion LiquidDota to reintegrate into TL.net
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread SPIRED by.ASL Mafia {211640}
Community
General
Things Aren’t Peaceful in Palestine US Politics Mega-thread Russo-Ukrainian War Thread YouTube Thread The Chess Thread
Fan Clubs
White-Ra Fan Club The herO Fan Club!
Media & Entertainment
Anime Discussion Thread [Manga] One Piece Korean Music Discussion Series you have seen recently... Movie Discussion!
Sports
Formula 1 Discussion 2024 - 2026 Football Thread MLB/Baseball 2023 TeamLiquid Health and Fitness Initiative For 2023 NBA General Discussion
World Cup 2022
Tech Support
SC2 Client Relocalization [Change SC2 Language] Linksys AE2500 USB WIFI keeps disconnecting Computer Build, Upgrade & Buying Resource Thread
TL Community
The Automated Ban List Recent Gifted Posts
Blogs
The Benefits Of Limited Comm…
TrAiDoS
Sabrina was soooo lame on S…
Peanutsc
Our Last Hope in th…
KrillinFromwales
Certified Crazy
Hildegard
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1640 users

ZvP is imbalanced - Page 33

Forum Index > BW General
Post a Reply
Prev 1 31 32 33 34 35 47 Next All
Elite00fm
Profile Joined January 2008
United States548 Posts
October 20 2009 05:46 GMT
#641
also, there is a very high chance you miscalculated the standard deviation
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 05:47 GMT
#642
On October 20 2009 14:46 Elite00fm wrote:
also, there is a very high chance you miscalculated the standard deviation

OK

tell me what it actually is and how you got it, please.
ModeratorGood content always wins.
Elite00fm
Profile Joined January 2008
United States548 Posts
October 20 2009 05:48 GMT
#643
On October 20 2009 14:46 motbob wrote:
Show nested quote +
On October 20 2009 14:44 Elite00fm wrote:
set null hypothesis to the winrate zerg had for the past 5 years or so before march 1st of this year

Getting that data would be pure hell. No thanks.


Yeah good point, I guess we could estimate it at like 53-55% though
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 05:48 GMT
#644
On October 20 2009 14:44 Elite00fm wrote:
set null hypothesis to the winrate zerg had for the past 5 years or so before march 1st of this year


should it be that or the average of the zvp stats over every 7 month period ever in progaming? Since it should be the same sample sizes?
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 05:54 GMT
#645
or the same amount of games, not time.
motbob
Profile Blog Joined July 2008
United States12546 Posts
Last Edited: 2009-10-20 05:55:33
October 20 2009 05:55 GMT
#646
On October 20 2009 14:48 zulu_nation8 wrote:
Show nested quote +
On October 20 2009 14:44 Elite00fm wrote:
set null hypothesis to the winrate zerg had for the past 5 years or so before march 1st of this year


should it be that or the average of the zvp stats over every 7 month period ever in progaming? Since it should be the same sample sizes?

No. Elite's suggestion is OK because it compares the recent win rate to the historical ZvP winrate, which might actually serve the purposes of this thread better. My method compares the recent winrate to a rate of 50%. But your suggestion doesn't make that much sense... a rate is a rate.
ModeratorGood content always wins.
Heyoka
Profile Blog Joined March 2008
Katowice25012 Posts
October 20 2009 05:59 GMT
#647
Is a rate still a rate when you estimate the average at 50% but then go on to say your expected variance is from 1% to 99%?
@RealHeyoka | ESL / DreamHack StarCraft Lead
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 06:02 GMT
#648
On October 20 2009 14:59 heyoka wrote:
Is a rate still a rate when you estimate the average at 50% but then go on to say your expected variance is from 1% to 99%?

Uh, yeah, that's the nature of binary data.
ModeratorGood content always wins.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
Last Edited: 2009-10-20 06:26:24
October 20 2009 06:22 GMT
#649
Standard deviation means how far the mean % from other samples of 800 games in the history of progaming can deviate from the null hypothesis. Which should be something like .05 or .1. What your test proved was that basically your numbers are wrong.
Elite00fm
Profile Joined January 2008
United States548 Posts
Last Edited: 2009-10-20 06:29:59
October 20 2009 06:27 GMT
#650
Wouldn't you solve this by finding the standard deviation of win % between these games, then find the z score using this value?

edit: i'm a little rusty when it comes to stats rofl
motbob
Profile Blog Joined July 2008
United States12546 Posts
October 20 2009 06:33 GMT
#651
On October 20 2009 15:22 zulu_nation8 wrote:
Standard deviation means how far the mean % from other samples of 800 games in the history of progaming can deviate from the null hypothesis. Which should be something like .05 or .1. What your test proved was that basically your numbers are wrong.

Go into excel and use the command stdev on a bunch of numbers. That's the standard deviation I'm talking about. You plug that into this equation (for omega):

[image loading]


Please don't criticize my methods again until you do a statistical test of your own. After all, you said you would.
ModeratorGood content always wins.
Matrijs
Profile Joined May 2009
United States147 Posts
October 20 2009 06:44 GMT
#652
If the purpose is to prove that Zergs have had the advantage over Protoss players the last 3 months or whatever time period, why isn't a null hypothesis of winrate = 50% ideal for that purpose?

If we use the historical average, we invite the argument that Zergs have had a historical advantage over Protoss players, which would corrupt our test. The goal for mapmakers should be 50% winrates over time for each race in all three matchups - why shouldn't we measure their results against that goal?
Elite00fm
Profile Joined January 2008
United States548 Posts
October 20 2009 06:51 GMT
#653
On October 20 2009 15:44 Matrijs wrote:
If the purpose is to prove that Zergs have had the advantage over Protoss players the last 3 months or whatever time period, why isn't a null hypothesis of winrate = 50% ideal for that purpose?

If we use the historical average, we invite the argument that Zergs have had a historical advantage over Protoss players, which would corrupt our test. The goal for mapmakers should be 50% winrates over time for each race in all three matchups - why shouldn't we measure their results against that goal?


Because the game has always been slightly T>Z>P>T, and this sort of equilibrium has been deemed balanced. It is already assumed that zergs have had a historical advantage over protoss, what we are trying to determine is if in the past 7 months is if this increased winrate of the zergs is so much more than the historical figure that the probability of this occurring to do variance is very small, and if infact an imbalance has emerged in the matchup.
Matrijs
Profile Joined May 2009
United States147 Posts
Last Edited: 2009-10-20 07:19:00
October 20 2009 07:08 GMT
#654
On October 20 2009 15:51 Elite00fm wrote:
Show nested quote +
On October 20 2009 15:44 Matrijs wrote:
If the purpose is to prove that Zergs have had the advantage over Protoss players the last 3 months or whatever time period, why isn't a null hypothesis of winrate = 50% ideal for that purpose?

If we use the historical average, we invite the argument that Zergs have had a historical advantage over Protoss players, which would corrupt our test. The goal for mapmakers should be 50% winrates over time for each race in all three matchups - why shouldn't we measure their results against that goal?


Because the game has always been slightly T>Z>P>T, and this sort of equilibrium has been deemed balanced. It is already assumed that zergs have had a historical advantage over protoss, what we are trying to determine is if in the past 7 months is if this increased winrate of the zergs is so much more than the historical figure that the probability of this occurring to do variance is very small, and if infact an imbalance has emerged in the matchup.


I don't see this as a particularly compelling argument. If maps are sufficient to significantly alter, and even reverse, the T>Z>P>T historical pattern of imbalance, why should we accept that imbalance? Why shouldn't we aim for T=Z=P=T?

Ignoring that, it seems to me that the proper test is a one-tailed one-proportion z-test:
http://www.acastat.com/Statbook/ztest1.htm

The null hypothesis would be that the Zerg winrate over the sampled period equals the historical rate, which we will approximate conservatively as 55%.

The alternative hypothesis would be that the Zerg winrate over the sampled period exceeds the historical rate.

By my calculation, that test gives us a z-value around 2.5, which is easily high enough to conclude that the current Zerg winrate exceeds the historical rate, even given the conservative assumption of a 55% historical winrate.

Edit: Including my calculations so others can check my work:
Standard error = sqrt((.55)(.45)/855) ~ .01701
Z-value = (.5921-.55)/.01701 ~ 2.475
Elite00fm
Profile Joined January 2008
United States548 Posts
Last Edited: 2009-10-20 07:22:56
October 20 2009 07:21 GMT
#655
How did you get that standard error? The formula is stdev/sqrt(n) isn't it?
Matrijs
Profile Joined May 2009
United States147 Posts
Last Edited: 2009-10-20 07:45:25
October 20 2009 07:24 GMT
#656
On October 20 2009 16:21 Elite00fm wrote:
How did you get that standard error?


The formula's in the link I posted.

SE = square root (p(q)/n)

where p = population proportion (here, the estimated historical Zerg winrate, .55),
q = (p-1)
and n = sample size (855 games sampled)

So, it seems to me that, yes, something has changed recently. I see several possibilities:
1) Metagame shift. Protoss players may be struggling to find a good counter for the current popular 3 hatch spire to 5 hatch hydra build. This could be either a temporary effect, which will disappear or reverse itself once Protoss players discover an effective counter, or it could be a permanent effect, if the matchup is sufficiently "played out" strategically.
2) Maps. The new maps may be more Zerg-favored in this matchup than previous maps.
3) Mechanics. No one denies that the mechanics of modern pro players are vastly superior to those in the past. It may be that improved mechanics have more of a positive impact on a Zerg's effectiveness than they do on that of a Protoss.

The bottom line, it seems to me, is that unless we see a reversal of the trend over the next few months, tournament and league organizers should start looking at ways to tweak the existing map pool to bring the matchup back into balance, regardless of the cause. A 60% win rate for one race over another is just bad for the game at the competitive level.
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
Last Edited: 2009-10-20 16:15:58
October 20 2009 16:14 GMT
#657
On October 20 2009 15:33 motbob wrote:
Show nested quote +
On October 20 2009 15:22 zulu_nation8 wrote:
Standard deviation means how far the mean % from other samples of 800 games in the history of progaming can deviate from the null hypothesis. Which should be something like .05 or .1. What your test proved was that basically your numbers are wrong.

Go into excel and use the command stdev on a bunch of numbers. That's the standard deviation I'm talking about. You plug that into this equation (for omega):

[image loading]


Please don't criticize my methods again until you do a statistical test of your own. After all, you said you would.


motbob i think its pretty obvious a standard deviation of 50% is wrong, the sooner you realize this and drop the im an econ major i know stats attitude, the faster we can move on.

A win is not 100%, and a loss is not 0%, that would be the standard deviation if brood war had like 80% half wins or something, even then that would not make sense since there would be no statistical significance since EVERYTHING would fall under the range of 0 and 1, thats why your numbers are so messed up.
Black Gun
Profile Blog Joined July 2009
Germany4482 Posts
Last Edited: 2009-10-20 16:35:56
October 20 2009 16:29 GMT
#658
On October 20 2009 13:26 motbob wrote:
OK I just found a much easier way to compile map matchup data! So when I get access to Stata, I'll have better data. I'll do this for all stats since March 1st, 2009.

Byzantium 3: 25-13
Byzantium 2: 30-11
Tears of the Moon: 1-0
New Autumn Wind: 3-1
Medusa: 34-23
Tau Cross: 7-7
Carthage 2: 2-4
Carthage: 0-1
Battle Royale: 4-5
Holy World: 4-3
Shades of Twilight: 1-3
Colosseum II: 2-4
Andromeda: 7-19 (?????)
Neo Harmony: 5-0
God's Garden: 56-44
Carthage 3: 1-0
Outsider: 41-27
Neo Medusa: 34-25
Return of the King: 47-22
Eye of the Storm: 1-1
El Niño: 1-1
Destination: 110-72 (this changed significantly since the time of the OP... EVER OSL prelims used it)
Tornado: 5-1
Outsider SE: 2-0
Moon Glaive: 2-3
Match Point: 3-4
Heartbreak Ridge: 90-64
Fighting Spirit: 6-3

Overall: 524-361, or 59.21%



the variable we are discussing here is binary, hence the estimator of the mean is the proportion p = 524/(524+361) = 0.592. the sample size is large enough to use a normal approximation.

if we assume a null-hypothesis of a balanced winrate of p0 = 50%, then in the corresponding test we need to use this p0 and not p in the formula for the standard deviation!

the test statistic then is:

Z = sqrt(n)*(p - p0)/sqrt[p0*(1-p0)] = sqrt(885)*(0.5921 - 0.5)/sqrt(0.5*(1-0.5)) = 5.479 -> highly significant.


if we assume a null-hypothesis of p0 = 0.55, then we obtain a Z of 2.517 -> p-value of 0.0059, ie significant even on a confidence level of 99%.

so the ZvP-winrate during that timeframe significantly exceeds 55%.
"What am I supposed to do against this?" - "Lose!" :-]
zulu_nation8
Profile Blog Joined May 2005
China26351 Posts
October 20 2009 16:43 GMT
#659
can you explain to me why 1-P0 is the SD
Muirhead
Profile Blog Joined October 2007
United States556 Posts
Last Edited: 2009-10-20 16:45:00
October 20 2009 16:44 GMT
#660
If you guys still have trouble doubting the significance, ask yourself why we even need statistics in such a simple case. These numbers are small enough and the question simple enough that we don't need any fancy approximations or limits of the binomial distribution.

ZvP stats are 524-361

If you flip a coin 885 times, the chance of heads coming up 361 times or less is

(Sum(i=0 to 361) (885 C i))/(2^885)

If the coin has, say, a historical 47% chance of heads, then the chance of heads coming up 361 times or less is

Sum(i=0 to 361) (885 C i) * (.47)^i*(.53)^(885-i)

Someone can figure these out in 10 seconds with their TI-89 or Mathematica... unfortunately I can't right now. No need to hide behind fancy stats here!
starleague.mit.edu
Prev 1 31 32 33 34 35 47 Next All
Please log in or register to reply.
Live Events Refresh
The PiG Daily
22:10
Best Games of SC
Rogue vs herO
MaxPax vs Clem
MaxPax vs Lambo
Clem vs herO
Reynor vs Classic
PiGStarcraft187
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
PiGStarcraft187
UpATreeSC 149
SteadfastSC 140
ProTech96
SpeCial 63
StarCraft: Brood War
Britney 11917
Artosis 555
Bonyth 97
NaDa 17
Dota 2
XaKoH 350
capcasts205
canceldota84
syndereN13
Counter-Strike
Foxcn173
PGG 120
kRYSTAL_57
Super Smash Bros
C9.Mang0463
PPMD38
Heroes of the Storm
Liquid`Hasu470
Other Games
Grubby3184
FrodaN1058
shahzam533
Maynarde89
Organizations
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 18 non-featured ]
StarCraft 2
• musti20045 137
• Hupsaiya 74
• StrangeGG 68
• RyuSc2 48
• davetesta21
• IndyKCrew
• AfreecaTV YouTube
• intothetv
• Kozan
• sooper7s
• LaughNgamezSOOP
• Migwel
StarCraft: Brood War
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• masondota21920
League of Legends
• Doublelift2124
Other Games
• imaqtpie1468
Upcoming Events
BSL 21
2h 22m
Replay Cast
11h 22m
Streamerzone vs Shopify Rebellion
Streamerzone vs Team Vitality
Shopify Rebellion vs Team Vitality
WardiTV Invitational
13h 22m
CrankTV Team League
14h 22m
BASILISK vs Shopify Rebellion
Team Liquid vs Team Falcon
BSL 21
1d 2h
Replay Cast
1d 11h
BASILISK vs TBD
Team Liquid vs Team Falcon
OSC
1d 13h
CrankTV Team League
1d 14h
Replay Cast
2 days
The PondCast
2 days
[ Show More ]
CrankTV Team League
2 days
Replay Cast
3 days
WardiTV Invitational
3 days
CrankTV Team League
3 days
Replay Cast
4 days
BSL Team A[vengers]
4 days
Dewalt vs Shine
UltrA vs ZeLoT
BSL 21
4 days
Sparkling Tuna Cup
5 days
BSL Team A[vengers]
5 days
Cross vs Motive
Sziky vs HiyA
BSL 21
5 days
Wardi Open
6 days
Liquipedia Results

Completed

ASL Season 20
WardiTV TLMC #15
Eternal Conflict S1

Ongoing

BSL 21 Points
CSL 2025 AUTUMN (S18)
BSL 21 Team A
C-Race Season 1
IPSL Winter 2025-26
KCM Race Survival 2025 Season 4
SOOP Univ League 2025
CranK Gathers Season 2: SC II Pro Teams
PGL Masters Bucharest 2025
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025

Upcoming

SC4ALL: Brood War
YSL S2
BSL Season 21
SLON Tour Season 2
BSL 21 Non-Korean Championship
RSL Offline Finals
WardiTV 2025
RSL Revival: Season 3
Stellar Fest
SC4ALL: StarCraft II
META Madness #9
eXTREMESLAND 2025
ESL Impact League Season 8
SL Budapest Major 2025
BLAST Rivals Fall 2025
IEM Chengdu 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.