• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 13:25
CEST 19:25
KST 02:25
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Classic Games #3: Rogue vs Serral at BlizzCon8[ASL20] Ro16 Preview Pt1: Ascent10Maestros of the Game: Week 1/Play-in Preview12[ASL20] Ro24 Preview Pt2: Take-Off7[ASL20] Ro24 Preview Pt1: Runway13
Community News
Weekly Cups (Sept 1-7): MaxPax rebounds & Clem saga continues23LiuLi Cup - September 2025 Tournaments3Weekly Cups (August 25-31): Clem's Last Straw?39Weekly Cups (Aug 18-24): herO dethrones MaxPax6Maestros of The Game—$20k event w/ live finals in Paris76
StarCraft 2
General
SC4ALL: A North American StarCraft LAN Classic Games #3: Rogue vs Serral at BlizzCon #1: Maru - Greatest Players of All Time Geoff 'iNcontroL' Robinson has passed away [G] How to watch Korean progamer Streams.
Tourneys
RSL: Revival, a new crowdfunded tournament series LiuLi Cup - September 2025 Tournaments Maestros of The Game—$20k event w/ live finals in Paris WardiTV Mondays Sparkling Tuna Cup - Weekly Open Tournament
Strategy
Custom Maps
External Content
Mutation # 490 Masters of Midnight Mutation # 489 Bannable Offense Mutation # 488 What Goes Around Mutation # 487 Think Fast
Brood War
General
BGH Auto Balance -> http://bghmmr.eu/ alas... i aint gon' lie to u bruh... BW General Discussion [ASL20] Ro16 Preview Pt1: Ascent The Korean Terminology Thread
Tourneys
Small VOD Thread 2.0 [ASL20] Ro16 Group A [ASL20] Ro16 Group B [Megathread] Daily Proleagues
Strategy
Simple Questions, Simple Answers Muta micro map competition Fighting Spirit mining rates [G] Mineral Boosting
Other Games
General Games
Stormgate/Frost Giant Megathread The PlayStation 5 General RTS Discussion Thread Iron Harvest: 1920+ Nintendo Switch Thread
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Laptop on Rent in Delhi – Smart Choice for Student
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread
Community
General
US Politics Mega-thread Russo-Ukrainian War Thread Things Aren’t Peaceful in Palestine The Games Industry And ATVI UK Politics Mega-thread
Fan Clubs
The Happy Fan Club!
Media & Entertainment
[Manga] One Piece Anime Discussion Thread Movie Discussion!
Sports
2024 - 2026 Football Thread Formula 1 Discussion MLB/Baseball 2023 TeamLiquid Health and Fitness Initiative For 2023
World Cup 2022
Tech Support
Linksys AE2500 USB WIFI keeps disconnecting Computer Build, Upgrade & Buying Resource Thread High temperatures on bridge(s)
TL Community
BarCraft in Tokyo Japan for ASL Season5 Final The Automated Ban List
Blogs
The Personality of a Spender…
TrAiDoS
A very expensive lesson on ma…
Garnet
hello world
radishsoup
Lemme tell you a thing o…
JoinTheRain
RTS Design in Hypercoven
a11
Evil Gacha Games and the…
ffswowsucks
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1346 users

Statistical Analysis of StarCraft 2 Balance - Page 3

Forum Index > SC2 General
Post a Reply
Prev 1 2 3 4 5 6 7 Next All
Apokilipse
Profile Joined April 2011
United States2 Posts
May 05 2011 05:46 GMT
#41
Very interesting to read! Most discussions about balance are simply unproductive rants, and it's fascinating to see someone take a scientific approach to documenting Starcraft balance.
Audi > Peugeot
d_ijk_stra
Profile Joined March 2011
United States36 Posts
May 05 2011 05:48 GMT
#42
arbitrageur/ it was based on the parameter I estimated via statistical inference. Of course it was not a significantly large value, but there was a slight indication. I did not extrapolate ladder data. It is based on GSL statistics, but instead of using mere ZvP statistics, I took each player's skill into account.

You can still question the adequacy of my model anyways, and thus further question adequacy of my estimated parameters. But at least, those values are from data, not from my personal understanding of a game. Actually I personally think P > Z, T if skills are equal, but this is what I got.
My Life for IU!
Thrombozyt
Profile Blog Joined June 2010
Germany1269 Posts
Last Edited: 2011-05-05 05:59:01
May 05 2011 05:57 GMT
#43
I guess it would be better to use a different data set, as the game has vastly changed from Oktober 2010. With Steppes of War and Delta Quadrant still being in the map pool and many balance changes not being in place (roach range increase anyone?).

You cannot really group different patches together, as potential 'imbalance' from a former patch will reflect on current patches. Also by using only current data (say March 2011 and onwards) but drawing from more tourneys you actually reduce the number of maps played and therefore the number of parameters you have to determine (as each map carries 3 beta values for the matchups) from a limited set of data.

Edit:
Changing the data set would also improve the quality of the analysis, because you wouldn't have to make the assumption that the Korean style is the 'gold standard' and rather take data from all over the world avoiding local bias.
Primadog
Profile Blog Joined April 2010
United States4411 Posts
Last Edited: 2011-05-05 05:59:36
May 05 2011 05:57 GMT
#44
On May 05 2011 14:26 Nontrivial wrote:
Although I'm no math major what I do understand I'm quite impressed with. I do have one question though how close to this is what the balence team talked about at Blizzcon?

Here is the link to what I'm referring to: Link



This paper's approach differs from the balance team's.

d_ijk_stra's approach is to creating a statistics model for competitive StarCraft that uses only two variables: (1) player skill (2) map racial bias. He then proves that the model is a good fit for the GSL data. Finally, he asks the question: Does this model demonstrate any strong racial biase (using an average of the map racial bias variable) and concluded that there's no significant biase observed thus far.

What is significant here is that his approach uses competitive play data, which the community generally consider a better indicator of game balance compared to the ladder. Secondarily, he created a model that separated player skills and map racial preference that fits this data, which is important to study the question of whether there's an imbalance in the game.
Thank God and gunrun.
palanq
Profile Blog Joined December 2004
United States761 Posts
May 05 2011 05:59 GMT
#45
this is great stuff.

are you going to do more, or was this just for a class or something? if so, you should scrape TLPD for broodwar proleague games or something, which would give you a lot more data, enough to do multi-period analysis and see how the parameter estimates change over time. plus you don't have as many inter-game dependencies that there are with best-of-X series.
time flies like an arrow; fruit flies like a banana
aksfjh
Profile Joined November 2010
United States4853 Posts
May 05 2011 06:01 GMT
#46
I really appreciate your work on the subject. It was done with the intent of academic integrity, and succeeded in that.

The only "beef" I have with it is the fact that it covers a rather volatile period of SC2 (with frequent patches completely changing matchups), along with a region that has been predominantly Terran based since release. Not only that, but the Protoss from that region have also failed performed on an individual basis in individual matches.
space_yes
Profile Joined April 2010
United States548 Posts
May 05 2011 06:03 GMT
#47
An interesting read though I'm skeptical of your approach given that you're taking games from different patches and each patch changed the rules of the game. Aside from suggesting that each patch may in fact represent a different population (given that each patch is technically a different game) sampling across the patches should significantly impact the limitations described in your model (particularly conditional independence and the interactions between the players).
space_yes
Profile Joined April 2010
United States548 Posts
May 05 2011 06:05 GMT
#48
I will add that it is nice to see someone actually doing statistics, I'm fucking tired of these "here are some numbers/graphs, now this is what I think type threads." These threads should be closed by mods and the users warned imo.
d_ijk_stra
Profile Joined March 2011
United States36 Posts
Last Edited: 2011-05-05 06:06:19
May 05 2011 06:05 GMT
#49
On May 05 2011 14:57 Thrombozyt wrote:
I guess it would be better to use a different data set, as the game has vastly changed from Oktober 2010. With Steppes of War and Delta Quadrant still being in the map pool and many balance changes not being in place (roach range increase anyone?).

You cannot really group different patches together, as potential 'imbalance' from a former patch will reflect on current patches. Also by using only current data (say March 2011 and onwards) but drawing from more tourneys you actually reduce the number of maps played and therefore the number of parameters you have to determine (as each map carries 3 beta values for the matchups) from a limited set of data.

Edit:
Changing the data set would also improve the quality of the analysis, because you wouldn't have to make the assumption that the Korean style is the 'gold standard' and rather take data from all over the world avoiding local bias.


I strongly agree with you and 'space-yes''s comment. At the time I was conducting the analysis, it was March and I didn't have good understandings on tournaments other than GSL. Moreover, gamers in GSL were isolated from others. But I didn't have enough GSL games per each patch, so I had to aggregate them all. I also feel very uncomfortable about this.

Now the situation is a little different. There are many ongoing "global" leagues like NASL/TSL which I also enjoy to watch, thus I have more number of games worldwide and it might be enough to conduct a valid analysis. I hope I can do follow-up analysis anytime soon!
My Life for IU!
slyboogie
Profile Blog Joined March 2011
United States3423 Posts
May 05 2011 06:17 GMT
#50
Good read! The regression hammer comes to SC2 =) I'd like to see a larger sample size, but the methodology is fine and the interpretation is sound. Thanks for the work!
"We dug coal together." Boyd Crowder, Justified
Valroth
Profile Joined January 2011
New Zealand28 Posts
May 05 2011 06:24 GMT
#51
A lot of effort for a fundamentally flawed analysis. You say that you've taken player skill into account, which is something that cannot be measured statistically in matches between different races. Measuring player skill based on mirror matches and then using that to add/reduce weight to balance statistics in matches between different races is logically misleading. I found it interesting anyway.
GhettoSheep
Profile Joined August 2008
United States150 Posts
May 05 2011 06:29 GMT
#52
I like how you admit that your results aren't statistically significant.
TheRabidDeer
Profile Blog Joined May 2003
United States3806 Posts
Last Edited: 2011-05-05 06:32:09
May 05 2011 06:30 GMT
#53
On May 05 2011 15:29 GhettoSheep wrote:
I like how you admit that your results aren't statistically significant.

There is nothing to admit, its stating a fact. Saying he admits to something makes it sound like its something bad.

Anyway, look forward to the next one! GL with all of your coursework!

EDIT: Or, I think maybe you misunderstood what statistical significance is?
d_ijk_stra
Profile Joined March 2011
United States36 Posts
May 05 2011 06:37 GMT
#54
On May 05 2011 15:24 Valroth wrote:
A lot of effort for a fundamentally flawed analysis. You say that you've taken player skill into account, which is something that cannot be measured statistically in matches between different races. Measuring player skill based on mirror matches and then using that to add/reduce weight to balance statistics in matches between different races is logically misleading. I found it interesting anyway.


This is a good point, but well I don't think this is fundamentally flawed.

This model assumes that each player's skill is the same for every match. Well it may not be true, as we know from BW that some gamer is really good vs. specific race and sucks vs. another. But I think most gamers show coherent level of skill between games, and then overall analysis may not be that misleading. Yes, actually without such an assumption it's impossible to quantify the balance between two races...

You may still disagree with this, and then deny the results. Every statistical model makes assumptions to overcome data parsimony, and I think whether the assumption is valid or not is a constructive discussion. I think the assumption is not that strong... But it's reasonable to question it. I have some ideas about more sophisticated models to account for this... Hope I can show results soon
My Life for IU!
han_han
Profile Joined October 2010
United States205 Posts
May 05 2011 06:43 GMT
#55
Wow, scholarly articles on Starcraft II? I am TOTALLY diggin' this.
Primadog
Profile Blog Joined April 2010
United States4411 Posts
May 05 2011 06:50 GMT
#56
On May 05 2011 15:24 Valroth wrote:
A lot of effort for a fundamentally flawed analysis. You say that you've taken player skill into account, which is something that cannot be measured statistically in matches between different races. Measuring player skill based on mirror matches and then using that to add/reduce weight to balance statistics in matches between different races is logically misleading. I found it interesting anyway.


There's not enough data points available to estimate every player's skill level in particular match-ups, but the tests he used showed that his model fits the dataset well despite this flaw. You also mischaracterized how skill is measured and used in the first place.

When you make a statistics model, you have to make certain assumptions that may not completely reflect reality. It's the nature of dealing with any large set of data. If you believe an assumption is incorrect, create a better model and demonstrate that it better fits the data. Believing that making assumptions somehow discredits a model simply shows that you have absolutely no idea how Statistics as a hard science works.
Thank God and gunrun.
Techno
Profile Joined June 2010
1900 Posts
Last Edited: 2011-05-05 14:51:53
May 05 2011 14:47 GMT
#57
On May 05 2011 13:54 d_ijk_stra wrote:
Techno/ Well this is what is called 'Latent Variable' method, which enables you to model which cannot be observed. It need not be defined or observed, although it's convenient to 'interpret' it that way. Actually the method of latent variable is very popular technique these days, although not covered in basic statistics courses (even in the graduate level).

I think you confused it with random effects / hierarchical model in ANOVA. You don't really need to assume latent variable to follow normal distribution. Of course, without any regularization it will overfit data, and using the assumption of normal distribution is a good way to regularize your parameters. But you can also use other types of regularization... I used L1 penalty for other reasons. However, I guess you may not want to discuss this much of technical details

I really think it would have been better if you had used win rates of certain leagues assuming skill is either non present, or normally distributed, as it is debatable that skill even exists outside of winning, and should you include skill, you should include variables like:

- Skills affect on Racial Performance
- Skills affect on this map
- Skills affect on this strategy (perhaps strategy is a part of skill, perhaps not)


I feel like skill is a very abstract concept, that cannot be precisely defined by even God. I feel like it has no place in statistical analyses. I may be wrong, but that's just my thoughts. I mean no disrespect to your report, in fact I respect it.
Hell, its awesome to LOSE to nukes!
Primadog
Profile Blog Joined April 2010
United States4411 Posts
Last Edited: 2011-05-05 20:05:14
May 05 2011 20:04 GMT
#58
Skill as a normally distributed variable that influence win-rate is the foundemental part of games and sports ratings dating back to the beginnings of Chess ELO. Every ELO, true-skill, or computerize/holistic-ranking system you see in major sports and gaming sites are based on the concept of skill as a measurable variable. There's nothing innovative or surprising about this assumption.
Thank God and gunrun.
awesomoecalypse
Profile Joined August 2010
United States2235 Posts
May 05 2011 20:12 GMT
#59
On May 06 2011 05:04 Primadog wrote:
Skill as a normally distributed variable that influence win-rate is the foundemental part of games and sports ratings dating back to the beginnings of Chess ELO. Every ELO, true-skill, or computerize/holistic-ranking system you see in major sports and gaming sites are based on the concept of skill as a measurable variable. There's nothing innovative or surprising about this assumption.


this is true, but all these assumptions correlate winrate to skill, which is something some players dispute. a guy like IdrA would argue that cheesy players are "unskilled" even when they win, something formula would clearly dispute.

But, as someone who thinks that mindset is counterproductive nonsense, and that a win is a win, I'm all for this system.
He drone drone drone. Me win. - ogsMC
hypnobean
Profile Joined October 2010
89 Posts
May 05 2011 20:20 GMT
#60
Anyone notice the paper identifies Jinro's race as Protoss?
Prev 1 2 3 4 5 6 7 Next All
Please log in or register to reply.
Live Events Refresh
Next event in 16h 35m
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
mouzHeroMarine 541
BRAT_OK 83
UpATreeSC 82
Codebar 37
MindelVK 27
ProTech12
JuggernautJason2
StarCraft: Brood War
Calm 4911
Rain 1713
Bisu 1489
Mini 1201
Shuttle 1172
EffOrt 742
Stork 456
BeSt 389
firebathero 224
sSak 201
[ Show more ]
Soulkey 177
hero 156
Dewaltoss 149
ggaemo 107
Mong 79
Sharp 68
TY 59
Rush 41
Mind 35
Aegong 21
Terrorterran 18
soO 17
Dota 2
The International96311
Gorgc9877
PGG 28
League of Legends
Reynor56
Counter-Strike
fl0m478
Heroes of the Storm
XaKoH 57
Other Games
Grubby1981
Mlord538
B2W.Neo529
Beastyqt444
Lowko283
RotterdaM216
Hui .173
ArmadaUGS142
KnowMe139
QueenE53
C9.Mang046
SortOf39
Chillindude30
ToD6
Organizations
Other Games
gamesdonequick1264
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 18 non-featured ]
StarCraft 2
• iHatsuTV 4
• IndyKCrew
• AfreecaTV YouTube
• sooper7s
• Migwel
• intothetv
• Kozan
• LaughNgamezSOOP
StarCraft: Brood War
• FirePhoenix11
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• C_a_k_e 1352
• Ler70
• Noizen36
League of Legends
• TFBlade587
Other Games
• Shiphtur346
• imaqtpie110
Upcoming Events
RSL Revival
16h 35m
Classic vs TriGGeR
ByuN vs Maru
Online Event
18h 35m
Kung Fu Cup
18h 35m
BSL Team Wars
1d 1h
RSL Revival
1d 16h
Maestros of the Game
1d 20h
ShoWTimE vs Classic
Clem vs herO
Serral vs Bunny
Reynor vs Zoun
Cosmonarchy
1d 22h
Bonyth vs Dewalt
[BSL 2025] Weekly
2 days
RSL Revival
2 days
Maestros of the Game
2 days
[ Show More ]
BSL Team Wars
3 days
Afreeca Starleague
3 days
Snow vs Sharp
Jaedong vs Mini
Wardi Open
3 days
Sparkling Tuna Cup
4 days
Afreeca Starleague
4 days
Light vs Speed
Larva vs Soma
LiuLi Cup
5 days
The PondCast
6 days
Liquipedia Results

Completed

Copa Latinoamericana 4
SEL Season 2 Championship
HCC Europe

Ongoing

BSL 20 Team Wars
KCM Race Survival 2025 Season 3
BSL 21 Points
ASL Season 20
CSL 2025 AUTUMN (S18)
LASL Season 20
RSL Revival: Season 2
Maestros of the Game
Chzzk MurlocKing SC1 vs SC2 Cup #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
FISSURE Playground #1

Upcoming

2025 Chongqing Offline CUP
BSL Polish World Championship 2025
BSL Season 21
BSL 21 Team A
EC S1
SL Budapest Major 2025
BLAST Rivals Fall 2025
IEM Chengdu 2025
PGL Masters Bucharest 2025
MESA Nomadic Masters Fall
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.