• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 13:41
CEST 19:41
KST 02:41
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
[ASL19] Finals Recap: Standing Tall10HomeStory Cup 27 - Info & Preview18Classic wins Code S Season 2 (2025)16Code S RO4 & Finals Preview: herO, Rogue, Classic, GuMiho0TL Team Map Contest #5: Presented by Monster Energy6
Community News
Firefly given lifetime ban by ESIC following match-fixing investigation9$25,000 Streamerzone StarCraft Pro Series announced4Weekly Cups (June 30 - July 6): Classic Doubles6[BSL20] Non-Korean Championship 4x BSL + 4x China9Flash Announces Hiatus From ASL66
StarCraft 2
General
Firefly given lifetime ban by ESIC following match-fixing investigation TL Team Map Contest #4: Winners Weekly Cups (June 30 - July 6): Classic Doubles The SCII GOAT: A statistical Evaluation The GOAT ranking of GOAT rankings
Tourneys
FEL Cracov 2025 (July 27) - $8000 live event $25,000 Streamerzone StarCraft Pro Series announced Sparkling Tuna Cup - Weekly Open Tournament RSL: Revival, a new crowdfunded tournament series WardiTV Mondays
Strategy
How did i lose this ZvP, whats the proper response Simple Questions Simple Answers
Custom Maps
[UMS] Zillion Zerglings
External Content
Mutation # 481 Fear and Lava Mutation # 480 Moths to the Flame Mutation # 479 Worn Out Welcome Mutation # 478 Instant Karma
Brood War
General
BGH Auto Balance -> http://bghmmr.eu/ i aint gon lie to u bruh... ASL20 Preliminary Maps [G] Progamer Settings [ASL19] Finals Recap: Standing Tall
Tourneys
[Megathread] Daily Proleagues [BSL20] Non-Korean Championship 4x BSL + 4x China [BSL20] Grand Finals - Sunday 20:00 CET CSL Xiamen International Invitational
Strategy
Simple Questions, Simple Answers I am doing this better than progamers do.
Other Games
General Games
Nintendo Switch Thread Stormgate/Frost Giant Megathread Path of Exile What do you want from future RTS games? Beyond All Reason
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Positive Thoughts on Setting Up a Dual-Caliber FX
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread Vanilla Mini Mafia
Community
General
US Politics Mega-thread Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread Summer Games Done Quick 2025! Stop Killing Games - European Citizens Initiative
Fan Clubs
SKT1 Classic Fan Club! Maru Fan Club
Media & Entertainment
Anime Discussion Thread [Manga] One Piece [\m/] Heavy Metal Thread
Sports
Formula 1 Discussion 2024 - 2025 Football Thread NBA General Discussion TeamLiquid Health and Fitness Initiative For 2023 NHL Playoffs 2024
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread
TL Community
The Automated Ban List
Blogs
momentary artworks from des…
tankgirl
Culture Clash in Video Games…
TrAiDoS
from making sc maps to makin…
Husyelt
StarCraft improvement
iopq
Customize Sidebar...

Website Feedback

Closed Threads



Active: 689 users

GSL Season 3 predictions using statistics

Forum Index > StarCraft 2 Tournaments
Post a Reply
1 2 3 4 Next All
Mip
Profile Joined June 2010
United States63 Posts
Last Edited: 2010-12-09 18:17:31
December 08 2010 09:44 GMT
#1
Hey all,

I'm working on my master's degree in statistics. I've wanted to do a project that was Starcraft related. So, for my semester project in my Bayesian Methods class, I thought I'd do a ranking system for the GSL players. I hope to expand on this later, but right now it just takes all the brackets, the win/loss numbers with the player names, processes them using Bayesian magic (I used a Bayesian hierarchical model with binomial prior, p = inverse logit link with 2 player skill parameters, and player skills are distributed Normal(0,sigma^2). I'll post the technical write-up later if anyone is interested), and spits out a skill parameter which can be translated into a probability statement about who will win in a game.

A lot of players didn't play enough games to be able to estimate their skill with very much confidence, Jinro and Choya are examples, but players with high uncertainty are pulled towards the group average.

Forgive the formatting.

Probabilities of each of the top 4 taking the advancing to the next round and also advancing and then winning the whole tournament (Based only on GSL Seasons 1-3 data):
+ Show Spoiler +

Remember these are only based on the data and are based on the chances of each player winning a Bo7 match against each of these other players. You should look at these predictions and say to yourself, "Those are almost all the same, based on just the data, this GSL could realistically go to any one of these 4 players." .

Names ProbWinNext ProbWinFinal
1 Rain - 0.5134 - 0.2443
2 HongUn - 0.4865 - 0.2256
3 MC - 0.4402 - 0.2183
4 Jinro - 0.5597 - 0.3115



Edit : Fixed Coding problem that made people with high variance skill's go incorrectly high.
Top Player Rankings (Final skill chosen to be mean - 2 * std. error)


Rankings on Google Spreadsheet

How to interpret the skill parameters :

The actual number is not important, only the distance between the numbers.

+ Show Spoiler +
To get the probability that one player will beat another, the formula is exp(skill1)/(exp(skill1)+exp(skill2)) where skill1 is the skill of the winner and skill2 is the skill of the loser.

So if you wanted to calculate FruitDealer vs NesTea and their skills were 1.48 and 1.32 respectively, calculate exp(1.48)/(exp(1.48)+exp(1.32)) = .5387. Meaning FruitDealer has a 53.87% chance of beating NesTea in a single game. This is not exactly how I did it, but this is a simple approximation.



If this is interesting to anyone, I'd happily provide more information.

I can generate hypothetical match-ups of anyone I have good data for (Code S players I have pretty good data on as a rule of thumb).

Disclaimer
+ Show Spoiler +
Take the predictions and rankings with a grain of salt, they are only preliminary and will in crease in accuracy over time. They are only based on GSL data and they also treat all seasons equally, assuming no skill change between seasons (not an assumption I wanted to make, there just isn't enough data). This makes the data more useful as a ranking tool than a prediction tool at this point. There also are other tournaments that I could potentially enter in, but gathering the data and getting it properly formatted is proving to be a pain in the butt.



Future Work
+ Show Spoiler +
I have additions I'd like to add to this analysis. If I can get some help with gathering and formatting data, I could adjust for and calculate race advantages or how much of an advantage a certain map gives to a certain race. Also, if I could just get the quantity of the data to increase, I could do a lot more calculation. I'd like to add in more tournaments besides the GSL. Using just the GSL is great to have a fair localized ranking system, but to predict future games, it's best to have as much data about each player as possible.

If I had hundreds of times more data, I could calculate, based on win/loss data, which build orders, strategies, transitions and such are more effective vs which others.

Right now my data looks like this :

NesTea 2
Jys 1

Vines 1
Sen 2

Goma 2
JookToJung 0

Maka 2
Sleep 0

etc.

No race information, no map information, nothing, I just copied it straight from Liquipedia and then organized it for analysis to look like this:

NesTea Jys
NesTea Jys
Jys NesTea
Vines Sen
Sen Vines
Sen Vines
etc.

After that it was just a matter of coding up the model and interpreting the results.


So leave comments if you found this interesting or want to know more or whatever. If you want help implementing something similar, pm me. If you can think of some interesting Starcraft related questions that can be answered through data, post them and I'll see about looking into it. Also, if you are interested in helping me find and/or format data pm me.
Consolidate
Profile Joined February 2010
United States829 Posts
December 08 2010 09:49 GMT
#2
Very cool stuff. I'd be one of those people interested in the technical write-up.
Creature posessed the the spirit of inquiry and bloodlust - Adventure Time
Wargizmo
Profile Joined March 2010
Australia1237 Posts
Last Edited: 2010-12-08 09:55:24
December 08 2010 09:53 GMT
#3
Well I'm no stats or math wiz but I think the fact that NEXLiveForever is 3rd on your rankings list means that your system could be improved a lot, clearly some weighting should be given to more recent results, and possibly qualifiers should be taken into account as well, as someone like Rainbow who can qualify for 3 GSLs and make a semi final and a finals appearance is clearly better than someone like NEXLiveForever who was only able to only qualify once and made one semi, or OGSTop who made one Ro16 and was unable to qualify after that.



Information is not knowledge. Knowledge is not wisdom. Wisdom is not truth. Truth is not beauty. Beauty is not love. Love is not music. Music is best. - Frank Zappa
McDonalds
Profile Joined March 2010
Liechtenstein2244 Posts
December 08 2010 10:03 GMT
#4
On December 08 2010 18:53 Wargizmo wrote:
Well I'm no stats or math wiz but I think the fact that NEXLiveForever is 3rd on your rankings list means that your system could be improved a lot, clearly some weighting should be given to more recent results, and possibly qualifiers should be taken into account as well, as someone like Rainbow who can qualify for 3 GSLs and make a semi final and a finals appearance is clearly better than someone like NEXLiveForever who was only able to only qualify once and made one semi, or OGSTop who made one Ro16 and was unable to qualify after that.

Didn't LF skip the second and third season to focus on school?
High five :---)
Mip
Profile Joined June 2010
United States63 Posts
December 08 2010 10:10 GMT
#5
I agree with your concern completely Wargizmo. I saw NEXLiveForever getting very highly ranked, and was like, "wtf". He's boosted, probably unfairly, by having taking out NesTea in his first set. The model, as it is, treats all games equally, regardless of round played in or which season it is in. At this point, however, with only the GSL data, trying to add in a time-effect isn't feasible. There just isn't enough data.

The biggest issue is cross-over, the single elimination bracket format is a statistical nightmare because it doesn't give you a lot of information how players would be doing against other opponents. A round robin into brackets or even double elimination would be so much better for an objective analysis.

Time effects are something I definitely have in mind for future use. I mean, it's pretty clear that a year from now, no one will care what happened in GSL Season 1 as far as predictions are concerned.

As far as fairness, at very worst, my ranking is as bad as the GomTV rankings with their arbitrary point system that doesn't take into account how difficult your bracket is. My ranking system is almost entirely based on the strength of your opponents in how much a win effects your ranking. With more data, this bayesian approach will be infinitely more fair and more reliable than a simple point system.
Mip
Profile Joined June 2010
United States63 Posts
December 08 2010 10:19 GMT
#6
Yeah, I was just looking at the brackets again on LiveForever and Top, they are sandwiched at a high ranking by who they beat and who they lost against.

Top beat Polt, who beat MC, who is getting very highly ranked by the current season, which drags Top up. Then he lost to Fruitdealer, which pushes him back down.

LiveForever knocked out NesTea, and lost to FruitDealer. Ro4 is nothing to scoff at though, he deserves to be fairly high ranked.

I'm sure if I had more data on these guys, they'd get dragged down somewhat. But LF took out bigger names on his way to Ro4 than did Jinro, for example. I think this is a positive thing of the ranking algorithm, and more data will sort out any strange results.
TheFinalWord
Profile Joined May 2010
Australia790 Posts
December 08 2010 10:24 GMT
#7
You should use the stats to vote for your next gsl liquibet and see how it goes. Cool stuff.
nkr
Profile Blog Joined November 2010
Sweden5451 Posts
December 08 2010 10:35 GMT
#8
Interesting stuff, keep improving it
ESPORTS ILLUMINATI
namedplayer
Profile Joined June 2010
844 Posts
December 08 2010 10:44 GMT
#9
Liveforever didn't come to GSL2,3.
You know what I'm talking about
greycubed
Profile Joined May 2010
United States615 Posts
December 08 2010 10:47 GMT
#10
On December 08 2010 18:53 Wargizmo wrote:
Well I'm no stats or math wiz but I think the fact that NEXLiveForever is 3rd on your rankings list means that your system could be improved a lot, clearly some weighting should be given to more recent results, and possibly qualifiers should be taken into account as well, as someone like Rainbow who can qualify for 3 GSLs and make a semi final and a finals appearance is clearly better than someone like NEXLiveForever who was only able to only qualify once and made one semi, or OGSTop who made one Ro16 and was unable to qualify after that.
Yeah that's the most glaring flaw so far. I mean Rain as 5th best? - -

Standardizing the scoring for someone who fails to qualify also has big problems (like for Tester who got knocked out in a qualifier by Foxer).

May have to separate it by season until there is more data.
http://i.imgur.com/N3ujB.png
MindTricks
Profile Joined November 2010
16 Posts
December 08 2010 10:53 GMT
#11
u rly got nothing better to do

User was warned for this post
Wargizmo
Profile Joined March 2010
Australia1237 Posts
Last Edited: 2010-12-08 11:02:57
December 08 2010 11:00 GMT
#12
On December 08 2010 19:19 Mip wrote:
Yeah, I was just looking at the brackets again on LiveForever and Top, they are sandwiched at a high ranking by who they beat and who they lost against.

Top beat Polt, who beat MC, who is getting very highly ranked by the current season, which drags Top up. Then he lost to Fruitdealer, which pushes him back down.

LiveForever knocked out NesTea, and lost to FruitDealer. Ro4 is nothing to scoff at though, he deserves to be fairly high ranked.

I'm sure if I had more data on these guys, they'd get dragged down somewhat. But LF took out bigger names on his way to Ro4 than did Jinro, for example. I think this is a positive thing of the ranking algorithm, and more data will sort out any strange results.


That makes sense, but having said that, there still needs to be some weighting for recent results, to account for a player improving over time. NesTea for example had only just switched over to Sc2 in GSL1 and if you watched those games against LiveForever you would see that he was an absolute newbie at the game back then.

Also I don't see why a player's performance in in GSL3 would determine his rating for a game that occurred in GSL1 in the first place, that seems kind of silly to me. I mean if someone else LiveForever beat ends up winning a GSL then you could potentially have this guy rising up the rankings even further without even playing a single game.

Information is not knowledge. Knowledge is not wisdom. Wisdom is not truth. Truth is not beauty. Beauty is not love. Love is not music. Music is best. - Frank Zappa
greycubed
Profile Joined May 2010
United States615 Posts
Last Edited: 2010-12-08 11:12:56
December 08 2010 11:07 GMT
#13
Using data from other tourneys would help as well.

Good place to start looking- though of course minor tourneys should be given a lot less weight- http://www.gosugamers.net/starcraft2/rankings
http://i.imgur.com/N3ujB.png
Nosferaziel
Profile Joined July 2010
France9 Posts
December 08 2010 11:12 GMT
#14
Could you give us the standard error of your probabilities ?
I'd like to know if they're relevant or not, obviously you don't have a lot of data with just GSL 1-3.

I like the idea though !
jyisvip
Profile Joined August 2010
Canada209 Posts
December 08 2010 11:18 GMT
#15
you have a high chance of winning the LB for gsl4! lol
kazansky
Profile Blog Joined February 2010
Germany931 Posts
Last Edited: 2010-12-08 11:26:01
December 08 2010 11:22 GMT
#16
My background: I am studying mathematics to diploma for 12 semesters now.
To be honest, I think your model should imply matchup deviation, therefor needs to be at least 3-dimensional. Of course, considering every map would be even better but then the sample pool will by far not return you any valueable information.
But unless you at least concern matchups I fear the model is too theoretical for any substance.
I sorted around the issue aswell when I wanted to do some Starcraft manager toy and wanted to use winning probability to generate match results.

Adressed to the point of your transititive conclusion (A beats B beats C), you can't nevertheless your amount of data, because as you already figured out with your approach, matches can be interpreted as weighed coinflips, but you cannot achieve any kind of transitive ordering in a competition (which is good, because thats the point of competition after alll *g*)

Nice stuff though, I love mathematical approaches to this.

Edit: If you want to generate forecasts, you should consider reading the book from the guy that wrote that baseball statistics book and developed the "on-base-percentage" stat. I can for heavens sake not recall his name, i'll look after it when i'm home, you americans though might know him right away, he was consulted by the Red Sox before winning MLB.
"Mathematicians don't understand mathematics, they get used to it." - Prof. Kredler || "That was more one-sided that a mobius strip." - Tasteless
Raysalis
Profile Joined July 2010
Malaysia1034 Posts
December 08 2010 11:36 GMT
#17
What is the error for the probability that you generated? Its not something ridiculous like +-0.5 right?
:)
MindTrick
Profile Joined December 2010
7 Posts
December 08 2010 12:14 GMT
#18
--- Nuked ---
LawnMower
Profile Joined February 2010
Sweden557 Posts
Last Edited: 2010-12-08 12:54:53
December 08 2010 12:20 GMT
#19
Look who's talking.
The trouble with the world is that the stupid are cocksure and the intelligent are full of doubt. - Bertrand Russel
borny
Profile Blog Joined April 2010
China481 Posts
Last Edited: 2010-12-08 12:54:58
December 08 2010 12:26 GMT
#20

I know trolls should not be fed but seriously, if someone put work and time into something concerning SC2 he should be credited and not patronized. You are the sad fuck with no life flaming people like him.
Naniwa . July . Morrow . ¯\_(ツ)_/¯ Go STARTALE!
1 2 3 4 Next All
Please log in or register to reply.
Live Events Refresh
WardiTV European League
16:00
Swiss Groups Day 3
MaxPax vs HarstemLIVE!
Scarlett vs Shameless
WardiTV649
TKL 242
Liquipedia
uThermal 2v2 Circuit
16:00
Warmup Cup 2
uThermal244
IndyStarCraft 205
SteadfastSC90
Liquipedia
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
mouzHeroMarine 427
TKL 239
uThermal 237
IndyStarCraft 208
mcanning 164
SteadfastSC 90
UpATreeSC 89
BRAT_OK 74
MindelVK 27
StarCraft: Brood War
Bisu 1629
Shuttle 1067
EffOrt 863
Larva 431
firebathero 318
Mini 278
Dewaltoss 84
TY 69
soO 42
HiyA 35
[ Show more ]
Rock 19
GoRush 12
yabsab 10
IntoTheRainbow 10
sas.Sziky 8
Dota 2
Gorgc10354
qojqva2318
League of Legends
singsing1812
Dendi903
Counter-Strike
fl0m832
byalli548
flusha332
zeus204
Foxcn160
Heroes of the Storm
Liquid`Hasu367
Khaldor154
Other Games
FrodaN2151
Beastyqt682
KnowMe144
oskar128
QueenE55
Trikslyr45
Organizations
Other Games
gamesdonequick45299
StarCraft 2
angryscii 12
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 19 non-featured ]
StarCraft 2
• Reevou 7
• intothetv
• sooper7s
• Migwel
• AfreecaTV YouTube
• LaughNgamezSOOP
• IndyKCrew
• Kozan
StarCraft: Brood War
• FirePhoenix1
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• C_a_k_e 2698
• masondota2661
League of Legends
• Nemesis5071
• TFBlade1122
• Jankos980
Other Games
• imaqtpie638
• Shiphtur469
Upcoming Events
Replay Cast
6h 19m
RSL Revival
16h 19m
ByuN vs SHIN
Clem vs Reynor
OSC
19h 19m
Replay Cast
1d 6h
RSL Revival
1d 16h
Classic vs Cure
FEL
1d 22h
OSC
2 days
RSL Revival
2 days
FEL
2 days
FEL
2 days
[ Show More ]
CSO Cup
2 days
BSL20 Non-Korean Champi…
3 days
Bonyth vs QiaoGege
Dewalt vs Fengzi
Hawk vs Zhanhun
Sziky vs Mihu
Mihu vs QiaoGege
Zhanhun vs Sziky
Fengzi vs Hawk
Sparkling Tuna Cup
3 days
RSL Revival
3 days
FEL
3 days
BSL20 Non-Korean Champi…
4 days
Bonyth vs Dewalt
QiaoGege vs Dewalt
Hawk vs Bonyth
Sziky vs Fengzi
Mihu vs Zhanhun
QiaoGege vs Zhanhun
Fengzi vs Mihu
Replay Cast
5 days
Liquipedia Results

Completed

Proleague 2025-07-07
HSC XXVII
Heroes 10 EU

Ongoing

JPL Season 2
BSL 2v2 Season 3
Acropolis #3
KCM Race Survival 2025 Season 2
CSL 17: 2025 SUMMER
Copa Latinoamericana 4
Jiahua Invitational
Championship of Russia 2025
RSL Revival: Season 1
Murky Cup #2
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
PGL Astana 2025
Asian Champions League '25
BLAST Rivals Spring 2025
MESA Nomadic Masters
CCT Season 2 Global Finals
IEM Melbourne 2025

Upcoming

2025 ACS Season 2: Qualifier
CSLPRO Last Chance 2025
CSL Xiamen Invitational
CSL Xiamen Invitational: ShowMatche
2025 ACS Season 2
CSLPRO Chat StarLAN 3
K-Championship
uThermal 2v2 Main Event
SEL Season 2 Championship
FEL Cracov 2025
Esports World Cup 2025
Underdog Cup #2
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
FISSURE Playground #1
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.