• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EST 06:55
CET 12:55
KST 20:55
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
RSL Season 3 - Playoffs Preview0RSL Season 3 - RO16 Groups C & D Preview0RSL Season 3 - RO16 Groups A & B Preview2TL.net Map Contest #21: Winners12Intel X Team Liquid Seoul event: Showmatches and Meet the Pros10
Community News
BGE Stara Zagora 2026 announced10[BSL21] Ro.16 Group Stage (C->B->A->D)4Weekly Cups (Nov 17-23): Solar, MaxPax, Clem win3RSL Season 3: RO16 results & RO8 bracket13Weekly Cups (Nov 10-16): Reynor, Solar lead Zerg surge2
StarCraft 2
General
BGE Stara Zagora 2026 announced SC: Evo Complete - Ranked Ladder OPEN ALPHA When will we find out if there are more tournament Weekly Cups (Nov 17-23): Solar, MaxPax, Clem win Weekly Cups (Nov 10-16): Reynor, Solar lead Zerg surge
Tourneys
RSL Revival: Season 3 Constellation Cup - Main Event - Stellar Fest Tenacious Turtle Tussle [Alpha Pro Series] Nice vs Cure $5,000+ WardiTV 2025 Championship
Strategy
Custom Maps
Map Editor closed ?
External Content
Mutation # 501 Price of Progress Mutation # 500 Fright night Mutation # 499 Chilling Adaptation Mutation # 498 Wheel of Misfortune|Cradle of Death
Brood War
General
BW General Discussion Which season is the best in ASL? A cwal.gg Extension - Easily keep track of anyone soO on: FanTaSy's Potential Return to StarCraft BGH Auto Balance -> http://bghmmr.eu/
Tourneys
[Megathread] Daily Proleagues [BSL21] RO16 Group B - Sunday 21:00 CET [BSL21] RO16 Group C - Saturday 21:00 CET Small VOD Thread 2.0
Strategy
Game Theory for Starcraft How to stay on top of macro? Current Meta PvZ map balance
Other Games
General Games
Nintendo Switch Thread The Perfect Game Stormgate/Frost Giant Megathread Beyond All Reason Should offensive tower rushing be viable in RTS games?
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
Mafia Game Mode Feedback/Ideas TL Mafia Community Thread
Community
General
Russo-Ukrainian War Thread US Politics Mega-thread The Big Programming Thread Things Aren’t Peaceful in Palestine Artificial Intelligence Thread
Fan Clubs
White-Ra Fan Club
Media & Entertainment
[Manga] One Piece Movie Discussion! Anime Discussion Thread
Sports
2024 - 2026 Football Thread Formula 1 Discussion NBA General Discussion MLB/Baseball 2023 TeamLiquid Health and Fitness Initiative For 2023
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread
TL Community
Where to ask questions and add stream? The Automated Ban List
Blogs
Esports Earnings: Bigger Pri…
TrAiDoS
Thanks for the RSL
Hildegard
Saturation point
Uldridge
DnB/metal remix FFO Mick Go…
ImbaTosS
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1142 users

Update: so I made a rating system (and a website) - Page 8

Forum Index > SC2 General
Post a Reply
Prev 1 6 7 8 9 Next All
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2012-12-17 22:16:44
December 17 2012 22:14 GMT
#141
On December 18 2012 06:47 JohnAdams wrote:
TheBB : http://aligulac.com/players/9/period/73/

Yeah, like Conti said, the numbers listed next to each opponent is the rating of the opponent (in the given matchup) at the time the match was played.

Edit: The traffic stats skyrocket when this thread is bumped at a US-friendly time, lol.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
opterown *
Profile Blog Joined August 2011
Australia54784 Posts
December 17 2012 23:25 GMT
#142
ok hmm after looking at recent results i think you may have them a bit too volatile, haha
ModeratorRetired LR Bonjwa
TL+ Member
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2012-12-18 12:26:40
December 18 2012 12:25 GMT
#143
On December 18 2012 08:25 opterown wrote:
ok hmm after looking at recent results i think you may have them a bit too volatile, haha

Well, do I have good news for you then.

I made some tweaks today and I think I can make it a bit less volatile without impacting the predictive power. There are four parameters:

– RD (rating deviation) decay. How fast does uncertainy grow when a player doesn't play. Currently 0.01.
– Initial RD. How uncertain is the rating of a new player. Currently set at 0.5.
– Minimal RD. Currently set at 0.13.
– Period length. Currently 14 days. I won't touch this one.

A player's rating changes quickly if his or her RD is high. Thus a large minimal RD will create volatility among "stable" players, a large RD decay will create volatility among players who play less frequently, and a large initial RD will create volatility amont totally new players.

Here is a plot showing the predictive power of the original system.

[image loading]


How did I make this? Well, I went through every game in the training data set (containing almost 50000 games), and computed the ratings at the time the game was played, and assigned it a "slot" corresponding to how certain it was that the assumed stronger player would win. The slots are ranges of probabilities, i.e. 50-55%, 55-60% and so on. This is the "predicted winrate" of the x-axis. The black jagged line shows the actual winrate for each slot, and the dashed black line (slanting the other way) shows the number of games that was associated to each slot.

The dashed blue line shows the linear fit weighted by number of games, and the dashed red line shows the "ideal," namely actual winrate=predicted winrate over the board.

So you can see that the system works pretty well already, but ok, so maybe it's too volatile. Can we fix that?

[image loading]


This uses a higher decay rate and a lower minimum. Essentially this means that we allow the ratings of the most frequently playing players to become "more certain" but that the information of their skill level decays faster when they don't play.

[image loading]


Here I have upped the initial RD to 0.6 to try to fix the slight offset. Right now I think it looks almost perfect.

So this is what will happen. In a week, when the time comes to publish the new list, I will recompute all ratings, using a minimal RD of 0.06, initial RD of 0.6 and RD decay of 0.04.

What you should see is that the ratings of the most frequent players will be much more stable, but the ratings of players who play rarely will become unstable faster than before. Additionally, new players will adjust somewhat quicker than before.

Also, Conti has added a ton of missing SPL games to the database, so hopefully that will help with the Kespa players.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
bittman
Profile Joined February 2011
Australia8759 Posts
December 18 2012 13:15 GMT
#144
Just properly flicked through the site for the first time and I really like the work that's going into it.
Mvp - Leenock - Dongraegu - MC - Gumiho - Keen - Polt - Squirtle - Jjakji - Genius - Seed - Life - sC - Dream || LG-IM - MVP - FXO
myk3
Profile Joined June 2010
Austria80 Posts
December 18 2012 13:46 GMT
#145
Awesome. Nice design too. Good job, thx!
Greenei
Profile Joined November 2011
Germany1754 Posts
December 22 2012 14:24 GMT
#146
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?
IMBA IMBA IMBA IMBA IMBA IMBA
Grovbolle
Profile Blog Joined July 2011
Denmark3811 Posts
December 22 2012 15:38 GMT
#147
On December 22 2012 23:24 Greenei wrote:
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?


Hi, I am one of the contributors of games to the site, and currently as far as I know it works in a way where we add data directly to his database, but I am not sure if the functionality/logic is located in an online version, obviously TheBB will be able to tell you, but since not all games are updated the second they are played, you will not have a "clean" rating because some games might not be added yet even though they have been played if you get to update the rating yourself .
Lies, damned lies and statistics: http://aligulac.com
Odoakar
Profile Joined May 2010
Croatia1837 Posts
December 22 2012 15:52 GMT
#148
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?
Grovbolle
Profile Blog Joined July 2011
Denmark3811 Posts
December 22 2012 16:06 GMT
#149
On December 23 2012 00:52 Odoakar wrote:
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not currently, but I assume that it is something TheBB would implement when/if he has the time for it
Lies, damned lies and statistics: http://aligulac.com
Iodem
Profile Joined October 2010
United States1173 Posts
December 24 2012 01:38 GMT
#150
http://aligulac.com/periods/73/page-2/

CombatEX is ranked as the second best foreign Protoss (1st being Grubby) xD
If you don't like it, you can quit.
graan
Profile Joined May 2011
Germany589 Posts
December 24 2012 03:10 GMT
#151
im glad you got a nice shout out by tlo at hsc.. im a statistics mayor and love to see some mathematical work, dont get your model over saturated, just stick to your data and keep it simple.. for example the best football predictions are only based on market values, if you base your research on 'upsets' you might get specific results right, but overall it gets off very fast.
ZigguratOfUr
Profile Blog Joined April 2012
Iraq16955 Posts
December 24 2012 03:19 GMT
#152
Some feedback on the predictions:

-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

-The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup. For example Goody's win-loss record versus Stephano is 7W-9L, while the (generally considered) much better player, PuMa, is only 2W-6L.

Gfire
Profile Joined March 2011
United States1699 Posts
December 24 2012 03:51 GMT
#153
On December 24 2012 12:19 BrokenMirage wrote:
-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?
all's fair in love and melodies
Roonweld
Profile Blog Joined October 2011
United States144 Posts
Last Edited: 2012-12-24 04:12:17
December 24 2012 04:10 GMT
#154
So, this is what a Nate Silver for Starcraft looks like. This will be awesome for my liquibet ranking!
Twitter @RoonSC
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
December 25 2012 17:59 GMT
#155
On Thursday when the new list comes I will recompute all the ratings from the start using some different parameters. Hopefully this will help with many of your issues.

is there any way of calculating the new rating and new predictions yourself?

Yeah, but it involves a bit of programming. There is no closed form expression. This feature would be kinda cool to add to the site, I agree.

Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not yet.

The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%).

This is because the ratings adjust very quickly, so a player on a hot streak will be very highly rated. When the new ratings come on Thursday, they won't be so volatile, so presumably the top will be closer to the mid tier. Maybe.

I don't want to just adjust my predictions toward the mean based on gut feeling. Based on historical data, the assumed stronger player wins almost exactly as many games as he or she should according to the ratings, if not more in some cases.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?

Yes.

The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
felisconcolori
Profile Blog Joined October 2011
United States6168 Posts
December 25 2012 18:37 GMT
#156
It TheBB the new stats bonjwa?

Great site, I like the layout and feel. Very useful so far while poking at it.
Yes, I email sponsors... to thank them. Don't post drunk, kids. My king, what has become of you?
StarGalaxy
Profile Joined March 2011
Germany744 Posts
December 25 2012 19:02 GMT
#157
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!
Cj hero | Zest
sitromit
Profile Joined June 2011
7051 Posts
December 25 2012 19:23 GMT
#158
On December 26 2012 02:59 TheBB wrote:
Show nested quote +
The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).


I don't think it's a good idea to take head-to-head into consideration, because even though there do seem to be some players who struggle against a particular opponent in a match up where they do quite well otherwise (hello MKP vs Mvp :p), it doesn't seem to be a factor the majority of the time.
slowbacontron
Profile Joined October 2012
United States7722 Posts
December 25 2012 19:27 GMT
#159
On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?
jjakji fan
StarGalaxy
Profile Joined March 2011
Germany744 Posts
Last Edited: 2012-12-25 23:14:54
December 25 2012 23:09 GMT
#160
On December 26 2012 04:27 slowbacontron wrote:
Show nested quote +
On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?


the match making is great.
The rating system is horrible. (Even if you take out the bonus pool which inflates the rating.)
It's impossible to track your own development with the blizzard ranking.
Cj hero | Zest
Prev 1 6 7 8 9 Next All
Please log in or register to reply.
Live Events Refresh
Next event in 5m
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
SortOf 282
MindelVK 41
Rex 39
TKL 29
StarCraft: Brood War
Britney 46448
Rain 2319
Horang2 1325
Mini 567
Hyuk 558
Stork 411
BeSt 395
firebathero 375
Larva 374
Last 228
[ Show more ]
Rush 203
ZerO 158
Soulkey 131
Pusan 128
Hyun 126
Zeus 125
hero 93
ggaemo 75
sorry 56
Sharp 49
Barracks 49
Backho 41
Mong 40
Shine 29
scan(afreeca) 12
Noble 11
IntoTheRainbow 9
Aegong 5
Dota 2
qojqva2934
XcaliburYe556
League of Legends
JimRising 438
Counter-Strike
byalli7950
zeus898
oskar117
edward65
Heroes of the Storm
Khaldor206
Other Games
summit1g12399
B2W.Neo846
Fuzer 248
Mew2King186
ArmadaUGS58
nookyyy 57
Pyrionflax42
ZerO(Twitch)12
Organizations
Dota 2
PGL Dota 2 - Main Stream461
StarCraft: Brood War
lovetv 11
StarCraft 2
WardiTV9
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 14 non-featured ]
StarCraft 2
• LUISG 45
• Adnapsc2 10
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
Dota 2
• WagamamaTV220
• lizZardDota2154
Upcoming Events
WardiTV Korean Royale
5m
ByuN vs Cure
TBD vs NightMare
TBD vs Classic
TBD vs Solar
Zoun vs Creator
OSC
5h 5m
Sparkling Tuna Cup
22h 5m
WardiTV Korean Royale
1d
TBD vs SHIN
TBD vs Reynor
TBD vs herO
Replay Cast
1d 12h
Wardi Open
2 days
Monday Night Weeklies
2 days
StarCraft2.fi
2 days
Replay Cast
2 days
Wardi Open
3 days
[ Show More ]
StarCraft2.fi
3 days
PiGosaur Monday
3 days
Wardi Open
4 days
StarCraft2.fi
4 days
Replay Cast
4 days
The PondCast
4 days
Replay Cast
5 days
Korean StarCraft League
6 days
CranKy Ducklings
6 days
Liquipedia Results

Completed

SOOP Univ League 2025
RSL Revival: Season 3
Eternal Conflict S1

Ongoing

C-Race Season 1
IPSL Winter 2025-26
KCM Race Survival 2025 Season 4
YSL S2
BSL Season 21
CSCL: Masked Kings S3
Slon Tour Season 2
META Madness #9
SL Budapest Major 2025
ESL Impact League Season 8
BLAST Rivals Fall 2025
IEM Chengdu 2025
PGL Masters Bucharest 2025
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2

Upcoming

BSL 21 Non-Korean Championship
Acropolis #4
IPSL Spring 2026
Bellum Gens Elite Stara Zagora 2026
HSC XXVIII
RSL Offline Finals
WardiTV 2025
PGL Cluj-Napoca 2026
IEM Kraków 2026
BLAST Bounty Winter 2026
BLAST Bounty Winter 2026: Closed Qualifier
eXTREMESLAND 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.