• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 07:16
CEST 13:16
KST 20:16
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
[ASL21] Ro4 Preview: On Course12Code S Season 1 - RO8 Preview7[ASL21] Ro8 Preview Pt2: Progenitors8Code S Season 1 - RO12 Group A: Rogue, Percival, Solar, Zoun13[ASL21] Ro8 Preview Pt1: Inheritors16
Community News
Code S Season 1 (2026) - RO8 Results1Weekly Cups (May 4-10): Clem, MaxPax, herO win1Maestros of The Game 2 announcement and schedule !11Weekly Cups (April 27-May 4): Clem takes triple0RSL Revival: Season 5 - Qualifiers and Main Event12
StarCraft 2
General
Team Liquid Map Contest #22 - The Finalists Code S Season 1 (2026) - RO8 Results MaNa leaves Team Liquid Weekly Cups (May 4-10): Clem, MaxPax, herO win Code S Season 1 - RO8 Preview
Tourneys
2026 GSL Season 2 Qualifiers Maestros of The Game 2 announcement and schedule ! $5,000 WardiTV Spring Championship 2026 SC2 INu's Battles#16 <BO.9> Master Swan Open (Global Bronze-Master 2)
Strategy
Custom Maps
[D]RTS in all its shapes and glory <3 [A] Nemrods 1/4 players
External Content
Mutation # 525 Wheel of Misfortune The PondCast: SC2 News & Results Mutation # 524 Death and Taxes Mutation # 523 Firewall
Brood War
General
BGH Auto Balance -> http://bghmmr.eu/ vespene.gg — BW replays in browser Pros React to: TvT Masterclass in FlaSh vs Light ASL21 Strategy, Pimpest Plays Discussions Flashes ASL S21 Ro8 Review
Tourneys
[ASL21] Semifinals B [ASL21] Semifinals A [BSL22] RO8 Bracket Stage + Another TieBreaker Small VOD Thread 2.0
Strategy
Fighting Spirit mining rates [G] Hydra ZvZ: An Introduction Simple Questions, Simple Answers Muta micro map competition
Other Games
General Games
Warcraft III: The Frozen Throne Stormgate/Frost Giant Megathread Nintendo Switch Thread Starcraft Tabletop Miniature Game PC Games Sales Thread
Dota 2
The Story of Wings Gaming
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
Vanilla Mini Mafia Mafia Game Mode Feedback/Ideas TL Mafia Community Thread Five o'clock TL Mafia
Community
General
US Politics Mega-thread Russo-Ukrainian War Thread UK Politics Mega-thread YouTube Thread European Politico-economics QA Mega-thread
Fan Clubs
The IdrA Fan Club
Media & Entertainment
[Manga] One Piece Anime Discussion Thread [Req][Books] Good Fantasy/SciFi books
Sports
2024 - 2026 Football Thread McBoner: A hockey love story Formula 1 Discussion
World Cup 2022
Tech Support
streaming software Strange computer issues (software) [G] How to Block Livestream Ads
TL Community
The Automated Ban List
Blogs
How EEG Data Can Predict Gam…
TrAiDoS
ramps on octagon
StaticNine
Funny Nicknames
LUCKY_NOOB
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1959 users

Update: so I made a rating system (and a website) - Page 8

Forum Index > SC2 General
Post a Reply
Prev 1 6 7 8 9 Next All
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2012-12-17 22:16:44
December 17 2012 22:14 GMT
#141
On December 18 2012 06:47 JohnAdams wrote:
TheBB : http://aligulac.com/players/9/period/73/

Yeah, like Conti said, the numbers listed next to each opponent is the rating of the opponent (in the given matchup) at the time the match was played.

Edit: The traffic stats skyrocket when this thread is bumped at a US-friendly time, lol.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
opterown *
Profile Blog Joined August 2011
Australia54784 Posts
December 17 2012 23:25 GMT
#142
ok hmm after looking at recent results i think you may have them a bit too volatile, haha
ModeratorRetired LR Bonjwa
TL+ Member
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2012-12-18 12:26:40
December 18 2012 12:25 GMT
#143
On December 18 2012 08:25 opterown wrote:
ok hmm after looking at recent results i think you may have them a bit too volatile, haha

Well, do I have good news for you then.

I made some tweaks today and I think I can make it a bit less volatile without impacting the predictive power. There are four parameters:

– RD (rating deviation) decay. How fast does uncertainy grow when a player doesn't play. Currently 0.01.
– Initial RD. How uncertain is the rating of a new player. Currently set at 0.5.
– Minimal RD. Currently set at 0.13.
– Period length. Currently 14 days. I won't touch this one.

A player's rating changes quickly if his or her RD is high. Thus a large minimal RD will create volatility among "stable" players, a large RD decay will create volatility among players who play less frequently, and a large initial RD will create volatility amont totally new players.

Here is a plot showing the predictive power of the original system.

[image loading]


How did I make this? Well, I went through every game in the training data set (containing almost 50000 games), and computed the ratings at the time the game was played, and assigned it a "slot" corresponding to how certain it was that the assumed stronger player would win. The slots are ranges of probabilities, i.e. 50-55%, 55-60% and so on. This is the "predicted winrate" of the x-axis. The black jagged line shows the actual winrate for each slot, and the dashed black line (slanting the other way) shows the number of games that was associated to each slot.

The dashed blue line shows the linear fit weighted by number of games, and the dashed red line shows the "ideal," namely actual winrate=predicted winrate over the board.

So you can see that the system works pretty well already, but ok, so maybe it's too volatile. Can we fix that?

[image loading]


This uses a higher decay rate and a lower minimum. Essentially this means that we allow the ratings of the most frequently playing players to become "more certain" but that the information of their skill level decays faster when they don't play.

[image loading]


Here I have upped the initial RD to 0.6 to try to fix the slight offset. Right now I think it looks almost perfect.

So this is what will happen. In a week, when the time comes to publish the new list, I will recompute all ratings, using a minimal RD of 0.06, initial RD of 0.6 and RD decay of 0.04.

What you should see is that the ratings of the most frequent players will be much more stable, but the ratings of players who play rarely will become unstable faster than before. Additionally, new players will adjust somewhat quicker than before.

Also, Conti has added a ton of missing SPL games to the database, so hopefully that will help with the Kespa players.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
bittman
Profile Joined February 2011
Australia8759 Posts
December 18 2012 13:15 GMT
#144
Just properly flicked through the site for the first time and I really like the work that's going into it.
Mvp - Leenock - Dongraegu - MC - Gumiho - Keen - Polt - Squirtle - Jjakji - Genius - Seed - Life - sC - Dream || LG-IM - MVP - FXO
myk3
Profile Joined June 2010
Austria80 Posts
December 18 2012 13:46 GMT
#145
Awesome. Nice design too. Good job, thx!
Greenei
Profile Joined November 2011
Germany1754 Posts
December 22 2012 14:24 GMT
#146
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?
IMBA IMBA IMBA IMBA IMBA IMBA
Grovbolle
Profile Blog Joined July 2011
Denmark3813 Posts
December 22 2012 15:38 GMT
#147
On December 22 2012 23:24 Greenei wrote:
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?


Hi, I am one of the contributors of games to the site, and currently as far as I know it works in a way where we add data directly to his database, but I am not sure if the functionality/logic is located in an online version, obviously TheBB will be able to tell you, but since not all games are updated the second they are played, you will not have a "clean" rating because some games might not be added yet even though they have been played if you get to update the rating yourself .
Lies, damned lies and statistics: http://aligulac.com
Odoakar
Profile Joined May 2010
Croatia1837 Posts
December 22 2012 15:52 GMT
#148
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?
Grovbolle
Profile Blog Joined July 2011
Denmark3813 Posts
December 22 2012 16:06 GMT
#149
On December 23 2012 00:52 Odoakar wrote:
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not currently, but I assume that it is something TheBB would implement when/if he has the time for it
Lies, damned lies and statistics: http://aligulac.com
Iodem
Profile Joined October 2010
United States1173 Posts
December 24 2012 01:38 GMT
#150
http://aligulac.com/periods/73/page-2/

CombatEX is ranked as the second best foreign Protoss (1st being Grubby) xD
If you don't like it, you can quit.
graan
Profile Joined May 2011
Germany589 Posts
December 24 2012 03:10 GMT
#151
im glad you got a nice shout out by tlo at hsc.. im a statistics mayor and love to see some mathematical work, dont get your model over saturated, just stick to your data and keep it simple.. for example the best football predictions are only based on market values, if you base your research on 'upsets' you might get specific results right, but overall it gets off very fast.
ZigguratOfUr
Profile Blog Joined April 2012
Iraq16955 Posts
December 24 2012 03:19 GMT
#152
Some feedback on the predictions:

-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

-The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup. For example Goody's win-loss record versus Stephano is 7W-9L, while the (generally considered) much better player, PuMa, is only 2W-6L.

Gfire
Profile Joined March 2011
United States1699 Posts
December 24 2012 03:51 GMT
#153
On December 24 2012 12:19 BrokenMirage wrote:
-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?
all's fair in love and melodies
Roonweld
Profile Blog Joined October 2011
United States144 Posts
Last Edited: 2012-12-24 04:12:17
December 24 2012 04:10 GMT
#154
So, this is what a Nate Silver for Starcraft looks like. This will be awesome for my liquibet ranking!
Twitter @RoonSC
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
December 25 2012 17:59 GMT
#155
On Thursday when the new list comes I will recompute all the ratings from the start using some different parameters. Hopefully this will help with many of your issues.

is there any way of calculating the new rating and new predictions yourself?

Yeah, but it involves a bit of programming. There is no closed form expression. This feature would be kinda cool to add to the site, I agree.

Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not yet.

The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%).

This is because the ratings adjust very quickly, so a player on a hot streak will be very highly rated. When the new ratings come on Thursday, they won't be so volatile, so presumably the top will be closer to the mid tier. Maybe.

I don't want to just adjust my predictions toward the mean based on gut feeling. Based on historical data, the assumed stronger player wins almost exactly as many games as he or she should according to the ratings, if not more in some cases.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?

Yes.

The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
felisconcolori
Profile Blog Joined October 2011
United States6168 Posts
December 25 2012 18:37 GMT
#156
It TheBB the new stats bonjwa?

Great site, I like the layout and feel. Very useful so far while poking at it.
Yes, I email sponsors... to thank them. Don't post drunk, kids. My king, what has become of you?
StarGalaxy
Profile Joined March 2011
Germany744 Posts
December 25 2012 19:02 GMT
#157
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!
Cj hero | Zest
sitromit
Profile Joined June 2011
7051 Posts
December 25 2012 19:23 GMT
#158
On December 26 2012 02:59 TheBB wrote:
Show nested quote +
The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).


I don't think it's a good idea to take head-to-head into consideration, because even though there do seem to be some players who struggle against a particular opponent in a match up where they do quite well otherwise (hello MKP vs Mvp :p), it doesn't seem to be a factor the majority of the time.
slowbacontron
Profile Joined October 2012
United States7722 Posts
December 25 2012 19:27 GMT
#159
On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?
jjakji fan
StarGalaxy
Profile Joined March 2011
Germany744 Posts
Last Edited: 2012-12-25 23:14:54
December 25 2012 23:09 GMT
#160
On December 26 2012 04:27 slowbacontron wrote:
Show nested quote +
On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?


the match making is great.
The rating system is horrible. (Even if you take out the bonus pool which inflates the rating.)
It's impossible to track your own development with the blizzard ranking.
Cj hero | Zest
Prev 1 6 7 8 9 Next All
Please log in or register to reply.
Live Events Refresh
RSL Revival
10:00
Season 5: Playoffs Day 1
herO vs SHINLIVE!
Tasteless753
Ryung 526
IntoTheiNu 497
TKL 216
IndyStarCraft 90
CranKy Ducklings79
Rex78
3DClanTV 26
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
Tasteless 753
Ryung 526
TKL 216
IndyStarCraft 90
Rex 78
StarCraft: Brood War
Britney 8826
Calm 5254
Horang2 1457
firebathero 1265
Sea 970
Jaedong 866
Hyuk 464
EffOrt 286
BeSt 273
actioN 239
[ Show more ]
Light 238
Snow 229
Mini 216
scan(afreeca) 177
ggaemo 170
Soulkey 157
Soma 119
ZerO 99
ToSsGirL 63
hero 62
Shinee 53
Liquid`Ret 53
Sexy 48
Rush 46
Mind 45
Backho 38
soO 30
Movie 28
910 21
Sacsri 20
Shine 17
yabsab 15
Sea.KH 15
Barracks 12
sorry 10
JulyZerg 8
zelot 8
ajuk12(nOOB) 7
Icarus 3
Last 0
Dota 2
Gorgc3404
League of Legends
JimRising 343
Counter-Strike
shoxiejesuss1510
allub173
markeloff131
Super Smash Bros
Mew2King98
Westballz17
Other Games
gofns19124
olofmeister2242
singsing1988
B2W.Neo399
byalli320
DeMusliM301
crisheroes245
monkeys_forever176
Lowko173
XcaliburYe66
ZerO(Twitch)10
Organizations
Counter-Strike
PGL23058
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
[ Show 16 non-featured ]
StarCraft 2
• StrangeGG 44
• Adnapsc2 8
• CranKy Ducklings SOOP7
• sooper7s
• Migwel
• LaughNgamezSOOP
• IndyKCrew
• Kozan
• intothetv
• AfreecaTV YouTube
StarCraft: Brood War
• escodisco2213
• iopq 6
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
League of Legends
• Jankos1014
Upcoming Events
OSC
1h 44m
Big Brain Bouts
4h 44m
sebesdes vs Iba
Percival vs YoungYakov
Reynor vs GgMaChine
Korean StarCraft League
15h 44m
RSL Revival
22h 44m
Clem vs Rogue
Bunny vs Lambo
IPSL
1d 4h
Dewalt vs nOmaD
Ret vs Cross
BSL
1d 7h
Bonyth vs Doodle
Dewalt vs TerrOr
GSL
1d 20h
Cure vs herO
SHIN vs Maru
IPSL
2 days
Bonyth vs Napoleon
G5 vs JDConan
BSL
2 days
OyAji vs JDConan
DragOn vs TBD
Replay Cast
2 days
[ Show More ]
Monday Night Weeklies
3 days
Replay Cast
3 days
The PondCast
3 days
GSL
4 days
Replay Cast
5 days
GSL
5 days
Replay Cast
6 days
Sparkling Tuna Cup
6 days
Liquipedia Results

Completed

Proleague 2026-05-13
WardiTV TLMC #16
Nations Cup 2026

Ongoing

BSL Season 22
ASL Season 21
IPSL Spring 2026
KCM Race Survival 2026 Season 2
Acropolis #4
KK 2v2 League Season 1
BSL 22 Non-Korean Championship
Escore Tournament S2: W7
SCTL 2026 Spring
RSL Revival: Season 5
2026 GSL S1
Heroes Pulsing #1
Asian Champions League 2026
IEM Atlanta 2026
PGL Astana 2026
BLAST Rivals Spring 2026
IEM Rio 2026
PGL Bucharest 2026
Stake Ranked Episode 1
BLAST Open Spring 2026
ESL Pro League S23 Finals
ESL Pro League S23 Stage 1&2

Upcoming

YSL S3
Escore Tournament S2: W8
CSLAN 4
Kung Fu Cup 2026 Grand Finals
HSC XXIX
uThermal 2v2 2026 Main Event
Maestros of the Game 2
WardiTV Spring 2026
2026 GSL S2
BLAST Bounty Summer Qual
Stake Ranked Episode 3
XSE Pro League 2026
IEM Cologne Major 2026
Stake Ranked Episode 2
CS Asia Championships 2026
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2026 TLnet. All Rights Reserved.