• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 00:49
CEST 06:49
KST 13:49
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Tournament Spotlight: FEL Cracow 20254Power Rank - Esports World Cup 202576RSL Season 1 - Final Week9[ASL19] Finals Recap: Standing Tall15HomeStory Cup 27 - Info & Preview18
Community News
Google Play ASL (Season 20) Announced19BSL Team Wars - Bonyth, Dewalt, Hawk & Sziky teams10Weekly Cups (July 14-20): Final Check-up0Esports World Cup 2025 - Brackets Revealed19Weekly Cups (July 7-13): Classic continues to roll8
StarCraft 2
General
Tournament Spotlight: FEL Cracow 2025 #1: Maru - Greatest Players of All Time Power Rank - Esports World Cup 2025 What tournaments are world championships? I offer completely free coaching services
Tourneys
Esports World Cup 2025 $25,000 Streamerzone StarCraft Pro Series announced $5,000 WardiTV Summer Championship 2025 WardiTV Mondays FEL Cracov 2025 (July 27) - $10,000 live event
Strategy
How did i lose this ZvP, whats the proper response
Custom Maps
External Content
Mutation #239 Bad Weather Mutation # 483 Kill Bot Wars Mutation # 482 Wheel of Misfortune Mutation # 481 Fear and Lava
Brood War
General
Google Play ASL (Season 20) Announced BGH Auto Balance -> http://bghmmr.eu/ BW General Discussion Flash Announces (and Retracts) Hiatus From ASL Dewalt's Show Matches in China
Tourneys
[Megathread] Daily Proleagues [BSL20] Non-Korean Championship 4x BSL + 4x China CSL Xiamen International Invitational [CSLPRO] It's CSLAN Season! - Last Chance
Strategy
Simple Questions, Simple Answers [G] Mineral Boosting Does 1 second matter in StarCraft?
Other Games
General Games
Nintendo Switch Thread Stormgate/Frost Giant Megathread Total Annihilation Server - TAForever [MMORPG] Tree of Savior (Successor of Ragnarok) Path of Exile
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread Vanilla Mini Mafia
Community
General
US Politics Mega-thread UK Politics Mega-thread Stop Killing Games - European Citizens Initiative Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread
Fan Clubs
INnoVation Fan Club SKT1 Classic Fan Club!
Media & Entertainment
[\m/] Heavy Metal Thread Anime Discussion Thread Movie Discussion! [Manga] One Piece Korean Music Discussion
Sports
2024 - 2025 Football Thread Formula 1 Discussion TeamLiquid Health and Fitness Initiative For 2023 NBA General Discussion
World Cup 2022
Tech Support
Installation of Windows 10 suck at "just a moment" Computer Build, Upgrade & Buying Resource Thread
TL Community
TeamLiquid Team Shirt On Sale The Automated Ban List
Blogs
Ping To Win? Pings And Their…
TrAiDoS
momentary artworks from des…
tankgirl
from making sc maps to makin…
Husyelt
StarCraft improvement
iopq
Socialism Anyone?
GreenHorizons
Eight Anniversary as a TL…
Mizenhauer
Customize Sidebar...

Website Feedback

Closed Threads



Active: 583 users

Update: so I made a rating system (and a website) - Page 8

Forum Index > SC2 General
Post a Reply
Prev 1 6 7 8 9 Next All
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2012-12-17 22:16:44
December 17 2012 22:14 GMT
#141
On December 18 2012 06:47 JohnAdams wrote:
TheBB : http://aligulac.com/players/9/period/73/

Yeah, like Conti said, the numbers listed next to each opponent is the rating of the opponent (in the given matchup) at the time the match was played.

Edit: The traffic stats skyrocket when this thread is bumped at a US-friendly time, lol.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
opterown *
Profile Blog Joined August 2011
Australia54784 Posts
December 17 2012 23:25 GMT
#142
ok hmm after looking at recent results i think you may have them a bit too volatile, haha
ModeratorRetired LR Bonjwa
TL+ Member
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2012-12-18 12:26:40
December 18 2012 12:25 GMT
#143
On December 18 2012 08:25 opterown wrote:
ok hmm after looking at recent results i think you may have them a bit too volatile, haha

Well, do I have good news for you then.

I made some tweaks today and I think I can make it a bit less volatile without impacting the predictive power. There are four parameters:

– RD (rating deviation) decay. How fast does uncertainy grow when a player doesn't play. Currently 0.01.
– Initial RD. How uncertain is the rating of a new player. Currently set at 0.5.
– Minimal RD. Currently set at 0.13.
– Period length. Currently 14 days. I won't touch this one.

A player's rating changes quickly if his or her RD is high. Thus a large minimal RD will create volatility among "stable" players, a large RD decay will create volatility among players who play less frequently, and a large initial RD will create volatility amont totally new players.

Here is a plot showing the predictive power of the original system.

[image loading]


How did I make this? Well, I went through every game in the training data set (containing almost 50000 games), and computed the ratings at the time the game was played, and assigned it a "slot" corresponding to how certain it was that the assumed stronger player would win. The slots are ranges of probabilities, i.e. 50-55%, 55-60% and so on. This is the "predicted winrate" of the x-axis. The black jagged line shows the actual winrate for each slot, and the dashed black line (slanting the other way) shows the number of games that was associated to each slot.

The dashed blue line shows the linear fit weighted by number of games, and the dashed red line shows the "ideal," namely actual winrate=predicted winrate over the board.

So you can see that the system works pretty well already, but ok, so maybe it's too volatile. Can we fix that?

[image loading]


This uses a higher decay rate and a lower minimum. Essentially this means that we allow the ratings of the most frequently playing players to become "more certain" but that the information of their skill level decays faster when they don't play.

[image loading]


Here I have upped the initial RD to 0.6 to try to fix the slight offset. Right now I think it looks almost perfect.

So this is what will happen. In a week, when the time comes to publish the new list, I will recompute all ratings, using a minimal RD of 0.06, initial RD of 0.6 and RD decay of 0.04.

What you should see is that the ratings of the most frequent players will be much more stable, but the ratings of players who play rarely will become unstable faster than before. Additionally, new players will adjust somewhat quicker than before.

Also, Conti has added a ton of missing SPL games to the database, so hopefully that will help with the Kespa players.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
bittman
Profile Joined February 2011
Australia8759 Posts
December 18 2012 13:15 GMT
#144
Just properly flicked through the site for the first time and I really like the work that's going into it.
Mvp - Leenock - Dongraegu - MC - Gumiho - Keen - Polt - Squirtle - Jjakji - Genius - Seed - Life - sC - Dream || LG-IM - MVP - FXO
myk3
Profile Joined June 2010
Austria80 Posts
December 18 2012 13:46 GMT
#145
Awesome. Nice design too. Good job, thx!
Greenei
Profile Joined November 2011
Germany1754 Posts
December 22 2012 14:24 GMT
#146
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?
IMBA IMBA IMBA IMBA IMBA IMBA
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
December 22 2012 15:38 GMT
#147
On December 22 2012 23:24 Greenei wrote:
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?


Hi, I am one of the contributors of games to the site, and currently as far as I know it works in a way where we add data directly to his database, but I am not sure if the functionality/logic is located in an online version, obviously TheBB will be able to tell you, but since not all games are updated the second they are played, you will not have a "clean" rating because some games might not be added yet even though they have been played if you get to update the rating yourself .
Lies, damned lies and statistics: http://aligulac.com
Odoakar
Profile Joined May 2010
Croatia1836 Posts
December 22 2012 15:52 GMT
#148
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
December 22 2012 16:06 GMT
#149
On December 23 2012 00:52 Odoakar wrote:
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not currently, but I assume that it is something TheBB would implement when/if he has the time for it
Lies, damned lies and statistics: http://aligulac.com
Iodem
Profile Joined October 2010
United States1173 Posts
December 24 2012 01:38 GMT
#150
http://aligulac.com/periods/73/page-2/

CombatEX is ranked as the second best foreign Protoss (1st being Grubby) xD
If you don't like it, you can quit.
graan
Profile Joined May 2011
Germany589 Posts
December 24 2012 03:10 GMT
#151
im glad you got a nice shout out by tlo at hsc.. im a statistics mayor and love to see some mathematical work, dont get your model over saturated, just stick to your data and keep it simple.. for example the best football predictions are only based on market values, if you base your research on 'upsets' you might get specific results right, but overall it gets off very fast.
ZigguratOfUr
Profile Blog Joined April 2012
Iraq16955 Posts
December 24 2012 03:19 GMT
#152
Some feedback on the predictions:

-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

-The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup. For example Goody's win-loss record versus Stephano is 7W-9L, while the (generally considered) much better player, PuMa, is only 2W-6L.

Gfire
Profile Joined March 2011
United States1699 Posts
December 24 2012 03:51 GMT
#153
On December 24 2012 12:19 BrokenMirage wrote:
-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?
all's fair in love and melodies
Roonweld
Profile Blog Joined October 2011
United States144 Posts
Last Edited: 2012-12-24 04:12:17
December 24 2012 04:10 GMT
#154
So, this is what a Nate Silver for Starcraft looks like. This will be awesome for my liquibet ranking!
Twitter @RoonSC
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
December 25 2012 17:59 GMT
#155
On Thursday when the new list comes I will recompute all the ratings from the start using some different parameters. Hopefully this will help with many of your issues.

is there any way of calculating the new rating and new predictions yourself?

Yeah, but it involves a bit of programming. There is no closed form expression. This feature would be kinda cool to add to the site, I agree.

Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not yet.

The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%).

This is because the ratings adjust very quickly, so a player on a hot streak will be very highly rated. When the new ratings come on Thursday, they won't be so volatile, so presumably the top will be closer to the mid tier. Maybe.

I don't want to just adjust my predictions toward the mean based on gut feeling. Based on historical data, the assumed stronger player wins almost exactly as many games as he or she should according to the ratings, if not more in some cases.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?

Yes.

The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
felisconcolori
Profile Blog Joined October 2011
United States6168 Posts
December 25 2012 18:37 GMT
#156
It TheBB the new stats bonjwa?

Great site, I like the layout and feel. Very useful so far while poking at it.
Yes, I email sponsors... to thank them. Don't post drunk, kids. My king, what has become of you?
StarGalaxy
Profile Joined March 2011
Germany744 Posts
December 25 2012 19:02 GMT
#157
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!
Cj hero | Zest
sitromit
Profile Joined June 2011
7051 Posts
December 25 2012 19:23 GMT
#158
On December 26 2012 02:59 TheBB wrote:
Show nested quote +
The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).


I don't think it's a good idea to take head-to-head into consideration, because even though there do seem to be some players who struggle against a particular opponent in a match up where they do quite well otherwise (hello MKP vs Mvp :p), it doesn't seem to be a factor the majority of the time.
slowbacontron
Profile Joined October 2012
United States7722 Posts
December 25 2012 19:27 GMT
#159
On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?
jjakji fan
StarGalaxy
Profile Joined March 2011
Germany744 Posts
Last Edited: 2012-12-25 23:14:54
December 25 2012 23:09 GMT
#160
On December 26 2012 04:27 slowbacontron wrote:
Show nested quote +
On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?


the match making is great.
The rating system is horrible. (Even if you take out the bonus pool which inflates the rating.)
It's impossible to track your own development with the blizzard ranking.
Cj hero | Zest
Prev 1 6 7 8 9 Next All
Please log in or register to reply.
Live Events Refresh
Next event in 4h 11m
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
WinterStarcraft598
Nina 265
RuFF_SC2 117
StarCraft: Brood War
Sea 4478
Larva 368
Backho 125
sSak 92
Sexy 49
zelot 14
scan(afreeca) 13
League of Legends
JimRising 823
Heroes of the Storm
Khaldor154
Other Games
summit1g11326
tarik_tv9845
ViBE286
Organizations
Other Games
gamesdonequick1436
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 16 non-featured ]
StarCraft 2
• Berry_CruncH205
• Hupsaiya 74
• practicex 40
• Light_VIP 6
• davetesta5
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
League of Legends
• Lourlo1098
Upcoming Events
FEL
4h 11m
BSL20 Non-Korean Champi…
9h 11m
BSL20 Non-Korean Champi…
13h 11m
Bonyth vs Zhanhun
Dewalt vs Mihu
Hawk vs Sziky
Sziky vs QiaoGege
Mihu vs Hawk
Zhanhun vs Dewalt
Fengzi vs Bonyth
Sparkling Tuna Cup
2 days
Online Event
2 days
uThermal 2v2 Circuit
3 days
The PondCast
4 days
Replay Cast
4 days
Korean StarCraft League
5 days
CranKy Ducklings
6 days
Liquipedia Results

Completed

CSLPRO Last Chance 2025
Esports World Cup 2025
Murky Cup #2

Ongoing

Copa Latinoamericana 4
Jiahua Invitational
BSL 20 Non-Korean Championship
BSL 20 Team Wars
FEL Cracov 2025
CC Div. A S7
Underdog Cup #2
IEM Cologne 2025
FISSURE Playground #1
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
PGL Astana 2025
Asian Champions League '25

Upcoming

ASL Season 20: Qualifier #1
ASL Season 20: Qualifier #2
ASL Season 20
CSLPRO Chat StarLAN 3
BSL Season 21
RSL Revival: Season 2
Maestros of the Game
SEL Season 2 Championship
WardiTV Summer 2025
uThermal 2v2 Main Event
HCC Europe
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.