• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 15:29
CEST 21:29
KST 04:29
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
[ASL19] Finals Recap: Standing Tall9HomeStory Cup 27 - Info & Preview18Classic wins Code S Season 2 (2025)16Code S RO4 & Finals Preview: herO, Rogue, Classic, GuMiho0TL Team Map Contest #5: Presented by Monster Energy6
Community News
Flash Announces Hiatus From ASL47Weekly Cups (June 23-29): Reynor in world title form?12FEL Cracov 2025 (July 27) - $8000 live event16Esports World Cup 2025 - Final Player Roster16Weekly Cups (June 16-22): Clem strikes back1
StarCraft 2
General
The SCII GOAT: A statistical Evaluation Statistics for vetoed/disliked maps Esports World Cup 2025 - Final Player Roster How does the number of casters affect your enjoyment of esports? Weekly Cups (June 23-29): Reynor in world title form?
Tourneys
RSL: Revival, a new crowdfunded tournament series [GSL 2025] Code S: Season 2 - Semi Finals & Finals $5,100+ SEL Season 2 Championship (SC: Evo) FEL Cracov 2025 (July 27) - $8000 live event HomeStory Cup 27 (June 27-29)
Strategy
How did i lose this ZvP, whats the proper response Simple Questions Simple Answers
Custom Maps
[UMS] Zillion Zerglings
External Content
Mutation # 480 Moths to the Flame Mutation # 479 Worn Out Welcome Mutation # 478 Instant Karma Mutation # 477 Slow and Steady
Brood War
General
Flash Announces Hiatus From ASL [ASL19] Finals Recap: Standing Tall BGH Auto Balance -> http://bghmmr.eu/ Player “Jedi” cheat on CSL Help: rep cant save
Tourneys
Small VOD Thread 2.0 [Megathread] Daily Proleagues [BSL20] GosuLeague RO16 - Tue & Wed 20:00+CET The Casual Games of the Week Thread
Strategy
Simple Questions, Simple Answers I am doing this better than progamers do.
Other Games
General Games
Stormgate/Frost Giant Megathread Nintendo Switch Thread Path of Exile What do you want from future RTS games? Beyond All Reason
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread Vanilla Mini Mafia
Community
General
Things Aren’t Peaceful in Palestine US Politics Mega-thread Russo-Ukrainian War Thread Trading/Investing Thread The Games Industry And ATVI
Fan Clubs
SKT1 Classic Fan Club! Maru Fan Club
Media & Entertainment
Anime Discussion Thread [Manga] One Piece [\m/] Heavy Metal Thread
Sports
2024 - 2025 Football Thread NBA General Discussion Formula 1 Discussion TeamLiquid Health and Fitness Initiative For 2023 NHL Playoffs 2024
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread
TL Community
Blogs
Culture Clash in Video Games…
TrAiDoS
from making sc maps to makin…
Husyelt
Blog #2
tankgirl
StarCraft improvement
iopq
Trip to the Zoo
micronesia
Customize Sidebar...

Website Feedback

Closed Threads



Active: 650 users

Aligulac Feb 6 Update: Oops

Forum Index > SC2 General
Post a Reply
Normal
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2013-02-07 10:11:38
February 07 2013 00:26 GMT
#1
What's this?

Aligulac.com is an ongoing statistical project and website in development started in December 2012. It offers a comprehensive database of games from the pro and semipro SC2 scene, as well as a unique rating system aimed at rating players and teams, and predicting games.

The FAQ might be able to answer your questions. If not, I'll be keeping an eye on this thread so you can ask away here.

Also, before we start, I want to quote Heartland from one of my previous threads who had this to share, better put into words than I ever could.

On February 03 2013 02:50 Heartland wrote:
I think what's cool and great about this work is that it does what statistics are good for. They give you the ability to create data and then to look at it critically. But maybe people in this thread confuse statistics with the Truth with a capital p (sic). That's not the way you should read statistics, whether in the morning paper or on TL. Rather statistics can make us think about deeper connections that we haven't seen before, twist and turn around concepts and play with them through statistical models. They're not meant to say "Scarlett should be in Code S." Obviously there are flaws or issues with these stats, but it's common for statistics everywhere. What you can do with that is to add or change some modifier, let it meet other forms of reasoning or to extrapolate on what we take for granted.

So yeah, tl;dr, lies, damn lies and statistics are the case with all stats but it's not the point of stats.

So, let's get on with the news.

Ratings bug

Aside from a lot of minor improvements, two significant things have happened. On Monday I got an e-mail from a certain Zomia, who had lots of constructive feedback to the site.

You see, I initially envisioned this site to be about ratings, but now it's more of a TLPD-like thing, which is to say it's a database of results, and most of the development lately has been related to that. But Zomia convinced me to go back and look at the rating system once more, and lo and behold, there was a bug.

Strictly speaking, the bug was not in the rating system, but in the code that I used analyze it and pick parameters. Until now I have basically been basing these numbers off of flawed information. No longer! As it turns out this is for everyone's benefit, because it allowed me to tweak the system to be more conservative (which a lot of people have been wanting), while giving it the predictive power that I thought it had, and which it now actually has.

+ Show Spoiler [Proof] +
[image loading]


This graph plots all 114k+ games, categorizing them by predicted winrate for the presumed strongest player on the horizontal axis, and actual winrate for same on the vertical axis. That's the thick black line. The blue line shows a weighted linear fit, while the red line is the ideal that we want. The system tends to slightly overestimate the strength of the strongest player up to a game winrate of 80%. From then on the overestimation might be significant. (Note that this is game winrate, not Bo3 or Bo5 winrate, which is higher.)

Hopefully this can convince everyone that it's working just fine now.

The upshot of this is that the rating list looks much more sensible, as does the Hall of Fame (which I've changed a bit how it works), which is now topped by people like MC, MKP and Mvp.

So to those of you who have been critiquing me, I owe you an apology.

Add results! Now easy even for grandma

Now, to the results database. We have 114 thousand games. That's about twice the size of sc2charts.net and about a third more than TLPD has, which makes us the most complete publically available pro/semipro SC2 database in the world (Blizzard's internal database is neither). Which is fine by me, I don't much care for sc2charts.net, and while TLPD is great the international database is not kept up to date very well any more.

Yet, I don't blame them. Creating this database with little more than just four people pulling the cart, I've learned that this shit's hard.

That is why we have opened for everyone to submit results. You can do that here: http://aligulac.com/add/

Publically submitted results will be subject to review by us before becoming visible. If you are interested, you can still PM me and get an admin account, which will allow you to:
  • submit results directly without all the review bother.
  • change, create and delete players, teams, matches...
  • review other people's submitted results.
  • sort matches into our events catalogue, which is still missing about 40% of the database.
  • mark the offline games as offline (some people wanted a separate rating for offline games, but that can't happen until this sorting is done).
  • bug me for feature requests (priority given to helpers).
  • just about anything else you can think of.
So, if your favourite player is rated too low, I suggest you go out and find some games that they won and add them for us.

(Kinda joking, but also kinda not.)

Thanks to the unknown guy out there who submitted IPTL this morning, and thus unknowingly became the first to use this system. (No idea who you are...)

Now, as usual the ratings have been updated to the latest two-week period. This one includes results from the two first days of GSL Code A and the preliminaries, most of GSL Code S Ro32, Proleague, IPL qualifiers, IPTL, Iron Squid finals, a handful of go4sc2s and ZOTACs and a bunch of other tournaments.

Zerg is still ahead in the top of the ladder, but this time it's Terran who is lagging behind, and not (as is usually the case) Protoss.

(I'm a little bit earlier this time so the list can be out in time for the GSL Ro16.)

Current top 10
  1. Life 2256 (+4 after Iron Squid)
  2. PartinG 2239 (+15 after Code S Ro32, MLG quals and FXO inv. playoffs)
  3. Leenock 2235 (no change after FXO Inv. playoffs)
  4. Bomber 2233 (+12 after Code S Ro32)
  5. Rain 2211 (-9 after Proleague and Code A prelims)
  6. DongRaeGu 2203 (+41 after IPL intl. regionals, Iron Squid, and Code S Ro32)
  7. TaeJa 2194 (-1 after Code S Ro32 and a delayed game from IPL intl. regional)
  8. RorO 2179 (+93 after Proleague, Code S Ro32 and MLG quals)
  9. Scarlett 2137 (no games)
  10. viOLet 2132 (no games)
Parting and Leenock have switched places, same with DongRaeGu and Taeja. Roro and Violet make their appearances while Hero and Last drop out. Roro and DRG make the biggest jumps upwards.

Foreigner top 10
  1. Scarlett 2137
  2. Stephano 2015
  3. VortiX 2014
  4. Snute 1973
  5. LucifroN 1947
  6. Sen 1909
  7. Kas 1886
  8. Fraer 1869
  9. TitaN 1868
  10. Nerchio 1826
The foreigner list is still Zerg heavy, but not as much as before maybe. Nobody in the top 10 gained any points, except Snute who shot up 136 after playing a ton of games.

Top 10 teams
  1. MVP 91.44%
  2. StarTale 91.11%
  3. SK Telecom T1 91.04%
  4. Incredible Miracle 90.14%
  5. AZUBU 89.08%
  6. Team Liquid 88.75%
  7. Prime 86.25%
  8. FXOpen e-Sports Korea 85.96%
  9. STX SouL 84.76%
  10. Evil Geniuses 84.44%
This is the allkill rank, and it's very close in the top. StarTale has lost their big advantage, and SKT T1 keeps making their name as the strongest Kespa team.

The proleague rank looks like this:
  1. SK Telecom T1 79.26%
  2. MVP 78.80%
  3. Incredible Miracle 74.91%
  4. AZUBU 74.71%
  5. StarTale 72.00%
  6. STX SouL 67.12%
  7. Team Liquid 66.52%
  8. Prime 62.50%
  9. Evil Geniuses 61.92%
  10. FXOpen e-Sports Korea 60.87%
Still close between SKT T1 and MVP, who might be the strongest teams (on paper) at the moment. StarTale is comparatively weaker in this format (their roster is topped out with Life and Bomber), while STX is stronger.

(Note team ranks are based on player ratings and rosters, not actual team matches.)

Thanks

To my team of trusted helpers: Conti, kiekaboe, Grovbolle, PhoenixVoid, Inflicted_ (new) and scisyhp (new). This project would never have been possible without you.

This week particularly to Zomia whose feedback led me to reconsider a few things.

Also to my academic advisor whose timely conference trip abroad allowed me the free time to waste.

Bonus

If you haven't seen the player connectivity graph yet you need to check it out. It's the kind of thing that makes this work truly worth it, I feel. http://www.teamliquid.net/forum/viewmessage.php?topic_id=396566

+ Show Spoiler [Awesomeness inside] +
[image loading]
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
Promethelax
Profile Joined February 2012
Canada7089 Posts
February 07 2013 00:32 GMT
#2
You guys are awesome and I always love seeing your statistics. Not much I can add to the discussion but I think you do great work.
TL Mafia. Love it. Play it. http://www.teamliquid.net/forum/index.php?show_part=31 I find Kennigit really attractive. If anyone has a picture of him please feel free to PM it to me.
PhoenixVoid
Profile Blog Joined December 2011
Canada32740 Posts
Last Edited: 2013-02-07 00:45:14
February 07 2013 00:41 GMT
#3
Hahaha, the person who submitted the IPTL results was ME! The system reset, and I forgot I had to login again, which I only realized after seeing the message that showed I was not logged in.

Funnily enough, I still didn't do it properly, forgot the "Source".
I'm afraid of demented knife-wielding escaped lunatic libertarian zombie mutants
edwahn
Profile Joined March 2011
New Zealand121 Posts
February 07 2013 00:44 GMT
#4
Brilliant update, I really enjoy this website and hope to contribute some day
pPingu
Profile Joined September 2011
Switzerland2892 Posts
February 07 2013 00:47 GMT
#5
Small mistake here

2. StarTale 91.11%
3. SK Telecom T1 91.14%


Anyways great job as usual
opterown *
Profile Blog Joined August 2011
Australia54784 Posts
February 07 2013 00:49 GMT
#6
keep up the good work theBB!
ModeratorRetired LR Bonjwa
TL+ Member
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 07 2013 00:49 GMT
#7
On February 07 2013 09:47 pPingu wrote:
Small mistake here
Show nested quote +
2. StarTale 91.11%
3. SK Telecom T1 91.14%

Anyways great job as usual

One day I will manage this without having to edit.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
Greenei
Profile Joined November 2011
Germany1754 Posts
Last Edited: 2013-02-07 01:01:58
February 07 2013 00:55 GMT
#8
hmm the 80%+ winrates seem to be really poorly predicted. but i guess that makes sense when new people have 1000 points and play against somewhat equally skilled players who have 2000 points. what really matters is the 50-70% region anyways. thx for the update, now i may FINALLY make some money with this thing :D

IMBA IMBA IMBA IMBA IMBA IMBA
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 07 2013 00:57 GMT
#9
On February 07 2013 09:55 Greenei wrote:
hmm the 80%+ winrates seem to be really poorly predicted.

Also this could be because of the underlying model (maybe using normal distribution wasn't the best idea, logistic might be better after all... more on this in a later edition maybe), or because there really aren't that many games with 80%+ skill gap.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
mierin
Profile Joined August 2010
United States4943 Posts
February 07 2013 01:01 GMT
#10
On February 07 2013 09:49 opterown wrote:
keep up the good work theBB!

JD, Stork, Calm, Hyuk Fighting!
Greenei
Profile Joined November 2011
Germany1754 Posts
Last Edited: 2013-02-07 01:02:49
February 07 2013 01:02 GMT
#11
btw: i still think a timeframe independant model would be nice, because it would make ratingupdates quicker. i could understand it though if that's not possible. so here is my idea:

how about a 'predicted rating development'? as in 'if these games, which have been reviewed, were the only games played in this timeframe, this rating would be the result.
IMBA IMBA IMBA IMBA IMBA IMBA
Day[9]
Profile Blog Joined April 2003
United States7366 Posts
February 07 2013 01:07 GMT
#12
Aligulac is very nearly Caligula backwards. At least they anagram.
Whenever I encounter some little hitch, or some of my orbs get out of orbit, nothing pleases me so much as to make the crooked straight and crush down uneven places. www.day9.tv
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
February 07 2013 01:11 GMT
#13
On February 07 2013 10:07 Day[9] wrote:
Aligulac is very nearly Caligula backwards. At least they anagram.

Whoa. You're right O.O
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 07 2013 01:15 GMT
#14
On February 07 2013 10:07 Day[9] wrote:
Aligulac is very nearly Caligula backwards. At least they anagram.

It's not even that complicated. Just move the C to the front or the back.

It's a word I came up with as a kid :-P. I don't remember if Caligula was the inspiration. I guess he could've been.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
StarVe
Profile Joined June 2011
Germany13591 Posts
Last Edited: 2013-02-07 01:37:15
February 07 2013 01:36 GMT
#15
Just want to say that you're awesome, TheBB. It's great someone picked up the pieces after TLPD just kinda shattered and made something even better and more complete.

I've been using it quite a bit and it's now my go-to SC2 database site if I need to look up something, thanks a lot to you and all your helpers!

Also, I'm very happy with the foreigner rankings, ROX.KIS fighting!
LockeTazeline
Profile Blog Joined June 2012
2390 Posts
Last Edited: 2013-02-07 02:06:00
February 07 2013 02:05 GMT
#16
Awesome stuff, as always.

What are you going to do with HotS? I noticed you've already put in the MLG Showdowns.
Greenei
Profile Joined November 2011
Germany1754 Posts
February 07 2013 02:11 GMT
#17
On February 07 2013 10:02 Greenei wrote:
how about a 'predicted rating development'? as in 'if these games, which have been reviewed, were the only games played in this timeframe, this rating would be the result.


so what do you think about this? i just think it would be really cool stuff.
IMBA IMBA IMBA IMBA IMBA IMBA
docvoc
Profile Blog Joined July 2011
United States5491 Posts
February 07 2013 02:43 GMT
#18
On February 07 2013 10:07 Day[9] wrote:
Aligulac is very nearly Caligula backwards. At least they anagram.

He has been revealed O.o
User was warned for too many mimes.
idkfa
Profile Joined October 2011
United States77 Posts
February 07 2013 02:56 GMT
#19
Click for cool graphs, evolved!

I like how Sen is a geographic region all by himself!
If you use "literally" as a form of hyperbole, you will literally DIE! (Eventually.)
feanor1
Profile Blog Joined November 2010
United States1899 Posts
February 07 2013 03:05 GMT
#20
On February 07 2013 11:56 idkfa wrote:
Click for cool graphs, evolved!

I like how Sen is a geographic region all by himself!

Probably should make SEA "MoonGlade" lol
_Puppies
Profile Blog Joined February 2012
24 Posts
February 07 2013 03:10 GMT
#21
On February 07 2013 10:07 Day[9] wrote:
Aligulac is very nearly Caligula backwards. At least they anagram.

Text Twist Power!
Canucklehead
Profile Joined March 2011
Canada5074 Posts
February 07 2013 03:12 GMT
#22
Ok boys, someone put this to the test and start betting for a month, then report on your winnings!

http://www.pinnaclesports.com/League/E Sports/SC2 GSL/1/Lines.aspx
Top 10 favourite pros: MKP, MVP, MC, Nestea, DRG, Jaedong, Flash, Life, Creator, Leenock
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 07 2013 07:25 GMT
#23
It makes no sense for me to do a write-up on the kespa pros since a shit ton of old matches were added, meaning that everyones ranking changed so it's hard to compare between period 76 and 77.
Will do it for the next ranking. I feel like quoting my own skype log with TheBB because I said the whole "aligulac is like caligula" first :D
Lies, damned lies and statistics: http://aligulac.com
StarGalaxy
Profile Joined March 2011
Germany744 Posts
Last Edited: 2013-02-07 12:30:19
February 07 2013 12:28 GMT
#24
One of my favourite sc2 releated sites.

poor socke he lost the position as the best foreigner protoss. He isn't even in the top 10 Eu toss right now.
I guess he needs some more consistency to get up there again with the new ranking.^^
Cj hero | Zest
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 07 2013 12:34 GMT
#25
On February 07 2013 21:28 StarGalaxy wrote:
One of my favourite sc2 releated sites.

poor socke he lost the position as the best foreigner protoss. He isn't even in the top 10 Eu toss right now.
I guess he needs some more consistency to get up there again with the new ranking.^^


Consistency and lots of matches will boost him
Lies, damned lies and statistics: http://aligulac.com
WigglingSquid
Profile Joined August 2011
5194 Posts
February 07 2013 12:41 GMT
#26
We're just waiting for your "Quality Poster" star, that's all.
JustPassingBy
Profile Blog Joined January 2011
10776 Posts
February 07 2013 12:41 GMT
#27
Only 5 terrans in the top 40 non-Koreans now... ;;
Is there a reason why Demulsim does not show up in the non-Korean rankings, btw?
Targe
Profile Blog Joined February 2012
United Kingdom14103 Posts
February 07 2013 12:44 GMT
#28
Awesome as usual BB!
11/5/14 CATACLYSM | The South West's worst Falco main
Conti
Profile Joined November 2010
Germany2516 Posts
February 07 2013 12:51 GMT
#29
On February 07 2013 11:05 LockeTazeline wrote:
Awesome stuff, as always.

What are you going to do with HotS? I noticed you've already put in the MLG Showdowns.

There are already various HotS showmatches and tournaments in the database, and we (and anyone who wants to help out!) are in the process of categorizing them as HotS matches/tournaments.

As far as I know, there won't be a distinction made between WoL and HotS matches played in regards to the ratings. There might be some chaos during the launch of HotS because of this, but there's really no smart way to prevent this, and the ratings will adjust quickly enough.
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
Last Edited: 2013-02-07 12:56:28
February 07 2013 12:55 GMT
#30
On February 07 2013 21:41 JustPassingBy wrote:
Only 5 terrans in the top 40 non-Koreans now... ;;
Is there a reason why Demulsim does not show up in the non-Korean rankings, btw?


No matches played for 4 periods (8 weeks) meaning he is currently too hard to rate, his rating isn't deleted, just hidden until we again have an idea of his skill level.

It's explained here
http://aligulac.com/faq

"If a player has not played any games for four periods (eight weeks, or about two months), he or she is removed from the list. The entry is still there, and will be taken into account if the player plays a new game some time in the future. It's only kept from the published list to keep it from filling up with uncertain and possibly irrelevant data.

The same happens if your rating uncertainty goes over 200.

I'm aware that a strict limit of only two months may seem harsh, but as the game is so volatile, if a player doesn't play fairly frequently, the ratings quickly become very uncertain and useless. Remember, your rating is still there.
"
Lies, damned lies and statistics: http://aligulac.com
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 07 2013 14:30 GMT
#31
On February 07 2013 11:11 Greenei wrote:
Show nested quote +
On February 07 2013 10:02 Greenei wrote:
how about a 'predicted rating development'? as in 'if these games, which have been reviewed, were the only games played in this timeframe, this rating would be the result.

so what do you think about this? i just think it would be really cool stuff.

Yeah, this is doable.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
dcemuser
Profile Joined August 2010
United States3248 Posts
Last Edited: 2013-02-07 14:54:30
February 07 2013 14:50 GMT
#32
I love Aligulac.

The -only- issue that I can think of is that there is a 'flaw' to the way it handles GSL/PL. When the Korean scene stays largely separate from the foreign scene, Aligulac has a hard time distinguishing "the gap". For example, the first few GSL seasons had almost no players who had played against foreigners in recorded matches.

Therefore, players who beat exclusively Koreans who did not play (and stomp) foreigners are not having their bar set high enough. This taints a lot of the 2010/early 2011 data. For example, look at NesTea in the Hall of Fame. He's below -Naniwa- and like 4 other foreign players, despite winning 3 GSLs in a 9 month time period.

Luckily, this probably isn't a major issue going forward, since MLG and other tournaments are bringing major Koreans to stomp foreigners, and then those Koreans go home and get beaten by better Koreans, which keeps things balanced. However, there was a once upon a time where the scenes rarely mixed.

TL;DR: GSL Code S is the highest skill tournament in the world, but to Aligulac, this is something that needs to be -continuously- proven, and therefore runs the risk of -not- being proven due to the said players not traveling and focusing entirely on GSL.

http://aligulac.com/periods/21/?sort=&race=ptzrs&nats=all is a big example of this. Some guy named Bubbles 3-0'd a bunch of foreigners (0 korean players) and became the #1 player in the world, lol.

I'm honestly not sure how you solve this, other than just mentioning it in the FAQ and admitting the early data is going to be kind of weird.
Conti
Profile Joined November 2010
Germany2516 Posts
February 07 2013 15:15 GMT
#33
That's precisely the reason why we have an international TLPD and a Korean TLPD (at least I think so. Do let me know if I'm wrong here). Since in Brood War, Koreans practically never played against foreigners, and if they would have combined the TLPD elo ranking back then, foreigners would rank evenly with Koreans. Which was even more laughable back then in Brood War than it is now in SC2. So having separate databases for TLPD made perfect sense.

Now, however, Koreans play foreigners all the time, so we can compare them. With the exception, as you say, of the early days of GSL. Another more prominent exception is Proleague, and it's even worse there, IMO. You can see the jumps in the rankings of most Kespa players (Classic, Last, Trap, etc.), which denotes the point where Kespa players started to play against non-Kespa Koreans. As you can see, the rankings have adjusted pretty quickly.

Unfortunately, there are still a lot of lesser known Kespa players who have almost solely played in Proleague so far, and their ranking is unrealistically low. Heck, some of the worst ranked players in the entire ranking are Kespa players. I don't think there's any real way to "fix" this, other than waiting it out until the Kespa ratings have adjusted to the overall ratings.
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 07 2013 15:20 GMT
#34
On February 07 2013 23:50 dcemuser wrote:
I love Aligulac.

The -only- issue that I can think of is that there is a 'flaw' to the way it handles GSL/PL. When the Korean scene stays largely separate from the foreign scene, Aligulac has a hard time distinguishing "the gap". For example, the first few GSL seasons had almost no players who had played against foreigners in recorded matches.

Therefore, players who beat exclusively Koreans who did not play (and stomp) foreigners are not having their bar set high enough. This taints a lot of the 2010/early 2011 data. For example, look at NesTea in the Hall of Fame. He's below -Naniwa- and like 4 other foreign players, despite winning 3 GSLs in a 9 month time period.

Luckily, this probably isn't a major issue going forward, since MLG and other tournaments are bringing major Koreans to stomp foreigners, and then those Koreans go home and get beaten by better Koreans, which keeps things balanced. However, there was a once upon a time where the scenes rarely mixed.

TL;DR: GSL Code S is the highest skill tournament in the world, but to Aligulac, this is something that needs to be -continuously- proven, and therefore runs the risk of -not- being proven due to the said players not traveling and focusing entirely on GSL.

http://aligulac.com/periods/21/?sort=&race=ptzrs&nats=all is a big example of this. Some guy named Bubbles 3-0'd a bunch of foreigners (0 korean players) and became the #1 player in the world, lol.

I'm honestly not sure how you solve this, other than just mentioning it in the FAQ and admitting the early data is going to be kind of weird.


It speaks for itself that it takes a lot of matches from a large group of players to determine their skills,and I agree that the early stages of the lists are somewhat dubious. The same goes for a lot of the Kespa pros, the more matches we get to see from them, and the more interaction there is in the entire scene, the better the lists and the predictions get. They all started out at 1000 rating, which isn't a lot since we knew that most of them would still be pretty good, but it is also hard to just say "well they are better than a 1000 for sure" when we have no definitive way of being sure.

Valid points/concerns though.
The system will get shaken a lot when HotS hits as well.
Lies, damned lies and statistics: http://aligulac.com
VirgilSC2
Profile Blog Joined June 2011
United States6151 Posts
February 07 2013 15:25 GMT
#35
At this point I'm starting to think TL should just buy rights to Aligulac and roll it in as TLPD....

I love this site man.
Clarity Gaming #1 Fan | Avid MTG Grinder | @VirgilSC2
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 07 2013 15:28 GMT
#36
On February 08 2013 00:25 VirgilSC2 wrote:
At this point I'm starting to think TL should just buy rights to Aligulac and roll it in as TLPD....

I love this site man.


Well they can always download the MySQL dump
http://aligulac.com/db/
Lies, damned lies and statistics: http://aligulac.com
twistedalpha
Profile Joined September 2010
Philippines139 Posts
February 07 2013 15:54 GMT
#37
For a second there I thought you're talking about this guy

[image loading]
JustPassingBy
Profile Blog Joined January 2011
10776 Posts
February 07 2013 16:27 GMT
#38
On February 07 2013 21:55 Grovbolle wrote:
Show nested quote +
On February 07 2013 21:41 JustPassingBy wrote:
Only 5 terrans in the top 40 non-Koreans now... ;;
Is there a reason why Demulsim does not show up in the non-Korean rankings, btw?


No matches played for 4 periods (8 weeks) meaning he is currently too hard to rate, his rating isn't deleted, just hidden until we again have an idea of his skill level.

It's explained here
http://aligulac.com/faq

"If a player has not played any games for four periods (eight weeks, or about two months), he or she is removed from the list. The entry is still there, and will be taken into account if the player plays a new game some time in the future. It's only kept from the published list to keep it from filling up with uncertain and possibly irrelevant data.

The same happens if your rating uncertainty goes over 200.

I'm aware that a strict limit of only two months may seem harsh, but as the game is so volatile, if a player doesn't play fairly frequently, the ratings quickly become very uncertain and useless. Remember, your rating is still there.
"


Ah, thanks for the info. That rules does make alot of sense to me.
KillerDucky
Profile Blog Joined July 2010
United States498 Posts
February 07 2013 17:16 GMT
#39
On February 07 2013 23:50 dcemuser wrote:
I love Aligulac.

The -only- issue that I can think of is that there is a 'flaw' to the way it handles GSL/PL. When the Korean scene stays largely separate from the foreign scene, Aligulac has a hard time distinguishing "the gap". For example, the first few GSL seasons had almost no players who had played against foreigners in recorded matches.

Therefore, players who beat exclusively Koreans who did not play (and stomp) foreigners are not having their bar set high enough. This taints a lot of the 2010/early 2011 data. For example, look at NesTea in the Hall of Fame. He's below -Naniwa- and like 4 other foreign players, despite winning 3 GSLs in a 9 month time period.

Luckily, this probably isn't a major issue going forward, since MLG and other tournaments are bringing major Koreans to stomp foreigners, and then those Koreans go home and get beaten by better Koreans, which keeps things balanced. However, there was a once upon a time where the scenes rarely mixed.

TL;DR: GSL Code S is the highest skill tournament in the world, but to Aligulac, this is something that needs to be -continuously- proven, and therefore runs the risk of -not- being proven due to the said players not traveling and focusing entirely on GSL.

http://aligulac.com/periods/21/?sort=&race=ptzrs&nats=all is a big example of this. Some guy named Bubbles 3-0'd a bunch of foreigners (0 korean players) and became the #1 player in the world, lol.

I'm honestly not sure how you solve this, other than just mentioning it in the FAQ and admitting the early data is going to be kind of weird.


It's possible to fix this, some papers I read call it parameter smoothing, using backward filtering to smooth the past ratings. See for example this paper: http://tennis-skill-rankings.googlecode.com/hg-history/c977c53a3af2913e780e39666fe1a272cc298319/links/glicko.pdf

MarineKingPrime Forever!
ZenithM
Profile Joined February 2011
France15952 Posts
Last Edited: 2013-02-07 17:21:07
February 07 2013 17:20 GMT
#40
On February 07 2013 10:15 TheBB wrote:
Show nested quote +
On February 07 2013 10:07 Day[9] wrote:
Aligulac is very nearly Caligula backwards. At least they anagram.

It's not even that complicated. Just move the C to the front or the back.

It's a word I came up with as a kid :-P. I don't remember if Caligula was the inspiration. I guess he could've been.

I thought it definitely was :D
When you released aligulac for the first time, I even searched into Caligula to find some kind of connection to your website. It wasn't very insightful. Except that he had decent macro, I guess.
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2013-02-07 18:22:34
February 07 2013 18:06 GMT
#41
On February 08 2013 02:16 KillerDucky wrote:
Show nested quote +
On February 07 2013 23:50 dcemuser wrote:
I love Aligulac.

The -only- issue that I can think of is that there is a 'flaw' to the way it handles GSL/PL. When the Korean scene stays largely separate from the foreign scene, Aligulac has a hard time distinguishing "the gap". For example, the first few GSL seasons had almost no players who had played against foreigners in recorded matches.

Therefore, players who beat exclusively Koreans who did not play (and stomp) foreigners are not having their bar set high enough. This taints a lot of the 2010/early 2011 data. For example, look at NesTea in the Hall of Fame. He's below -Naniwa- and like 4 other foreign players, despite winning 3 GSLs in a 9 month time period.

Luckily, this probably isn't a major issue going forward, since MLG and other tournaments are bringing major Koreans to stomp foreigners, and then those Koreans go home and get beaten by better Koreans, which keeps things balanced. However, there was a once upon a time where the scenes rarely mixed.

TL;DR: GSL Code S is the highest skill tournament in the world, but to Aligulac, this is something that needs to be -continuously- proven, and therefore runs the risk of -not- being proven due to the said players not traveling and focusing entirely on GSL.

http://aligulac.com/periods/21/?sort=&race=ptzrs&nats=all is a big example of this. Some guy named Bubbles 3-0'd a bunch of foreigners (0 korean players) and became the #1 player in the world, lol.

I'm honestly not sure how you solve this, other than just mentioning it in the FAQ and admitting the early data is going to be kind of weird.

It's possible to fix this, some papers I read call it parameter smoothing, using backward filtering to smooth the past ratings. See for example this paper: http://tennis-skill-rankings.googlecode.com/hg-history/c977c53a3af2913e780e39666fe1a272cc298319/links/glicko.pdf

I thought about this (that's the paper I based my method on actually), but I didn't quite like the idea of past lists changing forever. When FIDE (chess) ratings are published they are set in stone, and you know for example that Kasparov's 2851 record from 1999 or Carlsen's 2872 at the moment will never be anything other than what they are. It makes it awkward for enthusiasts to track records. Not that I've noticed a lot of people tracking Aligulac records, since the pasts lists are changing anyway due to the expanding database (for the time being), but still, I wanted to give people the option.

Thoughts?

Edit: Just so it's clear, we're talking about basing ratings on both past and future results, so that the historical ratings look more correct in hindsight. It can fix some of the early problems by (for example) adjusting Koreans upwards because we now know that they have an average higher skill level.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
KillerDucky
Profile Blog Joined July 2010
United States498 Posts
Last Edited: 2013-02-07 18:54:43
February 07 2013 18:54 GMT
#42
Maybe just run smoothing once. Really as long as you start from around October 2012 (MvP matches) and smooth backwards from there, most of the problems would probably be fixed.
MarineKingPrime Forever!
MCXD
Profile Blog Joined February 2012
Australia2738 Posts
Last Edited: 2013-02-07 19:00:32
February 07 2013 18:58 GMT
#43
Just letting you know that I encountered an error when playing around with the prediction stuff:

+ Show Spoiler +
[image loading]


It seems like the round robin thing doesn't like large groups. (Was just an 8-man round robin w/ bo1 using the players shown)
EtherealDeath
Profile Blog Joined July 2007
United States8366 Posts
February 07 2013 19:03 GMT
#44
Lol funny rounding error that is.
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 07 2013 19:11 GMT
#45
Ah yeah, you used a group that was so big it was forced to use Monte Carlo simulation. Thanks for the heads up.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
ACrow
Profile Joined October 2011
Germany6583 Posts
February 07 2013 19:56 GMT
#46
There is not a lot of difference between First and Last in the recent list.

Good job, always love your list! Glad you found a bug, it still seems a bit weird seeing Scarlett that high on the list, but w/e, math does not lie and it's only a model not the truth (whatever truth is).
Get off my lawn, young punks
Greenei
Profile Joined November 2011
Germany1754 Posts
February 07 2013 22:22 GMT
#47
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?
IMBA IMBA IMBA IMBA IMBA IMBA
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
Last Edited: 2013-02-08 08:41:08
February 07 2013 23:05 GMT
#48
On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?


Making a qualified guess, I would say it is +-3 standard deviations (meaning that 95% of the time, the actual rating falls within the confidence interval, i.e. Rating +- 3 St. Deviations.)

Edit: Of course 3 std's = 99% (I am retarded)
Lies, damned lies and statistics: http://aligulac.com
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2013-02-07 23:43:28
February 07 2013 23:34 GMT
#49
On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
Greenei
Profile Joined November 2011
Germany1754 Posts
February 08 2013 02:52 GMT
#50
Making a qualified guess, I would say it is +-3 standard deviations (meaning that 95% of the time, the actual rating falls within the confidence interval, i.e. Rating +- 3 St. Deviations.)


3 stds would be ~99%.


On February 08 2013 08:34 TheBB wrote:
Show nested quote +
On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.


k thx. do you plan on making the database open source at any point? because i'd like to make some calculations of my own from time to time and there would be no point at all in starting an own database at this point.
IMBA IMBA IMBA IMBA IMBA IMBA
Conti
Profile Joined November 2010
Germany2516 Posts
February 08 2013 06:17 GMT
#51
On February 08 2013 11:52 Greenei wrote:
Show nested quote +
Making a qualified guess, I would say it is +-3 standard deviations (meaning that 95% of the time, the actual rating falls within the confidence interval, i.e. Rating +- 3 St. Deviations.)


3 stds would be ~99%.


Show nested quote +
On February 08 2013 08:34 TheBB wrote:
On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.


k thx. do you plan on making the database open source at any point? because i'd like to make some calculations of my own from time to time and there would be no point at all in starting an own database at this point.

You can download an SQL database dump at http://aligulac.com/db/.
Greenei
Profile Joined November 2011
Germany1754 Posts
February 08 2013 07:53 GMT
#52
On February 08 2013 15:17 Conti wrote:
Show nested quote +
On February 08 2013 11:52 Greenei wrote:
Making a qualified guess, I would say it is +-3 standard deviations (meaning that 95% of the time, the actual rating falls within the confidence interval, i.e. Rating +- 3 St. Deviations.)


3 stds would be ~99%.


On February 08 2013 08:34 TheBB wrote:
On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.


k thx. do you plan on making the database open source at any point? because i'd like to make some calculations of my own from time to time and there would be no point at all in starting an own database at this point.

You can download an SQL database dump at http://aligulac.com/db/.


ah thx, that was a bit hidden :D
IMBA IMBA IMBA IMBA IMBA IMBA
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 08 2013 08:41 GMT
#53
On February 08 2013 08:34 TheBB wrote:
Show nested quote +
On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.


Yeah ok, 68%
Lies, damned lies and statistics: http://aligulac.com
a3den
Profile Joined April 2012
704 Posts
Last Edited: 2013-02-08 13:10:08
February 08 2013 12:00 GMT
#54
As a stats buff, gotta say it really is a nice website, like a cleaner and better version of TLPD (or sc2charts, whatever floated your boat). Both infuriated me for the longest time because they had the data and did nothing with it. You on the other hand understand that a db is as good as what you do with it. I also love how well your data is historized.

Downloading that Db dump from work is so tempting...
maty
Profile Joined November 2010
Germany12 Posts
Last Edited: 2013-02-08 13:10:59
February 08 2013 13:10 GMT
#55
you could revisit the EG curse with those stats
MasterOfPuppets
Profile Blog Joined March 2011
Romania6942 Posts
February 10 2013 21:40 GMT
#56
So BB if you ever get particularly bored, could you make a prediction system for ProLeague/GSTL based on not only on player rating for both rosters but also maps? Or is it simply not going to be accurate enough to warrant the gargantuan effort involved in creating and implementing the system? xD
"my shaft scares me too" - strenx 2014
Conti
Profile Joined November 2010
Germany2516 Posts
Last Edited: 2013-02-10 22:14:22
February 10 2013 22:14 GMT
#57
On February 11 2013 06:40 MasterOfPuppets wrote:
So BB if you ever get particularly bored, could you make a prediction system for ProLeague/GSTL based on not only on player rating for both rosters but also maps? Or is it simply not going to be accurate enough to warrant the gargantuan effort involved in creating and implementing the system? xD

There's currently no map information saved in the database, only matches and results. So before any kind of predictive magic math can be applied, we'd need that information for >100.000 games. And we'd need a whole lot more volunteers for that.

Nudge. Nudge.
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
Last Edited: 2013-02-10 22:41:11
February 10 2013 22:34 GMT
#58
On February 11 2013 07:14 Conti wrote:
Show nested quote +
On February 11 2013 06:40 MasterOfPuppets wrote:
So BB if you ever get particularly bored, could you make a prediction system for ProLeague/GSTL based on not only on player rating for both rosters but also maps? Or is it simply not going to be accurate enough to warrant the gargantuan effort involved in creating and implementing the system? xD

There's currently no map information saved in the database, only matches and results. So before any kind of predictive magic math can be applied, we'd need that information for >100.000 games. And we'd need a whole lot more volunteers for that.

Nudge. Nudge.


Plus we (TheBB) had to rework how the entire database is configured because matches =/= games.

Plus it would be hard since a lot of LP-articles contain no mapinfo, even on big tournaments like MLG it is impossible to find map info for stuff like open bracket etc. So yeah, way too much work, whenever a new feature has to be "backtracked" as I like to call it, it literally takes our small team of 4-5 (TheBB, Conti, kiekaboe does a shit ton each and I + Inflicted does some as well) weeks, just look at this
http://aligulac.com/db/
"only" 64% is catalogued in the event hierarchy.
Lies, damned lies and statistics: http://aligulac.com
Epamynondas
Profile Joined September 2012
387 Posts
February 10 2013 23:09 GMT
#59
On February 08 2013 03:06 TheBB wrote:
+ Show Spoiler +
On February 08 2013 02:16 KillerDucky wrote:
Show nested quote +
On February 07 2013 23:50 dcemuser wrote:
I love Aligulac.

The -only- issue that I can think of is that there is a 'flaw' to the way it handles GSL/PL. When the Korean scene stays largely separate from the foreign scene, Aligulac has a hard time distinguishing "the gap". For example, the first few GSL seasons had almost no players who had played against foreigners in recorded matches.

Therefore, players who beat exclusively Koreans who did not play (and stomp) foreigners are not having their bar set high enough. This taints a lot of the 2010/early 2011 data. For example, look at NesTea in the Hall of Fame. He's below -Naniwa- and like 4 other foreign players, despite winning 3 GSLs in a 9 month time period.

Luckily, this probably isn't a major issue going forward, since MLG and other tournaments are bringing major Koreans to stomp foreigners, and then those Koreans go home and get beaten by better Koreans, which keeps things balanced. However, there was a once upon a time where the scenes rarely mixed.

TL;DR: GSL Code S is the highest skill tournament in the world, but to Aligulac, this is something that needs to be -continuously- proven, and therefore runs the risk of -not- being proven due to the said players not traveling and focusing entirely on GSL.

http://aligulac.com/periods/21/?sort=&race=ptzrs&nats=all is a big example of this. Some guy named Bubbles 3-0'd a bunch of foreigners (0 korean players) and became the #1 player in the world, lol.

I'm honestly not sure how you solve this, other than just mentioning it in the FAQ and admitting the early data is going to be kind of weird.

It's possible to fix this, some papers I read call it parameter smoothing, using backward filtering to smooth the past ratings. See for example this paper: http://tennis-skill-rankings.googlecode.com/hg-history/c977c53a3af2913e780e39666fe1a272cc298319/links/glicko.pdf


I thought about this (that's the paper I based my method on actually), but I didn't quite like the idea of past lists changing forever. When FIDE (chess) ratings are published they are set in stone, and you know for example that Kasparov's 2851 record from 1999 or Carlsen's 2872 at the moment will never be anything other than what they are. It makes it awkward for enthusiasts to track records. Not that I've noticed a lot of people tracking Aligulac records, since the pasts lists are changing anyway due to the expanding database (for the time being), but still, I wanted to give people the option.

Thoughts?


Maybe you could do some kind of backwards adjustement (or this "smoothing" you guys speak of) only on new players? Like, compute things normally for them for about 4 periods or something like that (or for a set amount of games played, i guess?), and then adjust their ratings retroactively, and then don't mess with their past ever again.


So imagine that I get a magical seed for Code S next season, and lose my first game of the group stages against Life (but only because i'm nervous). This doesn't give a lot of points to Life because I'm totally unknown at that point.

Then I proceed to stomp all competition and win Code S without dropping another map. Then your script readjusts my ratings and suddenly Life has a rating of like 3000 because he took a game off me.

And then pro players catch up to my silver strats and I don't win a game ever again.
Conti
Profile Joined November 2010
Germany2516 Posts
February 10 2013 23:11 GMT
#60
..and sorting matches into events is about a gazillion times faster to do than adding maps to matches would be.
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 11 2013 09:19 GMT
#61
On February 11 2013 08:09 Epamynondas wrote:
Show nested quote +
On February 08 2013 03:06 TheBB wrote:
+ Show Spoiler +
On February 08 2013 02:16 KillerDucky wrote:
Show nested quote +
On February 07 2013 23:50 dcemuser wrote:
I love Aligulac.

The -only- issue that I can think of is that there is a 'flaw' to the way it handles GSL/PL. When the Korean scene stays largely separate from the foreign scene, Aligulac has a hard time distinguishing "the gap". For example, the first few GSL seasons had almost no players who had played against foreigners in recorded matches.

Therefore, players who beat exclusively Koreans who did not play (and stomp) foreigners are not having their bar set high enough. This taints a lot of the 2010/early 2011 data. For example, look at NesTea in the Hall of Fame. He's below -Naniwa- and like 4 other foreign players, despite winning 3 GSLs in a 9 month time period.

Luckily, this probably isn't a major issue going forward, since MLG and other tournaments are bringing major Koreans to stomp foreigners, and then those Koreans go home and get beaten by better Koreans, which keeps things balanced. However, there was a once upon a time where the scenes rarely mixed.

TL;DR: GSL Code S is the highest skill tournament in the world, but to Aligulac, this is something that needs to be -continuously- proven, and therefore runs the risk of -not- being proven due to the said players not traveling and focusing entirely on GSL.

http://aligulac.com/periods/21/?sort=&race=ptzrs&nats=all is a big example of this. Some guy named Bubbles 3-0'd a bunch of foreigners (0 korean players) and became the #1 player in the world, lol.

I'm honestly not sure how you solve this, other than just mentioning it in the FAQ and admitting the early data is going to be kind of weird.

It's possible to fix this, some papers I read call it parameter smoothing, using backward filtering to smooth the past ratings. See for example this paper: http://tennis-skill-rankings.googlecode.com/hg-history/c977c53a3af2913e780e39666fe1a272cc298319/links/glicko.pdf


I thought about this (that's the paper I based my method on actually), but I didn't quite like the idea of past lists changing forever. When FIDE (chess) ratings are published they are set in stone, and you know for example that Kasparov's 2851 record from 1999 or Carlsen's 2872 at the moment will never be anything other than what they are. It makes it awkward for enthusiasts to track records. Not that I've noticed a lot of people tracking Aligulac records, since the pasts lists are changing anyway due to the expanding database (for the time being), but still, I wanted to give people the option.

Thoughts?


Maybe you could do some kind of backwards adjustement (or this "smoothing" you guys speak of) only on new players? Like, compute things normally for them for about 4 periods or something like that (or for a set amount of games played, i guess?), and then adjust their ratings retroactively, and then don't mess with their past ever again.


So imagine that I get a magical seed for Code S next season, and lose my first game of the group stages against Life (but only because i'm nervous). This doesn't give a lot of points to Life because I'm totally unknown at that point.

Then I proceed to stomp all competition and win Code S without dropping another map. Then your script readjusts my ratings and suddenly Life has a rating of like 3000 because he took a game off me.

And then pro players catch up to my silver strats and I don't win a game ever again.


Do you have a Code S seed that you haven't told anyone about?
:D
Lies, damned lies and statistics: http://aligulac.com
Epamynondas
Profile Joined September 2012
387 Posts
February 11 2013 16:06 GMT
#62
Negotations are still underway so I'm not supposed to talk about it.

;D
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 11 2013 16:37 GMT
#63
On February 12 2013 01:06 Epamynondas wrote:
Negotations are still underway so I'm not supposed to talk about it.

;D


In before EGEpamynondasRC
Lies, damned lies and statistics: http://aligulac.com
KingDime
Profile Blog Joined September 2010
Canada750 Posts
February 11 2013 17:05 GMT
#64
While it is definitely a rating based behind extremely high level players and seeing where they are ranked. I find it cool that I could even find some of my own results that I had completely forgotten about from like early 2011 etc. This is a great system for the high level professional players and also really useful for a lowly NA semi-pro.
Doom Guy
fezvez
Profile Blog Joined January 2011
France3021 Posts
February 11 2013 17:57 GMT
#65
I am a bit ashamed, studying statistics myself,but why would you consider your red line ideal?

I suppose the linear regression gives you the approximation with smallest prediction error. Why would you minimize/maximize something else?
Traceback
Profile Joined October 2010
United States469 Posts
February 11 2013 18:29 GMT
#66
On February 12 2013 02:57 fezvez wrote:
I am a bit ashamed, studying statistics myself,but why would you consider your red line ideal?

I suppose the linear regression gives you the approximation with smallest prediction error. Why would you minimize/maximize something else?

Because ideally you want the actual win rate to match your predicted win rate.
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
February 11 2013 18:31 GMT
#67
On February 12 2013 03:29 Traceback wrote:
Show nested quote +
On February 12 2013 02:57 fezvez wrote:
I am a bit ashamed, studying statistics myself,but why would you consider your red line ideal?

I suppose the linear regression gives you the approximation with smallest prediction error. Why would you minimize/maximize something else?

Because ideally you want the actual win rate to match your predicted win rate.


Ninja'd me
Lies, damned lies and statistics: http://aligulac.com
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
Last Edited: 2013-02-11 19:00:32
February 11 2013 18:33 GMT
#68
On February 12 2013 02:05 KingDime wrote:
While it is definitely a rating based behind extremely high level players and seeing where they are ranked. I find it cool that I could even find some of my own results that I had completely forgotten about from like early 2011 etc. This is a great system for the high level professional players and also really useful for a lowly NA semi-pro.

If you ever participated in any of the major tournaments, or as in what I assume is your case, NASL Season 4 and zotac cup #20 + what we scraped from TLPD, there is a good chance you are in the system

Edit: You are canadian not american, so your results are from WCS
Edit edit: I merged the players Dime (us) and Dime (ca). Please respond to my PM to you so I can get the results you submitted reviewed
Lies, damned lies and statistics: http://aligulac.com
metroid composite
Profile Joined February 2007
Canada231 Posts
February 11 2013 22:00 GMT
#69
On February 07 2013 09:57 TheBB wrote:
Show nested quote +
On February 07 2013 09:55 Greenei wrote:
hmm the 80%+ winrates seem to be really poorly predicted.

Also this could be because of the underlying model (maybe using normal distribution wasn't the best idea, logistic might be better after all... more on this in a later edition maybe), or because there really aren't that many games with 80%+ skill gap.


Logistic is what gets used in Chess ELO rankings specifically because it was found to be a better fit to the data. The only part the two functions handle significantly differently is the tail (so the 80% percentage gap cases).

I mean, specifically looking at the chart you posted:

+ Show Spoiler [Proof] +
[image loading]


Ignore the data noise for a moment and look at the fitted curve. The fitted curve starts dead even with the ideal curve, but slowly diverges (and from 50%-83% the actual game data very closely follows the fitted curve). This is what I'd expect to see if the probability distribution being used was collapsing too tightly.


The problem with the normal distribution is that it's not built for transitive operations.

If Life has an 84% chance of beating Sheth
and Sheth has an 84% chance of beating Artosis
Does this mean that Life has a 98% chance of beating Artosis?

Because this is what Normal distribution predicts when you combine two win percentages.

Two 31% chances combine into a 16% chance.
Two 23% chances combine into a 7% chance.
Two 16% chances combine into a 2% chance.
Two 7% chances combine into a 0.1% chance.

Which...doesn't feel right to me. If we take the above Life/Sheth/Artosis numbers, but make Sheth 50% less likely to win against Life, and Artosis 50% less likely to win against Sheth, suddenly Artosis is 95% less likely to win against Life? The odds against him really increase by a factor of 20, when the odds of the two intermediate matchups only get worse by a factor of 2?

Compare to the logistic curve. When you combine two win percentages to predict a more distant match...

Two 31% chances combine into a 17% chance
Two 23% chances combine into a 9% chance
Two 17% chances combine into a 4% chance
Two 9% chances combine into a 1% chance

Which is to say: if you take existing win percentages, and change them so that Sheth is 1/2 as likely to beat Life, and Artisis is 1/2 as likely to beat Sheth, then that makes Artosis is 1/4 as likely to beat Life (instead of 1/20 as likely).

Just intuitively, this just feels like a more reasonable way to combine percentages. If you told me with absolute certainty "Life beats sheth X% of the time" and "Sheth beats Artosis Y% of the time", and then asked me "What do you expect Life's winrate against Artosis to be?" My guess would be much closer to the logistic distribution than the normal distribution.
Cats land on their feet. Toast lands peanut butter side down. A cat with toast strapped to its back will hover above the ground in a state of quantum indecision
Heartland
Profile Blog Joined May 2012
Sweden24580 Posts
February 11 2013 22:21 GMT
#70
I got quoted in an OP? That's awesome.

Also, this is a cool project.
felisconcolori
Profile Blog Joined October 2011
United States6168 Posts
February 11 2013 22:23 GMT
#71
When I look at TheBB's posts, I see the gathering mass of a star being born.

This is insanely useful information, an excellent use of statistics, and I hope to (insert deity or otherworldly influence here) that you can get some academic use out of this project as well. (A paper, an essay, something.)
Yes, I email sponsors... to thank them. Don't post drunk, kids. My king, what has become of you?
metroid composite
Profile Joined February 2007
Canada231 Posts
Last Edited: 2013-02-11 23:32:26
February 11 2013 23:20 GMT
#72
On February 08 2013 04:56 ACrow wrote:
Good job, always love your list! Glad you found a bug, it still seems a bit weird seeing Scarlett that high on the list, but w/e, math does not lie and it's only a model not the truth (whatever truth is).


As a big Scarlett fan...yeah, I would not put her ahead of Hyun.

In general, the place the ratings feel a little wrong is when players play in an overly easy (or overly hard) group.

Take the list of best foreign Terrans:


http://aligulac.com/periods/77/?page=1&sort=&race=t&nats=foreigners


Actually, let me quickly note that MaSa is also not on this list (Aligulac has MaSa as Korean; Liquipedia has MaSa as Canadian; I believe Canadian is correct here).



But what I really want to point out here is.... Look at the 5th best foreign Terran. Bunny! Who is Bunny? Someone who participated in a Danish Starcraft tournament, and I guess got more wins than losses. Boom, 5th best foreigner Terran, apparently!

If you want to get highly rated by Aligulac, then play opponents weaker than yourself...which sums up a decent number of Scarlett's tournaments (WCS Canada, WCS North America, IPL qualifier for North America...).

Conversely, if you want a low rating on Aligulac, then play stronger opponents. (Most of the foreigners who participated in the MLG vs Proleague event had their ratings dip, usually by about 300 points right around October 2012. For example, look at the rating graph of qxc: http://aligulac.com/players/261/ ).

I don't really know if there's a good statistical way to fix this issue, however. If all the Danish people collectively decide to never play anyone outside of Denmark, some of them are going to end up with very high ratings, some of them are going to end up with very low ratings. Not a whole lot that can be done about it.
Cats land on their feet. Toast lands peanut butter side down. A cat with toast strapped to its back will hover above the ground in a state of quantum indecision
Grovbolle
Profile Blog Joined July 2011
Denmark3805 Posts
Last Edited: 2013-02-12 07:52:37
February 12 2013 07:52 GMT
#73
On February 12 2013 08:20 metroid composite wrote:
Show nested quote +
On February 08 2013 04:56 ACrow wrote:
Good job, always love your list! Glad you found a bug, it still seems a bit weird seeing Scarlett that high on the list, but w/e, math does not lie and it's only a model not the truth (whatever truth is).


As a big Scarlett fan...yeah, I would not put her ahead of Hyun.

In general, the place the ratings feel a little wrong is when players play in an overly easy (or overly hard) group.

Take the list of best foreign Terrans:


http://aligulac.com/periods/77/?page=1&sort=&race=t&nats=foreigners


Actually, let me quickly note that MaSa is also not on this list (Aligulac has MaSa as Korean; Liquipedia has MaSa as Canadian; I believe Canadian is correct here).



But what I really want to point out here is.... Look at the 5th best foreign Terran. Bunny! Who is Bunny? Someone who participated in a Danish Starcraft tournament, and I guess got more wins than losses. Boom, 5th best foreigner Terran, apparently!

If you want to get highly rated by Aligulac, then play opponents weaker than yourself...which sums up a decent number of Scarlett's tournaments (WCS Canada, WCS North America, IPL qualifier for North America...).

Conversely, if you want a low rating on Aligulac, then play stronger opponents. (Most of the foreigners who participated in the MLG vs Proleague event had their ratings dip, usually by about 300 points right around October 2012. For example, look at the rating graph of qxc: http://aligulac.com/players/261/ ).

I don't really know if there's a good statistical way to fix this issue, however. If all the Danish people collectively decide to never play anyone outside of Denmark, some of them are going to end up with very high ratings, some of them are going to end up with very low ratings. Not a whole lot that can be done about it.


Very true, I believe a few pages back someone posted an explanation of how "islands" within a ranking system affects this. But please remember, this isn't "THE TRUTH". Bunny has very few matches, the problem is always when someone new enters in a scene with a lot of "new" players (non-ranked players starting on 1000). And yes it can become a problem if a subcommunity only plays each other.
Lies, damned lies and statistics: http://aligulac.com
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 12 2013 08:07 GMT
#74
On February 12 2013 07:00 metroid composite wrote:
Show nested quote +
On February 07 2013 09:57 TheBB wrote:
On February 07 2013 09:55 Greenei wrote:
hmm the 80%+ winrates seem to be really poorly predicted.

Also this could be because of the underlying model (maybe using normal distribution wasn't the best idea, logistic might be better after all... more on this in a later edition maybe), or because there really aren't that many games with 80%+ skill gap.

+ Show Spoiler +
Logistic is what gets used in Chess ELO rankings specifically because it was found to be a better fit to the data. The only part the two functions handle significantly differently is the tail (so the 80% percentage gap cases).

I mean, specifically looking at the chart you posted:

+ Show Spoiler [Proof] +
[image loading]


Ignore the data noise for a moment and look at the fitted curve. The fitted curve starts dead even with the ideal curve, but slowly diverges (and from 50%-83% the actual game data very closely follows the fitted curve). This is what I'd expect to see if the probability distribution being used was collapsing too tightly.


The problem with the normal distribution is that it's not built for transitive operations.

If Life has an 84% chance of beating Sheth
and Sheth has an 84% chance of beating Artosis
Does this mean that Life has a 98% chance of beating Artosis?

Because this is what Normal distribution predicts when you combine two win percentages.

Two 31% chances combine into a 16% chance.
Two 23% chances combine into a 7% chance.
Two 16% chances combine into a 2% chance.
Two 7% chances combine into a 0.1% chance.

Which...doesn't feel right to me. If we take the above Life/Sheth/Artosis numbers, but make Sheth 50% less likely to win against Life, and Artosis 50% less likely to win against Sheth, suddenly Artosis is 95% less likely to win against Life? The odds against him really increase by a factor of 20, when the odds of the two intermediate matchups only get worse by a factor of 2?

Compare to the logistic curve. When you combine two win percentages to predict a more distant match...

Two 31% chances combine into a 17% chance
Two 23% chances combine into a 9% chance
Two 17% chances combine into a 4% chance
Two 9% chances combine into a 1% chance

Which is to say: if you take existing win percentages, and change them so that Sheth is 1/2 as likely to beat Life, and Artisis is 1/2 as likely to beat Sheth, then that makes Artosis is 1/4 as likely to beat Life (instead of 1/20 as likely).

Just intuitively, this just feels like a more reasonable way to combine percentages. If you told me with absolute certainty "Life beats sheth X% of the time" and "Sheth beats Artosis Y% of the time", and then asked me "What do you expect Life's winrate against Artosis to be?" My guess would be much closer to the logistic distribution than the normal distribution.

Yes, I know all this now. But thanks for putting it into words anyway.

I will try and see what happens. I expect some improvement, too.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 12 2013 08:16 GMT
#75
On February 12 2013 07:23 felisconcolori wrote:
This is insanely useful information, an excellent use of statistics, and I hope to (insert deity or otherworldly influence here) that you can get some academic use out of this project as well. (A paper, an essay, something.)

Thanks . I asked my advisor about whether or not the institute has a policy on publishing reports in topics that are outside the main area of research and he said it was fine as long as I found some statistician to look at it. (None of the people I usually work with know anything about statistics, lol.)
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2013-02-15 23:51:38
February 12 2013 19:03 GMT
#76
I've now converted everything to using the logistic distribution. You should see somewhat more conservative predictions now.

Updated prediction analysis:

[image loading]

It didn't help as much in the 80%+ regime as I thought it would. I'm thinking the problem is more related to the sudden arrival of new player pools (koreans in late 2010, kespa in 2012), and I may have to do something about that, such as one or more of:

- parameter smoothing over certain time periods
- use time-dependent parameters

At the moment I'm a bit tired of the mathematical part and I'll go back to working on the website for a few weeks.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
Boucot
Profile Blog Joined October 2011
France15997 Posts
February 14 2013 22:23 GMT
#77
Congrats for the article on ESFI World and all your job in general, well deserved !

http://www.esfiworld.com/interview-with-eivind-fonn-founder-of-starcraft-2-statistics-and-ranking-site-aligulac/
Former SC2 writer for Millenium - twitter.com/Boucot
StarVe
Profile Joined June 2011
Germany13591 Posts
February 14 2013 23:51 GMT
#78
Yeah, that's pretty fucking awesome, good luck with your thesis!
Greenei
Profile Joined November 2011
Germany1754 Posts
February 16 2013 18:55 GMT
#79
aaaah! as i see you have included my proposed feature, sweeeet!
IMBA IMBA IMBA IMBA IMBA IMBA
llIH
Profile Joined June 2011
Norway2143 Posts
February 16 2013 18:56 GMT
#80
Scarlett! wow!
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2013-02-16 19:10:26
February 16 2013 19:09 GMT
#81
On February 17 2013 03:55 Greenei wrote:
aaaah! as i see you have included my proposed feature, sweeeet!

The live preview? Yeah sure. They should update every six hours now.

I so do love cron.

Also, plugging this:

On February 15 2013 07:23 Boucot wrote:
Congrats for the article on ESFI World and all your job in general, well deserved !

http://www.esfiworld.com/interview-with-eivind-fonn-founder-of-starcraft-2-statistics-and-ranking-site-aligulac/
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
Boucot
Profile Blog Joined October 2011
France15997 Posts
February 16 2013 22:58 GMT
#82
I have a question about the way the results are updated on the website. Because I was amazed by how fast the results of Francophone Championship Season 2 were considered this afternoon. Did someone submitted them ?

And a second related question : when someine submit you a result, do you add them manually ?
Former SC2 writer for Millenium - twitter.com/Boucot
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
February 16 2013 23:11 GMT
#83
On February 17 2013 07:58 Boucot wrote:
I have a question about the way the results are updated on the website. Because I was amazed by how fast the results of Francophone Championship Season 2 were considered this afternoon. Did someone submitted them ?

And a second related question : when someine submit you a result, do you add them manually ?

I think that was just kiekaboe being really fast. (No guarantees for the future.)

When someone non-admin submits games, we have to review them before they are properly added, yes.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
MCXD
Profile Blog Joined February 2012
Australia2738 Posts
Last Edited: 2013-02-16 23:34:20
February 16 2013 23:27 GMT
#84
Would it be possible for you to implement byes into the single elimination bracket prediction? Like if you write # or BYE or something. The ESF IPL6 seeding tourney going on at the moment, for example, has 7 players with one receiving a first round bye. Lots of other tourneys like the IEMs do it too, with 12-man playoffs.

In the case of 12-man, I assume you would format it like this:

Seed 1
BYE
Seed 8
Seed 9

Seed 4
BYE
Seed 5
Seed 12

Seed 2
BYE
Seed 7
Seed 10

Seed 3
BYE
Seed 6
Seed 11

Thus giving you:

1 vs. (8 vs. 9)
4 vs. (5 vs. 12)
2 vs. (7 vs. 10)
3 vs. (6 vs. 11)

Perhaps implementing BYE as a fake player with a rating of 0 in all match-ups?
TheBB
Profile Blog Joined July 2009
Switzerland5133 Posts
Last Edited: 2013-02-16 23:53:24
February 16 2013 23:35 GMT
#85
On February 17 2013 08:27 MCXD wrote:
Would it be possible for you to implement byes into the single elimination bracket prediction? Like if you write # or BYE or something. The ESF IPL6 seeding tourney going on at the moment, for example, has 7 players with one receiving a first round bye. Lots of other tourneys like the IEMs do it too, with 12-man playoffs.

Good point, that's another one of those features that the backend code already has but I haven't worked it into the frontend. It's on the list.

In the meantime you can use nemuke as a placeholder. He is currently the lowest ranked player (even counting the inactive ones), and so he will have the least impact. Poor guy. :D

There's no restriction on using the same player more than once, either.
http://aligulac.com || Barcraft Switzerland! || Zerg best race. || Stats-poster extraordinaire.
MCXD
Profile Blog Joined February 2012
Australia2738 Posts
Last Edited: 2013-02-16 23:47:19
February 16 2013 23:46 GMT
#86
As far as I can tell, putting in a dummy player then going down to the 'update the results' section and giving them a 2-0 win in their first match works too, it turns out. You can then just crop off the dummy player in the little code block you post on the forums too, for cleanliness.
Normal
Please log in or register to reply.
Live Events Refresh
Next event in 4h 31m
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
mouzHeroMarine 802
BRAT_OK 159
JuggernautJason77
MindelVK 34
StarCraft: Brood War
Britney 17227
Dewaltoss 150
Aegong 45
sas.Sziky 39
soO 37
zelot 16
GoRush 10
IntoTheRainbow 6
yabsab 4
Stormgate
TKL 111
NightEnD7
Dota 2
Gorgc6784
qojqva3111
League of Legends
Dendi1365
JimRising 278
Counter-Strike
fl0m1728
flusha341
sgares205
Foxcn139
Super Smash Bros
Mew2King131
Heroes of the Storm
Liquid`Hasu488
Other Games
summit1g4679
FrodaN2395
ceh9743
elazer223
RotterdaM180
Pyrionflax86
Sick64
Trikslyr61
kaitlyn55
trigger1
Organizations
Other Games
BasetradeTV36
StarCraft 2
angryscii 27
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 18 non-featured ]
StarCraft 2
• LUISG 4
• intothetv
• IndyKCrew
• sooper7s
• Migwel
• AfreecaTV YouTube
• LaughNgamezSOOP
• Kozan
StarCraft: Brood War
• blackmanpl 23
• 80smullet 20
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
League of Legends
• Jankos2187
• Doublelift1864
• masondota2424
Other Games
• imaqtpie879
• Shiphtur279
Upcoming Events
Replay Cast
4h 31m
RSL Revival
14h 31m
herO vs SHIN
Reynor vs Cure
OSC
17h 31m
WardiTV European League
20h 31m
Scarlett vs Percival
Jumy vs ArT
YoungYakov vs Shameless
uThermal vs Fjant
Nicoract vs goblin
Harstem vs Gerald
FEL
20h 31m
Korean StarCraft League
1d 7h
CranKy Ducklings
1d 14h
RSL Revival
1d 14h
FEL
1d 20h
Sparkling Tuna Cup
2 days
[ Show More ]
RSL Revival
2 days
FEL
2 days
BSL: ProLeague
2 days
Dewalt vs Bonyth
Replay Cast
4 days
Replay Cast
4 days
The PondCast
5 days
Replay Cast
6 days
RSL Revival
6 days
Liquipedia Results

Completed

Proleague 2025-06-28
HSC XXVII
Heroes 10 EU

Ongoing

JPL Season 2
BSL 2v2 Season 3
BSL Season 20
Acropolis #3
KCM Race Survival 2025 Season 2
CSL 17: 2025 SUMMER
Copa Latinoamericana 4
Championship of Russia 2025
RSL Revival: Season 1
Murky Cup #2
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
PGL Astana 2025
Asian Champions League '25
BLAST Rivals Spring 2025
MESA Nomadic Masters
CCT Season 2 Global Finals
IEM Melbourne 2025
YaLLa Compass Qatar 2025

Upcoming

CSLPRO Last Chance 2025
CSLPRO Chat StarLAN 3
K-Championship
uThermal 2v2 Main Event
SEL Season 2 Championship
FEL Cracov 2025
Esports World Cup 2025
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
FISSURE Playground #1
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.