Update: so I made a rating system (and a website)

TheBB

Switzerland5133 Posts

December 10 2012 15:59 GMT

This is a bit of an update to this blog post. I put this here instead since presumably it will interest a few more people.

I built a website to house my ratings, and it's now at least usable. (Don't worry, it's 100% non-profit. Not an ad in sight.)

Front page here: http://aligulac.com/
Current rating list: http://aligulac.com/periods/72/
All rating lists from almost two years back: http://aligulac.com/periods/

Click on a player for cool graphs.

I also have a FAQ. Constructive criticism is welcome, but I might already have answered your question there. Ideas on cool graphs and statistics to produce are definitely desired.

The current period ends on Wednesday, and I will publish a new list after that. Hopefully I can keep publishing new lists every two weeks and keep things relevant.

(I just had the domain lying around, the name doesn't mean anything.)
(Yes, I'm not exactly a web designer. Hopefully it's still tolerable.)

Edit: Looks like my server plan is not up for this, lol.

Edit 2: Hopefully it's a bit faster now.

krutopatkin

Germany2612 Posts

December 10 2012 16:02 GMT

Wow, this is really sick thanks a lot

y0su

Finland7871 Posts

December 10 2012 16:04 GMT

It's not loading... is TL overloading it already?

Can't wait to actually check it out - big fan of the ratings in LRs :D

HaXXspetten

Sweden15718 Posts

December 10 2012 16:05 GMT

Glad you're still going full force on the issue at hand ^^

always good to help innocents lern maths

Mortal

2943 Posts

December 10 2012 16:05 GMT

On December 11 2012 00:59 TheBB wrote:
Click on a player for cool graphs.

How did you know I love cool graphs?

TheBB

Switzerland5133 Posts

December 10 2012 16:05 GMT

On December 11 2012 01:04 y0su wrote:
It's not loading... is TL overloading it already?

Hope not. Maybe I have to upgrade.

HaXXspetten

Sweden15718 Posts

December 10 2012 16:06 GMT

On December 11 2012 01:05 TheBB wrote:

Show nested quote +

Hope not. Maybe I have to upgrade.

it is loading, just kinda slowly

y0su

Finland7871 Posts

December 10 2012 16:07 GMT

On December 11 2012 01:06 HaXXspetten wrote:

Show nested quote +

it is loading, just kinda slowly

Yeah, I'm seeing it now. Really nice (simple/refined) layout and look!

TheBB

Switzerland5133 Posts

December 10 2012 16:07 GMT

On December 11 2012 01:06 HaXXspetten wrote:

Show nested quote +

it is loading, just kinda slowly

Well, it was just fine before I posted it.

NeonFox

2373 Posts

December 10 2012 16:09 GMT

#10

Aaaaand it's dead

Hope it's back up online soon to check it out!

StarVe

Germany13591 Posts

December 10 2012 16:11 GMT

#11

STATISTICS OVERLOAD!

TheBB does it again, where do you go from here? Have you reached your peak in StarCraft statistics? What are your goals for 2013?

Thanks, by the way, probably looking neat, but I can't access it.

Dodgin

Canada39254 Posts

December 10 2012 16:12 GMT

#12

This is like when Tasteless tells everyone to go to thehandsomenerd.com

It's down.

edit: cancel that, it loaded, just very slowly.

Mortal

2943 Posts

December 10 2012 16:13 GMT

#13

On December 11 2012 01:12 Dodgin wrote:
This is like when Tasteless tells everyone to go to thehandsomenerd.com

It's down.

edit: cancel that, it loaded, just very slowly.

Shit, everyone go to thehandsomenerd.com!

Serimek

France2274 Posts

December 10 2012 16:17 GMT

#14

Slow but nice. Good job !

m0ck

4194 Posts

December 10 2012 16:45 GMT

#15

You're doing really interesting things, thanks!

TheBB

Switzerland5133 Posts

December 10 2012 17:24 GMT

#16

Ok, I think it should be a bit faster now.

BoB_KiLLeR

Spain620 Posts

December 10 2012 17:26 GMT

#17

TLPD ELO 2.0

love it

Yello

Germany7411 Posts

December 10 2012 17:30 GMT

#18

Nice. It's still a little bit slow, but not to bad. And it's looking good so far. I like the simple design

Pandemona

Charlie Sheens House51493 Posts

December 10 2012 17:34 GMT

#19

Has it got BW matches in it? Cuz how the hell is Innovation so high?

ToKoreaWithLove

Norway10161 Posts

December 10 2012 17:35 GMT

#20

Loving it! Very nice work!

Faust852

Luxembourg4004 Posts

December 10 2012 17:35 GMT

#21

Why is Stephano 130th, behind players like Krass. He just beat 4 of the best code S players :o

Anyway, nice work, too bad it's so slow.

Butterednuts

United States859 Posts

December 10 2012 17:39 GMT

#22

Having some loading issues, mind if I ask who you're hosting with?

TheBB

Switzerland5133 Posts

December 10 2012 17:39 GMT

#23

On December 11 2012 02:39 Butterednuts wrote:
Having some loading issues, mind if I ask who you're hosting with?

These guys: www.server4you.net

I wouldn't mind some profiling help, actually.

TheBB

Switzerland5133 Posts

December 10 2012 17:41 GMT

#24

On December 11 2012 02:35 Faust852 wrote:
Why is Stephano 130th, behind players like Krass. He just beat 4 of the best code S players :o

The latest list is updated as of November 28th. There is a new one every two weeks. If you're referring to the Korea vs. The World games, they aren't included yet.

Martyrc

217 Posts

December 10 2012 17:43 GMT

#25

LSC2 isn't in it either, I guess? :o Really surprised to see stephano so low, since it's clear he's still probably the best foreigner out there.

EDIT: This site is sick!! Oo, love the stats and all, much better than TLPD, that's for sure :D

speknek

758 Posts

December 10 2012 17:48 GMT

#26

Maybe instead of the OP/UP summary you could post the matchup stats.

with maybe some kind of coefficient that indicates how often weaker players beat stronger players

y0su

Finland7871 Posts

December 10 2012 17:50 GMT

#27

Definitely loads faster now!

m0ck

4194 Posts

December 10 2012 18:09 GMT

#28

Stephano fell from 1st to 130th in four weeks (which includes winning LSC2 & and streaking against the Korean team at IPL 5 and falling out in group-stages in WCS & Dreamhack).

I think your system may need some tinkering, Though I agree that SC2 is indeed quite volatile, I think your system for determining current strength is overly reliant on the most recent results.

*I just saw that it does not include games from IPL 5

00Visor

4337 Posts

December 10 2012 18:17 GMT

#29

I love your approach.

Does the rating for one period regard other periods at all?
Regarding the "Stephano issue". Would you really take Stephanos actual rating (#130) for your new predictions or something different?

Soulriser

United States192 Posts

December 10 2012 18:21 GMT

#30

i enjoy how when i went on here, all of leenock's scores were going down, but his overall ranking was going up. lolol

Toxi78

966 Posts

December 10 2012 18:22 GMT

#31

finally!
congratulations!

Godwrath

Spain10132 Posts

December 10 2012 18:27 GMT

#32

Best vs. Protoss
Bomber (2849)

It's working :D

TheBB

Switzerland5133 Posts

December 10 2012 18:28 GMT

#33

On December 11 2012 03:17 00Visor wrote:
Does the rating for one period regard other periods at all?
Regarding the "Stephano issue". Would you really take Stephanos actual rating (#130) for your new predictions or something different?

1. Yes, the method for computing the rating in one period depends on the rating from the preceding period. There is no forward dependence.

2. I agree that it looks funny that Stephano is so low, but yes, I would use it. I don't think there's a place for making subjective tweaks to individual players.

jalstar

United States8198 Posts

December 10 2012 18:31 GMT

#34

So sick, I've been wanting a combined Korea/international system for a while for my fantasy pinnacle bets.

Nyarly

France1030 Posts

December 10 2012 18:35 GMT

#35

Nice stuff !
I love stats <3

m0ck

4194 Posts

December 10 2012 18:35 GMT

#36

On December 11 2012 03:28 TheBB wrote:

Show nested quote +

You definitely shouldn't be making tweaks to fit individual players. After all, it could be our judgment that is lagging behind

You're using the database from sc2charts/mystarcraft, correct?

bpgbcg

United States74 Posts

December 10 2012 18:38 GMT

#37

This looks awesome

Bookmarked.

Pimpmuckl

Germany528 Posts

December 10 2012 18:39 GMT

#38

Pretty cool man!
Fact is, you just can't make a tournament perfect, like you would need to give ipl5 as one of the most stacked tournaments in history more points than the gsl code s which would be counter intuitive. So i'm fine with your way of doing it <3

TheBB

Switzerland5133 Posts

December 10 2012 18:40 GMT

#39

On December 11 2012 03:35 m0ck wrote:

Show nested quote +

You're using the database from sc2charts/mystarcraft, correct?

Yeah, it's the most complete and up-to-date I could find. Still not perfect though, I had to clean it up a bit and remove some duplicate entries. I'm also a bit concerned because, for example, they still don't have the Proleague games in there.

Kazeyonoma

United States2912 Posts

December 10 2012 18:46 GMT

#40

nice! could this be our new power rank? what's the difference between your ranking the the TLPD's ELO calculations?

Mattidute

Netherlands232 Posts

December 10 2012 18:52 GMT

#41

Protoss sure had a nice streak in the UP rating from May 5th 2011 till June 13rd 2012 on the Periods page.

OrangeNinja21

United States84 Posts

December 10 2012 19:05 GMT

#42

My brain exploded upon opening the webpage, great work good sir

Uncultured

United States1340 Posts

December 10 2012 19:14 GMT

#43

Poor protoss

Hero totally being awesome though. Love this stuff.

Butterednuts

United States859 Posts

December 10 2012 19:14 GMT

#44

On December 11 2012 02:39 TheBB wrote:

Show nested quote +

These guys: www.server4you.net

I wouldn't mind some profiling help, actually.

Having spent time researching providers for my own personal sites, I have never ran across this site before. After reading a few reviews, it looks like they have really poor customer service and poorly managed servers. I highly suggest finding a new provider, these guys don't seem very responsible nor is their pricing all that competitive.

hp.Shell

United States2527 Posts

December 10 2012 19:18 GMT

#45

This is a cool system. I'd love to see a ratings table that includes only the top 25~ players, and only their games against each other.

WillS

61 Posts

December 10 2012 19:31 GMT

#46

I can lend you some help profiling this and can help you with some hosting options. Throw me a PM.

opterown

Australia54784 Posts

December 11 2012 02:31 GMT

#47

http://aligulac.com/periods/55/

my favourite :p

Best overall MarineKing (2891)
Best vs. ProtossMarineKing (3052)
Best vs. Terran MarineKing (2745)
Best vs. Zerg MarineKing (2875)

Glenn313

United States475 Posts

December 11 2012 02:35 GMT

#48

Cool. Using your talents for SC2.

mDuo13

United States307 Posts

December 11 2012 02:47 GMT

#49

I'm impressed.

playa

United States1284 Posts

December 11 2012 02:49 GMT

#50

I don't get Ranged being the most specialized against T, when, according to TLPD, P vs T is his worst mu.

lazyitachi

1043 Posts

December 11 2012 02:53 GMT

#51

Is there a definition of the Best? Most specialised somewhere?
Can't seem to find any such description in FAQ.

Best overall
Best vs. [Race]
Most specialised v Race

Implies most wins? Wins/ losses against higher rated players? Appreciate clarification on this.. Thanks!
(Edit: Hmm.. you mentioned Glicko. So I assume best vs should be "most improved after the new round of match calculation in terms of Glicko rating" Not sure what most specialised means tho....)

Anyway, from the way you are segmenting it (i.e. period by period) the rating is very short term focused i.e. difficult to see a long term rating for a player as it depends on their participation in tourneys to be listed.

Is there anyway to extend to multiple period rating calculation? Would be great for those who wants to manipulate or look up specific time frame (e.g. patch xxx time frame ELO/ WR)

Alryk

United States2718 Posts

December 11 2012 02:54 GMT

#52

This is awesome!

We should bring back the sad zealot

nmetasch

United States600 Posts

December 11 2012 02:57 GMT

#53

Naniwas 136? Sorry, but that just seems way too incorrect to me..

rrwrwx

United States247 Posts

December 11 2012 02:59 GMT

#54

Seems weird that Leenock now has a higher rating than Mvp ever did.

Incanus

Canada695 Posts

December 11 2012 03:06 GMT

#55

Very cool stuff, although IMHO it's a bit too volatile.

ZigguratOfUr

Iraq16955 Posts

December 11 2012 03:19 GMT

#56

Incredible site!!! Now I can get to quantify to my friends what I mean about protoss being the most underpowered race throughout sc2 history (see May 5, 2011 to May 31, 2012 in UP[underpowered] section).

iTzSnypah

United States1738 Posts

December 11 2012 03:24 GMT

#57

On December 11 2012 12:19 BrokenMirage wrote:
Incredible site!!! Now I can get to quantify to my friends what I mean about protoss being the most underpowered race throughout sc2 history (see May 5, 2011 to May 31, 2012 in UP[underpowered] section).

UP stands for underperforming, not underpowered.

Shellshock

United States97276 Posts

December 11 2012 03:26 GMT

#58

We missed your stats in Code A yesterday

Will they be making a return tonight?

zerious

Canada3803 Posts

December 11 2012 03:28 GMT

#59

On December 11 2012 11:59 rrwrwx wrote:
Seems weird that Leenock now has a higher rating than Mvp ever did.

Mvp plays mostly GSL

rrwrwx

United States247 Posts

December 11 2012 03:34 GMT

#60

On December 11 2012 12:28 zerious wrote:

Show nested quote +

Mvp plays mostly GSL

In a 1 year period, Leenock got 2 MLG wins, 1 IPL win, and 1 GSL and 1 MLG second place
In a 1 year period, Mvp got 3 GSL wins, a GSL second place, and won an MLG, Blizzcon, and WCG.

Even if Mvp plays mostly GSL, he still achieved more results than Leenock in an equivalent amount of time.

opterown

Australia54784 Posts

December 11 2012 03:38 GMT

#61

On December 11 2012 12:34 rrwrwx wrote:

Show nested quote +

these ratings are mostly over small week periods, and leenock's had an intense leenockvember for sure.

Daogin

Canada2308 Posts

December 11 2012 03:38 GMT

#62

Awesome Contribution BB always cool seeing your stats in the LR's and how accurate they are sometimes :O

mucker

United States1120 Posts

December 11 2012 03:47 GMT

#63

On December 11 2012 03:40 TheBB wrote:

Show nested quote +

That's really a shame there isn't a better source for you, Proleague is essential. Good luck finding additional sources, maybe you can find a way to work with tlpd?

I'd love to see you get graphs like financial websites have for stocks, all slick with the zooms and compares and whatnot.
I could procrastinate like crazy on a site with that.

GenoPsydE

Canada88 Posts

December 11 2012 03:51 GMT

#64

Very well done.

I hope you keep on top of this and develop it even further (as your time and potential volunteers allow).

clever_us

United States329 Posts

December 11 2012 04:24 GMT

#65

Best vs. Zerg: HyuN

Not after the last couple of days xD

furggy

United States3 Posts

December 11 2012 04:36 GMT

#66

the only problem is that Flash is not rated over 9000.

Everything else works great.

Blisse

Canada3710 Posts

December 11 2012 04:36 GMT

#67

Why Python 2.5.2, can I ask?

Cool site! I actually think the theme of your site fits what you're doing very well, so yeah. Good job!

crbox

Canada1180 Posts

December 11 2012 04:43 GMT

#68

Sick, I hope this gets spotlighted.

Supercilium

Sweden308 Posts

December 11 2012 04:52 GMT

#69

Really nice, good job!

Just a thought; since the ratings fluctuate quite much, is it possible to have an average rating for each player? For example the average rating for the past 3 or 6 months?

furggy

United States3 Posts

December 11 2012 04:54 GMT

#70

one suggestion is to use Google Visualization API to get the load off your poor server for graphing and table rendering.

examples:
https://developers.google.com/chart/interactive/docs/gallery

the advantage is that you put all the visualization on the client side, meaning you don't have the render PNG charts or send large HTML/XML data tables over the wire (JSON is much faster, especially when its compressed)

StarStruck

25339 Posts

December 11 2012 05:00 GMT

#71

Very nice; excellent work.

On second glance I really don't understand how JD is so high.

WombaT

Northern Ireland26096 Posts

December 11 2012 05:07 GMT

#72

On December 11 2012 11:49 playa wrote:
I don't get Ranged being the most specialized against T, when, according to TLPD, P vs T is his worst mu.

TLPD is really behind I think with their data. The other possibility is that he's on a real hot streak in vT, or that he's doing ok invT and terrible in the other matchups perhaps?

DOUDOU

Wales2940 Posts

December 11 2012 05:10 GMT

#73

the cursed trend of stephano is scary, he should hit gold league by the end of the year

RenSC2

United States1072 Posts

December 11 2012 05:24 GMT

#74

On December 11 2012 00:59 TheBB wrote:
(I just had the domain lying around, the name doesn't mean anything.)

If you switch the "I" and the "U" from the domain name around, it's Caligula spelled backwards.

Rabiator

Germany3948 Posts

December 11 2012 05:37 GMT

#75

At first glance I thought this was awesome, but then I noticed the ridiculously low timeframe of just a few weeks. That timeframe doesnt really mean anything and ups and downs only show the daily form of the player or even some experimenting with new builds.

If you would use longer timeframes it would improve the actual precision of the data, since this much fluctuation is not really useful for predicting a trend.

Canucklehead

Canada5074 Posts

December 11 2012 06:05 GMT

#76

Do you weigh games differently?

No, I don't. Korean tournaments and players receive no special treatment. The GSL is difficult because good players play there; the players aren't good because they play in the GSL.

I can't agree with this and is a flaw in the system. I'm not sure how to fix it because it would be complicated, but not taking opponent's skill into consideration is a flaw. Giving the same weight for a win if someone beats say an MVP and if they beat a non or semi pro in an MLG open bracket is wrong.

Your reasoning of " The GSL is difficult because good players play there; the players aren't good because they play in the GSL" is also flawed because yes the players in GSL are good because they play in GSL because it's damn hard to qualify for it and to remain in it and not drop to code b.

opterown

Australia54784 Posts

December 11 2012 06:09 GMT

#77

On December 11 2012 15:05 Canucklehead wrote:

Show nested quote +

nono i think you've got it wrong;
what he means is that if you beat MVP in GSL finals, it's the same as beating MVP in some random online tournament.

TheBB

Switzerland5133 Posts

December 11 2012 08:27 GMT

#78

On December 11 2012 11:53 lazyitachi wrote:
Is there a definition of the Best? Most specialised somewhere?

Best: highest rating.
Most specialised: largest number of standard deviations between general rating and matchup rating.

The tendency is that new players with few games have wild matchup ratings, while the more established players have pretty even ones. That's why they come off as "most specialised." I tried to fix this with the standard deviation weighing, but it didn't work satisfactorily.

On December 11 2012 12:26 Shellshock1122 wrote:
We missed your stats in Code A yesterday

Will they be making a return tonight?

Yeah, those have fallen wayside a bit. I'm sure I'll be there for the evening session.

On December 11 2012 13:36 Blisse wrote:
Why Python 2.5.2, can I ask?

Hahaha... I had this virtual server and domain laying around, and I was halfway through the development before I realised it's on Debian Lenny, and so I'm condemned to use a bit outdated software. Thankfully all the libraries I needed were still available. Others have suggested me to use a different host, so I don't think I will bother trying to update this one.

On December 11 2012 14:24 RenSC2 wrote:

Show nested quote +

If you switch the "I" and the "U" from the domain name around, it's Caligula spelled backwards.

If you switch the C to the other end, it's Caligula spelled forwards....

On December 11 2012 15:05 Canucklehead wrote:

Show nested quote +

Like opterown said, I do take the opponent's skill into consideration. I just give the same consideration to wins over Mvp in the GSL as I do to wins over Mvp at your grandma's dinner party.

Cyanure

France51 Posts

December 11 2012 08:55 GMT

#79

I find your rating system underwhelming, if not inconsistant with its purpose:

I'll assume that you created this site and rating in order to have a base for your prediction tool (which is great by the way).
The whole sense of predicting player performance is based on the fact that you think that skill is robust, namely that your win from yesterday means (somehow) that you're more likely to win tomorrow.
On the other hand, if you chose a volatile rating system (like you did), it means, on the contrary that the skill is volatile.
If you think that the game is too volatile, you just cannot make prediction.

I feel like it's better to have a maybe less accurate but more robust rating, so that the predictions would not be overly different from one period to the next because of a bad week.

EDIT : maybe I was too negative. I really like what you did and I will probably check both your proba to win and your site for rating (since TLPD is not at all up to date). I just don't think you chose a good rating system.

y0su

Finland7871 Posts

December 11 2012 17:16 GMT

#80

I saw a featured streamer I'm not very familiar with... So I decided to check how the player ranked here. It was very cool to see their vs P/T/Z rank etc, however it would be really cool to see an additional

Rating vT XXXX (#yyy, #YY [race])

where you would get an additional rating for how high a player ranks within their own race. Maybe it would be simpler/cleaner to just allow a sort by each rating.

movitz

Sweden15 Posts

December 12 2012 20:49 GMT

#81

Its a beautiful homepage. Looks neat and professional without the ever-present, dull character/artwork testosterone stuff.
The data and stats are really awesome!
Good work!! I have it bookmarked.

KristofferAG

Norway25712 Posts

December 12 2012 20:52 GMT

#82

Dude, the website is amazing. Definitely going to be my go-to-page instead of ELO now.

Chig

Lithuania6 Posts

December 12 2012 20:58 GMT

#83

Awesome. I prefer this now then the others. Simple but enough to be the best out there.

Boucot

France15997 Posts

December 12 2012 21:05 GMT

#84

I love your work on TL threads when you put some stats for betting on a player and this website must have been very difficult to build but how a guy that nobody knows like MilkEA (#101) can be higher placed than players like IdrA, Stephano, NaNiwa, TheStC... I get the period reason but even with it, that makes no sense.

I hope you can find some adjustments for your rankings so we could have an undeniable ranking finally, what everybody wants since a very long time.

GL to you, you do very well !

ECHOZs

United States499 Posts

December 12 2012 21:20 GMT

#85

This is a great site, nice work. I do have a suggestion and that is being able to compare players side by side. For instance if there's a match between Parting vs Life it would great to be able to see their graphs and results side by side instead of having to open two tabs.

Apolo

Portugal1259 Posts

December 12 2012 21:23 GMT

#86

Really nice work :O added to favorites ! I just don't understand how come Mvp doesn't even make it to top 10, and someone like INnoVation who i've never heard of is ranked no. 4 O_o

LockeTazeline

2390 Posts

December 12 2012 22:02 GMT

#87

You deserve a medal.

KillerDucky

United States498 Posts

December 12 2012 22:28 GMT

#88

Can you add an option to filter the list to show only non-Koreans? Useful for foreign events etc.
Can you see the game results that this is based on?
It the source code available?

Chronald

United States619 Posts

December 12 2012 22:32 GMT

#89

Another proof that Protoss has always been the worst race.

T_T

revel8

United Kingdom3022 Posts

December 12 2012 22:50 GMT

#90

Hey, Great work BB. Always love stuff like this.

tinytin

Germany4 Posts

December 12 2012 22:58 GMT

#91

Amazing work. Tought about doing something similiar few days ago, realized it's too much work for me.
Some suggestions:

1) Like already mention: List the opponents in a period.
2) While you`re at it: Add the expected winrate against each opponent.
3) Performance in a period would be nice.
4) I would love to see a seperate online/offline rating besides a combined (and perhaps weighted) overall-rating.

I don't really like that optimize your model for prediction power. In order to predict the outcome of a game, I would add more factors (online/offline; tournament structure; how far are we in the tournament?; Travel?, ...). Rating (which should measure skill) is just one factor (albeit the most important) for predicting the outcome of a game.

BluePanther

United States2776 Posts

December 12 2012 23:03 GMT

#92

Neat tool, but I'm not sure I can take any rating system seriously that has Stephano so low... He's obviously a top 40 player. Is there something I'm missing?

alQahira

United States511 Posts

December 12 2012 23:04 GMT

#93

Love the site. Would like to see the highest rating a player has achieved on their individual pages, then you can compare their current performance to their peak easily. Also would be curious about having the all-time highest rating listed on the front page, so we can see how the current leaders compare to the all-time best.

arcane1129

United States271 Posts

December 12 2012 23:14 GMT

#94

When it says 2 week rating periods, what exactly does that entail? I imagine it can't be just games played within those 2 weeks, so what sort of emphasis do you have on older games vs newer ones?

Swwww

Switzerland812 Posts

December 12 2012 23:32 GMT

#95

Looks awesome! Keep up the good work.

.kv

United States2332 Posts

December 12 2012 23:41 GMT

#96

Great and all but Stephano so low in ratings
only 130th and you have Massan at 70 instead

Greenei

Germany1754 Posts

December 12 2012 23:41 GMT

#97

On December 13 2012 06:23 Apolo wrote:
Really nice work :O added to favorites ! I just don't understand how come Mvp doesn't even make it to top 10, and someone like INnoVation who i've never heard of is ranked no. 4 O_o

you are talking bogus, bro.

also read the faq, the rating system is really volatile and mvp hasn't done so well last season.

way too volatile imo. stphano at 130? yeah sure...

HolydaKing

21254 Posts

December 13 2012 00:41 GMT

#98

Nice work!

Also Stephano is indeed a little bit too low, even though i don't particularly like him. You seem to have altered the glicko rating a lot, as at sc2charts.net he's #13.

Too_MuchZerg

Finland2818 Posts

December 13 2012 00:56 GMT

#99

So BB what is last seasons GSL code S score average (from 32 players before it started) and perhaps projection how next seasons average looks like.

Daumen

Germany1073 Posts

December 13 2012 01:02 GMT

#100

I love statistics ;>

rift

1819 Posts

December 13 2012 01:36 GMT

#101

Have you considered separate ratings for Korean leagues and international tournaments?

vesicular

United States1310 Posts

December 13 2012 02:40 GMT

#102

On December 13 2012 08:03 BluePanther wrote:
Neat tool, but I'm not sure I can take any rating system seriously that has Stephano so low... He's obviously a top 40 player. Is there something I'm missing?

Yes, you're placing your own opinions higher than the actual data. Stephano has lost 60% of his games in the last month. He's on a downtrend, and that's what the data is showing. The numbers don't lie.

vesicular

United States1310 Posts

December 13 2012 02:45 GMT

#103

As someone with a mathematics degree, I love the site, awesome job. The only thing I'd like to see is Ratings Peak on the player page. Otherwise, very informative, and I like the period size, good call on that.

Gihi

384 Posts

December 13 2012 03:19 GMT

#104

I discovered a bug.
Mvp is not on #1

SpaceFighting

New Zealand690 Posts

December 13 2012 03:54 GMT

#105

On December 13 2012 12:19 Gihi wrote:
I discovered a bug.
Mvp is not on #1

LOL, all joking aside ive bookmarked the website and will use it to keep tabs on the players xD i really love it. thanks

TheBB

Switzerland5133 Posts

December 13 2012 14:02 GMT

#106

Whoa, just realised there have been more posts here.

Thanks for all the feedback and nice words

. What I think I will prioritise next is:

– Overview over games. (So that it's transparent how a player has gotten the rating he or she has.)
– Prediction tool for best-of-N matches (nothing more fancy than that to begin with.)

I hear what you're all saying about the volatility, and I guess maybe I went a bit overboard. This system is optimised for predictiability, and one that is less volatile will tend to suffer more upsets than it should, but it seems what people really want is a system that is a little more in line with how we judge performances over time. Which is fair. So here is what I could do:

(a) Lower the volatility in the future. This will stabilise things a bit, but will negatively impact predictive power.
(b) Lower the volatility and recompute all ratings from the start. It's difficult to say what the latest rating list will look like in this case.
(c) Publish two concurrent ratings, one volatile for predictions and one less volatile for rankings. I'm afraid this might confuse people.
(d) ... any other ideas?

Targe

United Kingdom14103 Posts

December 13 2012 14:13 GMT

#107

Innovation is Bogus.

LockeTazeline

2390 Posts

December 13 2012 15:14 GMT

#108

On December 13 2012 23:02 TheBB wrote:
Whoa, just realised there have been more posts here.

Thanks for all the feedback and nice words

Greedy me loves c.

WombaT

Northern Ireland26096 Posts

December 14 2012 03:33 GMT

#109

On December 13 2012 23:02 TheBB wrote:
Whoa, just realised there have been more posts here.

Thanks for all the feedback and nice words

Those are great ways to streamline, I think.

Perhaps you could add some options to filter the data. Your periods are rather small in terms of how you're actually presenting the data you have, perhaps you could make it adjustable, or give a few options.

Say you'd have:
A: The current short periods
B: A monthly tracker
C: The last 6 months

Or any variation thereof I suppose. Again though, perhaps that clutters things up too much/makes it a lot more difficult to design the website.

At the guy who said to separate Korean tournaments and foreign ones. I actually like that the BB isn't doing that thus far. Considering many of the high end tournaments are almost exclusively Korean nowadays the TLPD Korean/International distinction almost seems updated in terms of getting a good sense of where people are at in certain matchups.

PVJ

Hungary5221 Posts

December 14 2012 11:37 GMT

#110

great stuff, bookmarked.

pAzand

Sweden539 Posts

December 14 2012 12:19 GMT

#111

Brilliant work, thank you!

Gladiator6

Sweden7024 Posts

December 14 2012 12:38 GMT

#112

I like this a lot! Suprising to see RangeD as XvT specialist though, haven't even seen any games with him.

Conti

Germany2516 Posts

December 14 2012 12:44 GMT

#113

On December 13 2012 23:02 TheBB wrote:
Whoa, just realised there have been more posts here.

Thanks for all the feedback and nice words

I would be really curious about c. Not as a permanent solution, but just to see how the rankings differ. My guess is that, no matter the ranking, there will be people complaining because some long-time player is not high enough, or some up and coming player is ranked too low, etc., and I kinda doubt a different ranking will change that. Still, I'd be curious to see the difference.

MarcoBrei

Brazil66 Posts

December 14 2012 15:54 GMT

#114

nice work, but sc2proranks.com is better.

KumihO.

United States55 Posts

December 14 2012 15:56 GMT

#115

Wow. Now I have some stuff to explore. Thank you so much :D

frogrubdown

1266 Posts

December 14 2012 16:36 GMT

#116

On December 15 2012 00:54 MarcoBrei wrote:
nice work, but sc2proranks.com is better.

I would have expected some form of justification for this claim, but having checked out the site I can see why you didn't offer any.

Simply put, they don't have any serious justification for the arbitrary values they assign in their rankings process. The process also contains obvious flaws, such as treating all placements of a given sort equally rather than assessing runs by the actual opponents beaten along the way.

Why should anyone trust them over a method that aims for and tests itself against accurate predictions?

TheBB

Switzerland5133 Posts

December 14 2012 20:44 GMT

#117

I added some results lists, so that people can see where the numbers come from, more or less.

If you now open a player page you will see, immediately below the graph, a list of games that have been added and which are scheduled for inclusion in the next period.

In the table for historical data, you will also see a "details" link. If you click it, you can see some information about the rating calculation for that player for that period. It shows which games were included, the average rating of the opposition, and the expected score for the player given the opposition. You should see that the rating adjustments correlate with how much the player over- or underperformed. Note that the correlation isn't necessarily exact, since there are a few other factors that come into play (see the FAQ for more details on those.)

I also want to thank Conti, Grovbolle and KristofferAG for aiding me with populating the database with results.

On December 15 2012 00:54 MarcoBrei wrote:
nice work, but sc2proranks.com is better.

sc2proranks.com appear to be using a totally different idea. Of course while you are free to enjoy whatever system you desire, I don't particularly care for their method. Aligulac is more "results-oriented" (a player is rated above another if they can be expected to win vs. them right now) versus theirs which I would call "impact-oriented" (a player is rated above another if they were recently more in the spotlight), and like frogrubdown said, it's not at all clear how they come up with the values they use.

Oh, and aligulac.com is totally better looking.

</shameless plug>

sitromit

7051 Posts

December 14 2012 22:44 GMT

#118

On December 15 2012 00:54 MarcoBrei wrote:
nice work, but sc2proranks.com is better.

That's an awful ranking system using prize pools of tournaments as a metric, rather than which opponents you're facing.

ACrow

Germany6583 Posts

December 14 2012 23:09 GMT

#119

This looks like a superb site! I've always liked the statistics you post in LR threads (they tend to be quite accurate as well), so with the new more up to date data base it'll be even better now. Good job!!

One thing would be nice: to be able to sort for the three vs race win rates, so you could quickly view the best vs Z players for example.

WombaT

Northern Ireland26096 Posts

December 15 2012 01:12 GMT

#120

On December 15 2012 05:44 TheBB wrote:
I added some results lists, so that people can see where the numbers come from, more or less.

Show nested quote +

</shameless plug>

So true!

I'd love to help out in some fashion, but am not the most mathematically inclined of folks. Hope feedback/ideas were helpful man, really think this project has some potential and the site looks damn sexy.

birchman

Sweden393 Posts

December 15 2012 01:19 GMT

#121

Love it, certainly interesting to go through the lists.

And I agree, the site is graphically stellar!

MarcoBrei

Brazil66 Posts

December 15 2012 14:59 GMT

#122

On December 15 2012 01:36 frogrubdown wrote:

Show nested quote +

Arbitrary values? Can you say what are you talking about? Did you actually read the FAQ?
Tournaments are weighted based on prize pool, which is related to tournament relevance.
Many people seems to prefer ratings based on what opponent some player has beaten, I simply can't understand why this is so popular, because it makes no sense if you think for more than 5 minutes. You can just look at the "unnoficial world champion" thread, which takes this concept to the limit. The so called unnoficial world champion is a joke.
You said about predictions. If a rating system is designed mainly to make predictions, I can assure it will fail miserably.

Finally, let's take a look to the top 5 players of each site:

Aligulac:
Sniper
PartinG
Leenock
Life
Effort

Sc2proranks:
Hero
PartinG
Rain
Taeja
Leenock

Hero, top 8 in GSL, top 8 in BWC, champion of the last DreamHack and champion of the last NASL, is not even in top5 players of Aligulac. And Effort appears in top 5. Effort! Really?

Aligulac is a very good site, but his concept (as well other rating systems) seems to result in a weird list of top players.

frogrubdown

1266 Posts

December 15 2012 15:31 GMT

#123

This post contains an impossible number of confusions. It will take a lot more than me to fix all that's wrong with your thinking.

On December 15 2012 23:59 MarcoBrei wrote:

Show nested quote +

Arbitrary values? Can you say what are you talking about? Did you actually read the FAQ?
Tournaments are weighted based on prize pool, which is related to tournament relevance.

Yes I read the FAQ, which I linked to to illustrate how arbitrary their values are. For instance:

How the points of one tournament are distributed to players? Is it based on the prize gained of each player?

Actually no. Only the overall prize pool is used to determine the relevance of the tournament. The player position in one tournament will give points in this way: 1st: 100%; 2nd: 70%; 3rd and 4th: 45%, 5th to 8th: 25%; 9th to 16th: 10%

Want to guess where these numbers come from? If your answer wasn't 'Their ass', then you guessed wrongly.

The mere fact that the values have a vague tie to something correlated with player ability, positions in prestigious tournaments, does nothing to make the values assigned to said positions non-arbitrary. There is no justification for any numerical value that is used anywhere in the system, not even the cut-off of only giving any credit whatsoever to at least ro16.

Many people seems to prefer ratings based on what opponent some player has beaten, I simply can't understand why this is so popular, because it makes no sense if you think for more than 5 minutes.

People like ratings based who you've beaten because being good at starcraft is having a disposition or skill that one expects to normally and in the long run result in wins. Beating the best player in the world to get to the round of 32 is extremely strong evidence that you have such a skill. But, if you get eliminated in the next round, you will get zero credit for this on your favored system. Beating some of the worst players in the world to get to the round of 16, on the other hand does not provide nearly as much evidence of skill.

Every run to a given position in a tournament involves a different difficulty level depending on whom you faced. There is no reason not to tie a player ranking directly to the actual players they have faced.

You can just look at the "unnoficial world champion" thread, which takes this concept to the limit. The so called unnoficial world champion is a joke.

What!?!? That isn't this concept taken to the extreme, because it considers an absurdly small subset of the total gameplay evidence for who is the best player. The concept taken to the extreme is something like how the rankings on Aligulac work, because they take account of all the evidence from wins and losses, not just those against a single player selected for no particularly good reason.

You said about predictions. If a rating system is designed mainly to make predictions, I can assure it will fail miserably.

Once again, you provide no reasons for this claim because you have none. Successful predictions are how you test statistical models like these. That's how it works in baseball stats, poll-based election prediction models (which were monumentally successful this past cycle), everywhere. If your non-sense claim were correct, then it would undermine a lot more than this sc2 ranking system. And yet somehow, with no specific reason given, you think you can a priori rule out the success of prediction based models.

Finally, let's take a look to the top 5 players of each site:

Aligulac:
Sniper
PartinG
Leenock
Life
Effort

Sc2proranks:
Hero
PartinG
Rain
Taeja
Leenock

Hero, top 8 in GSL, top 8 in BWC, champion of the last DreamHack and champion of the last NASL, is not even in top5 players of Aligulac. And Effort appears in top 5. Effort! Really?

Aligulac is a very good site, but his concept (as well other rating systems) seems to result in a weird list of top players.

You once again demonstrate an amazingly powerful a priori insight that your fellows lack. Tell me, if you can be so confident about which players are actually the best currently without consulting the best available evidence, then why do you need any model to make predictions?

This is not to say that one should have a credence of 1 that the aligulac rankings are correct, especially given sc2's volatility. But the idea of evaluating solely against the perceived best players is absurd. People get perceived as the best (among other noisy factors) by winning high-profile events, regardless of whether their runs to get there provide the best evidence of their goodness. We should expect our perceptions of who is the best to be flawed to the extent that it relies directly on such unreliable evidence.

sibs

635 Posts

December 15 2012 15:41 GMT

#124

@MarcoBrei:

Hero is high because he goes to high-paying but relatively easy tournaments, he does this because he has the option. See how well he fares on the Korean TLPD (hint: Not that well).

00Visor

4337 Posts

December 15 2012 15:55 GMT

#125

On December 15 2012 23:59 MarcoBrei wrote:
Hero, top 8 in GSL, top 8 in BWC, champion of the last DreamHack and champion of the last NASL, is not even in top5 players of Aligulac. And Effort appears in top 5. Effort! Really?

Aligulac is a very good site, but his concept (as well other rating systems) seems to result in a weird list of top players.

Not your achievements are relevant, but WHO you win against and how often you lose. Hero plays tons of tournaments with some disappoint results (going 1-3 at MLG and so on) and the foreign tournaments don't have the toughest competition. HerO beat 1 korean at BWC, and 2 koreans (ForGG not really that notable at the moment) at Dreamhack. Its pretty logical he is not in the Top5.

Efforts results seem to be greatly influented by the MLGvsProleague tournament.

TheBB

Switzerland5133 Posts

December 15 2012 15:58 GMT

#126

Yeah, like I said, that system rewards namepower and -recognition. You get recognised if you win a lot of titles, so a ranking of titles (which is what it is, essentially) will obviously correlate well with the public view of who the top players are. And that's fine, really, just keep in mind what you're looking at. Sc2ProRanks is a ranking of who actually won tournaments lately and Aligulac is an attempt to rate who would win in a hypothetical game between player X and Y right now. If you tried to use the former to predict game outcomes I daresay it would fail quite spectacularly.

Because of this they aren't really comparable, in my opinion. It was never my intention to design a system that confirmed what we already know. I don't find that very interesting, and I don't find their system very interesting, either.

I claim (and I haven't even seen his games) that Effort is probably better than many is giving him credit for.

ES_JohnClark

United States1121 Posts

December 15 2012 16:00 GMT

#127

I have not completely dived into the rankings system yet.. but one thing that sticks out for me and bothers me.. is that you have decided that tournaments are to be weighted for relevence based on prizing. This is a hugely subjective variable that often times can have very little to do with actual skill levels that attend a specific event. A better idea would have been to set up a standard, based mostly on numerical values to determine the 'tier' of each event. Once a baseline structure for this has been established, then each tournament can be placed in a 'Tier' level based on the participants that attend.

Also, and I have touched on this in the past when talking about rankings, but any rankings system that uses a raw value to determine overall points is extremely flawed for eSports. Meaning, that if a player attends 10 events and earns points at each, regardless of how they finish, they will most likely be ranked higher then a player that attends only 5 events and yet has much better relative finishes.

As soon as I can get the time needed, I will release full details of my Global Points System that works to alleviate both of the issues that I have addressed.

Other then those 2 problems, the site looks good and any rankings at this point are better then none (in most cases).

Good stuff and I am know it must have taken you a good amount of time.

Hider

Denmark9407 Posts

December 15 2012 16:13 GMT

#128

On December 15 2012 23:59 MarcoBrei wrote:

Show nested quote +

Wow you really fall short on understanding statistics.

Prediction is the best way to determine one's true skills. The best player today is the player who has the highest probability of winning tomorrow. That's it.

Tournament price pools are still arbitrary and can be very misleading.

Nublakhan

United States12 Posts

December 15 2012 21:11 GMT

#129

You mention that you are borrowing heavily from Glicko; I'm assuming from Glicko-2?

Why not show the player's volatility rating/range in addition to their score? I see players start with a 1000 rating, are they starting with an RD of 350? You mention assigning and using category modifiers, how do you determine the player's category?

Apologies if this was explained in your write-up; I read it twice over trying to reference between what I knew about Glicko(-1). I'm a math nerd at heart, but not in education.

opterown

Australia54784 Posts

December 15 2012 21:28 GMT

#130

On December 16 2012 01:00 csn_JohnClark wrote:
I have not completely dived into the rankings system yet.. but one thing that sticks out for me and bothers me.. is that you have decided that tournaments are to be weighted for relevence based on prizing.

No he doesn't?

bduddy

United States1326 Posts

December 15 2012 21:39 GMT

#131

On December 16 2012 06:28 opterown wrote:

Show nested quote +

No he doesn't?

I believe he was referring to the other ratings site that was being argued about before, sc2proranks.com.

TheBB

Switzerland5133 Posts

December 15 2012 21:44 GMT

#132

New feature: predict matches. (Will expand with fancy graphics when I get time.)

On December 16 2012 06:11 Nublakhan wrote:
You mention that you are borrowing heavily from Glicko; I'm assuming from Glicko-2?

Why not show the player's volatility rating/range in addition to their score? I see players start with a 1000 rating, are they starting with an RD of 350? You mention assigning and using category modifiers, how do you determine the player's category?

Apologies if this was explained in your write-up; I read it twice over trying to reference between what I knew about Glicko(-1). I'm a math nerd at heart, but not in education.

No, Glicko-1. That was what I tried first and I got it working, so I didn't try anything more fancy.

I don't show the rating deviation because for almost everyone it's at the floor. (I had to use a pretty high floor to keep up with rapid changes. That's why the ratings are so volatile, and that's why it's so easy for players to keep their RD at the minimal allowed level.) I just didn't think it'd be interesting information.

Actually players start with a rating of 0 and a deviation of 0.5. The ratings you see on the site are scaled by adding 1 and multiplying by 1000, since this creates a scale that people are familiar with. The top players are usually around 1.5 in the internal scale. This corresponds to starting RD of 500. I use a RD floor of 0.13 (=130). I am debating lowering it to 0.1 in the future, and increasing the decay a bit. Presumably the scene has "settled" enough now to allow something like this.

The categories are simply races.

Conti

Germany2516 Posts

December 16 2012 18:45 GMT

#133

On December 16 2012 06:44 TheBB wrote:
New feature: predict matches. (Will expand with fancy graphics when I get time.)

Great job!

New feature request: Do the same just for groups: Let people enter a few names (4, usually), choose the format from a dropdown menu (round-robin, GSL, etc.), and have it calculate the group results.

Of course, the next step would be to calculate entire tournaments!

JohnAdams

United States111 Posts

December 17 2012 02:38 GMT

#134

This looks interesting, but it seems very volatile. Sniper wins the GSL so his rating catapults from 2513 to 2954, but then he loses a BO1 to Gumiho, who is a measly 18th, and his rating plummets to 2068. Am I missing something here?

Xapti

Canada2473 Posts

December 17 2012 02:50 GMT

#135

On December 11 2012 00:59 TheBB wrote:
(Yes, I'm not exactly a web designer. Hopefully it's still tolerable.)

If you did the web design, I'd say that's pretty good. I like the visual style. Even code-wise it's better than teamliquid.net's (main page) mess— although that's not saying much.

If you're looking for feedback though:
- try to use em instead of pixel for sizes (like widths) of containers containing text though, so that regardless of the font and text size used, it will scale properly for the user.
- I personally think making the whole row a link is a no no. Maybe it's just me, but I find it to be really annoying.
- There's no reason [that I can think of] not to use HTML table elements (table, tr, td, etc) to display your data; that's what tables are for.

Non html/css related:
- a sort function would be nice
- so would race-specific stats, or other stats in general

Personally I'm not into following this sort of thing at all, but I'm sure many others appreciate the effort you're putting in.

FakeDouble

Australia676 Posts

December 17 2012 02:59 GMT

#136

Can someone enlighten me as to what 'aligulac' means? I don't get it...

Otherwise, very cool. I love cool graphs!

TheBB

Switzerland5133 Posts

December 17 2012 08:01 GMT

#137

On December 17 2012 11:38 JohnAdams wrote:
This looks interesting, but it seems very volatile. Sniper wins the GSL so his rating catapults from 2513 to 2954, but then he loses a BO1 to Gumiho, who is a measly 18th, and his rating plummets to 2068. Am I missing something here?

I think you are. Sniper has never been 2954? He's one of those whose rise has been very steady.

http://aligulac.com/players/9/

On December 17 2012 11:59 FakeDouble wrote:
Can someone enlighten me as to what 'aligulac' means? I don't get it...

Nothing... It's a word I came up with as a kid. Don't quite remember how.

On December 17 2012 11:50 Xapti wrote:

Show nested quote +

- try to use em instead of pixel for sizes (like widths) of containers containing text though, so that regardless of the font and text size used, it will scale properly for the user.
- I personally think making the whole row a link is a no no. Maybe it's just me, but I find it to be really annoying.
- There's no reason [that I can think of] not to use HTML table elements (table, tr, td, etc) to display your data; that's what tables are for.

Non html/css related:
- a sort function would be nice
- so would race-specific stats, or other stats in general

1. Ok!
2. Fair enough.
3. Well, you can't make the whole row a link with an HTML table.

4, 5. Yeah, it's "in the pipeline" so to speak. I try to dedicate an hour or two each day but I can't always do that.

JohnAdams

United States111 Posts

December 17 2012 21:47 GMT

#138

TheBB : http://aligulac.com/players/9/period/73/

Conti

Germany2516 Posts

December 17 2012 21:53 GMT

#139

On December 18 2012 06:47 JohnAdams wrote:
TheBB : http://aligulac.com/players/9/period/73/

I think the ratings displayed here are the ratings of the opponents.

Emptyness

Bulgaria1016 Posts

December 17 2012 21:59 GMT

#140

Great job

, thank you very much for doing this

TheBB

Switzerland5133 Posts

December 17 2012 22:14 GMT

#141

On December 18 2012 06:47 JohnAdams wrote:
TheBB : http://aligulac.com/players/9/period/73/

Yeah, like Conti said, the numbers listed next to each opponent is the rating of the opponent (in the given matchup) at the time the match was played.

Edit: The traffic stats skyrocket when this thread is bumped at a US-friendly time, lol.

opterown

Australia54784 Posts

December 17 2012 23:25 GMT

#142

ok hmm after looking at recent results i think you may have them a bit too volatile, haha

TheBB

Switzerland5133 Posts

December 18 2012 12:25 GMT

#143

On December 18 2012 08:25 opterown wrote:
ok hmm after looking at recent results i think you may have them a bit too volatile, haha

Well, do I have good news for you then.

I made some tweaks today and I think I can make it a bit less volatile without impacting the predictive power. There are four parameters:

– RD (rating deviation) decay. How fast does uncertainy grow when a player doesn't play. Currently 0.01.
– Initial RD. How uncertain is the rating of a new player. Currently set at 0.5.
– Minimal RD. Currently set at 0.13.
– Period length. Currently 14 days. I won't touch this one.

A player's rating changes quickly if his or her RD is high. Thus a large minimal RD will create volatility among "stable" players, a large RD decay will create volatility among players who play less frequently, and a large initial RD will create volatility amont totally new players.

Here is a plot showing the predictive power of the original system.

How did I make this? Well, I went through every game in the training data set (containing almost 50000 games), and computed the ratings at the time the game was played, and assigned it a "slot" corresponding to how certain it was that the assumed stronger player would win. The slots are ranges of probabilities, i.e. 50-55%, 55-60% and so on. This is the "predicted winrate" of the x-axis. The black jagged line shows the actual winrate for each slot, and the dashed black line (slanting the other way) shows the number of games that was associated to each slot.

The dashed blue line shows the linear fit weighted by number of games, and the dashed red line shows the "ideal," namely actual winrate=predicted winrate over the board.

So you can see that the system works pretty well already, but ok, so maybe it's too volatile. Can we fix that?

This uses a higher decay rate and a lower minimum. Essentially this means that we allow the ratings of the most frequently playing players to become "more certain" but that the information of their skill level decays faster when they don't play.

Here I have upped the initial RD to 0.6 to try to fix the slight offset. Right now I think it looks almost perfect.

So this is what will happen. In a week, when the time comes to publish the new list, I will recompute all ratings, using a minimal RD of 0.06, initial RD of 0.6 and RD decay of 0.04.

What you should see is that the ratings of the most frequent players will be much more stable, but the ratings of players who play rarely will become unstable faster than before. Additionally, new players will adjust somewhat quicker than before.

Also, Conti has added a ton of missing SPL games to the database, so hopefully that will help with the Kespa players.

bittman

Australia8759 Posts

December 18 2012 13:15 GMT

#144

Just properly flicked through the site for the first time and I really like the work that's going into it.

myk3

Austria80 Posts

December 18 2012 13:46 GMT

#145

Awesome. Nice design too. Good job, thx!

Greenei

Germany1754 Posts

December 22 2012 14:24 GMT

#146

is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?

Grovbolle

Denmark3811 Posts

December 22 2012 15:38 GMT

#147

On December 22 2012 23:24 Greenei wrote:
is there any way of calculating the new rating and new predictions yourself? with new i mean the "Results for next list" games. could you tell us how to calculate those ratingchanges, so i can do it myself when i need to?

Hi, I am one of the contributors of games to the site, and currently as far as I know it works in a way where we add data directly to his database, but I am not sure if the functionality/logic is located in an online version, obviously TheBB will be able to tell you, but since not all games are updated the second they are played, you will not have a "clean" rating because some games might not be added yet even though they have been played if you get to update the rating yourself

Odoakar

Croatia1837 Posts

December 22 2012 15:52 GMT

#148

Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Grovbolle

Denmark3811 Posts

December 22 2012 16:06 GMT

#149

On December 23 2012 00:52 Odoakar wrote:
Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not currently, but I assume that it is something TheBB would implement when/if he has the time for it

Iodem

United States1173 Posts

December 24 2012 01:38 GMT

#150

http://aligulac.com/periods/73/page-2/

CombatEX is ranked as the second best foreign Protoss (1st being Grubby) xD

graan

Germany589 Posts

December 24 2012 03:10 GMT

#151

im glad you got a nice shout out by tlo at hsc.. im a statistics mayor and love to see some mathematical work, dont get your model over saturated, just stick to your data and keep it simple.. for example the best football predictions are only based on market values, if you base your research on 'upsets' you might get specific results right, but overall it gets off very fast.

ZigguratOfUr

Iraq16955 Posts

December 24 2012 03:19 GMT

#152

Some feedback on the predictions:

-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

-The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup. For example Goody's win-loss record versus Stephano is 7W-9L, while the (generally considered) much better player, PuMa, is only 2W-6L.

Gfire

United States1699 Posts

December 24 2012 03:51 GMT

#153

On December 24 2012 12:19 BrokenMirage wrote:
-The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%). You know that Sting won, of course, and it was an upset, but not as much of an upset as your prediction made it sound. Starcraft 2 is a game where most top-tier, or mid-tier players can take games of each other seemingly at random. You should probably move the predictions towards the mean.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?

Roonweld

United States144 Posts

December 24 2012 04:10 GMT

#154

So, this is what a Nate Silver for Starcraft looks like. This will be awesome for my liquibet ranking!

TheBB

Switzerland5133 Posts

December 25 2012 17:59 GMT

#155

On Thursday when the new list comes I will recompute all the ratings from the start using some different parameters. Hopefully this will help with many of your issues.

is there any way of calculating the new rating and new predictions yourself?

Yeah, but it involves a bit of programming. There is no closed form expression. This feature would be kinda cool to add to the site, I agree.

Is there a way to see ELO for let's say top 10 players through 2 years on the same chart?

Not yet.

The results are much too onesided when comparing mid-tier players, and top players. For example, your prediction about Leenock vs Sting for Fight Club was overwhelmingly in Leenock's favour(97.6%).

This is because the ratings adjust very quickly, so a player on a hot streak will be very highly rated. When the new ratings come on Thursday, they won't be so volatile, so presumably the top will be closer to the mid tier. Maybe.

I don't want to just adjust my predictions toward the mean based on gut feeling. Based on historical data, the assumed stronger player wins almost exactly as many games as he or she should according to the ratings, if not more in some cases.

It should be pushed towards 50/50 if there's a higher uncertainty for the players, shouldn't it?

Yes.

The predictions don't seem to take into account head-to-head results, which can somehow defy the players rankings or win-rates in that watchup.

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).

felisconcolori

United States6168 Posts

December 25 2012 18:37 GMT

#156

It TheBB the new stats bonjwa?

Great site, I like the layout and feel. Very useful so far while poking at it.

StarGalaxy

Germany744 Posts

December 25 2012 19:02 GMT

#157

wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

sitromit

7051 Posts

December 25 2012 19:23 GMT

#158

On December 26 2012 02:59 TheBB wrote:

Show nested quote +

That's right. There is a simple Bayesian model that can do this, but I need to work out a good way to weigh past results (recent ones vs. older).

I don't think it's a good idea to take head-to-head into consideration, because even though there do seem to be some players who struggle against a particular opponent in a match up where they do quite well otherwise (hello MKP vs Mvp :p), it doesn't seem to be a factor the majority of the time.

slowbacontron

United States7722 Posts

December 25 2012 19:27 GMT

#159

On December 26 2012 04:02 OrbitalPlane wrote:
wow this is really impressive.
i wish we would had a ladder like that.
Blizzard hire that guy and make it happen!

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?

StarGalaxy

Germany744 Posts

December 25 2012 23:09 GMT

#160

On December 26 2012 04:27 slowbacontron wrote:

Show nested quote +

Well, Blizzard's matchmaking system on ladder is already extremely good, isn't it?

the match making is great.
The rating system is horrible. (Even if you take out the bonus pool which inflates the rating.)
It's impossible to track your own development with the blizzard ranking.