Aligulac Feb 6 Update: Oops - Page 3

TheBB

Switzerland5133 Posts

February 07 2013 18:06 GMT

#41

On February 08 2013 02:16 KillerDucky wrote:

It's possible to fix this, some papers I read call it parameter smoothing, using backward filtering to smooth the past ratings. See for example this paper: http://tennis-skill-rankings.googlecode.com/hg-history/c977c53a3af2913e780e39666fe1a272cc298319/links/glicko.pdf

I thought about this (that's the paper I based my method on actually), but I didn't quite like the idea of past lists changing forever. When FIDE (chess) ratings are published they are set in stone, and you know for example that Kasparov's 2851 record from 1999 or Carlsen's 2872 at the moment will never be anything other than what they are. It makes it awkward for enthusiasts to track records. Not that I've noticed a lot of people tracking Aligulac records, since the pasts lists are changing anyway due to the expanding database (for the time being), but still, I wanted to give people the option.

Thoughts?

Edit: Just so it's clear, we're talking about basing ratings on both past and future results, so that the historical ratings look more correct in hindsight. It can fix some of the early problems by (for example) adjusting Koreans upwards because we now know that they have an average higher skill level.

KillerDucky

United States498 Posts

February 07 2013 18:54 GMT

#42

Maybe just run smoothing once. Really as long as you start from around October 2012 (MvP matches) and smooth backwards from there, most of the problems would probably be fixed.

MCXD

Australia2738 Posts

February 07 2013 18:58 GMT

#43

Just letting you know that I encountered an error when playing around with the prediction stuff:

+ Show Spoiler +

It seems like the round robin thing doesn't like large groups. (Was just an 8-man round robin w/ bo1 using the players shown)

EtherealDeath

United States8366 Posts

February 07 2013 19:03 GMT

#44

Lol funny rounding error that is.

TheBB

Switzerland5133 Posts

February 07 2013 19:11 GMT

#45

Ah yeah, you used a group that was so big it was forced to use Monte Carlo simulation. Thanks for the heads up.

ACrow

Germany6583 Posts

February 07 2013 19:56 GMT

#46

There is not a lot of difference between First and Last in the recent list.

Good job, always love your list! Glad you found a bug, it still seems a bit weird seeing Scarlett that high on the list, but w/e, math does not lie and it's only a model not the truth (whatever truth is).

Greenei

Germany1754 Posts

February 07 2013 22:22 GMT

#47

what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

Grovbolle

Denmark3805 Posts

February 07 2013 23:05 GMT

#48

On February 08 2013 07:22 Greenei wrote:
what does the '+-30' in the matchuppoints nad general rating actually mean? does it mean ~100% of the time the rating is in that area? or is that 1 or 2 or 3 standarddeviations? or the maximum amount that the rating will shift?

Making a qualified guess, I would say it is +-3 standard deviations (meaning that 95% of the time, the actual rating falls within the confidence interval, i.e. Rating +- 3 St. Deviations.)

Edit: Of course 3 std's = 99% (I am retarded)

TheBB

Switzerland5133 Posts

February 07 2013 23:34 GMT

#49

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.

Greenei

Germany1754 Posts

February 08 2013 02:52 GMT

#50

Making a qualified guess, I would say it is +-3 standard deviations (meaning that 95% of the time, the actual rating falls within the confidence interval, i.e. Rating +- 3 St. Deviations.)

3 stds would be ~99%.

On February 08 2013 08:34 TheBB wrote:

Show nested quote +

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.

k thx. do you plan on making the database open source at any point? because i'd like to make some calculations of my own from time to time and there would be no point at all in starting an own database at this point.

Conti

Germany2516 Posts

February 08 2013 06:17 GMT

#51

On February 08 2013 11:52 Greenei wrote:

Show nested quote +

3 stds would be ~99%.

Show nested quote +

You can download an SQL database dump at http://aligulac.com/db/.

Greenei

Germany1754 Posts

February 08 2013 07:53 GMT

#52

On February 08 2013 15:17 Conti wrote:

Show nested quote +

You can download an SQL database dump at http://aligulac.com/db/.

ah thx, that was a bit hidden :D

Grovbolle

Denmark3805 Posts

February 08 2013 08:41 GMT

#53

On February 08 2013 08:34 TheBB wrote:

Show nested quote +

It's actually just one estimated standard deviation, so it's a pretty weak confidence interval.

Yeah ok, 68%

a3den

704 Posts

February 08 2013 12:00 GMT

#54

As a stats buff, gotta say it really is a nice website, like a cleaner and better version of TLPD (or sc2charts, whatever floated your boat). Both infuriated me for the longest time because they had the data and did nothing with it. You on the other hand understand that a db is as good as what you do with it. I also love how well your data is historized.

Downloading that Db dump from work is so tempting...

maty

Germany12 Posts

February 08 2013 13:10 GMT

#55

you could revisit the EG curse with those stats

MasterOfPuppets

Romania6942 Posts

February 10 2013 21:40 GMT

#56

So BB if you ever get particularly bored, could you make a prediction system for ProLeague/GSTL based on not only on player rating for both rosters but also maps? Or is it simply not going to be accurate enough to warrant the gargantuan effort involved in creating and implementing the system? xD

Conti

Germany2516 Posts

February 10 2013 22:14 GMT

#57

On February 11 2013 06:40 MasterOfPuppets wrote:
So BB if you ever get particularly bored, could you make a prediction system for ProLeague/GSTL based on not only on player rating for both rosters but also maps? Or is it simply not going to be accurate enough to warrant the gargantuan effort involved in creating and implementing the system? xD

There's currently no map information saved in the database, only matches and results. So before any kind of predictive ~~magic~~ math can be applied, we'd need that information for >100.000 games. And we'd need a whole lot more volunteers for that.

Nudge. Nudge.

Grovbolle

Denmark3805 Posts

February 10 2013 22:34 GMT

#58

On February 11 2013 07:14 Conti wrote:

Show nested quote +

Plus we (TheBB) had to rework how the entire database is configured because matches =/= games.

Plus it would be hard since a lot of LP-articles contain no mapinfo, even on big tournaments like MLG it is impossible to find map info for stuff like open bracket etc. So yeah, way too much work, whenever a new feature has to be "backtracked" as I like to call it, it literally takes our small team of 4-5 (TheBB, Conti, kiekaboe does a shit ton each and I + Inflicted does some as well) weeks, just look at this
http://aligulac.com/db/
"only" 64% is catalogued in the event hierarchy.

Epamynondas

387 Posts

February 10 2013 23:09 GMT

#59

On February 08 2013 03:06 TheBB wrote:
+ Show Spoiler +

On February 08 2013 02:16 KillerDucky wrote:

Show nested quote +

Maybe you could do some kind of backwards adjustement (or this "smoothing" you guys speak of) only on new players? Like, compute things normally for them for about 4 periods or something like that (or for a set amount of games played, i guess?), and then adjust their ratings retroactively, and then don't mess with their past ever again.

So imagine that I get a magical seed for Code S next season, and lose my first game of the group stages against Life (but only because i'm nervous). This doesn't give a lot of points to Life because I'm totally unknown at that point.

Then I proceed to stomp all competition and win Code S without dropping another map. Then your script readjusts my ratings and suddenly Life has a rating of like 3000 because he took a game off me.

And then pro players catch up to my silver strats and I don't win a game ever again.

Conti

Germany2516 Posts

February 10 2013 23:11 GMT

#60

..and sorting matches into events is about a gazillion times faster to do than adding maps to matches would be.

Prev 1 2 3 4 5 Next All

Please or register to reply.

Aligulac Feb 6 Update: Oops - Page 3

Completed

Ongoing

Upcoming