• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 09:48
CEST 15:48
KST 22:48
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Tournament Spotlight: FEL Cracow 20257Power Rank - Esports World Cup 202577RSL Season 1 - Final Week9[ASL19] Finals Recap: Standing Tall15HomeStory Cup 27 - Info & Preview18
Community News
Google Play ASL (Season 20) Announced24BSL Team Wars - Bonyth, Dewalt, Hawk & Sziky teams10Weekly Cups (July 14-20): Final Check-up0Esports World Cup 2025 - Brackets Revealed19Weekly Cups (July 7-13): Classic continues to roll8
StarCraft 2
General
Power Rank - Esports World Cup 2025 Tournament Spotlight: FEL Cracow 2025 #1: Maru - Greatest Players of All Time I offer completely free coaching services What tournaments are world championships?
Tourneys
FEL Cracov 2025 (July 27) - $10,000 live event Esports World Cup 2025 $25,000 Streamerzone StarCraft Pro Series announced $5,000 WardiTV Summer Championship 2025 WardiTV Mondays
Strategy
How did i lose this ZvP, whats the proper response
Custom Maps
External Content
Mutation #239 Bad Weather Mutation # 483 Kill Bot Wars Mutation # 482 Wheel of Misfortune Mutation # 481 Fear and Lava
Brood War
General
Google Play ASL (Season 20) Announced [Update] ShieldBattery: 2025 Redesign Dewalt's Show Matches in China BGH Auto Balance -> http://bghmmr.eu/ BW General Discussion
Tourneys
[Megathread] Daily Proleagues [BSL20] Non-Korean Championship 4x BSL + 4x China CSL Xiamen International Invitational [CSLPRO] It's CSLAN Season! - Last Chance
Strategy
Simple Questions, Simple Answers [G] Mineral Boosting Does 1 second matter in StarCraft?
Other Games
General Games
Stormgate/Frost Giant Megathread Nintendo Switch Thread Total Annihilation Server - TAForever [MMORPG] Tree of Savior (Successor of Ragnarok) Path of Exile
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread Vanilla Mini Mafia
Community
General
US Politics Mega-thread UK Politics Mega-thread Stop Killing Games - European Citizens Initiative Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread
Fan Clubs
INnoVation Fan Club SKT1 Classic Fan Club!
Media & Entertainment
[\m/] Heavy Metal Thread Anime Discussion Thread Movie Discussion! [Manga] One Piece Korean Music Discussion
Sports
Formula 1 Discussion 2024 - 2025 Football Thread TeamLiquid Health and Fitness Initiative For 2023 NBA General Discussion
World Cup 2022
Tech Support
Installation of Windows 10 suck at "just a moment" Computer Build, Upgrade & Buying Resource Thread
TL Community
TeamLiquid Team Shirt On Sale The Automated Ban List
Blogs
Ping To Win? Pings And Their…
TrAiDoS
momentary artworks from des…
tankgirl
from making sc maps to makin…
Husyelt
StarCraft improvement
iopq
Socialism Anyone?
GreenHorizons
Eight Anniversary as a TL…
Mizenhauer
Customize Sidebar...

Website Feedback

Closed Threads



Active: 752 users

[SPL] Statisfaction - Page 2

Forum Index > News
47 CommentsPost a Reply
Prev 1 2 3 Next All
Heyoka
Profile Blog Joined March 2008
Katowice25012 Posts
Last Edited: 2009-06-06 16:52:26
June 06 2009 16:49 GMT
#21
On June 06 2009 17:29 jtype wrote:
Mine is based a lot more on gut feeling rather than statistics, but I definitely like your thinking on this and look forward to seeing more on your system.


Ultimately gut feeling is probably better right now, but the first few trials I've been running of this show a lot of promise. I'm expecting to be able to outpace any judgment-based systems within a few months.



Remember guys it takes past rounds into account, for example round 1 and 2 CJ are 10-12 KHAN are 15-7 . Doesnt account for recent slumping in the individual leagues by KHAN.


I do this very much on purpose. People are really fast to call something a slump when in reality its just a series of coinflips gone bad. The only time it would be relevant is the psychological effect on the players/coaches themselves if they think they are slumping, but I don't have that kind of info.

I want to run that KHAN game again though, those numbers seem a little wonky. While I think some of their players are under-performing a little, relative to their numbers, its not nearly to the extent people want to think they are.

75% khan over CJ, yea right..... thats soem good number crunching. does that mean you'll bet me 100$ to 300$ ? ?


3-1 is a probably too generous but find an escrow and I'm more than willing.

For individual player stats, consider using PMF. There are good docs online. Alternatively, you can do something very similar using BUGS (or BRUGS) that I've found is pretty useful for these sorts of things.


This is a good idea, thanks.
@RealHeyoka | ESL / DreamHack StarCraft Lead
jtype
Profile Blog Joined April 2009
England2167 Posts
Last Edited: 2009-06-06 17:16:36
June 06 2009 17:15 GMT
#22
On June 07 2009 01:49 heyoka wrote:
Show nested quote +
On June 06 2009 17:29 jtype wrote:
Mine is based a lot more on gut feeling rather than statistics, but I definitely like your thinking on this and look forward to seeing more on your system.

Ultimately gut feeling is probably better right now,


Lol! Well, you say that, but then you haven't seen my 5 wrong vote-streak on Liquibet.
JWD
Profile Blog Joined October 2007
United States12607 Posts
June 06 2009 17:37 GMT
#23
So great to have regular PL coverage back! Thanks riptide.
✌
MutaDoom
Profile Blog Joined January 2009
Canada1163 Posts
June 06 2009 17:38 GMT
#24
Wow, beautiful work, Heyoka. I imagine that took a while to work out :D

Thanks for the writeup, Riptide. GG.
Avidkeystamper
Profile Blog Joined June 2008
United States8552 Posts
June 06 2009 18:26 GMT
#25
Yeah, SPL write-ups are back. This one was really nice because of the new way you approached it. And I usually enjoy your articles, riptide. Thanks.
Jaedong
Syntax Lost
Profile Joined May 2009
Finland86 Posts
Last Edited: 2009-06-06 19:54:15
June 06 2009 19:52 GMT
#26
I'm currently working on using machine learning techniques to improve estimation of the probability a given player will win given his past performance (last 10), current elo and map statistics. Collecting the data to train the models though is quite tedious and will take quite some time before a reasonably accurate model is obtained (since they're based on actual data), however, I think such models should be more accurate than the simple rough calculations provided in the OP. At the moment I'm looking only at simple linear models like naïve Bayesian and logistic since they tend to perform better with limited data over non-linear models. I'm hoping to try out non-linear models like neural networks in the future, when I have sufficient data (which may be a while). I'll be sure to post my results on TL when they become interesting, however at the moment, there isn't much to say except data collection is tedious.
ReketSomething
Profile Blog Joined November 2008
United States6012 Posts
June 06 2009 20:03 GMT
#27
Wow this is really nice!

Total dedication and belief in numbers =) Thats how I like it
Jaedong :3
Syntax Lost
Profile Joined May 2009
Finland86 Posts
June 06 2009 20:30 GMT
#28
By the way, ELO can already be used as a probabilistic estimator for a given matchup. I haven't tested its accuracy or perplexity though, so I have no clue how good it might be. Overall, I don't think these techniques can beat the best human experts though, but they can infer interesting data about how certain data affects the likelihood of a player winning. Probably most interesting for everyone is how much the map matters statistically speaking.

I'm doing my Master's in Machine Learning and Data Mining, so its good practice for me anyway.
geno
Profile Blog Joined October 2007
United States1404 Posts
June 06 2009 22:39 GMT
#29
When I was reading the statistical analysis that was the first thing that came to mind as well. I've always found that comparing two players ELO (or better yet, their detailed matchup ELO) to be a better indicator of the match outcome than W-L ratios. Hell with the exception of a few S-class players and rising talents, even just looking at the straight up map balance statistics has been more accurate
Entertaining
Profile Joined September 2007
Canada793 Posts
June 07 2009 00:42 GMT
#30
The comeback by xellos was sick, zerg went alittle crazy tho, i dont know why he didnt keep expoin before 20minute-ish mark.
Heyoka
Profile Blog Joined March 2008
Katowice25012 Posts
Last Edited: 2009-06-07 00:49:43
June 07 2009 00:44 GMT
#31
On June 07 2009 04:52 Syntax Lost wrote:
I'm currently working on using machine learning techniques to improve estimation of the probability a given player will win given his past performance (last 10), current elo and map statistics. Collecting the data to train the models though is quite tedious and will take quite some time before a reasonably accurate model is obtained (since they're based on actual data), however, I think such models should be more accurate than the simple rough calculations provided in the OP. At the moment I'm looking only at simple linear models like naïve Bayesian and logistic since they tend to perform better with limited data over non-linear models. I'm hoping to try out non-linear models like neural networks in the future, when I have sufficient data (which may be a while). I'll be sure to post my results on TL when they become interesting, however at the moment, there isn't much to say except data collection is tedious.


They definitely should be. What I have here really amounts to an estimation to see if this kind of study has any merit - I think this shows it does. There are hundreds of ways to improve this now, in the future when we look at this way of doing it, it will seem pathetically inaccurate.

I have done some experiments with ELO and the my results generally aren't good. The player pool we're working with is so small that the results I got ended up not being very accurate, I think techniques used for handicapping baseball and football are more suited for this kind of thing.

The KeSPA system might actually work here though, as it does something similar but heavily weighs recent results. It is more suited to telling us how players are right now whereas ELO results seem better for how good a player was at his best or averaged through his career.
@RealHeyoka | ESL / DreamHack StarCraft Lead
peanutter
Profile Joined February 2009
Australia165 Posts
June 07 2009 00:48 GMT
#32
This system of calculations would probably earn you a good amount of correct bets in the long run. In regards to winning a liquibet season or the sort, you probably wouldn't end up first using this kind of system. Basing decisions on probability is always a good thing in the long run since a majority of the time you'll end up being correct most of the time. (poker anyone?) But then again there's outside factors that a statistical model can't account for. (poker bluffs?)
Heyoka
Profile Blog Joined March 2008
Katowice25012 Posts
June 07 2009 01:25 GMT
#33
The real reason this isn't optimal for liquibet is that liquibet is very heavily influenced by 1v1 matches. Straight probability is (right now) a poor way of determining how a msl/osl will play out in various stages. That will probably continue to be true even with much more advanced models of ranking specific players as well.
@RealHeyoka | ESL / DreamHack StarCraft Lead
Syntax Lost
Profile Joined May 2009
Finland86 Posts
June 07 2009 12:39 GMT
#34
On June 07 2009 09:44 heyoka wrote:
I have done some experiments with ELO and the my results generally aren't good. The player pool we're working with is so small that the results I got ended up not being very accurate, I think techniques used for handicapping baseball and football are more suited for this kind of thing.


The current limiting factor is the lack of data. Currently, I'm just collecting last 10 statistics for each player for the given matchup, plus their current ELO, matchup ELO and map statistics for the matchup. E.g. for Bisu vs Hiya, I take Bisu's last 10 games vs Terran as a binary vector plus his ELO vs Terran, Hiya's last 10 vs Protoss as another binary vector plus his ELO vs Protoss, both their general ELO scores and the matchup statistic given as a probability for the map they're to play on. This gives the data 25 dimensions, which is quite substantial and therefore requires significant data to properly cover all of these dimensions, but should give reasonable accuracy for estimating the actual distribution once sufficient data is obtained.

The KeSPA system might actually work here though, as it does something similar but heavily weighs recent results. It is more suited to telling us how players are right now whereas ELO results seem better for how good a player was at his best or averaged through his career.


Problem is that KeSPA only update their ranking once per month, so their value decreases over the month. I'm also unsure what kind of distribution they're trying to fit the players to (if they are trying to fit a distribution at all) which makes estimating the actual distribution of player results much more difficult. ELO is nice in that its designed to fit to players to a Normal distribution (which is a reasonable assumption, in my opinion) and assigns a uniform weight to all matches (rather than what seems like an arbitrary weight given by KeSPA) so its properties are easier to understand.

I prefer ELO over KeSPA's system for these reasons and it seems pretty reasonable when coupled with last 10 statistics (so present winning streaks are factored). I took a couple liberties in re-factoring a few statistics (basically, just resetting Xellos' stats since his joining the army) to help compensate for a few short-comings in the system.

Anyhow, I have no clue how well the results will turn out, as I still require far more data.

By the way, if anybody knows how to quickly get player's ELO scores at the time of past matchups quickly and easily, it will greatly speed up the process.
eshlow
Profile Joined June 2008
United States5210 Posts
Last Edited: 2009-06-07 17:10:17
June 07 2009 17:09 GMT
#35
Nice stats analysis. Always love to see this.


On June 06 2009 21:21 3 Lions wrote:
Show nested quote +
On June 06 2009 20:12 Balfazar wrote:
Have a look at the matchups for Khan vs CJ, Khan are definitely favorites. The only match where Khan don't have a clear advantage is Effort vs Stork. I will be surprised if they don't win.

Given Firebathero's current form, I can actually realistically see CJ win 3-1

+ Show Spoiler +
Ironically, FBH was the only one to win.
Overcoming Gravity: A Systematic Approach to Gymnastics and Bodyweight Strength
Pokernubz
Profile Joined May 2009
Canada88 Posts
June 07 2009 17:59 GMT
#36
On June 08 2009 02:09 eshlow wrote:
Nice stats analysis. Always love to see this.


Show nested quote +
On June 06 2009 21:21 3 Lions wrote:
On June 06 2009 20:12 Balfazar wrote:
Have a look at the matchups for Khan vs CJ, Khan are definitely favorites. The only match where Khan don't have a clear advantage is Effort vs Stork. I will be surprised if they don't win.

Given Firebathero's current form, I can actually realistically see CJ win 3-1

+ Show Spoiler +
Ironically, FBH was the only one to win.



FAUUUUUUUUUUUUUK WHY DID I CLICK THAT!?@?!?@!?@!?@

DONT DO IT PEOPLE! it's soooooo not worth it..

sorry for caps, but that just ruined my day.
Lee Jae Dong! Phil Ivey! Lee Young-Ho! Tom Dwan!
Pokernubz
Profile Joined May 2009
Canada88 Posts
June 07 2009 22:15 GMT
#37
I think I'm going to trade Iris, I want him out anyways.

Who do you think is a good substitute for him?
I've been thinking about the follow for trades:
Luxury, July, Hwasin, Mind

I think im goin to pick luxury, seems like the new maps favor zerg, and July just isn't getting much playing time these days. Although i like STX more then KTF as far as team points goes. That's pretty much the main reason why I'm not choosing Mind, he just won't be getting the team points with Wemade.

What are your guys thoughts on Hwasin, jesus i wish i didn't choose Berserker in the first place, damn donkey is failing big time. can't even beat + Show Spoiler +
FBH... FBH's tvt isn't even stellar. sigh.

Lee Jae Dong! Phil Ivey! Lee Young-Ho! Tom Dwan!
ghermination
Profile Blog Joined April 2008
United States2851 Posts
June 07 2009 22:46 GMT
#38
Your analysis is really good but my CJ fanboyism still wont let me say you're right.
U Gotta Skate.
Heyoka
Profile Blog Joined March 2008
Katowice25012 Posts
June 08 2009 01:24 GMT
#39
On June 07 2009 21:39 Syntax Lost wrote:
By the way, if anybody knows how to quickly get player's ELO scores at the time of past matchups quickly and easily, it will greatly speed up the process.



I asked PoP about that exact thing, he said it was in his "to do" list for TLPD but not functional yet. I assume he is a busy dude, maybe if you ask too we can get it bumped up on the list

Regarding KHAN vs CJ:

+ Show Spoiler +

I actually had a post written up on how I think KHAN is a more even match than people realize, because FBH had a much higher chance to win his match than people want to admit/will see. I guess that specific point was right, but in the end CJ still blew out KHAN so I can't brag

Sucks for that dude he didnt get back to me on taking the bet
@RealHeyoka | ESL / DreamHack StarCraft Lead
DogofWar
Profile Joined May 2009
United States10 Posts
June 08 2009 06:30 GMT
#40
I love the great work keep it up.

On June 08 2009 10:24 heyoka wrote:
Regarding KHAN vs CJ:

+ Show Spoiler +

I actually had a post written up on how I think KHAN is a more even match than people realize, because FBH had a much higher chance to win his match than people want to admit/will see. I guess that specific point was right, but in the end CJ still blew out KHAN so I can't brag

Sucks for that dude he didnt get back to me on taking the bet


+ Show Spoiler +
I wouldn't be too discouraged because one prediction went bad. One match is not adequate statistical evidence against your method.


Have you considered running the algorithm on past games and data in order to check its accuracy with a large sample?
Prev 1 2 3 Next All
Please log in or register to reply.
Live Events Refresh
FEL
09:00
Cracow 2025
Clem vs Krystianer
uThermal vs SKillousLIVE!
Reynor vs MaNa
Lambo vs Gerald
RotterdaM1751
ComeBackTV 1636
IndyStarCraft 487
WardiTV283
CranKy Ducklings194
Rex139
3DClanTV 93
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
RotterdaM 1751
IndyStarCraft 487
Rex 139
BRAT_OK 72
ProTech68
MindelVK 27
StarCraft: Brood War
Britney 53317
Horang2 6809
EffOrt 2039
Barracks 1185
Hyuk 1120
Larva 891
BeSt 680
firebathero 504
Stork 455
Hyun 269
[ Show more ]
Soulkey 251
Last 243
Rush 108
Dewaltoss 71
Sharp 67
Movie 60
Shinee 57
Free 52
Sea.KH 52
sas.Sziky 45
sSak 40
sorry 34
Shine 33
zelot 22
yabsab 21
ajuk12(nOOB) 15
Terrorterran 11
Dota 2
qojqva4044
Gorgc3660
XcaliburYe452
420jenkins158
League of Legends
febbydoto10
Counter-Strike
sgares357
Heroes of the Storm
Khaldor480
Other Games
B2W.Neo1969
Beastyqt1139
Hui .362
DeMusliM242
Fuzer 179
oskar175
QueenE50
Organizations
Other Games
BasetradeTV22
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 16 non-featured ]
StarCraft 2
• Berry_CruncH312
• HeavenSC 22
• intothetv
• IndyKCrew
• AfreecaTV YouTube
• sooper7s
• Migwel
• LaughNgamezSOOP
• Kozan
StarCraft: Brood War
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• C_a_k_e 4101
• WagamamaTV868
League of Legends
• Nemesis3091
• Jankos1470
Upcoming Events
BSL20 Non-Korean Champi…
12m
BSL20 Non-Korean Champi…
4h 12m
Bonyth vs Zhanhun
Dewalt vs Mihu
Hawk vs Sziky
Sziky vs QiaoGege
Mihu vs Hawk
Zhanhun vs Dewalt
Fengzi vs Bonyth
Wardi Open
21h 12m
Sparkling Tuna Cup
1d 20h
WardiTV European League
2 days
Online Event
2 days
uThermal 2v2 Circuit
3 days
The PondCast
3 days
Replay Cast
4 days
Korean StarCraft League
5 days
[ Show More ]
CranKy Ducklings
5 days
Sparkling Tuna Cup
6 days
Liquipedia Results

Completed

CSLPRO Last Chance 2025
Esports World Cup 2025
Murky Cup #2

Ongoing

Copa Latinoamericana 4
Jiahua Invitational
BSL 20 Non-Korean Championship
BSL 20 Team Wars
FEL Cracov 2025
CC Div. A S7
Underdog Cup #2
IEM Cologne 2025
FISSURE Playground #1
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
PGL Astana 2025
Asian Champions League '25

Upcoming

ASL Season 20: Qualifier #1
ASL Season 20: Qualifier #2
ASL Season 20
CSLPRO Chat StarLAN 3
BSL Season 21
RSL Revival: Season 2
Maestros of the Game
SEL Season 2 Championship
WardiTV Summer 2025
uThermal 2v2 Main Event
HCC Europe
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.