• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 11:31
CET 16:31
KST 00:31
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
[ASL20] Finals Preview: Arrival10TL.net Map Contest #21: Voting10[ASL20] Ro4 Preview: Descent11Team TLMC #5: Winners Announced!3[ASL20] Ro8 Preview Pt2: Holding On9
Community News
Merivale 8 Open - LAN - Stellar Fest3Chinese SC2 server to reopen; live all-star event in Hangzhou22Weekly Cups (Oct 13-19): Clem Goes for Four3BSL Team A vs Koreans - Sat-Sun 16:00 CET10Weekly Cups (Oct 6-12): Four star herO8
StarCraft 2
General
Could we add "Avoid Matchup" Feature for rankgame RotterdaM "Serral is the GOAT, and it's not close" Chinese SC2 server to reopen; live all-star event in Hangzhou The New Patch Killed Mech! Weekly Cups (Oct 13-19): Clem Goes for Four
Tourneys
Merivale 8 Open - LAN - Stellar Fest $5,000+ WardiTV 2025 Championship $3,500 WardiTV Korean Royale S4 Tenacious Turtle Tussle RSL Season 3 Qualifier Links and Dates
Strategy
Custom Maps
Map Editor closed ?
External Content
Mutation # 496 Endless Infection Mutation # 495 Rest In Peace Mutation # 494 Unstable Environment Mutation # 493 Quick Killers
Brood War
General
ASL20 Pre-season Tier List ranking! [ASL20] Finals Preview: Arrival Is there anyway to get a private coach? BGH Auto Balance -> http://bghmmr.eu/ BSL Team A vs Koreans - Sat-Sun 16:00 CET
Tourneys
[ASL20] Grand Finals ASL final tickets help [ASL20] Semifinal A Small VOD Thread 2.0
Strategy
Soma's 9 hatch build from ASL Game 2 Simple Questions, Simple Answers Roaring Currents ASL final Relatively freeroll strategies
Other Games
General Games
Stormgate/Frost Giant Megathread General RTS Discussion Thread Path of Exile Nintendo Switch Thread Dawn of War IV
Dota 2
Official 'what is Dota anymore' discussion LiquidDota to reintegrate into TL.net
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread SPIRED by.ASL Mafia {211640}
Community
General
US Politics Mega-thread Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread YouTube Thread The Chess Thread
Fan Clubs
White-Ra Fan Club The herO Fan Club!
Media & Entertainment
Anime Discussion Thread [Manga] One Piece Korean Music Discussion Series you have seen recently... Movie Discussion!
Sports
Formula 1 Discussion 2024 - 2026 Football Thread MLB/Baseball 2023 TeamLiquid Health and Fitness Initiative For 2023 NBA General Discussion
World Cup 2022
Tech Support
SC2 Client Relocalization [Change SC2 Language] Linksys AE2500 USB WIFI keeps disconnecting Computer Build, Upgrade & Buying Resource Thread
TL Community
The Automated Ban List Recent Gifted Posts
Blogs
The Benefits Of Limited Comm…
TrAiDoS
Sabrina was soooo lame on S…
Peanutsc
Our Last Hope in th…
KrillinFromwales
Certified Crazy
Hildegard
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1621 users

What you don't know about statistics - Page 2

Forum Index > SC2 General
Post a Reply
Prev 1 2 3 4 5 6 7 Next All
corose
Profile Joined August 2011
United States31 Posts
May 04 2012 00:56 GMT
#21
???

This is horrible. He should include the fallacies that he makes too. There might be 3 things in this that make sense?

Why are we balancing a game around X's in a tournament who are better but underpowered, and who start crushing when it becomes balanced? I don't think Blizzard is...

And how do you know this is due to balance and not metagame shift, or players finally "getting" their races and utilizing what the race has to offer? Well, since your post is all anecdotal...you don't.

this means that players who are better stay in the tournament for longer, therefore they contribute more to the amount of games played


That's true.

since these are the players that overcome balance, again, it skews to 50-50 more than it is


That's not necessarily true.

Stephano is fucking good


True

If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games.


That's a good point.

'Clearly since this is taken from the absolute top, TvZ is highly imbalanced as a thousand games is a very large sample size'. No! you cannot


Explanation? No? Just say something as matter of fact cause it's true? Is he saying MKP's skills is much higher than DRG's or something? Also, why is he using extreme examples that have 0 validity to prove his points? Why doesn't he use actual examples or realistic things? I don't think he can cause they wouldn't support his argument...

IF a ZvP game ends in the 0-5 minute range, the chance is 60% it ends in Z's favour.
Now the 'if' is so bloody important here, the game needn't end.


He goes on to describe the process of 7 pooling and the events following.
Isn't the goal for Blizzard to make the game balanced from start to end? Of course, that's what this whole new Call o Action: Antiga Shipyard is about. Minor tweaks to make vZ more balanced in early game and they are also looking at PvT endgame. If zerg has a 60% win rate up to the 5 minute mark, as shown by a good set of data (good sample size, players of relative equal skill), then the match up is imbalanced during that time frame, but perhaps not overall. If you saw that 7 pools, and any strategy were 50% effective across the entire course of the game, then you have a successfully balanced game. Obviously if Zerg wins 75% of games that end before the 5 minute mark, early game balance is an issue. If they win much less than 50%, then it's imbalanced in the opposite direction.

I'm not really sure what he is saying in this last part. He anecdotally describes a 7 pool, a P potentially scouting it and walling, and the result following the Z's drone deficit. I don't think this story gives us any insight on balance, or on anything that he's arguing. What IS he arguing? I think his mistake is thinking that games ending before the 5 minute mark aren't subjected to analysis, because they are early pools so balance isn't an issue, because the Win/Loss is due to player mistakes??? No matter if it's a mistake, an inability to scout, or whatever...if the results heavily favor one side, then some mechanism of compensation is needed. Ideally, you are looking for 50% win ratio across the entire game time and between all combination of races.

I'm not a statistician...but i'm guessing this guy isn't either...?

Also...I don't think Blizzard makes all of it's decisions based on tournaments, where like said here, player skill is a factor.
hnQ
Profile Blog Joined September 2010
113 Posts
May 04 2012 00:58 GMT
#22
He's right on what he said, not sure if people didn't know this already though...
Reptilia
Profile Joined June 2010
Chile913 Posts
May 04 2012 01:05 GMT
#23
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.
The secret to creativity is knowing how to hide your sources
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:05 GMT
#24
On May 04 2012 09:56 corose wrote:
He goes on to describe the process of 7 pooling and the events following.
Isn't the goal for Blizzard to make the game balanced from start to end? Of course, that's what this whole new Call o Action: Antiga Shipyard is about. Minor tweaks to make vZ more balanced in early game and they are also looking at PvT endgame. If zerg has a 60% win rate up to the 5 minute mark, as shown by a good set of data (good sample size, players of relative equal skill), then the match up is imbalanced during that time frame, but perhaps not overall. If you saw that 7 pools, and any strategy were 50% effective across the entire course of the game, then you have a successfully balanced game. Obviously if Zerg wins 75% of games that end before the 5 minute mark, early game balance is an issue. If they win much less than 50%, then it's imbalanced in the opposite direction.

I'm not really sure what he is saying in this last part. He anecdotally describes a 7 pool, a P potentially scouting it and walling, and the result following the Z's drone deficit. I don't think this story gives us any insight on balance, or on anything that he's arguing. What IS he arguing? I think his mistake is thinking that games ending before the 5 minute mark aren't subjected to analysis, because they are early pools so balance isn't an issue, because the Win/Loss is due to player mistakes??? No matter if it's a mistake, an inability to scout, or whatever...if the results heavily favor one side, then some mechanism of compensation is needed. Ideally, you are looking for 50% win ratio across the entire game time and between all combination of races.

I'm not a statistician...but i'm guessing this guy isn't either...?

Also...I don't think Blizzard makes all of it's decisions based on tournaments, where like said here, player skill is a factor.


He is saying that seeing high Zerg win rates in the first 5 minutes and assuming early pools are imbalanced is wrong because games won by Zergs with early pools are often decided with in 5 minutes, but games won by Protoss against early pools often last longer than 5 minutes.

It has nothing to do with making mistakes/better player/whatever. It's just winrate graphs showing very incomplete, limited analysis of balance that are often very misleading.
Anything is Possible
shaldengeki
Profile Joined May 2009
United States104 Posts
May 04 2012 01:08 GMT
#25
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.

Please reconsider reposting topics like this in the future, or at the very least, try to be productive and rigorous in your arguments if you truly want TL to be a community that is rigorous in its discussion!
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:09 GMT
#26
On May 04 2012 10:05 Reptilia wrote:
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.


Did you read anything past the qualifier that was added on after his original post on Reddit?

Doesn't look like it. Your post has nothing to do with the vast majority of his post. Or where you saying he got butthurt that he got banned from TL so he went to Reddit and wrote an intelligent post about balance statistics and how they can be misleading?

All these douchey little TL > Reddit posts are the type of annoying shit that makes people think TL is pretentious anyways. I know he isn't any better for his equally douchey qualifier, but at least he followed it up with an informative, well-written post. Something I haven't seen much of from the Reddit bashers here. At least on Reddit your pathetic contributions would be downvoted enough so that I wouldn't have to waste time responding to them and could help clarify things for people who give a shit about having a meaningful discussion.
Anything is Possible
Quochobao
Profile Joined October 2010
United States350 Posts
May 04 2012 01:13 GMT
#27
All of what OP says about statistics is right -- that his tone is arrogant does not jeopardize the merit of this argument a bit.

Is he tactless in his approach? Probably. Is he right about statistics? Surely.

Don't be put off by the qualifier right away. That only proves TL to be not open-minded to the truth, no matter how harshly stated.
Best or nothing.
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:17 GMT
#28
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

Show nested quote +
On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.
Anything is Possible
Primadog
Profile Blog Joined April 2010
United States4411 Posts
May 04 2012 01:20 GMT
#29
A college drop out lecturing people on latin, how all social science are inherently pseudoscience, and statistics has no place in anything but the 'real sciences'. This is one moment where I wish reddit has a serious moderation policy.
Thank God and gunrun.
SappigeKutVolKots
Profile Joined May 2012
Netherlands1 Post
Last Edited: 2012-05-04 01:22:32
May 04 2012 01:20 GMT
#30
Okay, I wanted to stay away from this site, but I couldn't let some stuff go unanswered, I am drab, anyone can message me on reddit to verify this.

On May 04 2012 09:40 windsupernova wrote:
As much as I agree with him in some points. I don't like how he comes off as someone pretty arrogant and doesn't even present some kind of credentials on why he understand statistics more than 99% of people.I mean for all we know he could be some arrogant College kid who just passed his 1st statistics class.
I'm not making an argument from authority, I don't need credentials, even if I was a cow or an anencephalic protozoan, it doesn't matter, there is no need for credentials because I'm making an argument from reason, not from authority, I do not even need to cite any sources because my argument is purely rational, not empirical. If you ask for 'credentials' to verify this post then you lost and don't know how to verify academic literature.

My credentials are irrelevant, I'm not making an argument from authority. If you do not find yourself to have the confidence to check the correctness of my argument then you shouldn't agree or disagree either way. Say to yourself 'I don't understand what he's saying', above all, don't comment on a thread whose opening you don't understand, and move on with life.

Yes, I am very smug, I'm not even smug, I'm condescending, I'm not condescending because I have a higher education, I'm condescending because I'm fed up with stupidity, the arguments I put out are very easy and basic to understand and honestly, anyone reading those graphs should come to those conclusions, yet I've seen countless and countless people misinterpreting all those graphs without coming to the realization of these very basic givens, on both reddit and TL. I've seen 50 pages of TL posts discussing those graphics about probability of races to win at certain time intervals in matchups, and maybe 1-2 people pointed out how misleading it was because of the arguments I put out, and no one listened and other people go discuss trivial and unrelated stuff like 'sample size' while there are much bigger problems. I've seen the TLPD winrates posted on both TL and reddit and people discussed them for days and so few people initially pointed out that the lines between the graphs in the old aesthetic were completely ludicrous and they should be bars, and even fewer people were critical of the fact that the error bars were calculated by a means which assumes independent experiments, which they are not.

It doesn't take a genius to see this, it just takes allowing yourself to be critical. As soon as I ask a lot of those people 'There are some grave fallacies with those stats,c an you point them out?' they will most likely come with at least 80% of the shit I pointed out and probably with some things I overlooked. It doesn't take a brain, it takes not being a mindless drone and being critical of stuff that is being posted. As for credentials, I guarantee you that the people who post those TLPD winrate graphs either have no statistical credentials, or are wilfully lying to people and oversimplifying it, because it's just statistical faux pas.

But then he doesn't say how we should go about interpreting those statistics and providing proof.
We should interpret them as what they are. They are the winrates for this month, it says nothing of balance or any other interpretations you can make of them. You see what you get, and the error bars are, simply put, incorrect and a statistical gaffe. I'm not sure what they are supposed to mean, they don't mean anything if the map scores aren't independent probability experiments.

That being said I do think most of the people take a really simplistic approach to statistics, but well statistics are a hard subject to tackle
Nope, it's very easy, it's more that people like to see things that you can't conclude from stuff.

On May 04 2012 10:09 LaM wrote:
Show nested quote +
On May 04 2012 10:05 Reptilia wrote:
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.


Did you read anything past the qualifier that was added on after his original post on Reddit?

Doesn't look like it. Your post has nothing to do with the vast majority of his post. Or where you saying he got butthurt that he got banned from TL so he went to Reddit and wrote an intelligent post about balance statistics and how they can be misleading?

All these douchey little TL > Reddit posts are the type of annoying shit that makes people think TL is pretentious anyways. I know he isn't any better for his equally douchey qualifier, but at least he followed it up with an informative, well-written post. Something I haven't seen much of from the Reddit bashers here. At least on Reddit your pathetic contributions would be downvoted enough so that I wouldn't have to waste time responding to them and could help clarify things for people who give a shit about having a meaningful discussion.


As linked in the OP, I did not add that qualifier on top myself, I never added the qualifier formally, someone asked me 'Have you posted it on TL' (the OP here), I said 'Nope', he asked 'why?', I said that which he quoted.

That said, I never mentioned TL in the original post, I was mainly critical of screddit and its continued misuse of statistics and it got upvoted to be the #1 post on the screddit first page. This exemplifies a quality of screddit that I feel TL heavily lacks.

Edit: Also:

Pepper_MD just sent you a month of reddit gold! Wasn't that nice?
Here's a note that was included:
I have degree in Stats. All I have to say is Thank You.


I have no idea what reddit gold is, is it good?

User was banned for this post.
Sporks, the mighty Xel'Naga Hybrid between Spoons and Forks
KingPaddy
Profile Joined November 2010
1053 Posts
May 04 2012 01:23 GMT
#31
On May 04 2012 10:13 Quochobao wrote:
All of what OP says about statistics is right -- that his tone is arrogant does not jeopardize the merit of this argument a bit.

Is he tactless in his approach? Probably. Is he right about statistics? Surely.

Don't be put off by the qualifier right away. That only proves TL to be not open-minded to the truth, no matter how harshly stated.

But the thing is, what he says in his arrogant manner ("99% of the people don't know this") is basically common knowledge.
Is it true? For the most part yes. Is it advanced mathematics or anything ground-breaking, thought-provocing or fallacy-revealing? No. He's just arguing common sense against an imagined 99%.
shaldengeki
Profile Joined May 2009
United States104 Posts
May 04 2012 01:27 GMT
#32
On May 04 2012 10:17 LaM wrote:
Show nested quote +
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.
awu25
Profile Joined April 2010
United States2003 Posts
May 04 2012 01:29 GMT
#33
It is annoying when people look at winrates and automatically assume X race is OP. However, the condescending tone of the post will probably a turn off for most people and they will continue to believe in these statistics instead of actually playing the game.
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:35 GMT
#34
On May 04 2012 10:27 shaldengeki wrote:
Show nested quote +
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...
Anything is Possible
Haiq343
Profile Joined August 2011
United States2548 Posts
May 04 2012 01:35 GMT
#35
This is a pretty clear example of how you say things being as important as what you say. The content of the post is very good, correctly pointing out the people really like to conclude things from statistics that are totally not what the statistics say. It would make a much more compelling argument if it wasn't written by such an asshole.
I am enough of an artist to draw freely upon my imagination. Imagination is more important than knowledge. For knowledge is limited, whereas imagination encircles the world. -Einstein
Whole
Profile Blog Joined May 2010
United States6046 Posts
May 04 2012 01:36 GMT
#36
I prefer IdrA's way of saying this:

if statistics get you hard make one of those ladder analysis pages or something, but stop interfering with balance discussions.
Ansinjunger
Profile Joined November 2010
United States2451 Posts
May 04 2012 01:37 GMT
#37
On May 04 2012 09:26 dmasterding wrote:
Did any of you guys actually read the thing? He didn't actually give any opinions about the matchups, he was just trying to get rid of some misconceptions people had about interpretation of results. I am pretty sure that if the OP never mentioned this person was from r/SC you guys wouldn't be so biased against the author.



Frankly, he was asking for the bias by being biased against TL.

He didn't say anything new or ground breaking. I thought this was supposed to be some statistical elephant in the room where some expert of statistics would show us something very important. Instead, it basically said, "don't make poor assumptions." Well, good advice, I say.
Percutio
Profile Blog Joined April 2010
United States1672 Posts
Last Edited: 2012-05-04 01:41:55
May 04 2012 01:38 GMT
#38
This reminds of an old point that a lot of old brood war veterans like Artosis used to make about how the older maps were actually imbalanced despite the winrates they had because at the time dominant and superior players like boxer, iloveoov, and nada were skewing the results to the point where the map statistic should have been more than 50% win rate for terran because those players were much better than their opponents and had higher than 50% win rates and there weren't enough games from other average or less skilled players to reasonably counteract that fact. This is kind of related to how some maps were balanced at the top korean level for certain matchups, but not balanced for a top foreigner level. Essentially the balance in brood war and in brood war maps was relevant to the skill of the players at the time. Older maps might have sported balanced win rates only because of the strategy and skill of the players at the time. Applying modern strategies and the improved skill of players can suddenly reveal undiscovered imbalance.

I'm not saying whether or not that old point is correct, but it certainly had better concrete concepts than whatever this guy seems to be pointing out.

This extends even further, most tournaments have qualifiers, so say X is underpowered, the players who play X that get into the tournament are simply better because thety got in despite the imbalance, therefore as they are better, they will continue to win even despite the imbalance vested against them, thereby skewing the results to more 50-50 than it actually is.

This is kind of the bonjwa terran balance argument for old broodwar maps. However a lot of people didn't agree with this because they didn't believe the bonjwas were really that statistically significant. This causes issues when you talk about the sample size not being large enough.

If the sample size truly isn't large enough to make star players statistically insignificant then you would need to be able to identify "bonjwas" or something similar in player dominance to even hint that one race is underpowered with a 50% winrate. If you can't do that then you are making a judgement call that a few star players that play a specific race are that much superior to everyone else without winning enough to actually prove it. Without good proof the only other scenario you could argue is that it isn't the game that is balanced, but just the number of star players playing for each race (Which is silly because of the bizarre concept of improvement).
What does it matter how I loose it?
reneg
Profile Joined September 2010
United States859 Posts
May 04 2012 01:40 GMT
#39
On May 04 2012 10:35 LaM wrote:
Show nested quote +
On May 04 2012 10:27 shaldengeki wrote:
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...


While that's exactly the point, they don't indicate balance, everyone seems to completely ignore this fact, and treat them as irrefutable evidence of "P is UP, T is OP, because they have been losing/winning, look at the graph, it's obvious."

When people start looking past the stats and actually thinking, you can use the winrates as just another fun useless fact
moose...indian
shaldengeki
Profile Joined May 2009
United States104 Posts
May 04 2012 01:40 GMT
#40
On May 04 2012 10:35 LaM wrote:
Show nested quote +
On May 04 2012 10:27 shaldengeki wrote:
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...

Please remain calm. I'd love to have a level-headed discussion with you!

The winrate charts indicate win probabilities aggregated across each race. This is indisputable. They provide what I believe is a general indicator of balance - I don't believe that there are statistically-significant differences in skill between races, so it stands to reason that in the aggregate, this provides some information on the balance between races.

The error bars allow you to determine whether one month's average is significantly different from previous months. This is hugely important as before their addition people were making all sorts of wild claims as to how certain patches were throwing race balance off. Now that we can determine whether or not each month was significantly-different from previous months, we can more reasonably talk about whether or not changes to the game are having effects on winrates.
Prev 1 2 3 4 5 6 7 Next All
Please log in or register to reply.
Live Events Refresh
BSL Team A[vengers]
14:00
vs Korea
Gypsy vs nOOBLIVE!
JDConan vs ScanLIVE!
ZZZero.O133
LiquipediaDiscussion
CrankTV Team League
13:00
Preliminary Stage: 3 Bo5s
Team Liquid vs Shopify RebellionLIVE!
Team Vitality vs Team Falcon
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
Railgan 112
IndyStarCraft 99
StarCraft: Brood War
Movie 1557
ZZZero.O 133
Dewaltoss 70
ToSsGirL 39
sas.Sziky 20
Terrorterran 6
iFU.spx 5
Dota 2
qojqva3856
Dendi1158
KheZu576
Fuzer 250
canceldota91
Super Smash Bros
Mew2King80
Westballz35
Heroes of the Storm
Khaldor386
Liquid`Hasu9
Other Games
singsing2545
ScreaM1854
B2W.Neo924
DeMusliM311
KnowMe309
Hui .199
XcaliburYe134
ArmadaUGS22
Organizations
Counter-Strike
PGL13550
StarCraft 2
WardiTV881
StarCraft: Brood War
Kim Chul Min (afreeca) 12
CasterMuse 9
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 16 non-featured ]
StarCraft 2
• StrangeGG 60
• intothetv
• AfreecaTV YouTube
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• 3DClanTV 26
• Azhi_Dahaki12
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
Dota 2
• Ler96
League of Legends
• Jankos3801
Other Games
• WagamamaTV317
Upcoming Events
RSL Revival
1h 29m
Wardi Open
20h 29m
CrankTV Team League
21h 29m
Monday Night Weeklies
1d 1h
Replay Cast
1d 18h
WardiTV Invitational
1d 20h
CrankTV Team League
1d 21h
Replay Cast
2 days
CrankTV Team League
2 days
Replay Cast
3 days
[ Show More ]
The PondCast
3 days
CrankTV Team League
3 days
Replay Cast
4 days
WardiTV Invitational
4 days
CrankTV Team League
4 days
Replay Cast
5 days
Sparkling Tuna Cup
6 days
Liquipedia Results

Completed

Acropolis #4 - TS2
WardiTV TLMC #15
HCC Europe

Ongoing

BSL 21 Points
CSL 2025 AUTUMN (S18)
BSL 21 Team A
C-Race Season 1
IPSL Winter 2025-26
KCM Race Survival 2025 Season 4
CranK Gathers Season 2: SC II Pro Teams
EC S1
PGL Masters Bucharest 2025
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025

Upcoming

SC4ALL: Brood War
YSL S2
BSL Season 21
BSL 21 Non-Korean Championship
RSL Offline Finals
WardiTV 2025
RSL Revival: Season 3
Stellar Fest
SC4ALL: StarCraft II
META Madness #9
eXTREMESLAND 2025
ESL Impact League Season 8
SL Budapest Major 2025
BLAST Rivals Fall 2025
IEM Chengdu 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.