• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 13:52
CEST 19:52
KST 02:52
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Classic wins Code S Season 2 (2025)12Code S RO4 & Finals Preview: herO, Rogue, Classic, GuMiho0TL Team Map Contest #5: Presented by Monster Energy6Code S RO8 Preview: herO, Zoun, Bunny, Classic7Code S RO8 Preview: Rogue, GuMiho, Solar, Maru3
Community News
Weekly Cups (June 9-15): herO doubles on GSL week2Firefly suspended by EWC, replaced by Lancer12Classic & herO RO8 Interviews: "I think it’s time to teach [Rogue] a lesson."2Rogue & GuMiho RO8 interviews: "Lifting that trophy would be a testament to all I’ve had to overcome over the years and how far I’ve come on this journey.8Code S RO8 Results + RO4 Bracket (2025 Season 2)14
StarCraft 2
General
The SCII GOAT: A statistical Evaluation TL Team Map Contest #5: Presented by Monster Energy Classic wins Code S Season 2 (2025) Weekly Cups (June 9-15): herO doubles on GSL week The Memories We Share - Facing the Final(?) GSL
Tourneys
EWC 2025 Regional Qualifiers (May 28-June 1) RSL: Revival, a new crowdfunded tournament series Sparkling Tuna Cup - Weekly Open Tournament SOOPer7s Showmatches 2025 $5,100+ SEL Season 2 Championship (SC: Evo)
Strategy
How did i lose this ZvP, whats the proper response Simple Questions Simple Answers [G] Darkgrid Layout
Custom Maps
[UMS] Zillion Zerglings
External Content
Mutation # 478 Instant Karma Mutation # 477 Slow and Steady Mutation # 476 Charnel House Mutation # 475 Hard Target
Brood War
General
bonjwa.tv: my AI project that translates BW videos BW General Discussion BGH Auto Balance -> http://bghmmr.eu/ StarCraft & BroodWar Campaign Speedrun Quest ASL20 Preliminary Maps
Tourneys
[BSL20] ProLeague Bracket Stage - LB Round 4 & 5 [ASL19] Grand Finals [BSL20] ProLeague Bracket Stage - WB Finals & LBR3 The Casual Games of the Week Thread
Strategy
Simple Questions, Simple Answers I am doing this better than progamers do. [G] How to get started on ladder as a new Z player
Other Games
General Games
Stormgate/Frost Giant Megathread Path of Exile Nintendo Switch Thread Beyond All Reason What do you want from future RTS games?
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread Vanilla Mini Mafia
Community
General
US Politics Mega-thread Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread UK Politics Mega-thread Echoes of Revolution and Separation
Fan Clubs
SKT1 Classic Fan Club! Maru Fan Club
Media & Entertainment
[\m/] Heavy Metal Thread [Manga] One Piece Anime Discussion Thread Korean Music Discussion
Sports
2024 - 2025 Football Thread TeamLiquid Health and Fitness Initiative For 2023 NHL Playoffs 2024 Formula 1 Discussion
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread
TL Community
The Automated Ban List
Blogs
How Pro Gamers Cope with Str…
TrAiDoS
StarCraft improvement
iopq
Heero Yuy & the Tax…
KrillinFromwales
I was completely wrong ab…
jameswatts
Need Your Help/Advice
Glider
Trip to the Zoo
micronesia
Customize Sidebar...

Website Feedback

Closed Threads



Active: 27210 users

What you don't know about statistics

Forum Index > SC2 General
Post a Reply
Normal
Cyberonic
Profile Blog Joined September 2011
Germany80 Posts
Last Edited: 2012-05-04 07:33:28
May 04 2012 00:00 GMT
#1
This post is originally written by Drabzalver on Reddit. Since he does not have a TL account I asked and was allowed to repost if I include this:

Drabzalver on Reddit:
TL is the [swear word] of praetentious [swear word] where moderators reward supposed 'high quality posts' which are full of statistical and scientific garbage I just outlined just because they are 'praesented nicely' and the mods basically think that any long post with a lot of images and no swear words is intellectually advanced while often a lot of it is total garbage filled with wrong interpretations and grave statistical errors.
Also, I don't have a TL account, you're welcome to repost, but do include this qualifier, should be fun,

Clarification: This is not my opinion of this community and I only posted the quote because I was asked to and respect his author rights. The content is very interesting.


EDIT: A lot of people claim the OP/author just wants to bash the TL community. That is wrong! The text is directed towrds the reddit Community and was originally called "What reddit doesn't know about statistics" with me liberately altering the title... I see this might have let to some confusion.
-----------------------------------------------------------------------------------------------------------------------------------

What you don't know about statistics
by Drabzalver

Not a day goes by without some stat or graph being posted here or whenever in the community, 99% of the time this deals with winrates and balance, 99% of the people commenting on it have no clue how to interpret these numbers and how little they actually mean, and no, this has nothing to do with sample size. Stats and probability theory is actually something you study on for years and is a specialized field to avoid people from making the errors people make here. So stay a while and listen:

- Fallacy: winrates indicate balance

Yes and no, for the most part, they do not indicate balance, rather, they indicate balance shifts. It's so trivial to see that it boggles me that people don't figure this out on their own. Assume that race X was actually underpowered last month and is balanced this month. This means that the X players who actually qualified last month and stayed in tournaments are actually better than the Y and Z players, therefore, now that it gets balanced, as they are overal better, they start smashing Y and Z because they are better, thereby suddenly making the graph appear as X has been 'overbuffed' or whatever else while simply the few X players that were around in the scene were better. This will continue on until the mediocre Y and Z players who were more so carried by their race than the X players get weeded out.

This extends even further, most tournaments have qualifiers, so say X is underpowered, the players who play X that get into the tournament are simply better because thety got in despite the imbalance, therefore as they are better, they will continue to win even despite the imbalance vested against them, thereby skewing the results to more 50-50 than it actually is.

In fact, there are even more things wrong with this idea, because tournaments generally feature some form of elimination, this means that players who are better stay in the tournament for longer, therefore they contribute more to the amount of games played, since these are the players that overcome balance, again, it skews to 50-50 more than it is.

So yes, no matter how you have it, balance will skew to 50-50 more than it actually is, and monthily spikes and variation indicate balance shift more so than absolute balance. The only way to find out absolute balance is to get a random pool of pros, force them to play random, and have a round robin tournament to ensure that everyone plays the same amount of games. Very unfeasible to get enough games with that for a reasonable sample size.

- Fallacy: sample size is big enough

The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments. The fact that Stephano won 4-0 this time and 4-0 the last time is not independent, they have a common cause, Stephano is fucking good. More so than that, as indicated before, most tournaments feature a form of elimination, which means that because Stephano is fucking good he simply contributes more to the sample size than players who are not as good. One has to realize that in some single elimination tournaments, it's possible for the champion to have played 40% of all the games in the tournament...

As an extreme example to show that the idea is fundamentally flawed. Say you have 2 and only 2 players, the best 2 in the world, let's say you have MKP vs DRG, they play a thousand games, this ends 720-280 in MKP's favour. Can you then say 'Clearly since this is taken from the absolute top, TvZ is highly imbalanced as a thousand games is a very large sample size'. No! you cannot, and while this is an extreme example it shows the general idea and the fallacy thereof, the games are not independent probability experiments.

Especially in Korea, it is very likely that the TLPD graphs we're all so fond of do not indicate balance overall, they just indicate whichever race has a couple of good players this month that dominated everything. The Korean graph just shifts around every month while the amount of games should be large enough to stop that from happening, if they were independent experiments, but they aren't, they are quite dependent and how much the KR graph flips around each months demonstrates how unreliable it is.

Simply put, the amount of games has a large enough sample size to be significant, but the amount of players is way too small.

- Fallacy: advantages at certain times of matchups expressed in graphs

There are also a lot of graphs posted which supposedly indicate that some races may have an advantage at certain matchups. Oh boy do people misread what these graphs mean. Take this bad boy.

A naïve way to read it would simply be 'Hmm, Z has an early game advantage in ZvP, then it becomes about even, then P has a slight advantage up to the end, then Z again.', wrong; look closely, what does the graph actually say, it says this:
IF a ZvP game ends in the 0-5 minute range, the chance is 60% it ends in Z's favour.
Now the 'if' is so bloody important here, the game needn't end. Now, everyone of course realizes that that part is caused by early pools. Does Zerg really have a large advantage at that point? Can Zerg force a win at that point if they want to, are early pools overpowered? No, not at all, so what is going on?

Imagine a ZvP, Z decides to 7pool, P doesn't scout soon enough, lings get in, kill every probe, traalalala, P GG's. Game over in the first 5 minutes in Z's favour.
Okay, imagine a ZvP, Z decides to 7pool, P scouts in time, gets his wall up, damn, Z's like 'fuck man, shouldn't have done that'. But not necesarily GG's unless IdrA, the game goes on, Z however plays at such a disadvantage that in the next 5-10 minutes surely P will claim his victory unless P messes up.

See the fallacy? That Z has that 'early game advantage' doesn't mean that Z is more powerful or that 7pools are too powerful, it just means that IF the game immediately ends due to a 7pool it will most likely be in Z's favour. If the 7pool fails, the game doesn't end at that point, Z will most likely stay in the game and play from a significant disadvantage to lose later.

It is a grave statistical error of the magnitude of interpreting 'If a 8 year old child dies, the chance is the greatest he dies from a car swoop' as 'It is very likely 8 year old children die from car swoops'.

The graph doesn't even say how likely it is that the game ends at certain intervals. For all you it's far more likely for P to win in the late game than in the mid game, even though the graph indicates that if the game ends in the late game, the chance is higher that Z takes it. And even so, that still says nothing about advantages of races at certain times. One would assume that if a race is likely to win at time X, that race enjoys an advantage slightly before that time, no?

What would be far more intersting, though also not conclusive, would be a graph which outlines 'How large was the percentage of Z wins in ZvP at each interval', which is fundamentally different from 'at each interval, if the game ends, how often does it end into Z's favour in ZvP'. My bet is that because 7pools are actually quite rare, it would not at all show the huge spike for Z in the early game.
halfies
Profile Joined November 2011
United Kingdom327 Posts
May 04 2012 00:09 GMT
#2
praetentious?
really?
i hope that how he spelt it too, because it would be really funny if he wrote this much about people being pretentious and used big words that he couldn't even spell.
Lorizean
Profile Blog Joined March 2011
Germany1330 Posts
May 04 2012 00:12 GMT
#3
None of this is new or even in-depth. He starts of by saying that probability and stats are a filed that people study and continues on to bring arguments anybody could make. Not really proving a point there.
Not that I'm saying he's necessarily wrong, but having such an arrogant tone and then bringing something this trivial to the table is a bit... pretentious.
imMUTAble787
Profile Joined November 2011
United States680 Posts
May 04 2012 00:13 GMT
#4
reddit is a cesspool of memes and retardation

that being said, i love the job the mods do on these forums.

as far as stats etc go i really dont pay any mind to them. the only ones i think have any validity would be the GSL player stats because they are tournament level and reliable.
*eternalenvy fanboy*
Ripper41
Profile Joined July 2011
284 Posts
Last Edited: 2012-05-04 00:16:54
May 04 2012 00:16 GMT
#5
On May 04 2012 09:09 halfies wrote:
praetentious?
really?
i hope that how he spelt it too, because it would be really funny if he wrote this much about people being pretentious and used big words that he couldn't even spell.

Well he explains...
it's worse, he spelled it that way intentionally. still, he makes some good points in the substance of what he says.
Chocolate
Profile Blog Joined December 2010
United States2350 Posts
May 04 2012 00:17 GMT
#6
So basically the person who came up with this thinks that he is smarter than 99% of everyone else and then proceeds to list "fallacies" which tend to skew game results. Then he explains that ZvP ends early more often in favor of the zerg because of early pools.

I don't really see the value in this. It only shows what we already knew, that statistics aren't always true due to a couple factors. He also calls us pretentious but spells it oddly :/. The author himself strikes me as very pretentious.
Integra
Profile Blog Joined January 2008
Sweden5626 Posts
Last Edited: 2012-05-04 00:19:21
May 04 2012 00:18 GMT
#7
Yes, statistics taken out of context and without understanding of what they are suppose to prove makes them misleading. Good for reddit that they finally understand this

EDIT. @Chocolate: there is no value in the post, the poster just wanted to bash on TL.Net community and used statistics as an excuse.
"Dark Pleasure" | | I survived the Locust war of May 3, 2014
dmasterding
Profile Blog Joined January 2011
United States205 Posts
May 04 2012 00:19 GMT
#8
On May 04 2012 09:13 imMUTAble787 wrote:
reddit is a cesspool of memes and retardation

that being said, i love the job the mods do on these forums.

as far as stats etc go i really dont pay any mind to them. the only ones i think have any validity would be the GSL player stats because they are tournament level and reliable.


Please stop bashing the r/sc forums if you don't actually know what goes on over there. That was a lot more prevalent before, but even if you look at the front page right now you can see that there's a lot of starcraft legitimate related content and not just random shitty memes.
No tears now, only dreams.
Trumpstyle
Profile Joined May 2011
Sweden114 Posts
May 04 2012 00:23 GMT
#9
Sorry he's theory is totally garbage as he thinks the skill gap between the races are massive(it's minimum) and screws with the statistic.
dmasterding
Profile Blog Joined January 2011
United States205 Posts
May 04 2012 00:26 GMT
#10
Did any of you guys actually read the thing? He didn't actually give any opinions about the matchups, he was just trying to get rid of some misconceptions people had about interpretation of results. I am pretty sure that if the OP never mentioned this person was from r/SC you guys wouldn't be so biased against the author.
No tears now, only dreams.
i)awn
Profile Joined October 2011
United States189 Posts
Last Edited: 2012-05-04 00:28:04
May 04 2012 00:26 GMT
#11
While tournaments are indeed not reliable to calculate balance statistics because of the small pool and elimination process as Drabzalver said; tournaments and statistics still contain quite useful information. Many people might not interpret or analyze the numbers correctly but that doesn't mean that the numbers are useless.
Cyberonic
Profile Blog Joined September 2011
Germany80 Posts
May 04 2012 00:33 GMT
#12
On May 04 2012 09:26 dmasterding wrote:
Did any of you guys actually read the thing? He didn't actually give any opinions about the matchups, he was just trying to get rid of some misconceptions people had about interpretation of results. I am pretty sure that if the OP never mentioned this person was from r/SC you guys wouldn't be so biased against the author.


I think so, too... There's no reason for bashing around. I wrote why I included his text but I'd rather have a discussion on the topic and not its preface.
Bidj
Profile Joined September 2010
France98 Posts
May 04 2012 00:37 GMT
#13
At least a very interesting read.
Rooooaaaar
Jinsho
Profile Joined March 2011
United Kingdom3101 Posts
May 04 2012 00:37 GMT
#14
IdrA has been always been referring to point 1 and 2 when asked about winrates.
windsupernova
Profile Joined October 2010
Mexico5280 Posts
May 04 2012 00:40 GMT
#15
As much as I agree with him in some points. I don't like how he comes off as someone pretty arrogant and doesn't even present some kind of credentials on why he understand statistics more than 99% of people.I mean for all we know he could be some arrogant College kid who just passed his 1st statistics class.

Arguing from authority only works if you prove you are have some kind of authority. But his arguments are nice and he does seems to have some kind of understanding of statistics. But then he doesn't say how we should go about interpreting those statistics and providing proof.

That being said I do think most of the people take a really simplistic approach to statistics, but well statistics are a hard subject to tackle
"Its easy, just trust your CPU".-Boxer on being good at games
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 00:42 GMT
#16
On May 04 2012 09:13 imMUTAble787 wrote:
reddit is a cesspool of memes and retardation

that being said, i love the job the mods do on these forums.

as far as stats etc go i really dont pay any mind to them. the only ones i think have any validity would be the GSL player stats because they are tournament level and reliable.


Maybe you should read the OP instead of just the qualifier.
Anything is Possible
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 00:43 GMT
#17
On May 04 2012 09:23 Trumpstyle wrote:
Sorry he's theory is totally garbage as he thinks the skill gap between the races are massive(it's minimum) and screws with the statistic.


He makes literally 0 claims about the skill gap between races. Reread what he said.
Anything is Possible
JitnikoVi
Profile Joined May 2010
Russian Federation396 Posts
May 04 2012 00:45 GMT
#18
i dont understand the point of this... most of this is common sense yet it states that 99% of people dont know this?

really?
In theory yes, but theoretically, no.
Mephiztopheles1
Profile Blog Joined December 2010
1124 Posts
May 04 2012 00:49 GMT
#19
On May 04 2012 09:26 dmasterding wrote:
Did any of you guys actually read the thing? He didn't actually give any opinions about the matchups, he was just trying to get rid of some misconceptions people had about interpretation of results. I am pretty sure that if the OP never mentioned this person was from r/SC you guys wouldn't be so biased against the author.

I read it. He is extremely aggressive so he begets extremely aggressive answers.

Regardless of his tone and of any tl vs reddit thing brewing here, his points should accompany threads (better phrased, of course) like the TLPD win rates thread in my opinion so that users with no background in statistics (for whatever reason) can have a better grasp of what the data presented means and/or doesn't.
xrapture
Profile Blog Joined December 2011
United States1644 Posts
May 04 2012 00:55 GMT
#20
If that is a top of the line post from reddit then I'm glad I stick to TL lol.

He just said random stuff in an aggressive way (stuff everyone knows anyway) and called TLPD statistics pretentious? hypocrite much?
Everyone is either delusional, a nihlilst, or dead from suicide.
corose
Profile Joined August 2011
United States31 Posts
May 04 2012 00:56 GMT
#21
???

This is horrible. He should include the fallacies that he makes too. There might be 3 things in this that make sense?

Why are we balancing a game around X's in a tournament who are better but underpowered, and who start crushing when it becomes balanced? I don't think Blizzard is...

And how do you know this is due to balance and not metagame shift, or players finally "getting" their races and utilizing what the race has to offer? Well, since your post is all anecdotal...you don't.

this means that players who are better stay in the tournament for longer, therefore they contribute more to the amount of games played


That's true.

since these are the players that overcome balance, again, it skews to 50-50 more than it is


That's not necessarily true.

Stephano is fucking good


True

If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games.


That's a good point.

'Clearly since this is taken from the absolute top, TvZ is highly imbalanced as a thousand games is a very large sample size'. No! you cannot


Explanation? No? Just say something as matter of fact cause it's true? Is he saying MKP's skills is much higher than DRG's or something? Also, why is he using extreme examples that have 0 validity to prove his points? Why doesn't he use actual examples or realistic things? I don't think he can cause they wouldn't support his argument...

IF a ZvP game ends in the 0-5 minute range, the chance is 60% it ends in Z's favour.
Now the 'if' is so bloody important here, the game needn't end.


He goes on to describe the process of 7 pooling and the events following.
Isn't the goal for Blizzard to make the game balanced from start to end? Of course, that's what this whole new Call o Action: Antiga Shipyard is about. Minor tweaks to make vZ more balanced in early game and they are also looking at PvT endgame. If zerg has a 60% win rate up to the 5 minute mark, as shown by a good set of data (good sample size, players of relative equal skill), then the match up is imbalanced during that time frame, but perhaps not overall. If you saw that 7 pools, and any strategy were 50% effective across the entire course of the game, then you have a successfully balanced game. Obviously if Zerg wins 75% of games that end before the 5 minute mark, early game balance is an issue. If they win much less than 50%, then it's imbalanced in the opposite direction.

I'm not really sure what he is saying in this last part. He anecdotally describes a 7 pool, a P potentially scouting it and walling, and the result following the Z's drone deficit. I don't think this story gives us any insight on balance, or on anything that he's arguing. What IS he arguing? I think his mistake is thinking that games ending before the 5 minute mark aren't subjected to analysis, because they are early pools so balance isn't an issue, because the Win/Loss is due to player mistakes??? No matter if it's a mistake, an inability to scout, or whatever...if the results heavily favor one side, then some mechanism of compensation is needed. Ideally, you are looking for 50% win ratio across the entire game time and between all combination of races.

I'm not a statistician...but i'm guessing this guy isn't either...?

Also...I don't think Blizzard makes all of it's decisions based on tournaments, where like said here, player skill is a factor.
hnQ
Profile Blog Joined September 2010
113 Posts
May 04 2012 00:58 GMT
#22
He's right on what he said, not sure if people didn't know this already though...
Reptilia
Profile Joined June 2010
Chile913 Posts
May 04 2012 01:05 GMT
#23
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.
The secret to creativity is knowing how to hide your sources
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:05 GMT
#24
On May 04 2012 09:56 corose wrote:
He goes on to describe the process of 7 pooling and the events following.
Isn't the goal for Blizzard to make the game balanced from start to end? Of course, that's what this whole new Call o Action: Antiga Shipyard is about. Minor tweaks to make vZ more balanced in early game and they are also looking at PvT endgame. If zerg has a 60% win rate up to the 5 minute mark, as shown by a good set of data (good sample size, players of relative equal skill), then the match up is imbalanced during that time frame, but perhaps not overall. If you saw that 7 pools, and any strategy were 50% effective across the entire course of the game, then you have a successfully balanced game. Obviously if Zerg wins 75% of games that end before the 5 minute mark, early game balance is an issue. If they win much less than 50%, then it's imbalanced in the opposite direction.

I'm not really sure what he is saying in this last part. He anecdotally describes a 7 pool, a P potentially scouting it and walling, and the result following the Z's drone deficit. I don't think this story gives us any insight on balance, or on anything that he's arguing. What IS he arguing? I think his mistake is thinking that games ending before the 5 minute mark aren't subjected to analysis, because they are early pools so balance isn't an issue, because the Win/Loss is due to player mistakes??? No matter if it's a mistake, an inability to scout, or whatever...if the results heavily favor one side, then some mechanism of compensation is needed. Ideally, you are looking for 50% win ratio across the entire game time and between all combination of races.

I'm not a statistician...but i'm guessing this guy isn't either...?

Also...I don't think Blizzard makes all of it's decisions based on tournaments, where like said here, player skill is a factor.


He is saying that seeing high Zerg win rates in the first 5 minutes and assuming early pools are imbalanced is wrong because games won by Zergs with early pools are often decided with in 5 minutes, but games won by Protoss against early pools often last longer than 5 minutes.

It has nothing to do with making mistakes/better player/whatever. It's just winrate graphs showing very incomplete, limited analysis of balance that are often very misleading.
Anything is Possible
shaldengeki
Profile Joined May 2009
United States104 Posts
May 04 2012 01:08 GMT
#25
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.

Please reconsider reposting topics like this in the future, or at the very least, try to be productive and rigorous in your arguments if you truly want TL to be a community that is rigorous in its discussion!
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:09 GMT
#26
On May 04 2012 10:05 Reptilia wrote:
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.


Did you read anything past the qualifier that was added on after his original post on Reddit?

Doesn't look like it. Your post has nothing to do with the vast majority of his post. Or where you saying he got butthurt that he got banned from TL so he went to Reddit and wrote an intelligent post about balance statistics and how they can be misleading?

All these douchey little TL > Reddit posts are the type of annoying shit that makes people think TL is pretentious anyways. I know he isn't any better for his equally douchey qualifier, but at least he followed it up with an informative, well-written post. Something I haven't seen much of from the Reddit bashers here. At least on Reddit your pathetic contributions would be downvoted enough so that I wouldn't have to waste time responding to them and could help clarify things for people who give a shit about having a meaningful discussion.
Anything is Possible
Quochobao
Profile Joined October 2010
United States350 Posts
May 04 2012 01:13 GMT
#27
All of what OP says about statistics is right -- that his tone is arrogant does not jeopardize the merit of this argument a bit.

Is he tactless in his approach? Probably. Is he right about statistics? Surely.

Don't be put off by the qualifier right away. That only proves TL to be not open-minded to the truth, no matter how harshly stated.
Best or nothing.
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:17 GMT
#28
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

Show nested quote +
On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.
Anything is Possible
Primadog
Profile Blog Joined April 2010
United States4411 Posts
May 04 2012 01:20 GMT
#29
A college drop out lecturing people on latin, how all social science are inherently pseudoscience, and statistics has no place in anything but the 'real sciences'. This is one moment where I wish reddit has a serious moderation policy.
Thank God and gunrun.
SappigeKutVolKots
Profile Joined May 2012
Netherlands1 Post
Last Edited: 2012-05-04 01:22:32
May 04 2012 01:20 GMT
#30
Okay, I wanted to stay away from this site, but I couldn't let some stuff go unanswered, I am drab, anyone can message me on reddit to verify this.

On May 04 2012 09:40 windsupernova wrote:
As much as I agree with him in some points. I don't like how he comes off as someone pretty arrogant and doesn't even present some kind of credentials on why he understand statistics more than 99% of people.I mean for all we know he could be some arrogant College kid who just passed his 1st statistics class.
I'm not making an argument from authority, I don't need credentials, even if I was a cow or an anencephalic protozoan, it doesn't matter, there is no need for credentials because I'm making an argument from reason, not from authority, I do not even need to cite any sources because my argument is purely rational, not empirical. If you ask for 'credentials' to verify this post then you lost and don't know how to verify academic literature.

My credentials are irrelevant, I'm not making an argument from authority. If you do not find yourself to have the confidence to check the correctness of my argument then you shouldn't agree or disagree either way. Say to yourself 'I don't understand what he's saying', above all, don't comment on a thread whose opening you don't understand, and move on with life.

Yes, I am very smug, I'm not even smug, I'm condescending, I'm not condescending because I have a higher education, I'm condescending because I'm fed up with stupidity, the arguments I put out are very easy and basic to understand and honestly, anyone reading those graphs should come to those conclusions, yet I've seen countless and countless people misinterpreting all those graphs without coming to the realization of these very basic givens, on both reddit and TL. I've seen 50 pages of TL posts discussing those graphics about probability of races to win at certain time intervals in matchups, and maybe 1-2 people pointed out how misleading it was because of the arguments I put out, and no one listened and other people go discuss trivial and unrelated stuff like 'sample size' while there are much bigger problems. I've seen the TLPD winrates posted on both TL and reddit and people discussed them for days and so few people initially pointed out that the lines between the graphs in the old aesthetic were completely ludicrous and they should be bars, and even fewer people were critical of the fact that the error bars were calculated by a means which assumes independent experiments, which they are not.

It doesn't take a genius to see this, it just takes allowing yourself to be critical. As soon as I ask a lot of those people 'There are some grave fallacies with those stats,c an you point them out?' they will most likely come with at least 80% of the shit I pointed out and probably with some things I overlooked. It doesn't take a brain, it takes not being a mindless drone and being critical of stuff that is being posted. As for credentials, I guarantee you that the people who post those TLPD winrate graphs either have no statistical credentials, or are wilfully lying to people and oversimplifying it, because it's just statistical faux pas.

But then he doesn't say how we should go about interpreting those statistics and providing proof.
We should interpret them as what they are. They are the winrates for this month, it says nothing of balance or any other interpretations you can make of them. You see what you get, and the error bars are, simply put, incorrect and a statistical gaffe. I'm not sure what they are supposed to mean, they don't mean anything if the map scores aren't independent probability experiments.

That being said I do think most of the people take a really simplistic approach to statistics, but well statistics are a hard subject to tackle
Nope, it's very easy, it's more that people like to see things that you can't conclude from stuff.

On May 04 2012 10:09 LaM wrote:
Show nested quote +
On May 04 2012 10:05 Reptilia wrote:
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.


Did you read anything past the qualifier that was added on after his original post on Reddit?

Doesn't look like it. Your post has nothing to do with the vast majority of his post. Or where you saying he got butthurt that he got banned from TL so he went to Reddit and wrote an intelligent post about balance statistics and how they can be misleading?

All these douchey little TL > Reddit posts are the type of annoying shit that makes people think TL is pretentious anyways. I know he isn't any better for his equally douchey qualifier, but at least he followed it up with an informative, well-written post. Something I haven't seen much of from the Reddit bashers here. At least on Reddit your pathetic contributions would be downvoted enough so that I wouldn't have to waste time responding to them and could help clarify things for people who give a shit about having a meaningful discussion.


As linked in the OP, I did not add that qualifier on top myself, I never added the qualifier formally, someone asked me 'Have you posted it on TL' (the OP here), I said 'Nope', he asked 'why?', I said that which he quoted.

That said, I never mentioned TL in the original post, I was mainly critical of screddit and its continued misuse of statistics and it got upvoted to be the #1 post on the screddit first page. This exemplifies a quality of screddit that I feel TL heavily lacks.

Edit: Also:

Pepper_MD just sent you a month of reddit gold! Wasn't that nice?
Here's a note that was included:
I have degree in Stats. All I have to say is Thank You.


I have no idea what reddit gold is, is it good?

User was banned for this post.
Sporks, the mighty Xel'Naga Hybrid between Spoons and Forks
KingPaddy
Profile Joined November 2010
1053 Posts
May 04 2012 01:23 GMT
#31
On May 04 2012 10:13 Quochobao wrote:
All of what OP says about statistics is right -- that his tone is arrogant does not jeopardize the merit of this argument a bit.

Is he tactless in his approach? Probably. Is he right about statistics? Surely.

Don't be put off by the qualifier right away. That only proves TL to be not open-minded to the truth, no matter how harshly stated.

But the thing is, what he says in his arrogant manner ("99% of the people don't know this") is basically common knowledge.
Is it true? For the most part yes. Is it advanced mathematics or anything ground-breaking, thought-provocing or fallacy-revealing? No. He's just arguing common sense against an imagined 99%.
shaldengeki
Profile Joined May 2009
United States104 Posts
May 04 2012 01:27 GMT
#32
On May 04 2012 10:17 LaM wrote:
Show nested quote +
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.
awu25
Profile Joined April 2010
United States2003 Posts
May 04 2012 01:29 GMT
#33
It is annoying when people look at winrates and automatically assume X race is OP. However, the condescending tone of the post will probably a turn off for most people and they will continue to believe in these statistics instead of actually playing the game.
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:35 GMT
#34
On May 04 2012 10:27 shaldengeki wrote:
Show nested quote +
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...
Anything is Possible
Haiq343
Profile Joined August 2011
United States2548 Posts
May 04 2012 01:35 GMT
#35
This is a pretty clear example of how you say things being as important as what you say. The content of the post is very good, correctly pointing out the people really like to conclude things from statistics that are totally not what the statistics say. It would make a much more compelling argument if it wasn't written by such an asshole.
I am enough of an artist to draw freely upon my imagination. Imagination is more important than knowledge. For knowledge is limited, whereas imagination encircles the world. -Einstein
Whole
Profile Blog Joined May 2010
United States6046 Posts
May 04 2012 01:36 GMT
#36
I prefer IdrA's way of saying this:

if statistics get you hard make one of those ladder analysis pages or something, but stop interfering with balance discussions.
Ansinjunger
Profile Joined November 2010
United States2451 Posts
May 04 2012 01:37 GMT
#37
On May 04 2012 09:26 dmasterding wrote:
Did any of you guys actually read the thing? He didn't actually give any opinions about the matchups, he was just trying to get rid of some misconceptions people had about interpretation of results. I am pretty sure that if the OP never mentioned this person was from r/SC you guys wouldn't be so biased against the author.



Frankly, he was asking for the bias by being biased against TL.

He didn't say anything new or ground breaking. I thought this was supposed to be some statistical elephant in the room where some expert of statistics would show us something very important. Instead, it basically said, "don't make poor assumptions." Well, good advice, I say.
Percutio
Profile Blog Joined April 2010
United States1672 Posts
Last Edited: 2012-05-04 01:41:55
May 04 2012 01:38 GMT
#38
This reminds of an old point that a lot of old brood war veterans like Artosis used to make about how the older maps were actually imbalanced despite the winrates they had because at the time dominant and superior players like boxer, iloveoov, and nada were skewing the results to the point where the map statistic should have been more than 50% win rate for terran because those players were much better than their opponents and had higher than 50% win rates and there weren't enough games from other average or less skilled players to reasonably counteract that fact. This is kind of related to how some maps were balanced at the top korean level for certain matchups, but not balanced for a top foreigner level. Essentially the balance in brood war and in brood war maps was relevant to the skill of the players at the time. Older maps might have sported balanced win rates only because of the strategy and skill of the players at the time. Applying modern strategies and the improved skill of players can suddenly reveal undiscovered imbalance.

I'm not saying whether or not that old point is correct, but it certainly had better concrete concepts than whatever this guy seems to be pointing out.

This extends even further, most tournaments have qualifiers, so say X is underpowered, the players who play X that get into the tournament are simply better because thety got in despite the imbalance, therefore as they are better, they will continue to win even despite the imbalance vested against them, thereby skewing the results to more 50-50 than it actually is.

This is kind of the bonjwa terran balance argument for old broodwar maps. However a lot of people didn't agree with this because they didn't believe the bonjwas were really that statistically significant. This causes issues when you talk about the sample size not being large enough.

If the sample size truly isn't large enough to make star players statistically insignificant then you would need to be able to identify "bonjwas" or something similar in player dominance to even hint that one race is underpowered with a 50% winrate. If you can't do that then you are making a judgement call that a few star players that play a specific race are that much superior to everyone else without winning enough to actually prove it. Without good proof the only other scenario you could argue is that it isn't the game that is balanced, but just the number of star players playing for each race (Which is silly because of the bizarre concept of improvement).
What does it matter how I loose it?
reneg
Profile Joined September 2010
United States859 Posts
May 04 2012 01:40 GMT
#39
On May 04 2012 10:35 LaM wrote:
Show nested quote +
On May 04 2012 10:27 shaldengeki wrote:
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...


While that's exactly the point, they don't indicate balance, everyone seems to completely ignore this fact, and treat them as irrefutable evidence of "P is UP, T is OP, because they have been losing/winning, look at the graph, it's obvious."

When people start looking past the stats and actually thinking, you can use the winrates as just another fun useless fact
moose...indian
shaldengeki
Profile Joined May 2009
United States104 Posts
May 04 2012 01:40 GMT
#40
On May 04 2012 10:35 LaM wrote:
Show nested quote +
On May 04 2012 10:27 shaldengeki wrote:
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...

Please remain calm. I'd love to have a level-headed discussion with you!

The winrate charts indicate win probabilities aggregated across each race. This is indisputable. They provide what I believe is a general indicator of balance - I don't believe that there are statistically-significant differences in skill between races, so it stands to reason that in the aggregate, this provides some information on the balance between races.

The error bars allow you to determine whether one month's average is significantly different from previous months. This is hugely important as before their addition people were making all sorts of wild claims as to how certain patches were throwing race balance off. Now that we can determine whether or not each month was significantly-different from previous months, we can more reasonably talk about whether or not changes to the game are having effects on winrates.
LaM
Profile Blog Joined September 2011
United States1321 Posts
May 04 2012 01:59 GMT
#41
On May 04 2012 10:40 shaldengeki wrote:
Show nested quote +
On May 04 2012 10:35 LaM wrote:
On May 04 2012 10:27 shaldengeki wrote:
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...

Please remain calm. I'd love to have a level-headed discussion with you!

The winrate charts indicate win probabilities aggregated across each race. This is indisputable. They provide what I believe is a general indicator of balance - I don't believe that there are statistically-significant differences in skill between races, so it stands to reason that in the aggregate, this provides some information on the balance between races.

The error bars allow you to determine whether one month's average is significantly different from previous months. This is hugely important as before their addition people were making all sorts of wild claims as to how certain patches were throwing race balance off. Now that we can determine whether or not each month was significantly-different from previous months, we can more reasonably talk about whether or not changes to the game are having effects on winrates.


I'm very calm thanks

It's just you seem to be very oddly ignoring the relevant points of discussion in a biased attempt to defend your error bars.

You keep making the claim, with no backing, that the win rates provide some reflection of balance. People, including you in this discussion, always like to attack the straw man that people saying winrates don't indicate balance are making that assumption because of differences in the skill level of the races. If you read the OP, it is addressing exactly why they don't show any useful information about balance, for a set of reasons completely different than the one straw man reason you do address.

I'm still confused as to what you think the error bars add to the discussion. The winrates are results, not predictions. That means the only source of error would be missing results or incorrectly entering results into the graph. But that certainly isn't how you are presenting them. In this case you seem to be advocating them so that people can see a win rate of X% with an imagined deviation indicating it falls within a range from Y% to Z% and therefore not freak out since last month the win rate was A% which also falls between Y% and Z%. That's not really a useful error bar, its just tacking some extra shit on so that people can continue to try and draw erroneous balance conclusions from something that indicates nothing about balance.

Again, your viewing winrate graphs as balance graphs. I think that's the root of why we disagree about the error bars.
Anything is Possible
Befree
Profile Joined April 2010
695 Posts
May 04 2012 02:01 GMT
#42
I guess I was hoping for a little bit more of a comprehensive approach to this. Rather than a set of three, relatively specific, "fallacies."

I think there are a lot of people who don't think critically on statistics shown to them, but this doesn't do much to help them. You're just pointing out three possible ways mistakes could be made and then explaining them away with common sense. It's not really an approach from statistics and its not really about statistics, it's logic. All of what you explained was logic.

Now while people certainly could use a reminder to use logic more often, this is not related to their misunderstanding of the fundamentals of statistics. From you introduction, I got the impression that you were going to explain some fundamentals of statistics and explain how you should apply them to SC2 data.

While this post does indeed mention statistics, it is just used as an arbitrary example for illustrating logical fallacies which are not fundamentally related to statistics in any way. A more apt title would be "Mistakes in reasoning."
divito
Profile Blog Joined January 2011
Canada1213 Posts
May 04 2012 02:19 GMT
#43
Reddit is the new 4chan. That said, he has some points in the methodology and idea behind some of the things people commonly mistake; can't say I'm a fan of the way he went about it, but he uses Reddit, so that explains that.
Skype: divito7
whatthefat
Profile Blog Joined August 2010
United States918 Posts
May 04 2012 02:22 GMT
#44
All trivially obvious points. I suppose some people have these misconceptions, but this is hardly news. Based on the title, I was hoping for some deeper insights, or at least something more thoughtful, like this.
SlayerS_BoxeR: "I always feel sorry towards Greg (Grack?) T_T"
Woizit
Profile Joined June 2011
801 Posts
Last Edited: 2012-05-04 02:32:07
May 04 2012 02:27 GMT
#45
Are win rates an indicator of true balance? I guess not.

But can we even know what the true balance is? I guess not either.

What we use win rates for is to see if the game remains competitive, which will reflect a perceived balance. That is what almost everybody cares about more than a remote fantasy of true balance.
Jisall
Profile Blog Joined January 2011
United States2054 Posts
May 04 2012 02:28 GMT
#46
Makes solid points. If there was one thing I learned in my stats class is how broken statistics can get. It is a very tricky subject to understand.
Monk: Because being a badass is more fun then playing a dude wearing a scarf.. ... Ite fuck it, Witch Doctor cuz I like killing stuff in a timely mannor.
Complete
Profile Joined October 2009
United States1864 Posts
May 04 2012 02:36 GMT
#47
Why do people have to be aggressive and insulting when they're trying to have an intellectual argument?
figq
Profile Blog Joined May 2010
12519 Posts
May 04 2012 02:42 GMT
#48
The working assumption of balancing is that each race has equal skill representation among the pool of players. It's a flawed assumption, of course, but there isn't any better way. The samples of professional games aren't big enough, but that's all we got to work with; the ladder is less reliable in terms of quality. Should we be cautious in our claims, sure. Finally, the time intervals winrates just need added weight in number of games. Otherwise Tasteless is like omg 100% winrate 0-5 minutes; while it most likely means only one game ended so soon and it happened to be a win.
If you stand next to my head, you can hear the ocean. - Day[9]
Glockateer
Profile Joined June 2009
United States254 Posts
May 04 2012 02:45 GMT
#49
Even though not every one can agree on this subject, we're likely to agree that the blizzard dev team is trying TOO HARD to tweak the game. They screw it up the majority of the time instead of waiting and doing minor tweaks every 6 months and such. The game never gets to develop before someone gets the nerf bat just because another race is winning in the metagame. Terran continually get screwed over in late game because the early game is relatively powerful to try and make a ~50% ratio. They've shown countless times that they want a pretty statistic more than any other aspect... and they do it based off their crappy map pool of the past which is down right idiotic. Bigger maps would've been better than the barracks +5 second build time, for example, instead of having to deal with bigger maps and the change now.
GET SM4SHED
nakedsurfer
Profile Joined April 2011
Canada500 Posts
May 04 2012 02:52 GMT
#50
I was going to read the OP until I read the first post. He talks about statistics but starts off with saying 99% people don't know things twice. I don't think thats a proper way to start an argument about statistics. Especially if you're going to ask the "re-poster" if you will to post that first paragraph about TL calling us out on how we aren't the best at writing threads.
Root4Root
Oboeman
Profile Joined January 2011
Canada3980 Posts
May 04 2012 02:53 GMT
#51
His points are all sensible, and to me they all seemed like common sense.
pOnarreT
Profile Joined March 2012
155 Posts
May 04 2012 02:56 GMT
#52
On May 04 2012 09:00 Cyberonic wrote:
This post is originally written by Drabzalver on Reddit. Since he does not have a TL account I asked and was allowed to repost if I include this:

Drabzalver on Reddit:
Show nested quote +
TL is the [swear word] of praetentious [swear word] where moderators reward supposed 'high quality posts' which are full of statistical and scientific garbage I just outlined just because they are 'praesented nicely' and the mods basically think that any long post with a lot of images and no swear words is intellectually advanced while often a lot of it is total garbage filled with wrong interpretations and grave statistical errors.
Also, I don't have a TL account, you're welcome to repost, but do include this qualifier, should be fun,



I somewhat agree about the above statement and I think the fallacies stuff just posted were targeted to those balance whiners in TL
Paaaz
Profile Blog Joined November 2011
Philippines33 Posts
May 04 2012 02:56 GMT
#53
With a bunch of you lot replying because of his statement and not because of the article proves his point, I'm just saying.
Vul
Profile Blog Joined April 2011
United States685 Posts
May 04 2012 03:19 GMT
#54
On May 04 2012 10:20 SappigeKutVolKots wrote:
Okay, I wanted to stay away from this site, but I couldn't let some stuff go unanswered, I am drab, anyone can message me on reddit to verify this.

Show nested quote +
On May 04 2012 09:40 windsupernova wrote:
As much as I agree with him in some points. I don't like how he comes off as someone pretty arrogant and doesn't even present some kind of credentials on why he understand statistics more than 99% of people.I mean for all we know he could be some arrogant College kid who just passed his 1st statistics class.
I'm not making an argument from authority, I don't need credentials, even if I was a cow or an anencephalic protozoan, it doesn't matter, there is no need for credentials because I'm making an argument from reason, not from authority, I do not even need to cite any sources because my argument is purely rational, not empirical. If you ask for 'credentials' to verify this post then you lost and don't know how to verify academic literature.

My credentials are irrelevant, I'm not making an argument from authority. If you do not find yourself to have the confidence to check the correctness of my argument then you shouldn't agree or disagree either way. Say to yourself 'I don't understand what he's saying', above all, don't comment on a thread whose opening you don't understand, and move on with life.

Yes, I am very smug, I'm not even smug, I'm condescending, I'm not condescending because I have a higher education, I'm condescending because I'm fed up with stupidity, the arguments I put out are very easy and basic to understand and honestly, anyone reading those graphs should come to those conclusions, yet I've seen countless and countless people misinterpreting all those graphs without coming to the realization of these very basic givens, on both reddit and TL. I've seen 50 pages of TL posts discussing those graphics about probability of races to win at certain time intervals in matchups, and maybe 1-2 people pointed out how misleading it was because of the arguments I put out, and no one listened and other people go discuss trivial and unrelated stuff like 'sample size' while there are much bigger problems. I've seen the TLPD winrates posted on both TL and reddit and people discussed them for days and so few people initially pointed out that the lines between the graphs in the old aesthetic were completely ludicrous and they should be bars, and even fewer people were critical of the fact that the error bars were calculated by a means which assumes independent experiments, which they are not.

It doesn't take a genius to see this, it just takes allowing yourself to be critical. As soon as I ask a lot of those people 'There are some grave fallacies with those stats,c an you point them out?' they will most likely come with at least 80% of the shit I pointed out and probably with some things I overlooked. It doesn't take a brain, it takes not being a mindless drone and being critical of stuff that is being posted. As for credentials, I guarantee you that the people who post those TLPD winrate graphs either have no statistical credentials, or are wilfully lying to people and oversimplifying it, because it's just statistical faux pas.

Show nested quote +
But then he doesn't say how we should go about interpreting those statistics and providing proof.
We should interpret them as what they are. They are the winrates for this month, it says nothing of balance or any other interpretations you can make of them. You see what you get, and the error bars are, simply put, incorrect and a statistical gaffe. I'm not sure what they are supposed to mean, they don't mean anything if the map scores aren't independent probability experiments.

Show nested quote +
That being said I do think most of the people take a really simplistic approach to statistics, but well statistics are a hard subject to tackle
Nope, it's very easy, it's more that people like to see things that you can't conclude from stuff.

Show nested quote +
On May 04 2012 10:09 LaM wrote:
On May 04 2012 10:05 Reptilia wrote:
because reddit is so much better. Lol.
i wouldnt be surprised if that guy had a tl account and got banned and got so butthurt he posted that.


Did you read anything past the qualifier that was added on after his original post on Reddit?

Doesn't look like it. Your post has nothing to do with the vast majority of his post. Or where you saying he got butthurt that he got banned from TL so he went to Reddit and wrote an intelligent post about balance statistics and how they can be misleading?

All these douchey little TL > Reddit posts are the type of annoying shit that makes people think TL is pretentious anyways. I know he isn't any better for his equally douchey qualifier, but at least he followed it up with an informative, well-written post. Something I haven't seen much of from the Reddit bashers here. At least on Reddit your pathetic contributions would be downvoted enough so that I wouldn't have to waste time responding to them and could help clarify things for people who give a shit about having a meaningful discussion.


As linked in the OP, I did not add that qualifier on top myself, I never added the qualifier formally, someone asked me 'Have you posted it on TL' (the OP here), I said 'Nope', he asked 'why?', I said that which he quoted.

That said, I never mentioned TL in the original post, I was mainly critical of screddit and its continued misuse of statistics and it got upvoted to be the #1 post on the screddit first page. This exemplifies a quality of screddit that I feel TL heavily lacks.

Edit: Also:

Pepper_MD just sent you a month of reddit gold! Wasn't that nice?
Here's a note that was included:
I have degree in Stats. All I have to say is Thank You.


I have no idea what reddit gold is, is it good?


Even if your argument is true, it was poorly written and therefore deserves the terrible response that you're getting.

You need to realize that most people don't really care about the content of the OP, and it's not because they're stupid. What you're writing about is a niche interest. Stupidity isn't rampant because more people (and by people I mean forum posters, which I imagine is primarily below college age, by the way) don't spend their time critically evaluating video game statistics.

It's amazing to me that you're so indignant about people reacting negatively towards your condescension, simply because your argument is true or whatever it is you think justifies your tone. You're writing as if all that matters is the accuracy of what you're saying and not how you present it. Statistics can't teach you common sense I guess
Jono7272
Profile Joined November 2010
United Kingdom6330 Posts
May 04 2012 03:21 GMT
#55
Lol at how pretentiously he spells pretentious. (Also, wrongly, this is English not Latin)
Innovation | Flash | Mvp | Byun | TY
jakethesnake
Profile Blog Joined May 2011
Canada4948 Posts
May 04 2012 03:21 GMT
#56
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

Show nested quote +
On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.

Please reconsider reposting topics like this in the future, or at the very least, try to be productive and rigorous in your arguments if you truly want TL to be a community that is rigorous in its discussion!


I think this is really more helpful than the OP. For something that starts with a complaint about TL being pretentious, I found the post really condesending and a gross over simplification. Yes, it is true that people on TL do awful things with statistics, but as a statistician at a University, the people that I provide numbers to at work often do just as awful things with much more serious numbers. As well, for a post decrying people's statistical knowledge, it doesn't help to begin your post by making up some stats.

Also, if you want to prove something about math, you have to use math. Don't just tell people that they are wrong, help people understand how to use stats properly.
Community Newsjjakji || jjakji || jjakji || jjakji || jjakji || jjakji || jjakji nshoseo.jpg
jakethesnake
Profile Blog Joined May 2011
Canada4948 Posts
May 04 2012 03:31 GMT
#57
On May 04 2012 10:59 LaM wrote:
Show nested quote +
On May 04 2012 10:40 shaldengeki wrote:
On May 04 2012 10:35 LaM wrote:
On May 04 2012 10:27 shaldengeki wrote:
On May 04 2012 10:17 LaM wrote:
On May 04 2012 10:08 shaldengeki wrote:
While I agree that there are significant issues with the way that many statistics on TL are presented - I've posted on this before and I was the guy who nudged the monthly winrates graphs to add error bars in the first place - you're not doing the discourse any favors by reposting this, I think. There are hardly any statistical arguments actually made in the post - for instance:

On May 04 2012 09:00 Cyberonic wrote:
The TLPD winrate graphs are praetentious and amateuristic, sorry to say it but that's how it is, the error bars there are pure bollocks and are calculated using the rules of independent probability experiments, that is to say, it is assumed that the results of every series has no effect on the others, as if you flip a coin. If they were independent, sample size would be enough by a large margin to say something, but they are not independent. Because you're dealing with players, not just games. Good players simply ruin the idea of independent experiments.


There is something deeply hypocritical about decrying statistics discussions on TL for being superficial and then totally failing to present statistical evidence for your assertion that games outcomes are not independent. One would think that the actual mathematics would be pretty trivial, so simply asserting that "they are not independent because they are players" is committing exactly the sin that you're supposedly railing against.


I don't agree with you at all.

I think it is pretty clear that game results are not independent outcomes. Consider a 10game match between DRG and Joe "Code B Protoss" Schmoe.

DRG wins the first 9 games. Any rational, logical observer would favor him greatly to win the 10th game, right? But winrate graphs still assume the outcome of Game 10 should have a 50/50 chance of going either way, like it's just a coin flip. Now, in massive sample sizes this would be corrected by enough players from every race being better than their opponent in any series so that it would smooth out any errors, but in month long samples from a tiny group of pros the deviations don't get corrected.

I think the math for showing that is extremely hard, but the logic behind it is very strong. Similar to how I'm sure you won't debate that 2+2 is valid, but you would have a pretty damn hard time mathematically proving addition to me.


I think you're probably mistaking what the purpose of the winrate charts is, and what "independent outcomes" means in the context of repeated experiments. Of course you wouldn't apply the winrate charts to the situation you're describing - what they do is aggregate results across several skill levels and regions to provide a general indicator of race balance. Nobody is claiming that every single game between a protoss and a zerg has a 50/50 chance of going either way, and if this is how you're interpreting the winrate charts, that's definitely a problem on your end!

The issue you describe with skill impacting win chances is actually not an issue of independent events at all. If the events were dependent, then the results from all prior games between all zergs and all protosses would impact the win probability of the next game between a zerg and a protoss. This is not the issue at hand in your scenario, where you're talking about skill level of each player impacting win probabilities. That's the realm of ELO, and the winrate charts make no attempt at gauging the skill levels of each player.


And you are again making the mistake that winrate charts indicate win probabilities and balance, which is the whole point, THEY DON'T!

You wanted error bars added to winrate charts? Why? What error? They are cataloging winrates from the past month, where is the error coming from?

My mistake was in even ceding that error bars should be part of the chart and make any sense with them. They don't. I agree my explanation isn't applicable to the charts, but that isn't because my explanation is assuming things incorrectly, it's because the error bars shouldn't be there in the first place...

Please remain calm. I'd love to have a level-headed discussion with you!

The winrate charts indicate win probabilities aggregated across each race. This is indisputable. They provide what I believe is a general indicator of balance - I don't believe that there are statistically-significant differences in skill between races, so it stands to reason that in the aggregate, this provides some information on the balance between races.

The error bars allow you to determine whether one month's average is significantly different from previous months. This is hugely important as before their addition people were making all sorts of wild claims as to how certain patches were throwing race balance off. Now that we can determine whether or not each month was significantly-different from previous months, we can more reasonably talk about whether or not changes to the game are having effects on winrates.


I'm very calm thanks

It's just you seem to be very oddly ignoring the relevant points of discussion in a biased attempt to defend your error bars.

You keep making the claim, with no backing, that the win rates provide some reflection of balance. People, including you in this discussion, always like to attack the straw man that people saying winrates don't indicate balance are making that assumption because of differences in the skill level of the races. If you read the OP, it is addressing exactly why they don't show any useful information about balance, for a set of reasons completely different than the one straw man reason you do address.

I'm still confused as to what you think the error bars add to the discussion. The winrates are results, not predictions. That means the only source of error would be missing results or incorrectly entering results into the graph. But that certainly isn't how you are presenting them. In this case you seem to be advocating them so that people can see a win rate of X% with an imagined deviation indicating it falls within a range from Y% to Z% and therefore not freak out since last month the win rate was A% which also falls between Y% and Z%. That's not really a useful error bar, its just tacking some extra shit on so that people can continue to try and draw erroneous balance conclusions from something that indicates nothing about balance.

Again, your viewing winrate graphs as balance graphs. I think that's the root of why we disagree about the error bars.


I want to stick up for the error bars. Just because the graph displays results and not predictions, doesn't mean that the error bars are meaningless. If I measure something in the lab, I still place an uncertainty on that value even though it is a direct measurement. Likewise, even if I directly measure winrates, you still have an error rate on it. Why? Because not all games are included. Because these games aren't scientific outcomes and are not reproducable. Because the factors from one month to the next change and error rates are a simple way to show that. Sure, they aren't perfect, but don't think that just because they are a direct measurement that they don't have error and uncertainty attached.
Community Newsjjakji || jjakji || jjakji || jjakji || jjakji || jjakji || jjakji nshoseo.jpg
pOnarreT
Profile Joined March 2012
155 Posts
May 04 2012 03:34 GMT
#58
You guys don't get it, his post has got nothing to do with statistics, it's about the TL community. He wants to prove his point that you create a thread in TL with a bunch of fancy writing, but the bottom line is, his true intention is to bash the people in TL. Again, I somewhat agree.
figq
Profile Blog Joined May 2010
12519 Posts
May 04 2012 03:41 GMT
#59
On May 04 2012 12:34 pOnarreT wrote:
You guys don't get it, his post has got nothing to do with statistics, it's about the TL community. He wants to prove his point that you create a thread in TL with a bunch of fancy writing, but the bottom line is, his true intention is to bash the people in TL. Again, I somewhat agree.
I'm sure that's why he originally named his thread "What reddit does not understand about statistics" and posted it on reddit. Clearly all to do with TL.
If you stand next to my head, you can hear the ocean. - Day[9]
algorithm0r
Profile Joined May 2010
Canada486 Posts
May 04 2012 03:53 GMT
#60
Exactly true. What the OP doesn't mention is that Blizz "balances" based on statistics gathered this way. I can't say they are foolish in interpreting the data, but the skewing of data towards balanced by the matchmaker and the statistical points in the OP make balancing really hard.
Bagration
Profile Blog Joined October 2011
United States18282 Posts
May 04 2012 03:55 GMT
#61
He's quite the controversial character on r/sc as well, I've seen quite a few posts by him that buried by downvotes. He brings up a few good points, though it is interesting to see how he bears significant ill-will towards TL.
Team Slayers, Axiom-Acer and Vile forever
pOnarreT
Profile Joined March 2012
155 Posts
May 04 2012 03:59 GMT
#62
On May 04 2012 12:41 figq wrote:
Show nested quote +
On May 04 2012 12:34 pOnarreT wrote:
You guys don't get it, his post has got nothing to do with statistics, it's about the TL community. He wants to prove his point that you create a thread in TL with a bunch of fancy writing, but the bottom line is, his true intention is to bash the people in TL. Again, I somewhat agree.
I'm sure that's why he originally named his thread "What reddit does not understand about statistics" and posted it on reddit. Clearly all to do with TL.



He got you
Natespank
Profile Joined November 2011
Canada449 Posts
May 04 2012 04:00 GMT
#63
He makes good points, it's foolish to ignore a rude person's good ideas (though fine to abuse him for it).
Serpico
Profile Joined May 2010
4285 Posts
Last Edited: 2012-05-04 04:03:58
May 04 2012 04:02 GMT
#64
On May 04 2012 12:55 Bagration wrote:
He's quite the controversial character on r/sc as well, I've seen quite a few posts by him that buried by downvotes. He brings up a few good points, though it is interesting to see how he bears significant ill-will towards TL.



All the "this isn't reddit/4chan" stuff I've seen here at times is a bit sanctimonious. When you see how many people have to get warned/banned all the time you realize TL isn't some shining center for civility.
BraneSC2
Profile Joined May 2010
United States123 Posts
May 04 2012 04:09 GMT
#65
I love the "99%" statistic provided, then the post goes on to talk about fallacies. It's humorously ironic, especially when the post is about statistics and the first one provided is a blatant lie.
No fighting in the war room!
Jumbled
Profile Joined September 2010
1543 Posts
May 04 2012 04:15 GMT
#66
Most of his objections are just basic statistical limitations that are well-known on TL, and are usually brought up in any related thread. Inevitably, there will always be some ignorant posters who don't know about them and draw false conclusions, but they usually get deservedly panned.

His claim that people are more likely to get away with a rubbish post if it's nicely formatted does have some credence, but it's hardly a damning indictment.
TheRealDudeMan
Profile Joined April 2011
United States213 Posts
May 04 2012 04:18 GMT
#67
The only thing I find stupid is that he has no conclusion to how statistics can be used or what should be used. It just seems to me like he refutes any kind of information just to refute it. Maybe he is just tired of people basing balance off of win rates which is flimsy by itself to say the least. I don't see anything positive or informative about this.
"They claim that we violent. We named after Tyrants. This revolution won't be televised we keep it silent."
AKomrade
Profile Blog Joined March 2010
United States582 Posts
May 04 2012 04:25 GMT
#68
OP is pretty freaking right. TL is swamped by ridiculous statistics that mean next to nothing. Every week there is some new "mathematical study" that claims that race X is op/up because of a number that is nearly irrelevant to the game (like the analysis of mining per race which found T has an advantage of .5 minerals over a minute or something).

There is too much overanalysis done with very little understanding of the way the game operates and the evidence generated from these experiments just fuels more race wars than solving anything. It isn't exclusive to TL however, and to be honest, if you're going to put some statistics you found anywhere, it'd be here, if for no other reason than the quality of the posters and admins relative to other sites.

The TLPD stats too are just a summary of a wide range of talent across many games and many people treat them like they are the EXACT evaluation of ability and "race imbalance" when it is much more subtle than just simply who wins more. Its surprising how often people focus on the mathematical parts of Starcraft as rationalization for wins/losses when a good part of the game is based on things you can't measure like positioning, star sense, psychological advantages, etc. Cost effectiveness is a part of the game, obviously, but its blown so horribly out of proportion.





Actually, a lot of this post is really reasonable and makes quite a bit of sense to me. Even his attitude is understandable when you read the comments on posts in the strategy forums and LR posts. Why are people so pissed? He isn't even really attacking TL.
ALL HAIL THE KING IN THE NORTH! HAIL! HAIL!
wuhan_clan
Profile Joined April 2012
United States5609 Posts
Last Edited: 2012-05-04 04:58:12
May 04 2012 04:55 GMT
#69
Why do people even bother to respond if they don't even read the entire OP? This is is a problem rampant in forums anywhere on the internet, not just TL or Reddit.

On topic, the only real logical conclusion you can come to from the win/rate charts is this. If I select a random PvZ game from last month, the probability of selecting a game where P wins is x%, based on the chart (i don't know what it actually is, havent looked at the latest charts).
Arguments can be made that certain win/loss rates might be some indication of balance but with so many factors involved (and no raw data), any kind of statistical analysis is pointless.

Edit: win/loss rate can be used to identify a POSSIBILITY of an imbalance in a certain matchup but the actual win/loss rate stat is not evidence of any imbalance.
MustNotSleep
Profile Joined August 2010
United States8 Posts
May 04 2012 05:30 GMT
#70
The fallacy of Drabzalvers intelligence; he stumbles through his logic like a dog. Moving sloppily from one point to the next, he is so stuck in his narrow minded thinking that he can only focus on concepts as situations. While I understand relativism is a wonderful argument Drabzalver , it also holds many fallacies of its own. Your arguments are coincidental; they are so dependent on the situation at hand. While you are attempting to thwart a scientific study, you don’t realize that you’re only exemplifying stupidity by having an online epiphany involving the fundamental logic that a chimp involves stacking blocks. Drabzalver, where do you get off? The fact that your post has even made it into the TL community speaks loudly that others feel the same as you. Why I must not know; is it not obvious that these studies must take place? Even if you want to bring all of your player and race balances into the numbers, are statistics not necessary in performing formal calculations of balance?
MustNotSleep
While everyone around me drowns i float. . . .
Schnullerbacke13
Profile Joined August 2010
Germany1199 Posts
May 04 2012 05:49 GMT
#71
he's right. however same as in politics, the public opinion enjoys simplified thinking and therefore bashes him :D
21 is half the truth
cristo1122
Profile Blog Joined November 2010
Australia505 Posts
May 04 2012 06:04 GMT
#72
relevent ideas and well formed badly presented with a lack of emotional detachment that this sort of matter requires (imo ofc). TBH havent actually looked graphs on win rates at all but to get an idea what he is talking about. The data itself imo is incomplete and is not modified to take into account different variables that would have an impact on the result. For example we could say that korean players at MLG performed less optimal than foriegnors as they had a significantlly lower win rate than occurs when they are at a GSL event however this over view of the win rate does not take into the many variables that would impact on said winrate such as jet-lag, 1st set win rate (btw interesting bit of triva is that the player who gets to 2 wins in the OSL will win overall), degree that they have had to practise due to travel, fact that they are not sleeping their regular beds even. Though hese examples are somewat strange it highlights the fact that the winrate of a player and or a race (which is a sample of players) is not that usefull considering it dosent take into account the effects of lots of other impacts that could effect the outcome. That is why for any sort of deep analyisis they are sub-optimal as they do not take into account maps, players, 1st set wins, recent patch changes, etc. For a general sort of overview they are interesting but they are not representitive of true balance.


For example medusa (this taken from liquidpedia) The original map was very difficult for Terran against Protoss; though the statistics do not appear terribly imbalanced at first glance (39-18 in favor of Protoss), they become much more damning once you realize that Flash went 7-1 against Protoss on the map, without which the statistics reduce to a dismal 38-11.


if one were to take the overall winrate and use to form a complex argument about the balance on this map it would not take into account flash's skill (which btw is ridiculous) thus one can be given a false impression on the balance on the map as significant portion of the T wins are from a single player
ZvP imbalanced blizzards solution nerf terran
Torte de Lini
Profile Blog Joined September 2010
Germany38463 Posts
May 04 2012 06:15 GMT
#73
This is basic statistics 101 in university. He's not saying anything new, just assuming we're unqualified to understand the underlining meanings of stats.
https://twitter.com/#!/TorteDeLini (@TorteDeLini)
Jerubaal
Profile Blog Joined June 2010
United States7684 Posts
May 04 2012 06:22 GMT
#74
There are no facts, only interpretations.
I'm not stupid, a marauder just shot my brain.
HyunA
Profile Joined July 2010
Romania362 Posts
May 04 2012 06:29 GMT
#75
unfortunately, i aggree with the dude from reddit.
arbitrageur
Profile Joined December 2010
Australia1202 Posts
Last Edited: 2012-05-04 06:34:40
May 04 2012 06:34 GMT
#76
LMAO.

A criticism of the use of statistics with PURE WORDS. If this was an assignment topic in a statistics subject at university you would get 0%.
sandg
Profile Joined July 2011
Australia123 Posts
May 04 2012 07:05 GMT
#77
What I don't know about statistics? Who's being pretentious here lmao...

I'm pretty sure when people see TLPD winrates they don't instantly assume balances/imbalances and take it as gospel, and I'm pretty sure that blizzard doesn't balance the game around statistics gathered from the pro-level.
The mind is capable of anything, because everything is in it.
Cyx.
Profile Joined November 2010
Canada806 Posts
May 04 2012 07:07 GMT
#78
So despite all the hate going on in this thread, I see people making points every single month in the TLPD Winrates thread over and over and over that use some of the fallacies he listed as arguments. And he's right - everything he warned against is something that you shouldn't do if you're trying to interpret stats in a meaningful way. If everyone read this before they commented on winrates we'd be a better place. Fuck everyone hating on Reddit.
SeaSwift
Profile Blog Joined February 2011
Scotland4486 Posts
May 04 2012 07:16 GMT
#79
I read his thing on reddit, and I read his "credentials".

http://www.reddit.com/r/starcraft/comments/t5izg/what_reddit_does_not_understand_about_statistics/c4js5pq?context=3

There they are, if anyone wants to read them. Basically, he's just a uni drop out who was bad at stats.
oZe
Profile Joined January 2011
Sweden492 Posts
May 04 2012 07:31 GMT
#80
On May 04 2012 09:09 halfies wrote:
praetentious?
really?
i hope that how he spelt it too, because it would be really funny if he wrote this much about people being pretentious and used big words that he couldn't even spell.


You don't even begin a sentence with a capitalized letter, made even more fun by the fact that it was "I". Thereby forfeiting all rights to complain about spelling. Also if you have a problem with his argument, attack his argument.

In American English, spelt means exclusively a hardy wheat grown mostly in Europe, and the verb spell makes spelled in its past-tense and past-participial forms.

Ps. I put my money on you being the creator of the bad boy ^^
The worst kinds of organized crime are religion & government.
Cyberonic
Profile Blog Joined September 2011
Germany80 Posts
May 04 2012 07:36 GMT
#81
On May 04 2012 15:34 arbitrageur wrote:
LMAO.

A criticism of the use of statistics with PURE WORDS. If this was an assignment topic in a statistics subject at university you would get 0%.


So what you say is you need statistics to prove statistics are used wrongly?
surfinbird1
Profile Joined September 2009
Germany999 Posts
May 04 2012 07:43 GMT
#82
Haven't learned anything of note. Thought this guy had something to add.
life of lively to live to life of full life thx to shield battery
RoberP
Profile Joined March 2011
United Kingdom101 Posts
May 04 2012 07:45 GMT
#83
99% of the time this deals with winrates and balance, 99% of the people commenting on it have no clue how to interpret these numbers

Scumbag Drabzalver - rages at false statistics - makes up statistics in the first paragraph...
Rannasha
Profile Blog Joined August 2010
Netherlands2398 Posts
May 04 2012 07:52 GMT
#84
I was expecting some hard-core math in this thread when I saw the title. Or at least some math. I was disappointed to find that it was just a regurgitation of what should be blatantly obvious mixed with bad assumptions and lots of words linked together with shaky logic.
Such flammable little insects!
Surili
Profile Blog Joined March 2011
United Kingdom1141 Posts
May 04 2012 07:52 GMT
#85
Firstly i would like to say that being polite, disliking swearing and poorly writting language is a GOOD thing. It keeps everyone civil, and makes text easily legible, thereby reducing misinformation.

On May 04 2012 09:00 Cyberonic wrote:
- Fallacy: advantages at certain times of matchups expressed in graphs

There are also a lot of graphs posted which supposedly indicate that some races may have an advantage at certain matchups. Oh boy do people misread what these graphs mean. Take this bad boy.

A naïve way to read it would simply be 'Hmm, Z has an early game advantage in ZvP, then it becomes about even, then P has a slight advantage up to the end, then Z again.', wrong; look closely, what does the graph actually say, it says this:
IF a ZvP game ends in the 0-5 minute range, the chance is 60% it ends in Z's favour.
Now the 'if' is so bloody important here, the game needn't end. Now, everyone of course realizes that that part is caused by early pools. Does Zerg really have a large advantage at that point? Can Zerg force a win at that point if they want to, are early pools overpowered? No, not at all, so what is going on?

Imagine a ZvP, Z decides to 7pool, P doesn't scout soon enough, lings get in, kill every probe, traalalala, P GG's. Game over in the first 5 minutes in Z's favour.
Okay, imagine a ZvP, Z decides to 7pool, P scouts in time, gets his wall up, damn, Z's like 'fuck man, shouldn't have done that'. But not necesarily GG's unless IdrA, the game goes on, Z however plays at such a disadvantage that in the next 5-10 minutes surely P will claim his victory unless P messes up.

See the fallacy? That Z has that 'early game advantage' doesn't mean that Z is more powerful or that 7pools are too powerful, it just means that IF the game immediately ends due to a 7pool it will most likely be in Z's favour. If the 7pool fails, the game doesn't end at that point, Z will most likely stay in the game and play from a significant disadvantage to lose later.

It is a grave statistical error of the magnitude of interpreting 'If a 8 year old child dies, the chance is the greatest he dies from a car swoop' as 'It is very likely 8 year old children die from car swoops'.

The graph doesn't even say how likely it is that the game ends at certain intervals. For all you it's far more likely for P to win in the late game than in the mid game, even though the graph indicates that if the game ends in the late game, the chance is higher that Z takes it. And even so, that still says nothing about advantages of races at certain times. One would assume that if a race is likely to win at time X, that race enjoys an advantage slightly before that time, no?

What would be far more intersting, though also not conclusive, would be a graph which outlines 'How large was the percentage of Z wins in ZvP at each interval', which is fundamentally different from 'at each interval, if the game ends, how often does it end into Z's favour in ZvP'. My bet is that because 7pools are actually quite rare, it would not at all show the huge spike for Z in the early game.


But I do have to say that THIS section, was a very good point, that people completely misunderstand.

Frankly i like any post that rejects that we have any real understanding of imbalance.
The world is ending what should we do about it?
GeneralSnoop
Profile Joined February 2011
United States142 Posts
May 04 2012 08:04 GMT
#86
good job mods. i'm glad you keep people like this out of our community. this guy calling TL pretentious.........
"I could probably live in trees" - LiquidJinro
Chenz
Profile Joined November 2010
Sweden1197 Posts
May 04 2012 08:13 GMT
#87
What the guy says is nothing new, but a lot of posters on this forum take the tlpd statistics for more than it is.It's true what he says, you can't judge balance by those statistics, especially not over the time period of 2 or even 3 months.
hypercube
Profile Joined April 2010
Hungary2735 Posts
May 04 2012 08:17 GMT
#88
The tone is annoying but all the points he makes are valid. And not all of them are obvious either.
"Sending people in rockets to other planets is a waste of money better spent on sending rockets into people on this planet."
Roxor9999
Profile Joined December 2011
Netherlands771 Posts
May 04 2012 08:35 GMT
#89
So basically that guy on reddit says something that we already know and he thinks that we don't know in a very arrogant and condescending tone. I don't see a reason for this thread to exists since these points have already been expresses multiple times in for say the win rates thread every month. The only reason I can think of is for him to feel smarter then everyone else which is a dick move.
Grohg
Profile Joined March 2011
United States243 Posts
May 04 2012 09:07 GMT
#90
I'm pretty sure anyone who actually cares about the stats understands the basics he is attempting to explain. Anyone who has taken a statistics course will learn about logical fallacies before almost anything else. It's wonderful to see that he thinks people who frequent TL are incapable of pointing out obvious errors. I'm pretty sure almost any thread with stats in it gets analyze critically by multiple people...in fact it's probably the first thing that happens when a long post with a lot of graphs and charts pops up. The only thing about it is that the people who really don't care post in the thread first saying how great the OP is without having read it all. Jump to page 3 or 4 in those threads and someone will be correcting all of the mistakes or critiquing the post. TL isn't full of idiots and I think the winrate charts are widely understood to simply be something interesting to see...not to balance a game off of. Of course there are people who open up TL and find threads to whine about balance in. These people have major confirmation bias and it should be expected to see it pop up in a thread where balance is potential up for discussion.

Keep your ego in check...your high horse is a lot shorter than you think. If you'd stop to look up you'd be able to see that there are plenty of people on your level and above you and your pretentious comments.
You can't spell slaughter without laughter.
halfies
Profile Joined November 2011
United Kingdom327 Posts
May 04 2012 09:17 GMT
#91
On May 04 2012 09:16 Ripper41 wrote:
Show nested quote +
On May 04 2012 09:09 halfies wrote:
praetentious?
really?
i hope that how he spelt it too, because it would be really funny if he wrote this much about people being pretentious and used big words that he couldn't even spell.

Well he explains...
it's worse, he spelled it that way intentionally. still, he makes some good points in the substance of what he says.

so hes spelling pretentious wrong because hes pretentious? oh the irony.
he certainly doesnt seem to say anything wrong, but theres not much thats really enlightening either
straycat
Profile Blog Joined May 2011
230 Posts
May 04 2012 09:29 GMT
#92
Please, could you all stop being so insulted (wrongly insulted at that, since it was aimed at redditors in the first place) and actually try to read it? Contrary to what many have posted in their attempts to bring some of the hurt from their bleeding egos back to the proxied OP, it was an interesting read with interesting perspectives.
Chunhyang
Profile Joined December 2011
Bangladesh1389 Posts
May 04 2012 09:45 GMT
#93
On May 04 2012 18:29 straycat wrote:
Please, could you all stop being so insulted (wrongly insulted at that, since it was aimed at redditors in the first place) and actually try to read it? Contrary to what many have posted in their attempts to bring some of the hurt from their bleeding egos back to the proxied OP, it was an interesting read with interesting perspectives.


That most people already knew.
If you could reason with haters, there would be no haters. YGTMYFT
Cyberonic
Profile Blog Joined September 2011
Germany80 Posts
May 04 2012 09:55 GMT
#94
On May 04 2012 18:45 Chunhyang wrote:
Show nested quote +
On May 04 2012 18:29 straycat wrote:
Please, could you all stop being so insulted (wrongly insulted at that, since it was aimed at redditors in the first place) and actually try to read it? Contrary to what many have posted in their attempts to bring some of the hurt from their bleeding egos back to the proxied OP, it was an interesting read with interesting perspectives.


That most people already knew.


If that was true, why would there be so many posts containing balance whine with "supporting" their viewpoints by exactly the type of graphs criticized?
KuKri
Profile Joined February 2012
Germany168 Posts
May 04 2012 10:06 GMT
#95
On May 04 2012 18:55 Cyberonic wrote:
Show nested quote +
On May 04 2012 18:45 Chunhyang wrote:
On May 04 2012 18:29 straycat wrote:
Please, could you all stop being so insulted (wrongly insulted at that, since it was aimed at redditors in the first place) and actually try to read it? Contrary to what many have posted in their attempts to bring some of the hurt from their bleeding egos back to the proxied OP, it was an interesting read with interesting perspectives.


That most people already knew.


If that was true, why would there be so many posts containing balance whine with "supporting" their viewpoints by exactly the type of graphs criticized?

Hey, Mr. Statistician, your degree in statistics should give you the answer.
XenoX101
Profile Joined February 2011
Australia729 Posts
Last Edited: 2012-05-04 11:33:29
May 04 2012 10:43 GMT
#96
On May 04 2012 18:55 Cyberonic wrote:
Show nested quote +
On May 04 2012 18:45 Chunhyang wrote:
On May 04 2012 18:29 straycat wrote:
Please, could you all stop being so insulted (wrongly insulted at that, since it was aimed at redditors in the first place) and actually try to read it? Contrary to what many have posted in their attempts to bring some of the hurt from their bleeding egos back to the proxied OP, it was an interesting read with interesting perspectives.


That most people already knew.


If that was true, why would there be so many posts containing balance whine with "supporting" their viewpoints by exactly the type of graphs criticized?


This is the biggest issue I have with people saying this is 'common knowledge', if people were/are already well aware of the specific issues with these graphs (i.e. not just for example the general idea of what a confounding variable is) then many of the stat-pointing balance whine posts would either A) not exist or B) not be taken as seriously by the majority. Yet people freely point to these graphs as a beacon of balance truth whenever they get posted and nobody says a word.

Also, if these issues were common knowledge, then why haven't statistic-based threads such as the 'win rate over game length' one addressed them specifically? This kind of scrutiny should be common practice in statistics threads, and if it isn't mentioned by the OP then posters should be critical of this kind of omission because of how integral it is to the meaningfulness of the statistics.

The winrates post is an even worse example of a statistics thread, created completely devoid of any statistical analysis whatsoever and with very few people being critical of the graph and none of them being critical of the thread. Why do simple undeveloped threads always get removed yet simple undeveloped graphs persist? Regardless of whether people were aware of what OP said or not, it really needs to start being put into practice if we want to have proper discussions about trends in SC2.

Cyberonic
Profile Blog Joined September 2011
Germany80 Posts
Last Edited: 2012-05-04 11:33:32
May 04 2012 11:33 GMT
#97
On May 04 2012 19:06 KuKri wrote:
Show nested quote +
On May 04 2012 18:55 Cyberonic wrote:
On May 04 2012 18:45 Chunhyang wrote:
On May 04 2012 18:29 straycat wrote:
Please, could you all stop being so insulted (wrongly insulted at that, since it was aimed at redditors in the first place) and actually try to read it? Contrary to what many have posted in their attempts to bring some of the hurt from their bleeding egos back to the proxied OP, it was an interesting read with interesting perspectives.


That most people already knew.


If that was true, why would there be so many posts containing balance whine with "supporting" their viewpoints by exactly the type of graphs criticized?

Hey, Mr. Statistician, your degree in statistics should give you the answer.


haha, good point. However note that OP/author are not the same!
Lord_Maximus
Profile Joined March 2012
Denmark46 Posts
May 04 2012 11:50 GMT
#98
Why are people in this thread badmouthing this? It's completely correct and it's something a lot of people get wrong. If "praetentious" and "reddit" weren't said no one would mind.
straycat
Profile Blog Joined May 2011
230 Posts
Last Edited: 2012-05-04 12:00:37
May 04 2012 12:00 GMT
#99
On May 04 2012 18:45 Chunhyang wrote:
Show nested quote +
On May 04 2012 18:29 straycat wrote:
it was an interesting read with interesting perspectives.


That most people already knew.


1. Sigh.
2. QED.
C[h]ili
Profile Joined December 2011
Germany167 Posts
May 04 2012 12:06 GMT
#100
Oh well, that guy is writing a romane on how winrate statistics are biased because ability is an unobservable variable. That is hardly new, but we take a look on winratios nevertheless, because an important function of statistics is to facilitate discussion and communication.
halfies
Profile Joined November 2011
United Kingdom327 Posts
May 04 2012 12:23 GMT
#101
On May 04 2012 16:31 oZe wrote:
Show nested quote +
On May 04 2012 09:09 halfies wrote:
praetentious?
really?
i hope that how he spelt it too, because it would be really funny if he wrote this much about people being pretentious and used big words that he couldn't even spell.


You don't even begin a sentence with a capitalized letter, made even more fun by the fact that it was "I". Thereby forfeiting all rights to complain about spelling. Also if you have a problem with his argument, attack his argument.

In American English, spelt means exclusively a hardy wheat grown mostly in Europe, and the verb spell makes spelled in its past-tense and past-participial forms.

Ps. I put my money on you being the creator of the bad boy ^^

i wasn't complaining about anything except how pretentious it is to spell pretentious like that for the reasons he does it. i don't care about the spelling aspect as much as the reasons for it, which also explains why i don't bother to capitalize everything properly on a forum, since it doesn't matter and is easier.
Telenil
Profile Joined September 2010
France484 Posts
May 04 2012 13:47 GMT
#102
His overly agressive tone notwithstanding, he actually made some excellent points, the most important of all being that what matters is the number of players, not the number of games. Also good points about all-in defense making the defending race "favored" in the next few minutes.
Mass Recall: Brood War campaigns on SC2: http://www.teamliquid.net/forum/viewmessage.php?topic_id=303166
fortheGG
Profile Joined April 2011
United Kingdom1002 Posts
May 04 2012 14:04 GMT
#103
I remember seeing Artosis post something similar before.. Yes people (and hence mods because mods are people right?) like seeing nice and simple graphs/figures/conclusions but I don't think TL has more of that than anywhere else.

On May 04 2012 21:23 halfies wrote:
Show nested quote +
On May 04 2012 16:31 oZe wrote:
On May 04 2012 09:09 halfies wrote:
praetentious?
really?
i hope that how he spelt it too, because it would be really funny if he wrote this much about people being pretentious and used big words that he couldn't even spell.


You don't even begin a sentence with a capitalized letter, made even more fun by the fact that it was "I". Thereby forfeiting all rights to complain about spelling. Also if you have a problem with his argument, attack his argument.

In American English, spelt means exclusively a hardy wheat grown mostly in Europe, and the verb spell makes spelled in its past-tense and past-participial forms.

Ps. I put my money on you being the creator of the bad boy ^^

i wasn't complaining about anything except how pretentious it is to spell pretentious like that for the reasons he does it. i don't care about the spelling aspect as much as the reasons for it, which also explains why i don't bother to capitalize everything properly on a forum, since it doesn't matter and is easier.


The writer's pov was that people don't bother looking into details and try to talk about things they don't know much about. How is your post not proof of that.
Rakasha
Profile Joined April 2010
Canada71 Posts
May 04 2012 14:32 GMT
#104
I agree to the point he is trying to make, that statistic is not something you can trow out without context. And it's true that's a science but what else do we have to try to check if the game is balance? I doubt that even blizzard can check every player if they are pushing it in their favor just by being good player.

Still think it's the best way to check the game.

PS: sorry for the mistake, english is not my primary language.
Do not take life too seriously. You will never get out of it alive.
Nerski
Profile Blog Joined November 2010
United States1095 Posts
Last Edited: 2012-05-04 18:24:04
May 04 2012 18:22 GMT
#105
Do I or most people probably like the way he went about that post, probably not at all.

Does he make some points that some people realized exist, but just largely ignore because of the 'not worth it' factor. By all means yes, that's the case.

Let me expound upon what I've said above so it's not misconstrued as to what I was getting at. The VAST majority of people who either A) Care about the game or B) Care about the competitive arena of the game, are on the overall pretty satisfied with the product. Though I'm sure someone can find a statistic that states otherwise

I could make a long winded example, but I'll post the TLDR version of why most people fall under the 'not worth it' factor in regards to statistical posts.

1. The majority of people making a complaint based on a bias and statistics will not be swayed in their opinion. If they took the time to find a stat to try to prove a point, they are probably going to hold fast to that point and will die before the yield. That however, won't stop equally biased people for or against from arguing in a thread.

2. Most people realize someone posting statistics to make a point doesn't care if they are reading the stats wrong or right. They are trying to vent and make a point, so arguing with them will just make you go around in circles.

Overall I find his bashing of a particular community pointless, just as pointless as arguing with someone who has a clearly biased opinion on a matter. I wouldn't condone someone bashing on any community, by making a blanket judgment based on a very small statistical sample size to lump the whole community into it (see what I did there).
Twitter: @GoForNerski /// Youtube: Youtube.com/nerskisc
latan
Profile Joined July 2010
740 Posts
May 04 2012 18:27 GMT
#106
I thought it was a very bad and mostly uninformative post. the OP admitted to not even knowing that much about statistics, and you can tell by what he writes.
Jisall
Profile Blog Joined January 2011
United States2054 Posts
May 04 2012 19:28 GMT
#107
On May 05 2012 03:27 latan wrote:
I thought it was a very bad and mostly uninformative post. the OP admitted to not even knowing that much about statistics, and you can tell by what he writes.


Post isn't from the OP, post is quoted from a man on reddit.
Monk: Because being a badass is more fun then playing a dude wearing a scarf.. ... Ite fuck it, Witch Doctor cuz I like killing stuff in a timely mannor.
Greenei
Profile Joined November 2011
Germany1754 Posts
May 04 2012 19:44 GMT
#108
how about saying what we CAN get from the statistics and how things SHOULD be done instead of saying"this and this and this is bad".
IMBA IMBA IMBA IMBA IMBA IMBA
DarkPlasmaBall
Profile Blog Joined March 2010
United States44145 Posts
May 04 2012 19:59 GMT
#109
Drabzalver just created arbitrary fallacies and then disproved them. They don't even necessarily correlate to what we argue over.

Basically he's just smugly (or, one might say, "praetentiously") knocking down his own strawman arguments and then talking about how shitty TL is.

But now I'm glad to know that 1000 games of MKP vs. DRG doesn't define TvZ balance. Of course, he doesn't talk about how that particular example is nonsensical because it also necessitates how they'll learn each other's playstyles and the like (e.g. MKP's strategies don't represent all high-level Terrans' mindsets), and so Drabzalver's anecdote is pretty much garbage.

But his focus on how the common cause of a player winning two matches in a row 4-0 is that he's "fucking good" was so polished and well-spoken... I don't know how I could have missed such a revelation without it being pointed out to me by this guy.

I'm also glad I learned from him that 7 pools are quite rare though. And that Reddit really does live up to its reputation.
"There is nothing more satisfying than looking at a crowd of people and helping them get what I love." ~Day[9] Daily #100
Drowsy
Profile Blog Joined November 2005
United States4876 Posts
May 04 2012 20:30 GMT
#110
Fallacy 1: Mostly true, but when you examine 3 months of data and average it, assuming there haven't been any major changes in metagame, maps, or patching, you can very easily draw conclusions about balance based on winrates alone. April/March/Februrary works pretty well.

Fallacy 2: Not true and plainly really dumb. Internationally, there are enough players in each race to balance out differences in player skill between races. In Korea, this may not be as true, but to claim that MKP alone will shift the balance of a matchup internationally is stupid. Nobody is that dominant in sc2.

Fallacy 3: Agree wholeheartedly, can't really control for build order/bo losses.

Its kind of academic now, as balance is very close to 50/50/50 internationally and nothing is really broken. Teamliquid just has some insane bias against protoss in pvt despite strong evidence contrary to popular beliefs.
Our Protoss, Who art in Aiur HongUn be Thy name; Thy stalker come, Thy will be blunk, on ladder as it is in Micro Tourny. Give us this win in our daily ladder, and forgive us our cheeses, As we forgive those who play zerg against us.
1st_Panzer_Div.
Profile Joined November 2010
United States621 Posts
May 04 2012 20:51 GMT
#111
Hah, he calls out all the statistical bs perfectly. Maybe not in the kindest way, but he nailed it mostly. The TLPD non-korean chart is actually useful, and as SC2 progresses it will become much more accurate as the skill gap between players narrows.

This is also exactly why Blizzard is loathe to release a lot of numbers, because people have no idea what the hell they're looking at, and will make crap up perfectly to fit their answers.
Manager, Team RIP ZeeZ
Raid
Profile Joined September 2010
United States398 Posts
May 04 2012 20:55 GMT
#112
Seems like just another guy who is just full of him self and wrote a self masturbation post.. I like how he doesn't have a TL account and seemingly thinks he knows he smarter than 99% of everyone here bwahahaha.
freetgy
Profile Joined November 2010
1720 Posts
May 04 2012 20:59 GMT
#113
everyone that studied at least half year statistics knows that statistics that are used on this forums by players don't mean shit on balance why do you think there has do be done solid math/statistical hypotesis testing to do reliable deduction about balance.
zefreak
Profile Blog Joined December 2011
United States2731 Posts
May 04 2012 21:11 GMT
#114
People need to stop complaining about the author's writing style and aggression and focus on the content. He is 100% about the use of statistics in the SC2 scene. They are worthless. They are actually worse than worthless because they give people the illusion of knowledge where none exists.
www.gosu-sc.com - Starcraft News, Strategy and Merchandise
coverpunch
Profile Joined December 2011
United States2093 Posts
May 04 2012 21:15 GMT
#115
The general points are good, the specifics are not and his conclusions are not. The statistics that people generally use to judge balance have lots of fallacies and are used in misleading ways to confirm prejudicial beliefs. They have lots of problems and aren't particularly informative. But the OP needs to provide a better alternative or even some other direction that we should go. Should we abandon trying to statistically analyze StarCraft altogether? Should everyone just STFU about balance?
freetgy
Profile Joined November 2010
1720 Posts
Last Edited: 2012-05-04 21:19:40
May 04 2012 21:18 GMT
#116
On May 05 2012 06:15 coverpunch wrote:
The general points are good, the specifics are not and his conclusions are not. The statistics that people generally use to judge balance have lots of fallacies and are used in misleading ways to confirm prejudicial beliefs. They have lots of problems and aren't particularly informative. But the OP needs to provide a better alternative or even some other direction that we should go. Should we abandon trying to statistically analyze StarCraft altogether? Should everyone just STFU about balance?


you can statistically analyze your own games, to draw conclusions (to understand your weaknesses to fix them)
Still has nothing to do with balance but just your results.
NeonFox
Profile Joined January 2011
2373 Posts
May 04 2012 21:24 GMT
#117
Like the guy or not he's completely right, and this should be included as a disclaimer above any post talking about winrates.
zefreak
Profile Blog Joined December 2011
United States2731 Posts
May 04 2012 21:27 GMT
#118
On May 05 2012 06:15 coverpunch wrote:
The general points are good, the specifics are not and his conclusions are not. The statistics that people generally use to judge balance have lots of fallacies and are used in misleading ways to confirm prejudicial beliefs. They have lots of problems and aren't particularly informative. But the OP needs to provide a better alternative or even some other direction that we should go. Should we abandon trying to statistically analyze StarCraft altogether? Should everyone just STFU about balance?


Sometimes admitting that there is no strong answer currently is the best option. If you haven't read Taleb's The Black Swan or Fooled By Randomness I really recommend it. The issue isn't just attempting to do statistical analysis with insufficient data and improper methods, but an overconfidence in the results and the blind acceptance by a mathematically inept community.
www.gosu-sc.com - Starcraft News, Strategy and Merchandise
vrok
Profile Joined August 2009
Sweden2541 Posts
Last Edited: 2012-05-04 22:09:25
May 04 2012 22:03 GMT
#119
Some pretty obvious shit everyone should know simply by employing some plain old common sense, and probably gets said in every single statistics thread. And then coupled with random bullshit. It should be titled what most people already know. Keep crappy reddit rants on reddit please.

What's even the point of arguing overall balance anymore? There isn't one. Whatever issues remain are marginal at best in the grand scheme of things.
"Starcraft 2 very easy game" - White-Ra
FeyFey
Profile Joined September 2010
Germany10114 Posts
May 04 2012 22:15 GMT
#120
people study statistics for years, so that most people don't notice how far they stretched the stats in their favor. Otherwise statistics are fairly simple to read. But if people make graphs out of them and show you, they are fishy either way.
Nokshalees
Profile Joined March 2012
United States120 Posts
May 04 2012 22:20 GMT
#121
Sure is pretentious reddit circlejerk in here. I didn't get to finish that long pretentious post because I don't think you know how statistics work.

Why not make more dramas and upboat more just to get le karma XDXD and let TL do their thing.
MKP/MVP/Kas/ThorZaiN/Jinro/ForGG | I MAEK HAE BYUNG. | #terranprideworldwide
See.Blue
Profile Blog Joined October 2008
United States2673 Posts
May 04 2012 22:40 GMT
#122
TL is very praetentious. Lol
Destroyr
Profile Joined November 2011
Germany299 Posts
May 04 2012 22:54 GMT
#123
Great write up thanks
taddel
Profile Joined April 2012
Germany13 Posts
May 05 2012 00:08 GMT
#124
I completely agree with the author.
To the people who think it's common sense what he wrote I have to say that there are plenty of posts on team liquid which make exactly the mistakes he mentioned.
Azarkon
Profile Joined January 2010
United States21060 Posts
Last Edited: 2012-05-05 00:32:48
May 05 2012 00:29 GMT
#125
On May 04 2012 09:00 Cyberonic wrote:
Drabzalver

- Fallacy: winrates indicate balance

Yes and no, for the most part, they do not indicate balance, rather, they indicate balance shifts. It's so trivial to see that it boggles me that people don't figure this out on their own. Assume that race X was actually underpowered last month and is balanced this month. This means that the X players who actually qualified last month and stayed in tournaments are actually better than the Y and Z players, therefore, now that it gets balanced, as they are overal better, they start smashing Y and Z because they are better, thereby suddenly making the graph appear as X has been 'overbuffed' or whatever else while simply the few X players that were around in the scene were better. This will continue on until the mediocre Y and Z players who were more so carried by their race than the X players get weeded out.

...

The only way to find out absolute balance is to get a random pool of pros, force them to play random, and have a round robin tournament to ensure that everyone plays the same amount of games. Very unfeasible to get enough games with that for a reasonable sample size.


It's worse than this. Your definition of balance shifts is that it varies with the distribution of better players in a month's tournaments. But it's not just players that are shifting, it's their meta-game. Say Naniwa figures out a new mothership rush style tomorrow that Zerg has no answer for. The win rate for P vs. Z during that month is going to be lopsided in favor of Protoss because all the Protoss pros are going to switch to it and crush their Zerg opponents. But then a few weeks later, Stephano figures out a three hatch mass ultra style that beats the crap out of the mothership rush. Then the pendulum swings right back and those Protoss who practiced their arses off on how to rush motherships is going to get creamed until they readjust.

Because of this, getting a random pool of pros, forcing them to play random, and having a round robin tournament to ensure that everyone plays the same amount of games is still isn't going to give you absolute balance - because the pros are going to play each race the way the meta-game tells them to play, which gives you the balance of the current meta-game but not the balance of the game.

A lot of obvious imbalances are detectable through observing lopsided match-ups and thinking through the game logic to see that the lopsidedness is due to game design and not player skill, but there is no way to figure out the absolute balance of the game without solving the game, and thereby knowing the optimal strategies in each match-up.
Chargelot
Profile Blog Joined December 2010
2275 Posts
Last Edited: 2012-05-05 00:45:02
May 05 2012 00:44 GMT
#126
Anyone who says "pray-tentious" is probably the most pretentious bastard alive. I don't understand why this shit has to be shared on Team Liquid. What the fuck is reddit even good for if we just discuss it all here?
if (post == "stupid") { document.getElementById('post').style.display = 'none'; }
upperbound
Profile Joined September 2011
United States2300 Posts
Last Edited: 2012-05-05 00:51:07
May 05 2012 00:50 GMT
#127
It's too bad the OP of the reddit thread is such a douche, because there are probably feasible ways to account for some of the statistical inaccuracy to produce "truer" results. However, the TLPD graphs are by no means useless as approximations for the true statistics and attempts to characterize them as such are probably equally misguided as carrying them as gospel.
CluEleSs_UK
Profile Blog Joined August 2010
United Kingdom583 Posts
May 05 2012 01:14 GMT
#128
Kinda sad that in typical TL fashion, somebody makes an intelligent correct post, and instead of discussing it, they spend 7 pages complaining about his spelling/declaring they already knew everything he said.

Good post, well written, statistics very rarely have useful connotations in the world of Starcraft, and are reserved only for people with bogus arguments they're trying to defend.
"If it turns out he is leaving the ESL to focus on cooking crystal meth I'll agree that it is somewhat disgraceful, but I'll hold off judgement until then."
Normal
Please log in or register to reply.
Live Events Refresh
Road to EWC
14:00
Global Qualifier - Day 2
goblin vs HeRoMaRinELIVE!
Ryung vs Astrea
ewc_black2889
ComeBackTV 1566
WardiTV772
BRAT_OK 250
CranKy Ducklings208
Rex153
3DClanTV 137
EnkiAlexander 99
Liquipedia
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
BRAT_OK 250
Rex 153
Hui .115
StarCraft: Brood War
Britney 31625
Calm 4792
PianO 351
firebathero 254
EffOrt 118
Hyun 49
scan(afreeca) 31
Terrorterran 14
Dota 2
Gorgc6876
capcasts48
League of Legends
Dendi1018
Counter-Strike
summit1g8224
qojqva921
flusha341
Super Smash Bros
hungrybox1070
Mew2King77
Heroes of the Storm
Khaldor809
Other Games
tarik_tv58104
gofns46158
FrodaN1935
Mlord718
B2W.Neo656
C9.Mang0531
ArmadaUGS156
KnowMe141
elazer59
Organizations
Other Games
EGCTV1764
gamesdonequick1265
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 17 non-featured ]
StarCraft 2
• AfreecaTV YouTube
• sooper7s
• intothetv
• Migwel
• Kozan
• IndyKCrew
• LaughNgamezSOOP
StarCraft: Brood War
• blackmanpl 47
• Michael_bg 9
• Eskiya23 8
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• WagamamaTV476
• Ler122
Counter-Strike
• Nemesis6746
Other Games
• imaqtpie779
Upcoming Events
BSL: ProLeague
8m
UltrA vs Sziky
Dewalt vs MadiNho
Replay Cast
1d 16h
Replay Cast
3 days
The PondCast
3 days
Replay Cast
4 days
BSL: ProLeague
6 days
SOOP
6 days
Sparkling Tuna Cup
6 days
Liquipedia Results

Completed

NPSL Lushan
2025 GSL S2
Heroes 10 EU

Ongoing

JPL Season 2
BSL 2v2 Season 3
BSL Season 20
Acropolis #3
KCM Race Survival 2025 Season 2
CSL 17: 2025 SUMMER
Copa Latinoamericana 4
Championship of Russia 2025
RSL Revival: Season 1
Murky Cup #2
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
PGL Astana 2025
Asian Champions League '25
BLAST Rivals Spring 2025
MESA Nomadic Masters
CCT Season 2 Global Finals
IEM Melbourne 2025
YaLLa Compass Qatar 2025
PGL Bucharest 2025

Upcoming

CSLPRO Last Chance 2025
CSLPRO Chat StarLAN 3
K-Championship
uThermal 2v2 Main Event
SEL Season 2 Championship
Esports World Cup 2025
HSC XXVII
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
FISSURE Playground #1
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.