AlphaStar released: Deepmind Research on Ladder - Page 11

Xain0n

Italy3963 Posts

July 29 2019 10:10 GMT

#201

On July 29 2019 11:53 Muliphein wrote:
But clearly it is making a lot of mistakes in the micro and battle engage department.

And you saying that 'it has no idea' when it is a neural net and 'learns builds from other players' when it is trained by playing against itself, makes any further debate useless.

Show nested quote +

On July 29 2019 09:41 Xain0n wrote:

On July 29 2019 08:47 Muliphein wrote:

On July 29 2019 08:22 Inrau wrote:

On July 29 2019 03:19 Muliphein wrote:
The AI has exactly the same units as the player has. So saying the AI is playing rugby with tanks rather than human players is a false analogy. The analogy works, any analogy works up to a point, but it shows exactly why what AlphaGo is doing is fair. Not why it is unfair.

AlphaStar does not have to box-select units to move. The AI does not have any mouse trail so to speak. All players paint the map with their cursors.
[image loading]

The limitations are nice, locking the actions to a camera, lowering the APM. But Alphastar can still do things at 120APM that would take a human 600 APM.

The AI might as well be playing with three keyboards and mice. Not to mention the clicking one pixel in the top right corner to select a building or a unit it needs.

Yes, the AI is playing the same game, but without inherently human limitations. I have no idea what the point is that you are trying to argue. You think it is unfair for an AI to solve an AI problem if human limitations aren't hard coded in? Do you also think a chess AI needs to be forced to take a piss break because humans will I evitably have to do this as well under standard time control? Where draw the line. Why don't you support the view that for any AI to beat a AI problem, it needs to solve the problem by modeling a human brain solving the problem?

All this comes from the delusion that people believe SC2 is richer and more intellectually pleasing than it actually is. People cannot accept that the ideal play is to build mass roaches/stalkers and combined with perfect macro, micro and deciding when to engage. So the AI needs to be limited to play more like a human, and then the AI wil either lose to humans, or finally come up with genius elegant strategies.

Yet, all the facts we have yell at us the opposite. So please stop bringing up 'fairness' because there cannot be such a thing. As long as the AI doesn't get units with more hp or free resources, or the ability to see through the FoW, it is playing the wrong game. And when it seems stupid because it doesn't truly understand what is going on in the game, but it is beating all the best human players (and yes we are not quite there yet at all), maybe then you guys will accept that 'understanding the game' doesn't really matter for winning. (And let me note that Alpha Zero (chess or go) also don't really understand the game. They just happen to take the correct action). They cannot explain to you why they do what they do and it requires quite a bit of effort from Deepmind engineering to figure that out.)

If this is Deepmind's goal with Starcraft 2, they are wasting time and money. If they believed, as you seem to, that beating every player with inhuman macro and micro would be the right way of playing Sc2, I don't know why they would use a neural network for the task.

So because this disappointed your intellectual curiosity, for something that likely isn't even there to begin with, Deepmind is wasting their time and money when in fact they set up an RTS game, up to now played by only a bunch of scripts, as a math problem that gets solved by their neural net architecture and training methods, which generalized very well to similar real-world problems. Yeah, that makes sense. I my field of biophysics, Deepmind has a neural network that does better structure prediction of protein folding than any of the existing algorithms. And that specific competition has been running since 1994. Deepmind entered it last year for the firs time and immediately won.

Do you know how much money is invested in drug development that involves protein folding or protein protein interactions each year? You have absolutely no idea what you are talking about.

Show nested quote +

And in SC2, Alphastar makes micro decisions superior to all humans and beats most humans, even before they finalized their version to challenge the top player. And in Chess/Go Alphazero sees patterns impossible to see by a human.

Show nested quote +

SC2 isn't a game of strategy. It is a game decision making and execution. Deepmind is only making their AI 'play like a human' to not offend the SC2 community too much. Alphafold also doesn't fold proteins 'like a human'. It solves the problem. And in SC2, that problem is winning the game. Not 'coming up with strategies that please Xainon. And this is achieved through superior micro, superior macro, superior multitasking, and superior battle engage decisions. Not through hard countering the enemy's build or trying to trick your opponent into hard countering something you aren't actually doing.

Show nested quote +

No. All I care about is to see how well they are able to develop the strongest playing AI possible. Not an AI that can pass a Turing test through SC2 play. And in the mean time, I get annoyed by people who for emotional selfish reasons decide to deliberately misunderstand SC2 (I assume you aren't truly ignorant) and be too lazy to learn the basics of ML and deep neural networks while still believing their misunderstandings about the nature of Alphastar is worthwhile for others to read. others.

Let's start from the conclusion, then. If Deepmind's goal was yours, why would they apply limitations at all?
Why would they ever step back from the iteration that beat Mana with inhuman map awareness and stalker micro?
Maybe they don't just want to create the strongest possible AI playing sc2? They are doing that "not to offend sc2 community"? Why would we ever get offended? Machines have been mechanichally outperforming men for a long time already.
I didn't call Deepmind complaining on how they should please my intellectual curiosity, they are choosing themselves to force Alphastar to resemble a human more with every single step.

You are right, I don't know what Alphafold is doing or how much money is invested on that project; I just don't see why would you choose a game as complex as Sc2 if your goal would just be to make a neural network perform a task much faster and much more precisely than humans with no "decision making" involved.
AlphaGo sees pattern human mind can't, but we can try to learn from it by studying its moves; if Alphastar uses 40k apm, we can witness such prowess and learn nothing.

So you get annoyed on our lack of understanding regarding Deepmind and Alphastar? Do I have to remind you Team Liquid is a forum focused on RTS games?
Go somewhere else if you want to discuss the intricacies of neural networks with people understanding them as much as you do.

When we come to sc2 itself, how can you affirm sc2 is not a game of strategy? Have you, Muliphein, solved the game? It seems pure conceit to me.
Sc2 surely is a game of strategy when two mechanichally limited humans play it while it probably is as you say when an AI faces a human; how can you know how the game looks like when two unbound agents are playing it?

TitanEX1

14 Posts

July 29 2019 11:53 GMT

#202

Currently Casted Live. Our announcement:

CobaltBlu

United States919 Posts

July 29 2019 13:32 GMT

#203

I would like to see them release it on ladder for longer period of time with no barcode. I want to see how fragile it is vs novel strategies.

deacon.frost

Czech Republic12129 Posts

July 29 2019 14:15 GMT

#204

On July 29 2019 22:32 CobaltBlu wrote:
I would like to see them release it on ladder for longer period of time with no barcode. I want to see how fragile it is vs novel strategies.

I think they want to test the AI interaction against humans, not people interaction against AI

(the latter would result in abusive strategies that wouldn't be played against humans)

If anyone answers "would I play differentally had I known I play AI" - YES, then barcode is valid. Considering smoe reactions in this thread...

Edit>
At the same time I wouldn't mind seeing how abusive people would get against verified agents, so they may want to go into both types as this would be an interesting experiment either.

ShoCkeyy

7815 Posts

July 29 2019 14:24 GMT

#205

On July 29 2019 23:15 deacon.frost wrote:

Show nested quote +

I think they want to test the AI interaction against humans, not people interaction against AI

Edit>
At the same time I wouldn't mind seeing how abusive people would get against verified agents, so they may want to go into both types as this would be an interesting experiment either.

Your edit was my initial post, thanks for that. I was going to say, it'll be cool to see both variations.

Haukinger

Germany131 Posts

July 29 2019 14:37 GMT

#206

On July 29 2019 08:22 Inrau wrote:
AlphaStar does not have to box-select units to move. The AI does not have any mouse trail so to speak. All players paint the map with their cursors.
[image loading]

The limitations are nice, locking the actions to a camera, lowering the APM. But Alphastar can still do things at 120APM that would take a human 600 APM.

These are limitations of the game client, not from the game. The game is just the rules, e.g. when issuing an attack order to a marine to a target within range, it will instantly do x damage. Or when issuing a blink order to a stalker, it will instantly blink.

Remove the "instantly" from the rules, i.e. introduce universal cooldown and lag, and AI and human are on equal grounds. Not to mention you'd also remove exploits like stutterstepping or so called "warpprism micro".

Acrofales

Spain18044 Posts

July 29 2019 15:33 GMT

#207

On July 29 2019 23:37 Haukinger wrote:

Show nested quote +

It isn't really a limitation of the game client at all. It's an issue with human ability to perform a maximum number of actions per minute. The game client's ability to process actions per minute isn't the bottleneck there. It's a human control issue. It is simply easier to select the whole army and then drag the tanks elsewhere than to select each part of the army (or even, each unit individually) and give them different commands. Because it is so much easier to do, that makes it *more* optimal for a human to do the theoretically less optimal army micro (because the tanks spenda few milliseconds moving in the wrong direction). Meanwhile, the AI doesn't have this issue, so direct each part of the army immediately to its position. This ties in a bit to my earlier response to Muliphein, so I will continue that conversation here as well.

On July 29 2019 03:19 Muliphein wrote:

Show nested quote +

We are now trying to make a machine that is intelligent. In a philosophical sense, that is no different from making a machine that runs fast on wheels or that generates a lot of force. APM isn't limited by the human body. It is limited by the human mind. People cannot think fast enough and cannot think in parallel at all. Research shows that humans basically do not multitask.

Making a machine that is able to come up with 2000 actions a minute IS exactly like building a car with 2000 horsepower. Humans only have about 0.1 horsepower. So the machines win there with a way bigger margin. That this is not the type of intelligence where humans traditionally beat out machines is besides the point.

The AI has exactly the same units as the player has. So saying the AI is playing rugby with tanks rather than human players is a false analogy. The analogy works, any analogy works up to a point, but it shows exactly why what AlphaGo is doing is fair. Not why it is unfair.

Sure, the mind *might* be the bottleneck in hand-eye coordination, but I doubt it. I suspect that eAPM would be a lot higher if we had a perfect brain-starcraft interface. It only takes watching a few games by progamers to know that hand-eye coordination is a large part of the mechanics needed to play SC2, and a misclick (not a misthought, just a mistake in clicking on the wrong pixel) can cost you the game. However, as an AI researcher myself, I am quite confident when I say that making a perfect micro bot is not the part that the AlphaStar researchers are interested in. You don't throw tons of supercomputing resources to make a perfect micro bot. They aren't interested in "winning" at starcraft per se. It's just that winning at starcraft is a good benchmark for how good they are at solving a specific type of problem. They are interested in the problems of planning and adapting a strategy hampered by "real world" limitations.

Show nested quote +

I don't think this is an accurate account of the consensus, if there was any, at that time. Decades ago, it was actually a minority that correctly recognized that the brain is a machine like any other. And that in principle a machine could be build that does the same thing as a brain, only better. Respectable scientists for a long time placed the brain outside of any biological context. General principles of biology were not applied to it. Only with the rise of cognitive science did this change.

AI has gone through a number of "winters". The first of these was in the late 60s and 70s when it was clear that machines were not soon going to be "more intelligent" than humans despite early breakthroughs such as winning at backgammon or robots being able to correctly recognize simple objects and colors.

And you don't need to bring Cartesian duality in here, but if you do, there have been philosophers since the early 20th century who have questioned that duality, and the more we have learned about the brain, biology and particularly *computation*, the stronger the criticisms became. In particular, early AI researchers in the 60s didn't give two hoots about such arguments, and the Turing test as an evaluation tool for AI should make that clear. Note that the mind-brain duality argument is still not completely settled, although imho anybody arguing in favour of dualism is not understanding the concept of emergence.

The second AI winter was in the 90s and 00s, when it was clear that neural networks and expert machines *also* had serious limitations and despite early successes in visual object recognition and automated logical reasoning, there were still obvious gaps in what AIs could do. Deep learning has made AI, once again, reemerge from a winter. A cautious man would be hesitant to declare the problem will now be solved. In particular, things like abstract moral decision making and introspection are things that we don't really know how to do right now, and while deep learning looks a lot like a miracle, it is the same old neural networks we used in the 80s, but with more computing power and better optimization algorithms. Of course, I could also be describing a human brain...

But you are right that for the last decades it was just an issue of actually building a machine, because it proved to be quite challenging. Yes, it is true in some sense that just raw calculation wouldn't be enough. But it is very easy to calculate the phase space of Go and to then see that raw calculation was never going to solve that. And we have known for a long time that humans use pattern recognition properties of a neural network to play these games so well.

In fact, the opposite is true as people thought chess and go would be 'safe' from computers for a decade or two more than they actually were.

Show nested quote +

This is besides the point, but I beg to differ. Doing complex tasks is quite challenging for robots. It would be extremely challenging to build a robot that a human top rugby player could control using some VR interface (like in Avatar) that would allow for a similar level of play as the actual rugby player playing himself. We are decades off from that. But you were actually trying to make another point. So be careful with your language.

Sure, I don't really know how hard it is to build a robot that could play rugby. I'd argue that all you need is a remote control car with enough armored plating and horsepower, and a "ball catching, and holding mechanism". But it's beside the point. If you don't like the rugby tank, just stick to the racecar for "running" a sprint. It is an uninteresting problem. It becomes interesting when we add restrictions such as "the 100m dash must be run on 2 legs", because bipedal robotic running is something we still haven't solved adequately (although we are getting better at it).

Show nested quote +

So which one is it? Did we take way longer to solve these games? Or did we do it earlier than expected?

Both? You know I was talking about 60 years of history, with periods of unbridled optimism and AI winters of doom and gloom?

Show nested quote +

Perfect micro is an AI challenge. Not a 'how fast can I issue commands through an embedded systems interface'-challenge. That it is not the AI challenge most people are interested in, for the simple reason that it learns human players nothing new about the game, is besides the point.

It may be the case that in SC2, unlike in chess and go, an AI can play way way above the best humans without doing anything that humans hadn't realized or discovered themselves.

This all comes back to one important point. RTS games are games of execution and small scale decision making(tactics). They are not games of strategy. And their complexity is quite basis. There aren't layers upon layers that reshape how the game is played as you ascend the skill curve. Yes, the move space is huge and sparse, but in essence it is a straightforward game. Build an army stronger than your opponent, then force a fight and win the game. That's the entire game in a nutshell.

See above, I disagree. Mechanics are part of it, and the "least interesting" part from an AI perspective, but SC2 is definitely a game of strategy if you add limitations to the mechanics. The "build an army stronger than your opponent's and go and kill him with it" is a rather simplistic way of looking at it. I have no doubt that a completely perfectly executed blink stalker warp prism immortal rush "solves" the game if you allow 10000 APM (or so). And then you can definitely say that strategy is irrelevant, as the only thing to figure out is optimal movements on a map, which is a bit of a trivial problem. But if you limit the possible actions, you find that overall strategies become important, and it is not at all obvious what army is the strongest army and what is the best way to get there without just dying first. E.g. 3rd CC before Rax is sometimes possible, but straight up build order countered by plenty of early game aggression builds. But being a little bit more greedy than your opponent is generally a good strategy to get an advantage in the long term, and timing attacks exist to punish opponents exactly at moments when you expect them to be greedy and your aggression can punish them. 10k APM blink stalker micro would indeed thwart all these puny attacks, but it is irrelevant to SC as we understand the game, where strategy plays a real role. And it is exactly that part of the game that AlphaStar is designed to "solve", just as AlphaGo "solved" Go (a game where hand-eye coordination is mostly irrelevant).

alexanderzero

United States659 Posts

July 29 2019 16:17 GMT

#208

Regarding AlphaStar's apparent lack of strategy I really do question whether or not its a problem with the scale/computing power of the neural network, or a design flaw. People say that AlphaStar doesn't have the ability to react to things but that's not exactly true. The decisions that it makes during battles are direct responses to the things done by the opponent, like flying its phoenixes around and picking off units that venture too far from the group, and then engaging fully once it has a large enough army advantage.

I know that people make this distinction between tactics and strategy, but this is an artificial boundary that exists in the minds of humans. There is nothing fundamental about the theory of the game that justifies this division. The fact that it is able to think tactically is evidence that there are aspects of the game that it does understand and have the capability to reason about. Presumably if it's capacity to reason was increased to include more variables, it would start considering thing like scouting and tech switches more often. That, and more training time to allow it to do more experiments and map out more of the game.

skdsk

138 Posts

July 29 2019 16:51 GMT

#209

http://vod.afreecatv.com/PLAYER/STATION/46401370 vod of the alphastar cast event...

Muliphein

49 Posts

July 29 2019 19:35 GMT

#210

On July 29 2019 17:30 -Archangel- wrote:
Wasn't the point of this project to get AI that can solve problems? Having inhuman micro is not solving problems.

You have got to be fucking kidding!

It is like sending you to fight Superman. Superman will learn nothing beating your 1 000 000 times while all you might eventually do is somehow find kryptonite and beat him without it ever being a fair fight.

It is not about learning about SC2. It is about learning how to set up deep learning problems. And stop talking about fairness.
And the thing you hope AI will tell you about SC2 is very likely not there. People keep talking about the AI discovering new builds that humans can copy to become better. It is not going to happen because it is not relevant to high level AI play. An AI does not have the weakness that it wants to be 'clever'. And a deep learning AI will just relentlessly play the way it thinks is optimal.

There is not even a discussion that it it possible to find a hole in a deep learning AI. The AI is only as good as its training. Take the simple case of a 'Is it a cat or a dog' image recognition AI. If you provide an image of either a cat or dog in a very unusual pose, the AI might fail terribly, even though to us humans it is clearly a cat or dog. With any deep learning AI you can find input data where the AI will get it horribly wrong. But the point is that this is a tiny subset of the real input data where it fails, while for the vast majority of the input it does very well (and either outperforms humans overall or is most cost-efficient economy-wise even if humans are better). This is why when you watch the Alphago documentary, they were afraid of AlphaGo going 'delusional'.

A deep learning AI will not engage in mindgames and it will not cut corners and take risk on BOs in interesting ways. It either is fundamentally incapable of doing so, because it cares only about winning and not about being clever, because it isn't concluding anything or doing reasoning or deduction, because it has been trained playing other AIs, because it is a generalized algorithm that does the same thing for a specific game state, and because it isn't emotional or insecure. Or it won't because it is fundamentally suboptimal to play that way. And this makes sense because players like Flash also don't try to play a 'strategic' game. The AI just presents its best play and if that is not good enough it will stubbornly lose without adapting. Humans have insecurities and feel the need to outsmart their opponent. They want to do something to get an edge. They fear their opponent tricking them. They fear that playing straight up they will lose. They feel that in this match they need to do something that will guarantee them the win. A human will not be satisfied with a 51% win chance. It will try to come up with something to do better. The AI doesn't care. Hence, the AI has no need for doing marginal plays that may result into huge rewards. It will simply not explore that part of the phase space, even if there are pockets there that are really good, because overall this is a losing part of the phase space. The AI will converge in a smooth and consistent part of the phase space where it is easy to move into better versions of itself, as the network is being trained.

Sadistx

Zimbabwe5568 Posts

July 31 2019 06:10 GMT

#211

If there's anything I learned from Deep AI projects (including the Texas Hold'em NL 6-max poker AI released recently), is that AI optimizes for unexploitability, which in the context of SC2 is for least risky strategies. I believe the term used is 'Regret minimization'. It seems logical.

That it achieves a win rate of above 50% while doing this is just a side effect of what it optimizes for.

I'm honestly not particularly educated in this field, though, so correct me if what I typed is nonsense!

Acrofales

Spain18044 Posts

July 31 2019 08:13 GMT

#212

On July 31 2019 15:10 Sadistx wrote:
If there's anything I learned from Deep AI projects (including the Texas Hold'em NL 6-max poker AI released recently), is that AI optimizes for unexploitability, which in the context of SC2 is for least risky strategies. I believe the term used is 'Regret minimization'. It seems logical.

That it achieves a win rate of above 50% while doing this is just a side effect of what it optimizes for.

I'm honestly not particularly educated in this field, though, so correct me if what I typed is nonsense!

Actually it maxes its reward function. You can definitely do regret minimization by building that into the reward function (or the optimization algorithm), but there's no reason to assume that was applied. In a game with almost rock-paper-scissors like strategies, and the bots trained by adversarial games, I'm not even sure what to look for to distinguish a bot with regret minimization and without.

Equalizer

Canada115 Posts

July 31 2019 16:36 GMT

#213

At least according to Deepmind's blog post (https://deepmind.com/blog/alphastar-mastering-real-time-strategy-game-starcraft-ii/) they trained using a mixture over agent strategies using the game theory concept of the Nash equilibrium.

The basic point is that even though it may play a strategy that has a hard countered it should at randomly choose other strategies that would do well against this counter some of the time. I suppose this makes the most sense for openings but after that perhaps not so much.

What is odd is that in the games identified to almost certainly be against AlphaStar seems to have very little randomness so they may of just chosen the agent with the highest win rate for real world testing.

DimmuKlok

United States225 Posts

July 31 2019 16:54 GMT

#214

How does AlphaStar deal with cloaked units? Cloaked units are technically visible but rely on the human element to not be detected.

Acrofales

Spain18044 Posts

August 01 2019 09:49 GMT

#215

On August 01 2019 01:54 DimmuKlok wrote:
How does AlphaStar deal with cloaked units? Cloaked units are technically visible but rely on the human element to not be detected.

That depends on the API, but insofar as I know, that is deterministic, so if the AI is looking at the right part of the map, it will "see" the cloaked units. Whether it reacts is then part of AlphaStar. That, in turn, is heavily dependent on whether this situation occurred sufficiently often with enough salience to train a counter.

If you recall the showmatches, it reacted instantly and decisively when DTs appeared, but that was with full map vision. With only a "screen" sized area visible at any time, it may not have trained enough with that. Or maybe it did and reacts well?

Prev 1 9 10 11 All

Please or register to reply.

AlphaStar released: Deepmind Research on Ladder - Page 11

Completed

Ongoing

Upcoming