• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EST 16:58
CET 22:58
KST 06:58
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
RSL Revival - 2025 Season Finals Preview6RSL Season 3 - Playoffs Preview0RSL Season 3 - RO16 Groups C & D Preview0RSL Season 3 - RO16 Groups A & B Preview2TL.net Map Contest #21: Winners12
Community News
Weekly Cups (Dec 1-7): Clem doubles, Solar gets over the hump1Weekly Cups (Nov 24-30): MaxPax, Clem, herO win2BGE Stara Zagora 2026 announced15[BSL21] Ro.16 Group Stage (C->B->A->D)4Weekly Cups (Nov 17-23): Solar, MaxPax, Clem win3
StarCraft 2
General
RSL Revival - 2025 Season Finals Preview Weekly Cups (Dec 1-7): Clem doubles, Solar gets over the hump Chinese SC2 server to reopen; live all-star event in Hangzhou Maestros of the Game: Live Finals Preview (RO4) BGE Stara Zagora 2026 announced
Tourneys
RSL Offline Finals Info - Dec 13 and 14! Tenacious Turtle Tussle 2025 RSL Offline Finals Dates + Ticket Sales! Sparkling Tuna Cup - Weekly Open Tournament StarCraft2.fi 15th Anniversary Cup
Strategy
Custom Maps
Map Editor closed ?
External Content
Mutation # 503 Fowl Play Mutation # 502 Negative Reinforcement Mutation # 501 Price of Progress Mutation # 500 Fright night
Brood War
General
[BSL21] RO8 Bracket & Prediction Contest BGH Auto Balance -> http://bghmmr.eu/ BW General Discussion FlaSh on: Biggest Problem With SnOw's Playstyle Let's talk about Metropolis
Tourneys
[ASL20] Grand Finals [BSL21] RO8 - Day 2 - Sunday 21:00 CET [BSL21] RO8 - Day 1 - Saturday 21:00 CET Small VOD Thread 2.0
Strategy
Game Theory for Starcraft Simple Questions, Simple Answers Fighting Spirit mining rates Current Meta
Other Games
General Games
Dawn of War IV Path of Exile Stormgate/Frost Giant Megathread Awesome Games Done Quick 2026! Nintendo Switch Thread
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
Mafia Game Mode Feedback/Ideas Survivor II: The Amazon Sengoku Mafia TL Mafia Community Thread
Community
General
Russo-Ukrainian War Thread US Politics Mega-thread Things Aren’t Peaceful in Palestine YouTube Thread European Politico-economics QA Mega-thread
Fan Clubs
White-Ra Fan Club
Media & Entertainment
Anime Discussion Thread [Manga] One Piece Movie Discussion!
Sports
Formula 1 Discussion 2024 - 2026 Football Thread
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread
TL Community
TL+ Announced Where to ask questions and add stream?
Blogs
How Sleep Deprivation Affect…
TrAiDoS
I decided to write a webnov…
DjKniteX
James Bond movies ranking - pa…
Topin
Thanks for the RSL
Hildegard
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1958 users

Heuristics for Hotkey-based Player Identification

Blogs > Loser777
Post a Reply
Loser777
Profile Blog Joined January 2008
1931 Posts
Last Edited: 2013-09-20 03:51:03
September 19 2013 22:39 GMT
#1
This is the accompanying blog post to this thread:
http://www.teamliquid.net/forum/viewmessage.php?topic_id=429661

Please check that out before reading this!

That control group configurations allow skilled individuals to identify smurfs
has been known since the days of Brood War and notably demonstrated by roMAD. I
present a small collection of heuristics for systematically evaluating the
similarity between different hotkey setups and subsequently a method for
replay-based player identification. Some of these techniques are currently
implemented in vroMAD. These techniques will hopefully allow for the automatic
identification of players from large repositories of replay data.

Similarity Measures
Similarity measures are used in a wide variety of disciplines ranging from
applied mathematics to bioinformatics. They are commonly used in similarity
matrices, which can be thought of as a graph describing how close a given data
point is to another. We can adopt the idea of a similarity measure to hotkey
setups used by various players. The relevant question posed is: given two
players, how can we quantify how similar their hotkey setups are?

If we have a quantification of similarity between hotkey setups, we can do
several things:
  • Generate a ranking of similarity between players from an anonymous/"unknown"
    replay and a database of replays with confirmed player identifications
  • Cluster players from replays into groups with similar setups
  • From the clusters, classify players


At the moment, vroMAD only does the first of those things.

Now, we move into the juicy details and introduce three similarity models:

Frequency-Distribution Based Similarity
This "low-hanging fruit" (I know people hate this phrase) is the similarity
measure is currently implemented in vroMAD because of its extreme simplicity.
Frequency-Distributions of hotkey selection are generated from replay data. That
is, given the player data from a replay, we proceed as follows:
1. Extract all hotkey selections
2. Bin each of these selections according to their number {0,1,2,3,4,5,6,7,8,9}
3. Generate a frequency distribution vector in R^10 space where each element
corresponds to the frequency of selection of a specific hotkey.

Example vector: [0 0.5 0.2 0.2 0.3 0.1 0 0 0 0] (selections/second)

4. Calculate a similarity using a Gaussian function: given player 1 with
frequency distribution x1 and player 2 with frequency distribution x2, we
compute exp(-((x1-x2)^2)/2(sigma)^2). In this case, we take sigma as the
standard deviation of data with respect to each of the dimensions.

Note that this measure is theoretically race-agnostic. That is, it is not
directly influenced by a player's race, as it is not mapped to any race-specific
unit or buildings. This is what I refer to as a "roMAD-complete" similarity
measure, as it can be used to inform on players suspected of offracing. (roMAD
was famously able to identify off-racing progamers just from their hotkey
setups)


Fixed Unit Mapping Based Similarity
This is the "most-obvious" model for identification, and works as follows: given
a player and a race, we generate a vector in R^10 where the value of each
dimension corresponds to a race-specific unit e.g. Drone/Hatchery/Queen/Roach
for Zerg. For hotkeys with multiple types of units bound, we simply choose the
most frequent unit or adopt a similar technique. This technique is not
"roMAD-complete" unless we choose a very general mapping of unit types to
numbers. With a hotkey vector for each player, we apply the Gaussian function as
described previously.

For the sake of example, say we have a Zerg player and 1 maps to Roach, 2 to
Hydra, 4 to Hatchery, 5 to Queen, and 7 to Infestor. -1 Maps to no-selection.
Example vector: [-1 1 2 7 4 5 -1 -1 -1 -1] (unitless)

Floating Unit Mapping Based Similarity
We can improve formulation of "Fixed Unit Mapping Based Similarity". This is
because of each of these techniques attempt to map a hotkey setup into some
vector space and compute a similarity based on distance. However, it can be
seen that "Fixed Unit Mapping Based Similarity" doesn't generalize well to the
concept of distance. That is, (given two Zerg players) if one binds
control-group 1 to Roaches only, and other to Zerglings only, what is the
distance between their setups? Even if we say ground units are closer to other
ground units and further from air units and even further from buildings, "Fixed
Unit Mapping Based Similarity" remains an awkward model. To address this
problem, I introduce the "floating" version of this model. This model switches
the organization of the vector: that is, we instead define classes or types of
units a priori as the dimensions of our vector and assign values based on the
control-group number. Here, "floating" refers to the dimension of the vector.
This model generalizes better to the idea of a distance: we can say hotkey
setups where a given type of unit is mapped 1 key apart are closer than hotkey
setups where the same type of unit is mapped 4 keys apart. To compute a
similarity from this model, we again apply the Gaussian function described
previously. Note that the "roMAD-completeness" of this model depends on whether
we choose classes to be abstract such as "air units/ground units/buildings" or
race-specifc units.

For the sake of example, we define the first dimension as
Marine/Marauder/Medivac, the second as Viking/Banshee/Raven, the third as
Spellcasters, the fourth as Command Centers, the Fifth as ground production, the
Sixth as air production, and the Seventh as upgrades.
Example vector: [1 3 2 6 4 5 6] (unitless coordinates, but generalizes to
distance)


Note on the Gaussian function used:
The Gaussian function used has a range of (0, 1], and essentially operates
on the raw Euclidean distance of the vectors. Identical vectors have similarity
1, whereas very dissimilar vectors will have a similarity close to 0. For
experimental purposes, vroMAD also includes the ability to rank based on the raw
Euclidean distance in vroMAD. A high similarity corresponds to low Euclidean
distance, and vice versa.

***
6581
purakushi
Profile Joined August 2012
United States3301 Posts
Last Edited: 2013-09-19 22:48:04
September 19 2013 22:47 GMT
#2
Really neat stuff! Keep up the cool work~
I like reading what people do with their coding skills. :D
T P Z sagi
EatThePath
Profile Blog Joined September 2009
United States3943 Posts
September 20 2013 15:51 GMT
#3
thanks for the writeup
Comprehensive strategic intention: DNE
Please log in or register to reply.
Live Events Refresh
BSL 21
20:00
Playoffs - Day 1
Sziky vs StRyKeR
Hawk vs Dewalt
ZZZero.O281
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
White-Ra 369
JuggernautJason145
IndyStarCraft 123
mouzStarbuck 70
Nathanias 69
ROOTCatZ 57
StarCraft: Brood War
Britney 15703
Shuttle 497
ZZZero.O 281
LaStScan 54
Shinee 15
Dota 2
Dendi1432
LuMiX1
Counter-Strike
fl0m8986
Super Smash Bros
hungrybox261
Heroes of the Storm
Khaldor279
Other Games
Grubby6386
DeMusliM728
Liquid`Hasu335
ToD86
Mew2King74
Models4
Organizations
Other Games
gamesdonequick1525
BasetradeTV62
StarCraft 2
angryscii 15
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 14 non-featured ]
StarCraft 2
• HeavenSC 18
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
Other Games
• imaqtpie5283
• Shiphtur293
• tFFMrPink 19
Upcoming Events
RSL Revival
6h 32m
Classic vs Reynor
herO vs Zoun
WardiTV 2025
15h 2m
herO vs ShoWTimE
SHIN vs herO
Clem vs herO
SHIN vs Clem
SHIN vs ShoWTimE
Clem vs ShoWTimE
IPSL
19h 2m
Sziky vs JDConan
BSL 21
22h 2m
Tech vs Cross
Bonyth vs eOnzErG
Replay Cast
1d 11h
Wardi Open
1d 14h
Monday Night Weeklies
1d 19h
Sparkling Tuna Cup
2 days
Replay Cast
4 days
The PondCast
4 days
[ Show More ]
CranKy Ducklings
6 days
SC Evo League
6 days
BSL 21
6 days
Liquipedia Results

Completed

Acropolis #4 - TS3
RSL Revival: Season 3
Kuram Kup

Ongoing

IPSL Winter 2025-26
KCM Race Survival 2025 Season 4
YSL S2
BSL Season 21
Slon Tour Season 2
WardiTV 2025
RSL Offline Finals
META Madness #9
SL Budapest Major 2025
ESL Impact League Season 8
BLAST Rivals Fall 2025
IEM Chengdu 2025
PGL Masters Bucharest 2025
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22

Upcoming

BSL 21 Non-Korean Championship
Acropolis #4
IPSL Spring 2026
Bellum Gens Elite Stara Zagora 2026
HSC XXVIII
Big Gabe Cup #3
PGL Cluj-Napoca 2026
IEM Kraków 2026
BLAST Bounty Winter 2026
BLAST Bounty Winter Qual
eXTREMESLAND 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.