• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 03:43
CEST 09:43
KST 16:43
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Team Liquid Map Contest #22 - The Finalists13[ASL21] Ro16 Preview Pt1: Fresh Flow9[ASL21] Ro24 Preview Pt2: News Flash10[ASL21] Ro24 Preview Pt1: New Chaos0Team Liquid Map Contest #22 - Presented by Monster Energy21
Community News
2026 GSL Season 1 Qualifiers11Maestros of the Game 2 announced32026 GSL Tour plans announced10Weekly Cups (April 6-12): herO doubles, "Villains" prevail1MaNa leaves Team Liquid20
StarCraft 2
General
https://www.facebook.com/QinuxFootRevitaReviews/ Team Liquid Map Contest #22 - The Finalists Weekly Cups (April 6-12): herO doubles, "Villains" prevail MaNa leaves Team Liquid Oliveira Would Have Returned If EWC Continued
Tourneys
2026 GSL Season 1 Qualifiers Sparkling Tuna Cup - Weekly Open Tournament Master Swan Open (Global Bronze-Master 2) SEL Doubles (SC Evo Bimonthly) $5,000 WardiTV TLMC tournament - Presented by Monster Energy
Strategy
Custom Maps
[D]RTS in all its shapes and glory <3 [A] Nemrods 1/4 players [M] (2) Frigid Storage
External Content
Mutation # 521 Memorable Boss The PondCast: SC2 News & Results Mutation # 520 Moving Fees Mutation # 519 Inner Power
Brood War
General
BGH Auto Balance -> http://bghmmr.eu/ Pros React To: Tulbo in Ro.16 Group A ASL21 General Discussion BW General Discussion [BSL22] RO32 Group Stage
Tourneys
[ASL21] Ro16 Group B Small VOD Thread 2.0 Korean KCM Race Survival 2026 Season 2 [BSL22] RO32 Group D - Sunday 21:00 CEST
Strategy
Simple Questions, Simple Answers What's the deal with APM & what's its true value Any training maps people recommend? Fighting Spirit mining rates
Other Games
General Games
Nintendo Switch Thread General RTS Discussion Thread Battle Aces/David Kim RTS Megathread Stormgate/Frost Giant Megathread Starcraft Tabletop Miniature Game
Dota 2
The Story of Wings Gaming Official 'what is Dota anymore' discussion
League of Legends
G2 just beat GenG in First stand
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
Vanilla Mini Mafia Mafia Game Mode Feedback/Ideas TL Mafia Community Thread Five o'clock TL Mafia
Community
General
Things Aren’t Peaceful in Palestine US Politics Mega-thread Russo-Ukrainian War Thread YouTube Thread Canadian Politics Mega-thread
Fan Clubs
The IdrA Fan Club
Media & Entertainment
Anime Discussion Thread [Req][Books] Good Fantasy/SciFi books [Manga] One Piece Movie Discussion!
Sports
2024 - 2026 Football Thread McBoner: A hockey love story Formula 1 Discussion Cricket [SPORT]
World Cup 2022
Tech Support
[G] How to Block Livestream Ads
TL Community
The Automated Ban List
Blogs
Reappraising The Situation T…
TrAiDoS
lurker extra damage testi…
StaticNine
Broowar part 2
qwaykee
Funny Nicknames
LUCKY_NOOB
Iranian anarchists: organize…
XenOsky
ASL S21 English Commentary…
namkraft
Customize Sidebar...

Website Feedback

Closed Threads



Active: 2864 users

An interesting complex programming problem

Blogs > Qzy
Post a Reply
1 2 Next All
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
Last Edited: 2011-05-21 15:55:19
May 21 2011 15:44 GMT
#1
Hi programmers/math people.

Okay, here's the problem.
I have a hashmap with Strings as keys and values pointing to objects (as seen below in java)

HashMap<String, SomeObject>

The chars within a single string is element of the set {0, 1, #}. # is a wildcard which can represent either a 0 or 1.

When presented by a message, ie: 011010111 (a message's char is element of the set {0, 1}), the following strings are satisfied:
01#01#111
#1101011#
011010111
#########
etc., due to their wildcards.

Which look up/sorting method would you do, such that you have the fastest algorithm to store the strings and also find the strings which are satisfied?

Bruteforce
Complexity: Finding all satisfied strings: O(n*p) with n = population of strings, p = size of string.

Bruteforce works ofcourse:
for(all strings in hashmap)
is string satisfied? Save it
next string

Tree
Complexity: O(p*n), but very unlikely that all strings are found in ONE leaf. Constructing the tree O(2^p) (!HOLY FUCK!)

Keeping a tree which branches every time a wildcard apears in a string. Each leaf in the tree has a hashset, which looks like the one above. The string's SomeObject ie, 01#01#111 would be possible to find in 4 leafs of the tree:
010010111
010011111
011011111
011010111

The problem is constructing the tree... if the String is big like 20-30 chars, the construction is simply too big to be possible.

How would you do it?

*****
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
Cube
Profile Blog Joined February 2008
Canada777 Posts
Last Edited: 2011-05-21 16:21:33
May 21 2011 16:03 GMT
#2
what I want to do is solve the problem by "folding" the strings into unique integers somehow, but i'm not sure it can be done.

edit: I really don't think I can help you, sorry.

edit2: what about making a new hashmap with no wildcards by replicating each string/object pairing 2^(num #s) times, then sorting the strings as integers. (big setup time, subsequent searches are O(lgn)).
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
Last Edited: 2011-05-21 16:29:24
May 21 2011 16:23 GMT
#3
On May 22 2011 01:03 Cube wrote:
what I want to do is solve the problem by "folding" the strings into unique integers somehow, but i'm not sure it can be done.

edit: I really don't think I can help you, sorry.


Might actually be a good idea.

Then it's an experiment of how much fold it required to sort it into serveral small hashmaps
Ignore what i wrote, I gotta think more about it.
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
Famulus
Profile Joined April 2011
United States8 Posts
May 21 2011 16:33 GMT
#4
What about bruteforcing the other way. Assuming you only care about getting the correct object and not the actual string, make an entry in the hash table for every possible message for each string with a wildcard.
pullarius1
Profile Blog Joined May 2010
United States523 Posts
Last Edited: 2011-05-21 16:35:48
May 21 2011 16:34 GMT
#5
Just to clarify, there are no limits on the sizes or types of data, eg a string could be a million characters long and the population of acceptable strings could be arbitrarily large? Also, are we assuming that all strings we're working with are of the same length?

I guess what I'm really asking is whether the problem is for an actual project in real life or just a theoretical puzzle?
@pullarius1
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
May 21 2011 16:38 GMT
#6
On May 22 2011 01:33 Famulus wrote:
What about bruteforcing the other way. Assuming you only care about getting the correct object and not the actual string, make an entry in the hash table for every possible message for each string with a wildcard.


It would be a good idea, but every possible message is 2^(length of string), that's

length -> combinations
20 -> 1,048,576
40 -> 1,099,511,627,776 (in my case)

Not scalable :/.
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
May 21 2011 16:40 GMT
#7
On May 22 2011 01:34 pullarius1 wrote:
Just to clarify, there are no limits on the sizes or types of data, eg a string could be a million characters long and the population of acceptable strings could be arbitrarily large? Also, are we assuming that all strings we're working with are of the same length?

I guess what I'm really asking is whether the problem is for an actual project in real life or just a theoretical puzzle?


Perfectly good questions. All strings are the same length, and so is the message that needs to be satisfied. It's for an XCS engine - I'll provide a paper in a sec.
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
Last Edited: 2011-05-21 16:49:05
May 21 2011 16:43 GMT
#8
algorithmic description of XCS

It's an AI learning technique, based on "Learning classifier systems". You don't really need to read it to understand the problem though.

Bunch of strings with #10 in it, and a message with 10 which needs to find the strings that satisfies it.
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
arioch
Profile Joined May 2010
England403 Posts
May 21 2011 17:15 GMT
#9
I am interested to see if someone comes up with an alternative to iteration for this as I parse huge data files on a daily basis for work.

I often find myself setting up foreach loops with regular expressions to loop through hashtables in perl, and always wondered if there was a more efficient way of doing it.
Mx.DeeP
Profile Joined February 2008
China25 Posts
May 21 2011 17:18 GMT
#10
If you're not worried about memory, you can just take the initial HashMap and convert it into a new HashMap<String, ArrayList<SomeObject>> where the key is only {0,1}. You just iterate through the original HashMap and convert all '#' into '0' and '1'. This is worst case O(2^p) for a String of all '#' for storing, but gives you O(1) look-up time.
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
Last Edited: 2011-05-21 17:33:34
May 21 2011 17:32 GMT
#11
On May 22 2011 02:18 Mx.DeeP wrote:
If you're not worried about memory, you can just take the initial HashMap and convert it into a new HashMap<String, ArrayList<SomeObject>> where the key is only {0,1}. You just iterate through the original HashMap and convert all '#' into '0' and '1'. This is worst case O(2^p) for a String of all '#' for storing, but gives you O(1) look-up time.


Exactly - that's the "tree" i talked about..

My message (in my problem) has 40 bits, that's 2^40 in construction of that tree... 1,099,511,627,776 nodes in it - would take too long :/.

I'm gonna go work out, and think about it. .
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
pullarius1
Profile Blog Joined May 2010
United States523 Posts
Last Edited: 2011-05-21 17:34:14
May 21 2011 17:32 GMT
#12
I'm not sure how much you get to work with the lists beforehand, but obviously if you could sort the list of wild strings before hand it would help a lot. But it would be a waste of time if the number of strings you were checking for matches for were very low. That is, if you have 100 01# strings, but only needed to find the matches for a few 10 strings, sorting would probably hurt. But if you had 100 strings to match, it would probably be worth your while.

One thing I thought of that is probably not useful at all:
For each string, rehash it into integers in the following way-
For each placenumber i, assign the the 2i-th and (2i-1)th prime to it. If that place number holds a 1 one, choose the odd prime, a 0, choose the even prime, a # choose neither. Multiply all the chosen primes together.

For instance 10110 would be (2 or 3) (5 or 7) (11 or 13) (17 or 19) (23 or 29) 2*7*11*17*29 = 75,922

While #01#0 would be (2 or 3) (5 or 7) (11 or 13) (17 or 19) (23 or29) 7*11*29 = 2,233

The benefit of this system would be that wild strings would divide precisely the strings that satisfied them. For whatever that's worth.

...sometimes I wish I had taken some practical programming classes in school :-(.

@pullarius1
Cube
Profile Blog Joined February 2008
Canada777 Posts
May 21 2011 18:07 GMT
#13
On May 22 2011 02:32 pullarius1 wrote:
I'm not sure how much you get to work with the lists beforehand, but obviously if you could sort the list of wild strings before hand it would help a lot. But it would be a waste of time if the number of strings you were checking for matches for were very low. That is, if you have 100 01# strings, but only needed to find the matches for a few 10 strings, sorting would probably hurt. But if you had 100 strings to match, it would probably be worth your while.

One thing I thought of that is probably not useful at all:
For each string, rehash it into integers in the following way-
For each placenumber i, assign the the 2i-th and (2i-1)th prime to it. If that place number holds a 1 one, choose the odd prime, a 0, choose the even prime, a # choose neither. Multiply all the chosen primes together.

For instance 10110 would be (2 or 3) (5 or 7) (11 or 13) (17 or 19) (23 or 29) 2*7*11*17*29 = 75,922

While #01#0 would be (2 or 3) (5 or 7) (11 or 13) (17 or 19) (23 or29) 7*11*29 = 2,233

The benefit of this system would be that wild strings would divide precisely the strings that satisfied them. For whatever that's worth.

...sometimes I wish I had taken some practical programming classes in school :-(.



this is basically what I had in mind but as the string size grows arbitrarily large this becomes impractical. :[
Oracle
Profile Blog Joined May 2007
Canada411 Posts
Last Edited: 2011-05-21 19:20:00
May 21 2011 18:14 GMT
#14
On May 22 2011 02:32 pullarius1 wrote:
I'm not sure how much you get to work with the lists beforehand, but obviously if you could sort the list of wild strings before hand it would help a lot. But it would be a waste of time if the number of strings you were checking for matches for were very low. That is, if you have 100 01# strings, but only needed to find the matches for a few 10 strings, sorting would probably hurt. But if you had 100 strings to match, it would probably be worth your while.

One thing I thought of that is probably not useful at all:
For each string, rehash it into integers in the following way-
For each placenumber i, assign the the 2i-th and (2i-1)th prime to it. If that place number holds a 1 one, choose the odd prime, a 0, choose the even prime, a # choose neither. Multiply all the chosen primes together.

For instance 10110 would be (2 or 3) (5 or 7) (11 or 13) (17 or 19) (23 or 29) 2*7*11*17*29 = 75,922

While #01#0 would be (2 or 3) (5 or 7) (11 or 13) (17 or 19) (23 or29) 7*11*29 = 2,233

The benefit of this system would be that wild strings would divide precisely the strings that satisfied them. For whatever that's worth.

...sometimes I wish I had taken some practical programming classes in school :-(.



well thats actually a great solution, since if message modulo hashed-key = 0 then such an index satisfies the constraint.

So if you map every key by the hash function to this form, and store it in the next slot in the database, as well as with its object pointer, then simply do a linear search for message modulo hashed key = 0 on each element of the array

this circumvents directly hashing onto an array location since that hash function increases faster than n factorial
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
Last Edited: 2011-05-21 19:16:33
May 21 2011 19:02 GMT
#15
Okay, I gotta re-read it all, cos I'm a bit lost on this one.. .

Edit: okay, I read it! I need to write a sketch over it Might actually work with modulus it with the message.

Your only problem is if the String has 40 wildcards in it - then it takes a long time to write all the possible prime combinations (right?) ... or if you ignore wildcards, is 40x # = 0?

I'm gonna write an algorithm for this rly quick .
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
Oracle
Profile Blog Joined May 2007
Canada411 Posts
May 21 2011 19:19 GMT
#16
So when you store an object by its key, create an array with the hashed version of the key (H_i) O(p) and its object pointer O(1). Then store both into the next available position in the dataset O(1).

When you're searching for keys which satisfy a certain message:
First hash the key O(p) = H_k.
Then do a linear check over all database entries such that H_k modulo H_i = 0 and return it. O(l)

p = length of string
l = length of dataset
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
Last Edited: 2011-05-21 19:34:53
May 21 2011 19:34 GMT
#17
The problem is then, what if the dataset is 5,000,000 strings? :/
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
Oracle
Profile Blog Joined May 2007
Canada411 Posts
May 21 2011 19:40 GMT
#18
insertion is O(p)
extraction is O(p+l) in which l will probably dominate p so O(l)

which is still acceptable by any means (l = length of array, so linear time)
5,000,000 wouldn't take an enormous amount of time (in fact 5,000,000 is actually really fast to compute)
Qzy
Profile Blog Joined July 2010
Denmark1121 Posts
May 21 2011 19:42 GMT
#19
I'm thinking it might be possible to speed up look up.. Perhaps with tree-search, or other sorting methods.. Ofcourse this would kill the insertion-time.
TG Sambo... Intel classic! Life of lively to live to life of full life thx to shield battery
haxorz
Profile Blog Joined June 2009
United States138 Posts
May 21 2011 19:50 GMT
#20
^ Yes, it is. I've been thinking about this for the past hour or so and have coded up a working implementation in Java. I'll PM you once I write more tests.
And theres the GG.
1 2 Next All
Please log in or register to reply.
Live Events Refresh
Next event in 2h 17m
[ Submit Event ]
Live Streams
Refresh
StarCraft: Brood War
Sharp 1398
Stork 415
Leta 359
Tasteless 252
Hm[arnc] 145
soO 64
ggaemo 40
yabsab 20
Sacsri 14
Backho 12
[ Show more ]
Icarus 12
ZergMaN 8
Dota 2
XaKoH 403
ODPixel103
NeuroSwarm93
League of Legends
JimRising 607
Counter-Strike
Stewie2K1086
shoxiejesuss395
allub59
Super Smash Bros
Mew2King89
Heroes of the Storm
Trikslyr27
Other Games
summit1g8686
singsing625
ceh9546
C9.Mang0400
crisheroes116
Organizations
Counter-Strike
PGL113
StarCraft: Brood War
UltimateBattle 64
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 12 non-featured ]
StarCraft 2
• Berry_CruncH299
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
League of Legends
• Rush1198
Upcoming Events
Escore
2h 17m
WardiTV Map Contest Tou…
3h 17m
OSC
7h 17m
Big Brain Bouts
8h 17m
MaNa vs goblin
Scarlett vs Spirit
Serral vs herO
Korean StarCraft League
19h 17m
CranKy Ducklings
1d 2h
WardiTV Map Contest Tou…
1d 3h
IPSL
1d 8h
WolFix vs nOmaD
dxtr13 vs Razz
BSL
1d 11h
UltrA vs KwarK
Gosudark vs cavapoo
dxtr13 vs HBO
Doodle vs Razz
CranKy Ducklings
1d 16h
[ Show More ]
Sparkling Tuna Cup
2 days
WardiTV Map Contest Tou…
2 days
Ladder Legends
2 days
BSL
2 days
StRyKeR vs rasowy
Artosis vs Aether
JDConan vs OyAji
Hawk vs izu
IPSL
2 days
JDConan vs TBD
Aegong vs rasowy
Replay Cast
3 days
Wardi Open
3 days
Afreeca Starleague
3 days
Bisu vs Ample
Jaedong vs Flash
Monday Night Weeklies
3 days
RSL Revival
3 days
Afreeca Starleague
4 days
Barracks vs Leta
Royal vs Light
WardiTV Map Contest Tou…
4 days
RSL Revival
5 days
Replay Cast
5 days
The PondCast
6 days
WardiTV Map Contest Tou…
6 days
Replay Cast
6 days
Liquipedia Results

Completed

Proleague 2026-04-15
RSL Revival: Season 4
NationLESS Cup

Ongoing

BSL Season 22
ASL Season 21
CSL 2026 SPRING (S20)
IPSL Spring 2026
KCM Race Survival 2026 Season 2
Escore Tournament S2: W3
StarCraft2 Community Team League 2026 Spring
WardiTV TLMC #16
Nations Cup 2026
IEM Rio 2026
PGL Bucharest 2026
Stake Ranked Episode 1
BLAST Open Spring 2026
ESL Pro League S23 Finals
ESL Pro League S23 Stage 1&2
PGL Cluj-Napoca 2026
IEM Kraków 2026

Upcoming

Escore Tournament S2: W4
Acropolis #4
BSL 22 Non-Korean Championship
CSLAN 4
Kung Fu Cup 2026 Grand Finals
HSC XXIX
uThermal 2v2 2026 Main Event
2026 GSL S2
RSL Revival: Season 5
2026 GSL S1
XSE Pro League 2026
IEM Cologne Major 2026
Stake Ranked Episode 2
CS Asia Championships 2026
IEM Atlanta 2026
Asian Champions League 2026
PGL Astana 2026
BLAST Rivals Spring 2026
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2026 TLnet. All Rights Reserved.