• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 10:04
CEST 16:04
KST 23:04
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
[ASL21] Ro8 Preview Pt1: Inheritors13[ASL21] Ro16 Preview Pt2: All Star10Team Liquid Map Contest #22 - The Finalists19[ASL21] Ro16 Preview Pt1: Fresh Flow9[ASL21] Ro24 Preview Pt2: News Flash10
Community News
2026 GSL Season 1 Qualifiers21Maestros of the Game 2 announced92026 GSL Tour plans announced15Weekly Cups (April 6-12): herO doubles, "Villains" prevail1MaNa leaves Team Liquid25
StarCraft 2
General
Team Liquid Map Contest #22 - The Finalists MaNa leaves Team Liquid Maestros of the Game 2 announced 2026 GSL Tour plans announced Blizzard Classic Cup @ BlizzCon 2026 - $100k prize pool
Tourneys
2026 GSL Season 1 Qualifiers Sparkling Tuna Cup - Weekly Open Tournament INu's Battles#14 <BO.9 2Matches> GSL CK: More events planned pending crowdfunding RSL Revival: Season 5 - Qualifiers and Main Event
Strategy
Custom Maps
[D]RTS in all its shapes and glory <3 [A] Nemrods 1/4 players [M] (2) Frigid Storage
External Content
The PondCast: SC2 News & Results Mutation # 523 Firewall Mutation # 522 Flip My Base Mutation # 521 Memorable Boss
Brood War
General
BGH Auto Balance -> http://bghmmr.eu/ [ASL21] Ro8 Preview Pt1: Inheritors FlaSh: This Will Be My Final ASL【ASL S21 Ro.16】 Leta's ASL S21 Ro.16 review ASL21 General Discussion
Tourneys
[ASL21] Ro8 Day 1 [Megathread] Daily Proleagues [ASL21] Ro16 Group D Escore Tournament StarCraft Season 2
Strategy
Fighting Spirit mining rates Simple Questions, Simple Answers What's the deal with APM & what's its true value Any training maps people recommend?
Other Games
General Games
Stormgate/Frost Giant Megathread Diablo IV Nintendo Switch Thread Dawn of War IV Total Annihilation Server - TAForever
Dota 2
The Story of Wings Gaming
League of Legends
G2 just beat GenG in First stand
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
Vanilla Mini Mafia Mafia Game Mode Feedback/Ideas TL Mafia Community Thread Five o'clock TL Mafia
Community
General
3D technology/software discussion US Politics Mega-thread European Politico-economics QA Mega-thread Canadian Politics Mega-thread Things Aren’t Peaceful in Palestine
Fan Clubs
The IdrA Fan Club
Media & Entertainment
[Manga] One Piece Anime Discussion Thread [Req][Books] Good Fantasy/SciFi books Movie Discussion!
Sports
2024 - 2026 Football Thread Formula 1 Discussion McBoner: A hockey love story
World Cup 2022
Tech Support
streaming software Strange computer issues (software) [G] How to Block Livestream Ads
TL Community
The Automated Ban List
Blogs
Sexual Health Of Gamers
TrAiDoS
lurker extra damage testi…
StaticNine
Broowar part 2
qwaykee
Funny Nicknames
LUCKY_NOOB
Iranian anarchists: organize…
XenOsky
Customize Sidebar...

Website Feedback

Closed Threads



Active: 2147 users

New Search Engine - Now Live - Page 2

Forum Index > General Forum
152 CommentsPost a Reply
Prev 1 2 3 4 5 6 7 8 Next All
R1CH
Profile Blog Joined May 2007
Netherlands10342 Posts
December 23 2009 07:54 GMT
#21
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.
AdministratorTwitter: @R1CH_TL
TL+ Member
Mystlord *
Profile Blog Joined July 2008
United States10264 Posts
December 23 2009 08:01 GMT
#22
On December 23 2009 16:54 R1CH wrote:
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.

Ok then never mind . I don't think there are any major problems with the new search engine then. The edit thing might be a problem because we'd lose a lot of new material (I'm thinking updates to topics like Stylish FPVODs and Day[9] podcasts).

Regardless, if I find any bugs, then I'll report them here.
It is impossible to be a citizen if you don't make an effort to understand the most basic activities of your government. It is very difficult to thrive in an increasingly competitive world if you're a nation of doods.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:02 GMT
#23
That's pretty interesting. I've never heard of sphinx before.

I've worked with Lucene extensively in the past, and it has top-notch performance, and it can be coded to address all of your problems. Maybe you can have a look when you are really bored over the holidays. I'll definitely have a look at Sphinx

Good job re-writing the search engine though!

Regarding the non-edit problem, can't you delete contents from the indices, and then simply re-index edited contents?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:11:25
December 23 2009 08:09 GMT
#24
After reading some comments, I have another proposal for stale contents (this is actually what I did with Lucene)

Given that:
Building indices from scratch takes 30 minutes
Incremental updates take much less time

What we can do is keep two sets of indices, call them A and B. Find a time when TL is least busy (i.e. least stress on the server) and call this time T.

We incrementally update one set of indices, A, until time T; at time T, we completely remove the other set of indices, B, and rebuild it from scratch. While B is being built, we obviously still update A, and all search queries will be run against A. After B is complete, we "dump" A, and run all subsequent queries against B. And we switch B with A again at time T. This way, the indices will be 1-day stale in the worst case.

Is this feasible?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10342 Posts
Last Edited: 2009-12-23 08:14:06
December 23 2009 08:10 GMT
#25
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool

EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.
AdministratorTwitter: @R1CH_TL
TL+ Member
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:19:06
December 23 2009 08:12 GMT
#26
On December 23 2009 17:10 R1CH wrote:
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool


Ah that sucks. Hopefully they'll implement that soon
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:18:01
December 23 2009 08:16 GMT
#27
On December 23 2009 17:10 R1CH wrote:
EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.


Ha!

edit:

lucene owns~! (maybe sphinx won't pick this up)
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10342 Posts
December 23 2009 08:19 GMT
#28
Seriously, we only have on-peak times .

[image loading]
AdministratorTwitter: @R1CH_TL
TL+ Member
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:21 GMT
#29
after squinting: weekly updates on sunday nights?
When you want something, all the universe conspires in helping you to achieve it.
writer22816
Profile Blog Joined September 2008
United States5775 Posts
December 23 2009 08:23 GMT
#30
Thanks a ton!
8/4/12 never forget, never forgive.
R1CH
Profile Blog Joined May 2007
Netherlands10342 Posts
December 23 2009 08:24 GMT
#31
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.
AdministratorTwitter: @R1CH_TL
TL+ Member
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:25 GMT
#32
Out of curiosity, how long did it take you to write the new indexing service?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:28:06
December 23 2009 08:27 GMT
#33
On December 23 2009 17:24 R1CH wrote:
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.


It would probably improve performance as well. Lucene definitely didn't like incremental indexing (and updates) as much as I'd liked.

Maybe suggest an "optimize" function to re-organize the existing index if such method doesn't exist yet?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10342 Posts
December 23 2009 08:31 GMT
#34
That's one area where Sphinx excels, it can query any number of indexes, even on remote hosts with no performance penalty (other than the obvious aggregate index sizes). Most of the work setting up Sphinx was making sure all the charsets worked out properly (unicode and such) and setting up automatic updating / index rotation. The actual PHP code was quite simple since they provide a really good PHP API.
AdministratorTwitter: @R1CH_TL
TL+ Member
Ilikestarcraft
Profile Blog Joined November 2004
Korea (South)17735 Posts
December 23 2009 08:44 GMT
#35
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now

Thanks for the fix r1ch.
ils
"Nana is a goddess. Or at very least, Nana is my goddess." - KazeHydra
Harem
Profile Joined November 2007
United States11393 Posts
December 23 2009 08:55 GMT
#36
On December 23 2009 17:44 Ilikestarcraft wrote:
Show nested quote +
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now


-_________-

Anyways, fuck yes being able to search for 3 letter words now. Thanks R1ch. <33
Moderator。◕‿◕。
darktreb
Profile Joined May 2007
United States3017 Posts
December 23 2009 09:00 GMT
#37
On December 23 2009 16:23 R1CH wrote:
I really don't think it's as big an issue as you think, most edits are made within a few minutes of a post being posted and there's a low chance it would get indexed in that timeframe.


That's a really good point. Sounds good to me.
ShaLLoW[baY]
Profile Blog Joined January 2007
Canada12499 Posts
December 23 2009 09:06 GMT
#38
...I don't understand a single technical term in this thread. R1CH, you seem really smart. I'd love to be your Sphinx and suck the RAM out of your delta index
ALEXISONFIRE ARE FUCKING BACK (sAviOr for life)
H
Profile Blog Joined July 2007
New Zealand6138 Posts
December 23 2009 09:25 GMT
#39
R1CH why has SA not already done this
explain it to me in terms a dumb person would understand
[iHs]HCO | のヮの | pachi & plexa ownz | RIP _
NovaTheFeared
Profile Blog Joined October 2004
United States7232 Posts
December 23 2009 09:33 GMT
#40
Is the non updated index going to be a problem on live report topics where the OP gets edited many times over the course of hours of gameplay?
日本語が分かりますか
Prev 1 2 3 4 5 6 7 8 Next All
Please log in or register to reply.
Live Events Refresh
Wardi Open
11:00
#84
WardiTV1226
IntoTheiNu 972
OGKoka 464
Rex136
Ryung 28
Liquipedia
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
OGKoka 464
Lowko398
Hui .251
Rex 136
Ryung 28
StarCraft: Brood War
Calm 9140
Sea 2703
Jaedong 2178
EffOrt 966
Hyuk 921
Stork 668
BeSt 621
Mini 523
actioN 468
Snow 410
[ Show more ]
ggaemo 379
ZerO 177
Light 174
Hyun 139
PianO 108
Killer 106
Nal_rA 80
Rush 79
ToSsGirL 77
Sea.KH 60
Pusan 56
[sc1f]eonzerg 55
Aegong 52
Free 51
Shinee 49
Barracks 42
soO 36
Bale 34
ajuk12(nOOB) 29
Sacsri 27
yabsab 24
Sexy 19
910 19
scan(afreeca) 18
HiyA 15
GoRush 11
Noble 9
Icarus 8
Terrorterran 6
Dota 2
qojqva1804
BananaSlamJamma208
Counter-Strike
zeus1223
byalli736
markeloff262
edward225
Super Smash Bros
Mew2King93
Heroes of the Storm
Khaldor232
Other Games
singsing2485
B2W.Neo1310
hiko816
XBOCT350
crisheroes287
XaKoH 201
Pyrionflax171
Liquid`LucifroN159
Liquid`VortiX144
ArmadaUGS86
ZerO(Twitch)13
Organizations
Dota 2
PGL Dota 2 - Main Stream359
StarCraft: Brood War
Kim Chul Min (afreeca) 12
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
[ Show 11 non-featured ]
StarCraft 2
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
League of Legends
• Nemesis3531
Upcoming Events
Monday Night Weeklies
1h 56m
Replay Cast
9h 56m
Replay Cast
18h 56m
Afreeca Starleague
19h 56m
Leta vs YSC
GSL
1d 19h
Rogue vs Percival
Zoun vs Solar
Replay Cast
2 days
GSL
2 days
Cure vs TriGGeR
ByuN vs Bunny
The PondCast
2 days
KCM Race Survival
2 days
Replay Cast
3 days
[ Show More ]
Replay Cast
3 days
Escore
3 days
Replay Cast
4 days
Replay Cast
4 days
IPSL
5 days
Ret vs Art_Of_Turtle
Radley vs TBD
BSL
5 days
Replay Cast
5 days
uThermal 2v2 Circuit
5 days
BSL
6 days
IPSL
6 days
eOnzErG vs TBD
G5 vs Nesh
Replay Cast
6 days
Wardi Open
6 days
Afreeca Starleague
6 days
Jaedong vs Light
Liquipedia Results

Completed

Escore Tournament S2: W4
WardiTV TLMC #16
Nations Cup 2026

Ongoing

BSL Season 22
ASL Season 21
CSL 2026 SPRING (S20)
IPSL Spring 2026
KCM Race Survival 2026 Season 2
StarCraft2 Community Team League 2026 Spring
IEM Rio 2026
PGL Bucharest 2026
Stake Ranked Episode 1
BLAST Open Spring 2026
ESL Pro League S23 Finals
ESL Pro League S23 Stage 1&2
PGL Cluj-Napoca 2026

Upcoming

Escore Tournament S2: W5
KK 2v2 League Season 1
Acropolis #4
BSL 22 Non-Korean Championship
CSLAN 4
Kung Fu Cup 2026 Grand Finals
HSC XXIX
uThermal 2v2 2026 Main Event
Maestros of the Game 2
2026 GSL S2
RSL Revival: Season 5
2026 GSL S1
XSE Pro League 2026
IEM Cologne Major 2026
Stake Ranked Episode 2
CS Asia Championships 2026
IEM Atlanta 2026
Asian Champions League 2026
PGL Astana 2026
BLAST Rivals Spring 2026
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2026 TLnet. All Rights Reserved.