• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 17:34
CEST 23:34
KST 06:34
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Code S RO12 Preview: Maru, Trigger, Rogue, NightMare12Code S RO12 Preview: Cure, sOs, Reynor, Solar15[ASL19] Ro8 Preview: Unyielding3Official Ladder Map Pool Update (April 28, 2025)17[ASL19] Ro8 Preview: Rejuvenation8
Community News
Code S Season 1 - RO12 Group A Results (2025)4$1,250 WardiTV May [May 6th-May 18th]5Clem wins PiG Sty Festival #67Weekly Cups (April 28-May 4): ByuN & Astrea break through1Nexon wins bid to develop StarCraft IP content, distribute Overwatch mobile game29
StarCraft 2
General
Clem wins PiG Sty Festival #6 How does the number of casters affect your enjoyment of esports? Code S Season 1 - RO12 Group A Results (2025) Code S RO12 Preview: Maru, Trigger, Rogue, NightMare Nexon wins bid to develop StarCraft IP content, distribute Overwatch mobile game
Tourneys
[GSL 2025] Code S:Season 1 - RO12 - Group B [GSL 2025] Code S:Season 1 - RO12 - Group A $1,250 WardiTV May [May 6th-May 18th] SOOPer7s Showmatches 2025 Sea Duckling Open (Global, Bronze-Diamond)
Strategy
[G] PvT Cheese: 13 Gate Proxy Robo Simple Questions Simple Answers
Custom Maps
[UMS] Zillion Zerglings
External Content
Mutation # 472 Dead Heat Mutation # 471 Delivery Guaranteed Mutation # 470 Certain Demise Mutation # 469 Frostbite
Brood War
General
BW General Discussion BGH auto balance -> http://bghmmr.eu/ Recent recommended BW games Preserving Battlereports.com OGN to release AI-upscaled StarLeague from Feb 24
Tourneys
[BSL20] RO32 Group E - Sunday 20:00 CET [BSL20] RO32 Group F - Saturday 20:00 CET [ASL19] Ro8 Day 4 [CSLPRO] $1000 Spring is Here!
Strategy
[G] How to get started on ladder as a new Z player Creating a full chart of Zerg builds [G] Mineral Boosting
Other Games
General Games
Stormgate/Frost Giant Megathread What do you want from future RTS games? Nintendo Switch Thread Grand Theft Auto VI Battle Aces/David Kim RTS Megathread
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
LiquidLegends to reintegrate into TL.net
Heroes of the Storm
Simple Questions, Simple Answers
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
Vanilla Mini Mafia TL Mafia Community Thread TL Mafia Plays: Diplomacy TL Mafia: Generative Agents Showdown Survivor II: The Amazon
Community
General
Things Aren’t Peaceful in Palestine Elon Musk's lies, propaganda, etc. US Politics Mega-thread Ask and answer stupid questions here! Russo-Ukrainian War Thread
Fan Clubs
Serral Fan Club
Media & Entertainment
Movie Discussion! Anime Discussion Thread [Books] Wool by Hugh Howey Surprisingly good films/Hidden Gems
Sports
2024 - 2025 Football Thread NHL Playoffs 2024 NBA General Discussion Formula 1 Discussion
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread Cleaning My Mechanical Keyboard How to clean a TTe Thermaltake keyboard?
TL Community
BLinD-RawR 50K Post Watch Party The Automated Ban List TL.net Ten Commandments
Blogs
Why 5v5 Games Keep Us Hooked…
TrAiDoS
Info SLEgma_12
SLEgma_12
SECOND COMMING
XenOsky
WombaT’s Old BW Terran Theme …
WombaT
Heero Yuy & the Tax…
KrillinFromwales
BW PvZ Balance hypothetic…
Vasoline73
Test Entry for subject
xumakis
Customize Sidebar...

Website Feedback

Closed Threads



Active: 12731 users

New Search Engine - Now Live - Page 2

Forum Index > General Forum
152 CommentsPost a Reply
Prev 1 2 3 4 5 6 7 8 Next All
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 07:54 GMT
#21
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.
AdministratorTwitter: @R1CH_TL
Mystlord *
Profile Blog Joined July 2008
United States10264 Posts
December 23 2009 08:01 GMT
#22
On December 23 2009 16:54 R1CH wrote:
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.

Ok then never mind . I don't think there are any major problems with the new search engine then. The edit thing might be a problem because we'd lose a lot of new material (I'm thinking updates to topics like Stylish FPVODs and Day[9] podcasts).

Regardless, if I find any bugs, then I'll report them here.
It is impossible to be a citizen if you don't make an effort to understand the most basic activities of your government. It is very difficult to thrive in an increasingly competitive world if you're a nation of doods.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:02 GMT
#23
That's pretty interesting. I've never heard of sphinx before.

I've worked with Lucene extensively in the past, and it has top-notch performance, and it can be coded to address all of your problems. Maybe you can have a look when you are really bored over the holidays. I'll definitely have a look at Sphinx

Good job re-writing the search engine though!

Regarding the non-edit problem, can't you delete contents from the indices, and then simply re-index edited contents?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:11:25
December 23 2009 08:09 GMT
#24
After reading some comments, I have another proposal for stale contents (this is actually what I did with Lucene)

Given that:
Building indices from scratch takes 30 minutes
Incremental updates take much less time

What we can do is keep two sets of indices, call them A and B. Find a time when TL is least busy (i.e. least stress on the server) and call this time T.

We incrementally update one set of indices, A, until time T; at time T, we completely remove the other set of indices, B, and rebuild it from scratch. While B is being built, we obviously still update A, and all search queries will be run against A. After B is complete, we "dump" A, and run all subsequent queries against B. And we switch B with A again at time T. This way, the indices will be 1-day stale in the worst case.

Is this feasible?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
Last Edited: 2009-12-23 08:14:06
December 23 2009 08:10 GMT
#25
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool

EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:19:06
December 23 2009 08:12 GMT
#26
On December 23 2009 17:10 R1CH wrote:
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool


Ah that sucks. Hopefully they'll implement that soon
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:18:01
December 23 2009 08:16 GMT
#27
On December 23 2009 17:10 R1CH wrote:
EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.


Ha!

edit:

lucene owns~! (maybe sphinx won't pick this up)
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 08:19 GMT
#28
Seriously, we only have on-peak times .

[image loading]
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:21 GMT
#29
after squinting: weekly updates on sunday nights?
When you want something, all the universe conspires in helping you to achieve it.
writer22816
Profile Blog Joined September 2008
United States5775 Posts
December 23 2009 08:23 GMT
#30
Thanks a ton!
8/4/12 never forget, never forgive.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 08:24 GMT
#31
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:25 GMT
#32
Out of curiosity, how long did it take you to write the new indexing service?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:28:06
December 23 2009 08:27 GMT
#33
On December 23 2009 17:24 R1CH wrote:
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.


It would probably improve performance as well. Lucene definitely didn't like incremental indexing (and updates) as much as I'd liked.

Maybe suggest an "optimize" function to re-organize the existing index if such method doesn't exist yet?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 08:31 GMT
#34
That's one area where Sphinx excels, it can query any number of indexes, even on remote hosts with no performance penalty (other than the obvious aggregate index sizes). Most of the work setting up Sphinx was making sure all the charsets worked out properly (unicode and such) and setting up automatic updating / index rotation. The actual PHP code was quite simple since they provide a really good PHP API.
AdministratorTwitter: @R1CH_TL
Ilikestarcraft
Profile Blog Joined November 2004
Korea (South)17726 Posts
December 23 2009 08:44 GMT
#35
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now

Thanks for the fix r1ch.
"Nana is a goddess. Or at very least, Nana is my goddess." - KazeHydra
Harem
Profile Joined November 2007
United States11390 Posts
December 23 2009 08:55 GMT
#36
On December 23 2009 17:44 Ilikestarcraft wrote:
Show nested quote +
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now


-_________-

Anyways, fuck yes being able to search for 3 letter words now. Thanks R1ch. <33
Moderator。◕‿◕。
darktreb
Profile Joined May 2007
United States3016 Posts
December 23 2009 09:00 GMT
#37
On December 23 2009 16:23 R1CH wrote:
I really don't think it's as big an issue as you think, most edits are made within a few minutes of a post being posted and there's a low chance it would get indexed in that timeframe.


That's a really good point. Sounds good to me.
ShaLLoW[baY]
Profile Blog Joined January 2007
Canada12499 Posts
December 23 2009 09:06 GMT
#38
...I don't understand a single technical term in this thread. R1CH, you seem really smart. I'd love to be your Sphinx and suck the RAM out of your delta index
ALEXISONFIRE ARE FUCKING BACK (sAviOr for life)
H
Profile Blog Joined July 2007
New Zealand6138 Posts
December 23 2009 09:25 GMT
#39
R1CH why has SA not already done this
explain it to me in terms a dumb person would understand
[iHs]HCO | のヮの | pachi & plexa ownz | RIP _
NovaTheFeared
Profile Blog Joined October 2004
United States7212 Posts
December 23 2009 09:33 GMT
#40
Is the non updated index going to be a problem on live report topics where the OP gets edited many times over the course of hours of gameplay?
日本語が分かりますか
Prev 1 2 3 4 5 6 7 8 Next All
Please log in or register to reply.
Live Events Refresh
Circuito Brasileiro de…
19:00
A Decisão - Playoffs D1
CosmosSc2 512
CranKy Ducklings170
EnkiAlexander 71
davetesta23
Liquipedia
BSL Season 20
18:00
RO32 - Group F
WolFix vs ZZZero
Razz vs Zazu
ZZZero.O226
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
CosmosSc2 512
ProTech106
Ketroc 81
StarCraft: Brood War
ZZZero.O 226
sSak 75
Movie 33
soO 25
Sexy 13
Dota 2
Dendi1926
LuMiX1
Counter-Strike
Stewie2K1410
Fnx 698
flusha423
Heroes of the Storm
Khaldor371
Other Games
summit1g12635
Grubby4487
FrodaN3033
B2W.Neo776
crisheroes386
mouzStarbuck360
NeuroSwarm111
ViBE37
Organizations
Other Games
gamesdonequick1941
StarCraft 2
ESL.tv139
angryscii 58
Other Games
BasetradeTV24
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 18 non-featured ]
StarCraft 2
• printf 92
• tFFMrPink 18
• IndyKCrew
• musti20045 0
• AfreecaTV YouTube
• intothetv
• Kozan
• sooper7s
• LaughNgamezSOOP
• Migwel
StarCraft: Brood War
• 3DClanTV 34
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• Ler117
League of Legends
• Doublelift4871
Other Games
• Scarra1053
• Shiphtur337
Upcoming Events
Online Event
6h 26m
MaxPax vs herO
SHIN vs Cure
Clem vs MaxPax
ShoWTimE vs herO
ShoWTimE vs Clem
Sparkling Tuna Cup
12h 26m
WardiTV Invitational
13h 26m
AllThingsProtoss
13h 26m
uThermal 2v2 Circuit
16h 26m
Chat StarLeague
18h 26m
BSL Season 20
20h 26m
MadiNho vs dxtr13
Gypsy vs Dark
Circuito Brasileiro de…
21h 26m
Afreeca Starleague
1d 12h
BeSt vs Light
Wardi Open
1d 13h
[ Show More ]
PiGosaur Monday
2 days
Afreeca Starleague
2 days
Snow vs Soulkey
Replay Cast
2 days
WardiTV Invitational
2 days
Replay Cast
3 days
GSL Code S
3 days
ByuN vs Rogue
herO vs Cure
Replay Cast
4 days
GSL Code S
4 days
Classic vs Reynor
GuMiho vs Maru
The PondCast
4 days
RSL Revival
5 days
GSL Code S
5 days
Korean StarCraft League
6 days
RSL Revival
6 days
Liquipedia Results

Completed

BSL Nation Wars Season 2
PiG Sty Festival 6.0
Calamity Stars S2

Ongoing

JPL Season 2
ASL Season 19
YSL S1
BSL 2v2 Season 3
BSL Season 20
China & Korea Top Challenge
KCM Race Survival 2025 Season 2
CSLPRO Spring 2025
2025 GSL S1
Heroes 10 EU
PGL Astana 2025
Asian Champions League '25
ECL Season 49: Europe
BLAST Rivals Spring 2025
MESA Nomadic Masters
CCT Season 2 Global Finals
IEM Melbourne 2025
YaLLa Compass Qatar 2025
PGL Bucharest 2025
BLAST Open Spring 2025
ESL Pro League S21

Upcoming

NPSL S3
CSLPRO Last Chance 2025
CSLAN 2025
Esports World Cup 2025
HSC XXVII
Championship of Russia 2025
Bellum Gens Elite Stara Zagora 2025
2025 GSL S2
DreamHack Dallas 2025
IEM Cologne 2025
FISSURE Playground #1
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.