• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EST 18:39
CET 00:39
KST 08:39
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
TL.net Map Contest #21: Winners11Intel X Team Liquid Seoul event: Showmatches and Meet the Pros10[ASL20] Finals Preview: Arrival13TL.net Map Contest #21: Voting12[ASL20] Ro4 Preview: Descent11
Community News
[TLMC] Fall/Winter 2025 Ladder Map Rotation8Weekly Cups (Nov 3-9): Clem Conquers in Canada4SC: Evo Complete - Ranked Ladder OPEN ALPHA8StarCraft, SC2, HotS, WC3, Returning to Blizzcon!45$5,000+ WardiTV 2025 Championship7
StarCraft 2
General
[TLMC] Fall/Winter 2025 Ladder Map Rotation Mech is the composition that needs teleportation t Weekly Cups (Nov 3-9): Clem Conquers in Canada Craziest Micro Moments Of All Time? SC: Evo Complete - Ranked Ladder OPEN ALPHA
Tourneys
Tenacious Turtle Tussle RSL S3 Round of 16 Master Swan Open (Global Bronze-Master 2) Constellation Cup - Main Event - Stellar Fest Sparkling Tuna Cup - Weekly Open Tournament
Strategy
Custom Maps
Map Editor closed ?
External Content
Mutation # 499 Chilling Adaptation Mutation # 498 Wheel of Misfortune|Cradle of Death Mutation # 497 Battle Haredened Mutation # 496 Endless Infection
Brood War
General
[ASL20] Ask the mapmakers — Drop your questions BW General Discussion FlaSh on: Biggest Problem With SnOw's Playstyle Terran 1:35 12 Gas Optimization BGH Auto Balance -> http://bghmmr.eu/
Tourneys
[Megathread] Daily Proleagues [BSL21] RO32 Group D - Sunday 21:00 CET [BSL21] RO32 Group C - Saturday 21:00 CET [ASL20] Grand Finals
Strategy
Current Meta PvZ map balance How to stay on top of macro? Soma's 9 hatch build from ASL Game 2
Other Games
General Games
Nintendo Switch Thread Stormgate/Frost Giant Megathread EVE Corporation Should offensive tower rushing be viable in RTS games? Path of Exile
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Deck construction bug Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread SPIRED by.ASL Mafia {211640}
Community
General
Things Aren’t Peaceful in Palestine US Politics Mega-thread Russo-Ukrainian War Thread Canadian Politics Mega-thread The Games Industry And ATVI
Fan Clubs
White-Ra Fan Club The herO Fan Club!
Media & Entertainment
[Manga] One Piece Anime Discussion Thread Movie Discussion! Korean Music Discussion Series you have seen recently...
Sports
2024 - 2026 Football Thread Formula 1 Discussion NBA General Discussion MLB/Baseball 2023 TeamLiquid Health and Fitness Initiative For 2023
World Cup 2022
Tech Support
SC2 Client Relocalization [Change SC2 Language] Linksys AE2500 USB WIFI keeps disconnecting Computer Build, Upgrade & Buying Resource Thread
TL Community
The Automated Ban List
Blogs
Dyadica Gospel – a Pulp No…
Hildegard
Coffee x Performance in Espo…
TrAiDoS
Saturation point
Uldridge
DnB/metal remix FFO Mick Go…
ImbaTosS
Reality "theory" prov…
perfectspheres
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1577 users

New Search Engine - Now Live - Page 2

Forum Index > General Forum
152 CommentsPost a Reply
Prev 1 2 3 4 5 6 7 8 Next All
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 07:54 GMT
#21
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.
AdministratorTwitter: @R1CH_TL
TL+ Member
Mystlord *
Profile Blog Joined July 2008
United States10264 Posts
December 23 2009 08:01 GMT
#22
On December 23 2009 16:54 R1CH wrote:
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.

Ok then never mind . I don't think there are any major problems with the new search engine then. The edit thing might be a problem because we'd lose a lot of new material (I'm thinking updates to topics like Stylish FPVODs and Day[9] podcasts).

Regardless, if I find any bugs, then I'll report them here.
It is impossible to be a citizen if you don't make an effort to understand the most basic activities of your government. It is very difficult to thrive in an increasingly competitive world if you're a nation of doods.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:02 GMT
#23
That's pretty interesting. I've never heard of sphinx before.

I've worked with Lucene extensively in the past, and it has top-notch performance, and it can be coded to address all of your problems. Maybe you can have a look when you are really bored over the holidays. I'll definitely have a look at Sphinx

Good job re-writing the search engine though!

Regarding the non-edit problem, can't you delete contents from the indices, and then simply re-index edited contents?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:11:25
December 23 2009 08:09 GMT
#24
After reading some comments, I have another proposal for stale contents (this is actually what I did with Lucene)

Given that:
Building indices from scratch takes 30 minutes
Incremental updates take much less time

What we can do is keep two sets of indices, call them A and B. Find a time when TL is least busy (i.e. least stress on the server) and call this time T.

We incrementally update one set of indices, A, until time T; at time T, we completely remove the other set of indices, B, and rebuild it from scratch. While B is being built, we obviously still update A, and all search queries will be run against A. After B is complete, we "dump" A, and run all subsequent queries against B. And we switch B with A again at time T. This way, the indices will be 1-day stale in the worst case.

Is this feasible?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
Last Edited: 2009-12-23 08:14:06
December 23 2009 08:10 GMT
#25
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool

EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.
AdministratorTwitter: @R1CH_TL
TL+ Member
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:19:06
December 23 2009 08:12 GMT
#26
On December 23 2009 17:10 R1CH wrote:
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool


Ah that sucks. Hopefully they'll implement that soon
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:18:01
December 23 2009 08:16 GMT
#27
On December 23 2009 17:10 R1CH wrote:
EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.


Ha!

edit:

lucene owns~! (maybe sphinx won't pick this up)
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 08:19 GMT
#28
Seriously, we only have on-peak times .

[image loading]
AdministratorTwitter: @R1CH_TL
TL+ Member
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:21 GMT
#29
after squinting: weekly updates on sunday nights?
When you want something, all the universe conspires in helping you to achieve it.
writer22816
Profile Blog Joined September 2008
United States5775 Posts
December 23 2009 08:23 GMT
#30
Thanks a ton!
8/4/12 never forget, never forgive.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 08:24 GMT
#31
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.
AdministratorTwitter: @R1CH_TL
TL+ Member
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:25 GMT
#32
Out of curiosity, how long did it take you to write the new indexing service?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:28:06
December 23 2009 08:27 GMT
#33
On December 23 2009 17:24 R1CH wrote:
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.


It would probably improve performance as well. Lucene definitely didn't like incremental indexing (and updates) as much as I'd liked.

Maybe suggest an "optimize" function to re-organize the existing index if such method doesn't exist yet?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 08:31 GMT
#34
That's one area where Sphinx excels, it can query any number of indexes, even on remote hosts with no performance penalty (other than the obvious aggregate index sizes). Most of the work setting up Sphinx was making sure all the charsets worked out properly (unicode and such) and setting up automatic updating / index rotation. The actual PHP code was quite simple since they provide a really good PHP API.
AdministratorTwitter: @R1CH_TL
TL+ Member
Ilikestarcraft
Profile Blog Joined November 2004
Korea (South)17731 Posts
December 23 2009 08:44 GMT
#35
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now

Thanks for the fix r1ch.
"Nana is a goddess. Or at very least, Nana is my goddess." - KazeHydra
Harem
Profile Joined November 2007
United States11393 Posts
December 23 2009 08:55 GMT
#36
On December 23 2009 17:44 Ilikestarcraft wrote:
Show nested quote +
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now


-_________-

Anyways, fuck yes being able to search for 3 letter words now. Thanks R1ch. <33
Moderator。◕‿◕。
darktreb
Profile Joined May 2007
United States3016 Posts
December 23 2009 09:00 GMT
#37
On December 23 2009 16:23 R1CH wrote:
I really don't think it's as big an issue as you think, most edits are made within a few minutes of a post being posted and there's a low chance it would get indexed in that timeframe.


That's a really good point. Sounds good to me.
ShaLLoW[baY]
Profile Blog Joined January 2007
Canada12499 Posts
December 23 2009 09:06 GMT
#38
...I don't understand a single technical term in this thread. R1CH, you seem really smart. I'd love to be your Sphinx and suck the RAM out of your delta index
ALEXISONFIRE ARE FUCKING BACK (sAviOr for life)
H
Profile Blog Joined July 2007
New Zealand6138 Posts
December 23 2009 09:25 GMT
#39
R1CH why has SA not already done this
explain it to me in terms a dumb person would understand
[iHs]HCO | のヮの | pachi & plexa ownz | RIP _
NovaTheFeared
Profile Blog Joined October 2004
United States7229 Posts
December 23 2009 09:33 GMT
#40
Is the non updated index going to be a problem on live report topics where the OP gets edited many times over the course of hours of gameplay?
日本語が分かりますか
Prev 1 2 3 4 5 6 7 8 Next All
Please log in or register to reply.
Live Events Refresh
Tenacious Turtle Tussle
23:00
Biweekly #35
CranKy Ducklings69
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
White-Ra 351
PiGStarcraft308
ProTech114
Livibee 95
ZombieGrub44
StarCraft: Brood War
Shuttle 570
Artosis 482
UpATreeSC 72
Sexy 30
Rock 30
NaDa 15
Dota 2
syndereN252
Counter-Strike
Foxcn234
Super Smash Bros
C9.Mang0101
Mew2King85
Other Games
summit1g4547
Grubby4137
shahzam397
Maynarde89
Nathanias22
fpsfer 1
Organizations
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 18 non-featured ]
StarCraft 2
• Hupsaiya 70
• RyuSc2 52
• Adnapsc2 10
• HeavenSC 3
• Kozan
• LaughNgamezSOOP
• sooper7s
• AfreecaTV YouTube
• intothetv
• Migwel
• IndyKCrew
StarCraft: Brood War
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• masondota2931
League of Legends
• imaqtpie3112
Other Games
• Shiphtur242
• Scarra106
Upcoming Events
The PondCast
10h 21m
RSL Revival
10h 21m
Solar vs Zoun
MaxPax vs Bunny
Kung Fu Cup
12h 21m
ByuN vs ShoWTimE
Classic vs Cure
Reynor vs TBD
WardiTV Korean Royale
12h 21m
PiGosaur Monday
1d 1h
RSL Revival
1d 10h
Classic vs Creator
Cure vs TriGGeR
Kung Fu Cup
1d 12h
herO vs TBD
CranKy Ducklings
2 days
RSL Revival
2 days
herO vs Gerald
ByuN vs SHIN
Kung Fu Cup
2 days
[ Show More ]
IPSL
2 days
ZZZero vs rasowy
Napoleon vs KameZerg
BSL 21
2 days
Tarson vs Julia
Doodle vs OldBoy
eOnzErG vs WolFix
StRyKeR vs Aeternum
Sparkling Tuna Cup
3 days
RSL Revival
3 days
Reynor vs sOs
Maru vs Ryung
Kung Fu Cup
3 days
WardiTV Korean Royale
3 days
BSL 21
3 days
JDConan vs Semih
Dragon vs Dienmax
Tech vs NewOcean
TerrOr vs Artosis
IPSL
3 days
Dewalt vs WolFix
eOnzErG vs Bonyth
Replay Cast
3 days
Wardi Open
4 days
Monday Night Weeklies
4 days
WardiTV Korean Royale
5 days
The PondCast
6 days
Replay Cast
6 days
Liquipedia Results

Completed

Proleague 2025-11-07
Stellar Fest: Constellation Cup
Eternal Conflict S1

Ongoing

C-Race Season 1
IPSL Winter 2025-26
KCM Race Survival 2025 Season 4
SOOP Univ League 2025
YSL S2
BSL Season 21
BLAST Rivals Fall 2025
IEM Chengdu 2025
PGL Masters Bucharest 2025
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual

Upcoming

SLON Tour Season 2
BSL 21 Non-Korean Championship
Acropolis #4
IPSL Spring 2026
HSC XXVIII
RSL Offline Finals
WardiTV 2025
RSL Revival: Season 3
META Madness #9
BLAST Bounty Winter 2026
BLAST Bounty Winter 2026: Closed Qualifier
eXTREMESLAND 2025
ESL Impact League Season 8
SL Budapest Major 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.