• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 06:35
CEST 12:35
KST 19:35
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
Team TLMC #5 - Finalists & Open Tournaments0[ASL20] Ro16 Preview Pt2: Turbulence10Classic Games #3: Rogue vs Serral at BlizzCon9[ASL20] Ro16 Preview Pt1: Ascent10Maestros of the Game: Week 1/Play-in Preview12
Community News
BSL 2025 Warsaw LAN + Legends Showmatch0Weekly Cups (Sept 8-14): herO & MaxPax split cups4WardiTV TL Team Map Contest #5 Tournaments1SC4ALL $6,000 Open LAN in Philadelphia8Weekly Cups (Sept 1-7): MaxPax rebounds & Clem saga continues29
StarCraft 2
General
#1: Maru - Greatest Players of All Time Weekly Cups (Sept 8-14): herO & MaxPax split cups Team Liquid Map Contest #21 - Presented by Monster Energy SpeCial on The Tasteless Podcast Team TLMC #5 - Finalists & Open Tournaments
Tourneys
RSL: Revival, a new crowdfunded tournament series Maestros of The Game—$20k event w/ live finals in Paris Sparkling Tuna Cup - Weekly Open Tournament SC4ALL $6,000 Open LAN in Philadelphia WardiTV TL Team Map Contest #5 Tournaments
Strategy
Custom Maps
External Content
Mutation # 491 Night Drive Mutation # 490 Masters of Midnight Mutation # 489 Bannable Offense Mutation # 488 What Goes Around
Brood War
General
ASL20 General Discussion ASL TICKET LIVE help! :D Soulkey on ASL S20 BW General Discussion NaDa's Body
Tourneys
[ASL20] Ro16 Group D [ASL20] Ro16 Group C [Megathread] Daily Proleagues BSL 2025 Warsaw LAN + Legends Showmatch
Strategy
Simple Questions, Simple Answers Muta micro map competition Fighting Spirit mining rates [G] Mineral Boosting
Other Games
General Games
Stormgate/Frost Giant Megathread Nintendo Switch Thread Path of Exile Borderlands 3 General RTS Discussion Thread
Dota 2
Official 'what is Dota anymore' discussion LiquidDota to reintegrate into TL.net
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread
Community
General
UK Politics Mega-thread US Politics Mega-thread Canadian Politics Mega-thread Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread
Fan Clubs
The Happy Fan Club!
Media & Entertainment
Movie Discussion! [Manga] One Piece Anime Discussion Thread
Sports
2024 - 2026 Football Thread Formula 1 Discussion MLB/Baseball 2023
World Cup 2022
Tech Support
Linksys AE2500 USB WIFI keeps disconnecting Computer Build, Upgrade & Buying Resource Thread High temperatures on bridge(s)
TL Community
BarCraft in Tokyo Japan for ASL Season5 Final The Automated Ban List
Blogs
I <=> 9
KrillinFromwales
The Personality of a Spender…
TrAiDoS
A very expensive lesson on ma…
Garnet
hello world
radishsoup
Lemme tell you a thing o…
JoinTheRain
RTS Design in Hypercoven
a11
Evil Gacha Games and the…
ffswowsucks
Customize Sidebar...

Website Feedback

Closed Threads



Active: 1903 users

New Search Engine - Now Live - Page 2

Forum Index > General Forum
152 CommentsPost a Reply
Prev 1 2 3 4 5 6 7 8 Next All
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 07:54 GMT
#21
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.
AdministratorTwitter: @R1CH_TL
Mystlord *
Profile Blog Joined July 2008
United States10264 Posts
December 23 2009 08:01 GMT
#22
On December 23 2009 16:54 R1CH wrote:
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.

Ok then never mind . I don't think there are any major problems with the new search engine then. The edit thing might be a problem because we'd lose a lot of new material (I'm thinking updates to topics like Stylish FPVODs and Day[9] podcasts).

Regardless, if I find any bugs, then I'll report them here.
It is impossible to be a citizen if you don't make an effort to understand the most basic activities of your government. It is very difficult to thrive in an increasingly competitive world if you're a nation of doods.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:02 GMT
#23
That's pretty interesting. I've never heard of sphinx before.

I've worked with Lucene extensively in the past, and it has top-notch performance, and it can be coded to address all of your problems. Maybe you can have a look when you are really bored over the holidays. I'll definitely have a look at Sphinx

Good job re-writing the search engine though!

Regarding the non-edit problem, can't you delete contents from the indices, and then simply re-index edited contents?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:11:25
December 23 2009 08:09 GMT
#24
After reading some comments, I have another proposal for stale contents (this is actually what I did with Lucene)

Given that:
Building indices from scratch takes 30 minutes
Incremental updates take much less time

What we can do is keep two sets of indices, call them A and B. Find a time when TL is least busy (i.e. least stress on the server) and call this time T.

We incrementally update one set of indices, A, until time T; at time T, we completely remove the other set of indices, B, and rebuild it from scratch. While B is being built, we obviously still update A, and all search queries will be run against A. After B is complete, we "dump" A, and run all subsequent queries against B. And we switch B with A again at time T. This way, the indices will be 1-day stale in the worst case.

Is this feasible?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
Last Edited: 2009-12-23 08:14:06
December 23 2009 08:10 GMT
#25
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool

EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:19:06
December 23 2009 08:12 GMT
#26
On December 23 2009 17:10 R1CH wrote:
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool


Ah that sucks. Hopefully they'll implement that soon
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:18:01
December 23 2009 08:16 GMT
#27
On December 23 2009 17:10 R1CH wrote:
EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.


Ha!

edit:

lucene owns~! (maybe sphinx won't pick this up)
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 08:19 GMT
#28
Seriously, we only have on-peak times .

[image loading]
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:21 GMT
#29
after squinting: weekly updates on sunday nights?
When you want something, all the universe conspires in helping you to achieve it.
writer22816
Profile Blog Joined September 2008
United States5775 Posts
December 23 2009 08:23 GMT
#30
Thanks a ton!
8/4/12 never forget, never forgive.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 08:24 GMT
#31
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:25 GMT
#32
Out of curiosity, how long did it take you to write the new indexing service?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:28:06
December 23 2009 08:27 GMT
#33
On December 23 2009 17:24 R1CH wrote:
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.


It would probably improve performance as well. Lucene definitely didn't like incremental indexing (and updates) as much as I'd liked.

Maybe suggest an "optimize" function to re-organize the existing index if such method doesn't exist yet?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10341 Posts
December 23 2009 08:31 GMT
#34
That's one area where Sphinx excels, it can query any number of indexes, even on remote hosts with no performance penalty (other than the obvious aggregate index sizes). Most of the work setting up Sphinx was making sure all the charsets worked out properly (unicode and such) and setting up automatic updating / index rotation. The actual PHP code was quite simple since they provide a really good PHP API.
AdministratorTwitter: @R1CH_TL
Ilikestarcraft
Profile Blog Joined November 2004
Korea (South)17727 Posts
December 23 2009 08:44 GMT
#35
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now

Thanks for the fix r1ch.
"Nana is a goddess. Or at very least, Nana is my goddess." - KazeHydra
Harem
Profile Joined November 2007
United States11390 Posts
December 23 2009 08:55 GMT
#36
On December 23 2009 17:44 Ilikestarcraft wrote:
Show nested quote +
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now


-_________-

Anyways, fuck yes being able to search for 3 letter words now. Thanks R1ch. <33
Moderator。◕‿◕。
darktreb
Profile Joined May 2007
United States3016 Posts
December 23 2009 09:00 GMT
#37
On December 23 2009 16:23 R1CH wrote:
I really don't think it's as big an issue as you think, most edits are made within a few minutes of a post being posted and there's a low chance it would get indexed in that timeframe.


That's a really good point. Sounds good to me.
ShaLLoW[baY]
Profile Blog Joined January 2007
Canada12499 Posts
December 23 2009 09:06 GMT
#38
...I don't understand a single technical term in this thread. R1CH, you seem really smart. I'd love to be your Sphinx and suck the RAM out of your delta index
ALEXISONFIRE ARE FUCKING BACK (sAviOr for life)
H
Profile Blog Joined July 2007
New Zealand6138 Posts
December 23 2009 09:25 GMT
#39
R1CH why has SA not already done this
explain it to me in terms a dumb person would understand
[iHs]HCO | のヮの | pachi & plexa ownz | RIP _
NovaTheFeared
Profile Blog Joined October 2004
United States7224 Posts
December 23 2009 09:33 GMT
#40
Is the non updated index going to be a problem on live report topics where the OP gets edited many times over the course of hours of gameplay?
日本語が分かりますか
Prev 1 2 3 4 5 6 7 8 Next All
Please log in or register to reply.
Live Events Refresh
RSL Revival
10:00
Season 2: Playoffs Day 5
Maru vs ReynorLIVE!
Cure vs TriGGeR
Tasteless709
Crank 514
IndyStarCraft 132
CranKy Ducklings91
Rex71
3DClanTV 63
IntoTheiNu 25
LiquipediaDiscussion
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
Tasteless 639
Crank 514
IndyStarCraft 132
Rex 64
StarCraft: Brood War
Calm 7184
Bisu 1119
Horang2 408
Hyuk 318
actioN 304
Stork 297
Pusan 195
ZerO 165
Hyun 158
Snow 145
[ Show more ]
Mini 144
EffOrt 124
Light 121
Soma 107
Liquid`Ret 74
Last 73
ToSsGirL 58
sorry 58
HiyA 49
Soulkey 42
ggaemo 31
Sharp 31
Free 26
Mind 21
scan(afreeca) 19
Rush 15
SilentControl 11
Sexy 9
Terrorterran 6
Dota 2
singsing2156
XcaliburYe182
League of Legends
JimRising 306
Counter-Strike
olofmeister1634
shoxiejesuss609
x6flipin336
allub190
Other Games
XaKoH 143
NeuroSwarm55
Trikslyr15
Organizations
StarCraft: Brood War
Kim Chul Min (afreeca) 640
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 14 non-featured ]
StarCraft 2
• AfreecaTV YouTube
• intothetv
• Kozan
• IndyKCrew
• LaughNgamezSOOP
• Migwel
• sooper7s
StarCraft: Brood War
• iopq 1
• BSLYoutube
• STPLYoutube
• ZZZeroYoutube
League of Legends
• Jankos1554
• Stunt627
Other Games
• WagamamaTV172
Upcoming Events
Map Test Tournament
25m
The PondCast
2h 25m
RSL Revival
23h 25m
Zoun vs Classic
Korean StarCraft League
1d 16h
BSL Open LAN 2025 - War…
1d 21h
RSL Revival
1d 23h
BSL Open LAN 2025 - War…
2 days
RSL Revival
2 days
Online Event
3 days
Wardi Open
4 days
[ Show More ]
Monday Night Weeklies
4 days
Sparkling Tuna Cup
4 days
LiuLi Cup
6 days
The PondCast
6 days
Liquipedia Results

Completed

Proleague 2025-09-10
Chzzk MurlocKing SC1 vs SC2 Cup #2
HCC Europe

Ongoing

BSL 20 Team Wars
KCM Race Survival 2025 Season 3
BSL 21 Points
ASL Season 20
CSL 2025 AUTUMN (S18)
LASL Season 20
RSL Revival: Season 2
Maestros of the Game
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
FISSURE Playground #1

Upcoming

2025 Chongqing Offline CUP
BSL World Championship of Poland 2025
IPSL Winter 2025-26
BSL Season 21
SC4ALL: Brood War
BSL 21 Team A
Stellar Fest
SC4ALL: StarCraft II
EC S1
ESL Impact League Season 8
SL Budapest Major 2025
BLAST Rivals Fall 2025
IEM Chengdu 2025
PGL Masters Bucharest 2025
MESA Nomadic Masters Fall
Thunderpick World Champ.
CS Asia Championships 2025
ESL Pro League S22
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.