• Log InLog In
  • Register
Liquid`
Team Liquid Liquipedia
EDT 15:44
CEST 21:44
KST 04:44
  • Home
  • Forum
  • Calendar
  • Streams
  • Liquipedia
  • Features
  • Store
  • EPT
  • TL+
  • StarCraft 2
  • Brood War
  • Smash
  • Heroes
  • Counter-Strike
  • Overwatch
  • Liquibet
  • Fantasy StarCraft
  • TLPD
  • StarCraft 2
  • Brood War
  • Blogs
Forum Sidebar
Events/Features
News
Featured News
RSL Season 1 - Final Week6[ASL19] Finals Recap: Standing Tall15HomeStory Cup 27 - Info & Preview18Classic wins Code S Season 2 (2025)16Code S RO4 & Finals Preview: herO, Rogue, Classic, GuMiho0
Community News
Esports World Cup 2025 - Brackets Revealed19Weekly Cups (July 7-13): Classic continues to roll8Team TLMC #5 - Submission extension3Firefly given lifetime ban by ESIC following match-fixing investigation17$25,000 Streamerzone StarCraft Pro Series announced7
StarCraft 2
General
Who will win EWC 2025? Geoff 'iNcontroL' Robinson has passed away Program: SC2 / XSplit / OBS Scene Switcher Why doesnt SC2 scene costream tournaments RSL Revival patreon money discussion thread
Tourneys
Sparkling Tuna Cup - Weekly Open Tournament Sea Duckling Open (Global, Bronze-Diamond) FEL Cracov 2025 (July 27) - $8000 live event RSL: Revival, a new crowdfunded tournament series $5,100+ SEL Season 2 Championship (SC: Evo)
Strategy
How did i lose this ZvP, whats the proper response
Custom Maps
External Content
Mutation # 483 Kill Bot Wars Mutation # 482 Wheel of Misfortune Mutation # 481 Fear and Lava Mutation # 480 Moths to the Flame
Brood War
General
BW General Discussion Pro gamer house photos BGH Auto Balance -> http://bghmmr.eu/ Flash Announces (and Retracts) Hiatus From ASL Soulkey Muta Micro Map?
Tourneys
CSL Xiamen International Invitational [Megathread] Daily Proleagues 2025 ACS Season 2 Qualifier [BSL 2v2] ProLeague Season 3 - Friday 21:00 CET
Strategy
Simple Questions, Simple Answers I am doing this better than progamers do.
Other Games
General Games
[MMORPG] Tree of Savior (Successor of Ragnarok) Stormgate/Frost Giant Megathread Path of Exile Nintendo Switch Thread CCLP - Command & Conquer League Project
Dota 2
Official 'what is Dota anymore' discussion
League of Legends
Heroes of the Storm
Simple Questions, Simple Answers Heroes of the Storm 2.0
Hearthstone
Heroes of StarCraft mini-set
TL Mafia
TL Mafia Community Thread Vanilla Mini Mafia
Community
General
Things Aren’t Peaceful in Palestine Russo-Ukrainian War Thread US Politics Mega-thread The Games Industry And ATVI Stop Killing Games - European Citizens Initiative
Fan Clubs
SKT1 Classic Fan Club! Maru Fan Club
Media & Entertainment
Anime Discussion Thread Movie Discussion! [Manga] One Piece Korean Music Discussion [\m/] Heavy Metal Thread
Sports
2024 - 2025 Football Thread Formula 1 Discussion TeamLiquid Health and Fitness Initiative For 2023 NBA General Discussion
World Cup 2022
Tech Support
Computer Build, Upgrade & Buying Resource Thread
TL Community
The Automated Ban List
Blogs
Ping To Win? Pings And Their…
TrAiDoS
momentary artworks from des…
tankgirl
from making sc maps to makin…
Husyelt
StarCraft improvement
iopq
Customize Sidebar...

Website Feedback

Closed Threads



Active: 702 users

New Search Engine - Now Live - Page 2

Forum Index > General Forum
152 CommentsPost a Reply
Prev 1 2 3 4 5 6 7 8 Next All
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 07:54 GMT
#21
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.
AdministratorTwitter: @R1CH_TL
Mystlord *
Profile Blog Joined July 2008
United States10264 Posts
December 23 2009 08:01 GMT
#22
On December 23 2009 16:54 R1CH wrote:
Spelling suggestions are something I've considered adding at a later date. Infix (partial) matching on a word is supported, but would substantially increase the index size. I want to see how it performs by default first.

And Mystlord, Badjas is correct. Updates right now takes less than a second, but will grow as more data since the last full index is added. I actually had to index everything four times today to get everything how I wanted it.

Ok then never mind . I don't think there are any major problems with the new search engine then. The edit thing might be a problem because we'd lose a lot of new material (I'm thinking updates to topics like Stylish FPVODs and Day[9] podcasts).

Regardless, if I find any bugs, then I'll report them here.
It is impossible to be a citizen if you don't make an effort to understand the most basic activities of your government. It is very difficult to thrive in an increasingly competitive world if you're a nation of doods.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:02 GMT
#23
That's pretty interesting. I've never heard of sphinx before.

I've worked with Lucene extensively in the past, and it has top-notch performance, and it can be coded to address all of your problems. Maybe you can have a look when you are really bored over the holidays. I'll definitely have a look at Sphinx

Good job re-writing the search engine though!

Regarding the non-edit problem, can't you delete contents from the indices, and then simply re-index edited contents?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:11:25
December 23 2009 08:09 GMT
#24
After reading some comments, I have another proposal for stale contents (this is actually what I did with Lucene)

Given that:
Building indices from scratch takes 30 minutes
Incremental updates take much less time

What we can do is keep two sets of indices, call them A and B. Find a time when TL is least busy (i.e. least stress on the server) and call this time T.

We incrementally update one set of indices, A, until time T; at time T, we completely remove the other set of indices, B, and rebuild it from scratch. While B is being built, we obviously still update A, and all search queries will be run against A. After B is complete, we "dump" A, and run all subsequent queries against B. And we switch B with A again at time T. This way, the indices will be 1-day stale in the worst case.

Is this feasible?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
Last Edited: 2009-12-23 08:14:06
December 23 2009 08:10 GMT
#25
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool

EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:19:06
December 23 2009 08:12 GMT
#26
On December 23 2009 17:10 R1CH wrote:
When I say Sphinx indexes can't be modified, I mean it . It is on their todo list for a more flexible index format though. See http://www.sphinxsearch.com/docs/manual-0.9.9.html#conf-mva-updates-pool


Ah that sucks. Hopefully they'll implement that soon
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:18:01
December 23 2009 08:16 GMT
#27
On December 23 2009 17:10 R1CH wrote:
EDIT: TL doesn't really have an off-peak time, however the indexer can use a configurable IOPS, iosize and delay so if we want it to take 4 hours to update, it can. Refreshing the index every day like this is certainly possible, but I'd rather avoid it since even with very conservative settings, it's still going to suck up a lot of RAM and IO. Maybe a weekly update is doable.


Ha!

edit:

lucene owns~! (maybe sphinx won't pick this up)
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 08:19 GMT
#28
Seriously, we only have on-peak times .

[image loading]
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:21 GMT
#29
after squinting: weekly updates on sunday nights?
When you want something, all the universe conspires in helping you to achieve it.
writer22816
Profile Blog Joined September 2008
United States5775 Posts
December 23 2009 08:23 GMT
#30
Thanks a ton!
8/4/12 never forget, never forgive.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 08:24 GMT
#31
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.
AdministratorTwitter: @R1CH_TL
Cambium
Profile Blog Joined June 2004
United States16368 Posts
December 23 2009 08:25 GMT
#32
Out of curiosity, how long did it take you to write the new indexing service?
When you want something, all the universe conspires in helping you to achieve it.
Cambium
Profile Blog Joined June 2004
United States16368 Posts
Last Edited: 2009-12-23 08:28:06
December 23 2009 08:27 GMT
#33
On December 23 2009 17:24 R1CH wrote:
Perhaps, I'll need to play around with the indexer options to see if the throttling is effective. A full rebuild would be good every now and then regardless to minimize the size of the delta index.


It would probably improve performance as well. Lucene definitely didn't like incremental indexing (and updates) as much as I'd liked.

Maybe suggest an "optimize" function to re-organize the existing index if such method doesn't exist yet?
When you want something, all the universe conspires in helping you to achieve it.
R1CH
Profile Blog Joined May 2007
Netherlands10340 Posts
December 23 2009 08:31 GMT
#34
That's one area where Sphinx excels, it can query any number of indexes, even on remote hosts with no performance penalty (other than the obvious aggregate index sizes). Most of the work setting up Sphinx was making sure all the charsets worked out properly (unicode and such) and setting up automatic updating / index rotation. The actual PHP code was quite simple since they provide a really good PHP API.
AdministratorTwitter: @R1CH_TL
Ilikestarcraft
Profile Blog Joined November 2004
Korea (South)17726 Posts
December 23 2009 08:44 GMT
#35
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now

Thanks for the fix r1ch.
"Nana is a goddess. Or at very least, Nana is my goddess." - KazeHydra
Harem
Profile Joined November 2007
United States11390 Posts
December 23 2009 08:55 GMT
#36
On December 23 2009 17:44 Ilikestarcraft wrote:
Show nested quote +
On December 23 2009 16:07 R1CH wrote:
Once a post is indexed, it's indexed. If a post is edited after the search engine has indexed it, any changes in the edit will not be searchable. Get your post right first time!

none of my posts are going to be searchable now


-_________-

Anyways, fuck yes being able to search for 3 letter words now. Thanks R1ch. <33
Moderator。◕‿◕。
darktreb
Profile Joined May 2007
United States3016 Posts
December 23 2009 09:00 GMT
#37
On December 23 2009 16:23 R1CH wrote:
I really don't think it's as big an issue as you think, most edits are made within a few minutes of a post being posted and there's a low chance it would get indexed in that timeframe.


That's a really good point. Sounds good to me.
ShaLLoW[baY]
Profile Blog Joined January 2007
Canada12499 Posts
December 23 2009 09:06 GMT
#38
...I don't understand a single technical term in this thread. R1CH, you seem really smart. I'd love to be your Sphinx and suck the RAM out of your delta index
ALEXISONFIRE ARE FUCKING BACK (sAviOr for life)
H
Profile Blog Joined July 2007
New Zealand6138 Posts
December 23 2009 09:25 GMT
#39
R1CH why has SA not already done this
explain it to me in terms a dumb person would understand
[iHs]HCO | のヮの | pachi & plexa ownz | RIP _
NovaTheFeared
Profile Blog Joined October 2004
United States7222 Posts
December 23 2009 09:33 GMT
#40
Is the non updated index going to be a problem on live report topics where the OP gets edited many times over the course of hours of gameplay?
日本語が分かりますか
Prev 1 2 3 4 5 6 7 8 Next All
Please log in or register to reply.
Live Events Refresh
BSL 2v2 ProLeague S3
18:00
Grand Finals
ZZZero.O194
Liquipedia
[ Submit Event ]
Live Streams
Refresh
StarCraft 2
BRAT_OK 145
CosmosSc2 28
StarCraft: Brood War
Mini 1270
EffOrt 1027
firebathero 300
BeSt 249
ZZZero.O 194
Hyun 61
Aegong 25
ivOry 1
Stormgate
BeoMulf191
League of Legends
Grubby6352
Dendi1473
Counter-Strike
fl0m2533
Stewie2K1293
Super Smash Bros
hungrybox1702
Heroes of the Storm
Liquid`Hasu608
Other Games
FrodaN4482
B2W.Neo1044
Skadoodle212
Hui .186
ToD123
Sick42
Organizations
Other Games
gamesdonequick2122
StarCraft 2
Blizzard YouTube
StarCraft: Brood War
BSLTrovo
sctven
[ Show 18 non-featured ]
StarCraft 2
• Sammyuel 20
• Kozan
• LaughNgamezSOOP
• sooper7s
• AfreecaTV YouTube
• intothetv
• Migwel
• IndyKCrew
StarCraft: Brood War
• HerbMon 27
• 80smullet 18
• STPLYoutube
• ZZZeroYoutube
• BSLYoutube
Dota 2
• masondota21809
• Ler124
Other Games
• imaqtpie1598
• WagamamaTV425
• Shiphtur385
Upcoming Events
Esports World Cup
1d 14h
ByuN vs Astrea
Lambo vs HeRoMaRinE
Clem vs TBD
Solar vs Zoun
SHIN vs Reynor
Maru vs TriGGeR
herO vs Lancer
Cure vs ShoWTimE
Esports World Cup
2 days
Esports World Cup
3 days
Esports World Cup
4 days
CranKy Ducklings
5 days
BSL20 Non-Korean Champi…
5 days
BSL20 Non-Korean Champi…
5 days
Bonyth vs Sziky
Dewalt vs Hawk
Hawk vs QiaoGege
Sziky vs Dewalt
Mihu vs Bonyth
Zhanhun vs QiaoGege
QiaoGege vs Fengzi
FEL
6 days
BSL20 Non-Korean Champi…
6 days
BSL20 Non-Korean Champi…
6 days
Bonyth vs Zhanhun
Dewalt vs Mihu
Hawk vs Sziky
Sziky vs QiaoGege
Mihu vs Hawk
Zhanhun vs Dewalt
Fengzi vs Bonyth
Liquipedia Results

Completed

2025 ACS Season 2
RSL Revival: Season 1
Murky Cup #2

Ongoing

BSL 2v2 Season 3
Copa Latinoamericana 4
Jiahua Invitational
BSL20 Non-Korean Championship
CC Div. A S7
Underdog Cup #2
FISSURE Playground #1
BLAST.tv Austin Major 2025
ESL Impact League Season 7
IEM Dallas 2025
PGL Astana 2025
Asian Champions League '25

Upcoming

CSLPRO Last Chance 2025
CSLPRO Chat StarLAN 3
BSL Season 21
RSL Revival: Season 2
SEL Season 2 Championship
uThermal 2v2 Main Event
FEL Cracov 2025
Esports World Cup 2025
HCC Europe
ESL Pro League S22
StarSeries Fall 2025
FISSURE Playground #2
BLAST Open Fall 2025
BLAST Open Fall Qual
Esports World Cup 2025
BLAST Bounty Fall 2025
BLAST Bounty Fall Qual
IEM Cologne 2025
TLPD

1. ByuN
2. TY
3. Dark
4. Solar
5. Stats
6. Nerchio
7. sOs
8. soO
9. INnoVation
10. Elazer
1. Rain
2. Flash
3. EffOrt
4. Last
5. Bisu
6. Soulkey
7. Mini
8. Sharp
Sidebar Settings...

Advertising | Privacy Policy | Terms Of Use | Contact Us

Original banner artwork: Jim Warren
The contents of this webpage are copyright © 2025 TLnet. All Rights Reserved.