techhub.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A hub primarily for passionate technologists, but everyone is welcome

Administered by:

Server stats:

4.9K
active users

#bm25

0 posts0 participants0 posts today

🌗 PostgreSQL BM25 全文搜尋:透過這些技巧加速效能
➤ 破解效能慢的迷思,提升 PostgreSQL 全文搜尋的真實速度
blog.vectorchord.ai/postgresql
這篇文章探討了 PostgreSQL 內建全文搜尋(FTS)的效能問題,並針對 Neon 公司先前分析中使用的基準測試設定提出修正。文章指出,Neon 的基準測試可能未採用最佳化設定,導致標準 FTS 效能顯著落後於使用 Rust 語言的 Tantivy 函式庫所建構的 pg_search 擴充功能。透過預先計算並儲存 `tsvector`,以及關閉 GIN 索引的 `fastupdate` 選項,標準 FTS 的效能可提升約 50 倍。此外,文章也探討了排序效能(`ts_rank`)的限制,並推薦使用 VectorChord-BM25 擴充功能以獲得更快速、更具相關性的搜尋結果。
+
#PostgreSQL #全文搜尋 #BM25 #效能優化

VectorChord · PostgreSQL BM25 Full-Text Search: Speed Up Performance with These TipsBy Jinjing Zhou

🚀✨ Wow, #PostgreSQL just got a shiny new toy to play well with others! Apparently, adding #BM25 ranking makes it 3x faster than Elasticsearch—because, you know, exaggerated #performance boasts never get old. 🤔🔍 So, strap on folks, because your slightly quicker searches in databases are the next big thing! 😂📚
blog.vectorchord.ai/vectorchor #Elasticsearch #database #search #speed #HackerNews #ngated

VectorChord · VectorChord-BM25: Revolutionize PostgreSQL Search with BM25 RankingBy Jinjing Zhou

These past days, I've been working on and having lots of fun with Housaku (豊作), a personal search engine built on SQLite's FTS5. Currently it has:

- Support for file formats like .txt, .md, .pdf, .epub, and .docx
- Basic web scraping functionality
- BM25 for search results, thanks to SQLite's FTS5

And files are processed in parallel, making indexing faster than I expected!

If you're interested, check it out at:

github.com/dnlzrgz/housaku

A powerful yet simple personal search engine built on top of SQLite's FTS5. - dnlzrgz/housaku
GitHubGitHub - dnlzrgz/housaku: A powerful yet simple personal search engine built on top of SQLite's FTS5.A powerful yet simple personal search engine built on top of SQLite's FTS5. - dnlzrgz/housaku