From charlesreid1

Revision as of 01:53, 27 July 2018 by Admin (talk | contribs)

Code

Code here: https://git.charlesreid1.com/charlesreid1/whoosh-search

Notes

Whoosh is a library for building search engines in Python.

You define a schema and create an index, then start to add documents to the index. The result is a search index built by whoosh.

Because the task of building a search engine is extremely broad in scope and can cover a reall ywide range of things, Whoosh is highly configurable. Unfortunately that means it's often hard to understand what the documentation is talking about because it stays at too high a level.

This page collects notes on a Whoosh setup to create a search index for something nice and concrete: a pile of Markdown files.