✨ Features Overview
Indexly is a lightweight, modular document indexing + search engine. Here’s the latest overview:
Search
- Full-text search (FTS5) across content and metadata
- Boolean operators:
AND
,OR
,NOT
,NEAR
- Phrase search with quotes
search "term"
- Fuzzy search support via SQLite extensions
- Smart ranking and scoring
Tag Detection
- Extracts custom tags from document content
- Regex-based virtual tag matcher (
fts_core.py
) - Works with
.pdf
,.docx
,.eml
,.msg
,.txt
,.md
,.xlsx
- CLI previews via
tag list
command
Caching System
- Smart result caching for repeat searches
- Auto refresh if documents change
- Control via
--no-cache
CSV Analyzer
- Auto-detects delimiters
- Computes mean, median, stddev, IQR
- Value counts for categorical data
- Outputs in Markdown or TXT
Supported Formats
.pdf, .docx, .xlsx, .csv, .msg, .eml, .md, .txt, .json, .xml, .epub, .pptx, .odt, HTML, JS, CSS, Python, Logs, images (.jpg, .png, .tiff, .bmp)
CLI & Extensibility
- Modular CLI in
cli_utils.py
- Scriptable, clear logging system
- Fully open-source
Metadata Indexing
- Extracts title, author, subject, dates
- PDF and Office metadata
- Image EXIF metadata
Developer Focus
- Extend
tag_fields
infts_core.py
- Modify
filetype_utils.py
to support new formats