✨ Features Overview

Indexly is a lightweight, modular document indexing + search engine. Here’s the latest overview:


  • Full-text search (FTS5) across content and metadata
  • Boolean operators: AND, OR, NOT, NEAR
  • Phrase search with quotes search "term"
  • Fuzzy search support via SQLite extensions
  • Smart ranking and scoring

Search demo placeholder


Tag Detection

  • Extracts custom tags from document content
  • Regex-based virtual tag matcher (fts_core.py)
  • Works with .pdf, .docx, .eml, .msg, .txt, .md, .xlsx
  • CLI previews via tag list command

Tags placeholder


Caching System

  • Smart result caching for repeat searches
  • Auto refresh if documents change
  • Control via --no-cache

CSV Analyzer

  • Auto-detects delimiters
  • Computes mean, median, stddev, IQR
  • Value counts for categorical data
  • Outputs in Markdown or TXT

CSV analysis placeholder


Supported Formats

.pdf, .docx, .xlsx, .csv, .msg, .eml, .md, .txt, .json, .xml, .epub, .pptx, .odt, HTML, JS, CSS, Python, Logs, images (.jpg, .png, .tiff, .bmp)


CLI & Extensibility

  • Modular CLI in cli_utils.py
  • Scriptable, clear logging system
  • Fully open-source

Metadata Indexing

  • Extracts title, author, subject, dates
  • PDF and Office metadata
  • Image EXIF metadata

metadata-indexing


Developer Focus

  • Extend tag_fields in fts_core.py
  • Modify filetype_utils.py to support new formats

Docs