Backends - VecGrep

Backend comparison

Backend	Startup	PyTorch required	Custom HF models
`onnx` (default)	~100ms	No	ONNX-exported models only
`torch`	~2–3s	Yes	Any HuggingFace model

Uses fastembed with ONNX Runtime. Recommended for most users.

# Default — no configuration needed
vecgrep

Uses sentence-transformers with PyTorch. Use this when you need a model that isn’t available in ONNX format.

VECGREP_BACKEND=torch vecgrep