Private document chat,
built in.

Drop a file. Ask anything. Nothing leaves your machine.

Most "chat with your documents" tools work by uploading your file to a server, embedding it with a cloud API, and storing the chunks in someone else's vector database. That's three places your data has touched before the model answers a single question. tailor. does the entire pipeline locally , chunking, embedding, retrieval, and inference all run on your device.

What you can drop in

PDFs (text and scanned, with OCR via a multimodal model). Word documents. Spreadsheets , tailor. understands cell context and can run analysis. Source code with language-aware semantic chunking, so the model sees functions and classes as units instead of arbitrary line ranges. Images and charts, processed directly by vision models. Audio files, transcribed by local Whisper before retrieval. Markdown, plain text, ePub, JSON, CSV.

How retrieval actually works

tailor. runs a local embedding model (you can pick one in settings , defaults to a strong general-purpose model under 500MB). When you add a file, it's chunked, embedded, and stored in a local SQLite-backed vector index. Queries are embedded the same way, retrieved by cosine similarity with optional BM25 hybrid scoring, and the top-K chunks are passed to the chat model with citations. Citations link back to the exact page or line in the original file.

Why this is better than uploading to ChatGPT

Privacy first , nothing leaves the device. But there's a performance angle too: ChatGPT-style document chat tends to truncate or summarize large files because of context-window economics on the cloud side. tailor. uses local context however the model supports it, which means you can routinely work with 200-page reports or whole codebases without the model losing track.

Scanned PDFs and image-heavy documents

When a PDF has scanned pages or embedded images, tailor. uses a local vision-capable model to extract content directly. No separate OCR step, no cloud OCR API. The same loop handles whiteboard photos, screenshots, and chart images.

Codebases

Point tailor. at a directory and it walks every file with the right strategy: language-aware chunking that splits source code on function and class boundaries, Markdown handling for docs, structure-preserving handling for config. Then ask questions like "where is the authentication middleware defined" or "what's the difference between handleRequest in v2 and v3" and tailor. retrieves the right files before answering.

Questions

How large a document can I work with?

There's no fixed limit. The embedding index is on-disk so it scales to terabytes. Per-query, tailor. retrieves only the relevant chunks, so a 10,000-page archive works the same as a 10-page memo.

Are my embeddings stored anywhere off-device?

No. The vector index is a SQLite file in your tailor. data directory. You can back it up, copy it to another machine, or delete it , it's just a file.

Can I share a document index with a teammate?

Yes. tailor. has end-to-end encrypted workspace sharing over your local network. Two tailor. instances on the same Wi-Fi auto-discover each other, or pair via a one-shot QR code, then exchange workspaces directly over TLS with pinned certificate fingerprints. No cloud, no account, no server in the middle.

Which embedding model does tailor. use?

Defaults to a strong general-purpose model (~500MB on disk). You can pick a different one in settings , the catalog includes options optimized for code, multilingual content, and longer-context retrieval.

Try tailor. free for 7 days.

Full access. No credit card required. Mac, Windows, and Linux.

Start free trial → See pricing

Private document chat,built in.