AI-powered expert knowledge,
with full provenance

From raw documents to cited answers in four steps. Every answer traces back to the original source.

Step 1

Creator uploads corpus

Creators upload their knowledge: PDFs, books, transcripts, research papers, course recordings. Any format, any size. The system accepts it all.

PDF EPUB DOCX TXT MD HTML
Step 2

H100 GPU processes

NVIDIA H100 GPUs run the full pipeline: OCR extracts text, atomic chunking preserves meaning, embeddings capture semantics, and provenance chains track every transformation.

OCR Chunking Embeddings Provenance
Step 3

Subscriber asks questions

Subscribers use the AI chat interface to ask questions in natural language. The system searches across the creator's entire knowledge base using semantic search and cross-encoder reranking.

Step 4

Cited answers with document trail

Every answer includes inline citations linking back to the original document, page, and paragraph. Click any citation to open the document viewer and see the source in full context.

The technology behind it

Built on battle-tested open-source components with a commercial layer on top.

Marker-PDF OCR

State-of-the-art PDF extraction that handles scanned documents, complex layouts, tables, and multi-column text with high fidelity.

Nomic Embeddings

768-dimensional embeddings that capture semantic meaning. Run locally for privacy or on H100s for speed. No data leaves your machine unless you choose cloud processing.

Cross-Encoder Reranking

After initial retrieval, a cross-encoder model re-scores results for precision. The most relevant chunks always surface first.

SQLite-vec

Vector search built directly into SQLite. No external database servers. Your entire knowledge base is a single portable file.

Provenance Chains

Every transformation is tracked: original file, OCR output, chunks, embeddings. Full audit trail from source to answer.

Atomic Chunking

Documents are split into meaning-preserving chunks that respect paragraph boundaries, headers, and logical structure. No information is lost at chunk boundaries.

Try It Free

50 free queries per month. No credit card required.