AI-powered expert knowledge,
with full provenance
From raw documents to cited answers in four steps. Every answer traces back to the original source.
Creator uploads corpus
Creators upload their knowledge: PDFs, books, transcripts, research papers, course recordings. Any format, any size. The system accepts it all.
H100 GPU processes
NVIDIA H100 GPUs run the full pipeline: OCR extracts text, atomic chunking preserves meaning, embeddings capture semantics, and provenance chains track every transformation.
Subscriber asks questions
Subscribers use the AI chat interface to ask questions in natural language. The system searches across the creator's entire knowledge base using semantic search and cross-encoder reranking.
Cited answers with document trail
Every answer includes inline citations linking back to the original document, page, and paragraph. Click any citation to open the document viewer and see the source in full context.
The technology behind it
Built on battle-tested open-source components with a commercial layer on top.
Marker-PDF OCR
State-of-the-art PDF extraction that handles scanned documents, complex layouts, tables, and multi-column text with high fidelity.
Nomic Embeddings
768-dimensional embeddings that capture semantic meaning. Run locally for privacy or on H100s for speed. No data leaves your machine unless you choose cloud processing.
Cross-Encoder Reranking
After initial retrieval, a cross-encoder model re-scores results for precision. The most relevant chunks always surface first.
SQLite-vec
Vector search built directly into SQLite. No external database servers. Your entire knowledge base is a single portable file.
Provenance Chains
Every transformation is tracked: original file, OCR output, chunks, embeddings. Full audit trail from source to answer.
Atomic Chunking
Documents are split into meaning-preserving chunks that respect paragraph boundaries, headers, and logical structure. No information is lost at chunk boundaries.
50 free queries per month. No credit card required.