Research

Benchmarks

Evaluation on real organizational data. All benchmarks evaluated against meeting transcripts, email threads, Slack conversations, and project documents.

SignalDescriptionAccuracyLatency
Intent ClassificationRoute queries to the right strategy97%270ms
Memory TriageAuto-classify and summarize94%2.1s
Entity ExtractionPeople, orgs, decisions, actions92%1.8s
Sentiment AnalysisEmotional tone detection91%350ms
Relationship LinkingCross-memory connections89%1.5s
Query ExpansionEnrich vague queries87%700ms
Answer GenerationCited conversational answers85%4.2s

Benchmarks are run on each model version. v1.4 results shown. v1.5 targets 95%+ across all signals.