Research
Benchmarks
Evaluation on real organizational data. All benchmarks evaluated against meeting transcripts, email threads, Slack conversations, and project documents.
| Signal | Description | Accuracy | Latency |
|---|---|---|---|
| Intent Classification | Route queries to the right strategy | 97% | 270ms |
| Memory Triage | Auto-classify and summarize | 94% | 2.1s |
| Entity Extraction | People, orgs, decisions, actions | 92% | 1.8s |
| Sentiment Analysis | Emotional tone detection | 91% | 350ms |
| Relationship Linking | Cross-memory connections | 89% | 1.5s |
| Query Expansion | Enrich vague queries | 87% | 700ms |
| Answer Generation | Cited conversational answers | 85% | 4.2s |
Benchmarks are run on each model version. v1.4 results shown. v1.5 targets 95%+ across all signals.