-
-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Zero-overhead, agent-ready AI search and cognitive memory β embedded in a single JVM.
Spector is a Java-native AI search engine and cognitive memory system that combines SIMD-accelerated vector search, keyword search (BM25), and biologically-inspired memory consolidation into a single embeddable library. No Docker, no external databases, no Python β just a JAR.
Connect AI agents via the built-in MCP server (Claude Desktop, Cursor, custom agents), embed directly in your Spring Boot app, or run standalone. Spector delivers sub-millisecond search at scale with zero garbage collection pressure thanks to Project Panama off-heap memory.
| Metric | Value |
|---|---|
| π§ Cognitive Recall | 0.13ms p50 at 1M memories |
| β‘ Vector Search | 88Β΅s p50 (10K docs, 128-dim) |
| π Peak QPS | 61,011 concurrent searches |
| π€ MCP Tools | 13 tools (6 search + 7 cognitive memory) |
| ποΈ Compression | 4Γβ32Γ (SVASQ-8 to IVF-PQ) |
| β Test Suite | 685+ tests, all passing |
| π¦ Dependencies | Zero (JDK only) |
| Page | What you'll learn |
|:-----|:------------------|
| [[Quick Start|Getting-Started--Quickstart]] | Build, run, and search in 5 minutes |
| [[MCP Server Guide|Sdk-Usage--Mcp-Server]] | Connect Claude Desktop, Cursor, or custom agents |
| [[Installation|Getting-Started--Installation]] | Prerequisites and setup options |
| [[Configuration|Configuration--Parameters]] | All parameters with tuning advice |
| [[REST API Reference|Api-Reference--Rest-Endpoints]] | All endpoints with curl examples |
| [[Cognitive Memory|Memory]] | Getting started with AI agent memory |
| [[Cortex Dashboard|Cortex]] | Real-time neural visualization dashboard |
| Page | What you'll learn |
|:-----|:------------------|
| [[Architecture Overview|Architecture--Overview]] | Module diagram, data flow, threading model |
| [[Core Concepts|Architecture--Core-Concepts]] | HNSW, IVF-PQ, BM25, RRF, SIMD deep-dives |
| [[Memory Architecture|Memory--Architecture]] | How cognitive memory works under the hood |
| [[6-Phase Scoring Pipeline|Memory--Scoring-Pipeline]] | Fused SIMD scoring across memory tiers |
| [[Cortex Dashboard|Cortex]] | Watch your AI's brain think β 12+ live panels |
| [[SVASQ Quantization|Deep-Dives--Svasq-Deep-Dive]] | Our proprietary SIMD-first quantization engine |
| [[Benchmarks|Deep-Dives--Real-Embedding-Benchmarks]] | Empirical sweeps on 4096-dim embeddings |
| Page | What you'll learn |
|:-----|:------------------|
| [[Contributing Guide|Operations--Contributing]] | Development setup and PR process |
| [[JDK API Status|Getting-Started--Jdk-Api-Status]] | Vector API, Panama FFM compatibility |
| [[Roadmap]] | What's planned next |
| [[FAQ]] | Common questions answered |
Spector combines three search modalities β semantic vectors, keyword matching, and cognitive scoring β into a single fused pipeline:
graph LR
A["π€ AI Agent"] --> B["π‘ MCP Server"]
B --> C["β‘ SpectorEngine"]
C --> D["π§ Hybrid Search"]
D --> E["π― RRF Fusion"]
E --> F["π€ LLM Re-ranking"]
F --> G["β¨ Results"]
H["π Document"] --> I["π§© Chunking"]
I --> J["𧬠Embedding"]
J --> C
- Embedded deployment β runs as a library inside your JVM. No Docker, no servers, no network hops.
- Agent-native β 13 MCP tools for search, memory, and cognitive operations. Connect Claude Desktop or Cursor in one config line.
- Cognitive memory β the only system combining power-law decay, Two-Factor strengthening (Bjork & Bjork), emotional valence, and Hebbian association in a single scoring formula.
- Zero GC pressure β all vector data and headers live off-heap via Project Panama. The JVM garbage collector never sees memory records.
- SIMD everywhere β vector distance, quantization, and scoring use Java Vector API (AVX2/AVX-512/NEON) for hardware-accelerated computation.
tip: New here? Start with Quick Start to build and run your first search in under 5 minutes. Want to connect an AI agent? See the MCP Server Guide.
| Language | Java 25 |
| License | Apache 2.0 Β· BSL 1.1 (memory module) |
| Modules | 25 Maven modules |
| Dependencies | Zero (JDK only) |
| SIMD | AVX2 / AVX-512 / NEON |
| GPU | CUDA via Panama FFM |
| MCP | Built-in, 13 agent-ready tools |
| Distributed | gRPC fan-out + consistent hashing |
Built with β‘ by Spectrayan Β· GitHub Β· Apache 2.0 Β· BSL 1.1 (memory)
- Home
- Getting Started
-
Cognitive Memory
- Overview
- Getting Started
- Use Cases & Configuration
- API Reference
- Architecture
- The 6-Phase Scoring Pipeline
- Cognitive Profiles
- Salience & Importance
-
Biological Systems
- Overview
- Cortex β Tier Stores
- Hippocampus β Sleep Consolidation
- Synapse β Tags & Scoring
- Dopamine β Surprise Detection
- Amygdala β Emotional Valence
- 3-Layer Cognitive Graph
- Habituation β Anti-Filter Bubble
- Inhibition β Suppression
- Interference β Deduplication
- Prospective β Future Intents
- Metamemory β Self-Reflection
- Sync β Persistence & Replication
- Performance & Internals
- Cognitive Evaluation
-
Architecture
- System Overview
- Core Concepts
- Ingestion Pipeline
- RAG Pipeline
- MCP Integration
- Distributed Mode
- Event Notifications
- Namespace Sharding
- GPU Acceleration
- Performance Tuning
- Test Framework & LLM Judge
- JDK API Status
- Security & Data
- Deep Dives
- Cortex Dashboard
-
Community
- Contributing
- FAQ
- Roadmap
- π¬ Labs