Premium Plugins

Extend Your Database with Premium Plugins

GPU acceleration, in-database machine learning, and adaptive caching. Add or remove at any time - no downtime, no migration.

GPU Acceleration

CUDA-based GPU acceleration for HNSW vector similarity search. Batch distance computation, brute-force KNN, graph construction, SQ8 quantized distance, priority scheduling, multi-query batching, and exact re-ranking - all in-process on database nodes with zero network hops.

100xMax speedup (brute-force KNN)
sm_60+CUDA compute capability
0msExtra network latency
64Concurrent queries per kernel launch

Supported Operations

Batch Distance

Compute distances between query vector and candidate set

10-50x speedup

Brute-Force KNN

Exact nearest neighbor search over full dataset

50-100x speedup

Graph Construction

Build HNSW graph layers in parallel

5-20x speedup

SQ8 Quantized

Distance computation on scalar-quantized vectors

20-80x speedup

Priority Scheduling

Multi-stream CUDA scheduler with Critical, Normal, and Background priority lanes and anti-starvation promotion

3 priority levels

Multi-Query Batching

Coalesces up to 64 concurrent search queries into a single GPU kernel launch, amortizing transfer overhead

64 queries/batch

GPU Re-Ranking

Exact float32 re-ranking of approximate search candidates with rank delta tracking for quality metrics

10x candidate pool

Tier Comparison

FeatureStandard
$799/mo
Pro
$1499/mo
Enterprise
$3999/mo
Batch Distance
Brute-Force KNN
Multi-Query Batching
Graph Construction-
SQ8 Quantized-
Priority Scheduling--
GPU Re-Ranking--
Infrastructure
GPU Memory (VRAM)2 GB4 GB8 GB
Max Concurrent Queries32128512

Feature Deep-Dives

ML Engine

Run machine learning directly inside your database — 13 algorithms including regression, clustering, classification, dimensionality reduction, anomaly detection, and ensemble methods. Pure Rust, zero external services.

13 Built-in Algorithms

LinearRegressionRegression
LogisticRegressionClassification
ElasticNetRegression
KMeansClustering
MiniBatchKMeansClustering
DBSCANClustering
PCADim. Reduction
TruncatedSVDDim. Reduction
GaussianNBClassification
MultinomialNBClassification
DecisionTreeClassification
IsolationForestAnomaly Detection
RandomForestEnsemble

Example Workflow

ml-predict.js
// Train a KMeans model on user behavior
db.ml.train({
  algorithm: "KMeans",
  source: "user_events",
  features: ["session_duration",
    "pages_viewed", "actions_count"],
  params: { k: 5 }
})

// Predict cluster for new data
db.ml.predict({
  model: "user_events_kmeans",
  input: {
    session_duration: 120,
    pages_viewed: 8,
    actions_count: 15
  }
})

Tier Comparison

FeatureStandard
$99/mo
Pro
$199/mo
Enterprise
$299/mo
AlgorithmsAll 13All 13All 13
Max Training Samples1M10MUnlimited
Max Features5,00010,000Unlimited
Max Iterations1,0001,00010,000
Model Persistence
Batch Prediction API--
Custom Iteration Limits----

Intelligent Cache

Production-grade 3-tier adaptive caching with ML-inspired eviction scoring (7 signals), automatic workload classification (5 classes), and predictive pre-warming. L1 memory, L2 SSD, L3 distributed — sub-millisecond latency, zero config.

< 1msL1 memory cache latency
3Cache tiers (L1 + L2 + L3)
7ML eviction scoring signals
5Auto-detected workload classes

L1 In-Memory

moka hash map

<1ms

latency

L2 SSD

LZ4 compressed

1-5ms

latency

L3 Distributed

TCP mesh

5-20ms

latency

ML-Inspired Eviction Scoring - 7 Signals

A production-grade eviction system that replaces simple LRU/LFU with a 7-signal composite scoring function. Higher scores mean the entry stays longer.

Recency20%

Exponential decay based on time since last access

Frequency25%

Log-scaled access count normalized across entry ages

Cost20%

Recompute cost - cold tier queries score higher than L1 hits

Predicted Value25%

Future access probability from the Access Pattern Predictor

Size Efficiency10%

Inverse size - smaller entries are more space-efficient

Age Decay0%

Optional - prevents zombie entries cached once and never evicted

Settling Boost2x

New entries get a score boost during settling period (30s)

Cache Types

Documents

Individual document lookups by ID

Cold Partitions

Frequently accessed cold-tier data promoted to cache

Query Results

Cached results for repeated aggregate/filter queries

Aggregations

Pre-computed aggregation pipeline results

Metadata

Collection schemas, index definitions, and config - tiny but accessed on every query

Tier Comparison

FeatureStandard
$299/mo
Pro
$599/mo
Enterprise
$999/mo
L1 In-Memory Cache
L2 SSD Cache
L3 Distributed Cache----
Eviction StrategyLRU + FrequencyML Scoring (7 signals)ML Scoring (7 signals)
Workload Classification--5 classes5 classes
Predictive Pre-Warming----
Admission ControlSize-basedSize + ValueSize + Value + Workload
Cache TypesDocs + QueryAll 5 typesAll 5 types
Cross-Node Coordination----
Infrastructure
L1 Memory (max entries)100K500K1M
L2 SSD Storage10 GB50 GB200 GB
L3 Distributed----Unlimited

Feature Deep-Dives

Plugin Pricing at a Glance

Mix and match plugins across your clusters. Add or remove at any time with no downtime.

PluginTier 1Tier 2Tier 3
GPU Acceleration$799/mo
Standard
$1499/mo
Pro
$3999/mo
Enterprise
ML Engine$99/mo
Standard
$199/mo
Pro
$299/mo
Enterprise
Intelligent Cache$299/mo
Standard
$599/mo
Pro
$999/mo
Enterprise

Build AI Applications on a Database That Gets It

One database for your vectors, your context, your agent memory, and your application data. Start free. Scale without limits.

We use cookies to improve your experience and analyze site usage. Read our Privacy Policy for more information.