NeurondB

PostgreSQL AI Vector Extension

GPU-accelerated vector search, model inference, hybrid retrieval, and RAG orchestration built into PostgreSQL. NeurondB is an AI PostgreSQL extension. Use this documentation to deploy NeurondB, operate background workers, and embed ML pipelines in SQL.

Get Started View on GitHub

Key Capabilities

Vector Search

HNSW, IVF, product quantization, and custom distance metrics for billion-scale similarity search.

ML Inference

ONNX runtime integration, GPU offload, and batch execution for deep learning workloads in SQL.

Hybrid Retrieval

Blend keyword, metadata, and vector signals to deliver highly relevant multimodal results.

RAG Pipelines

In-database retrieval augmented generation with prompt templates, metadata policies, and observability.

Documentation Library

Getting Started

Install NeurondB on PostgreSQL 16–18, verify GPU support, and apply baseline configuration.

Installation
Build from source or install packages.
Quick Start
Load sample data and run first vector searches.
Configuration
GUC parameters for CPU/GPU execution paths.

Core Features

Learn how NeurondB models vectors, maintains indexes, and tunes recall versus latency.

Vector Types
Supported dimensionality and storage formats.
Distance Metrics
Cosine, L2, IP, dot, and hybrid scoring.
Indexing
HNSW, IVF, PQ, and adaptive index selection.
Quantization
Reduce memory footprint with scalar and vector quantization.

ML & Embeddings

Generate, store, and serve embeddings alongside model lifecycle management.

Embeddings
Transform text, audio, and images into dense vectors.
Inference
Deploy ONNX models with GPU batching and caching.
Model Management
Version control, approvals, and rollback workflows.

Hybrid Search & Reranking

Combine text search, BM25, and neural rerankers for production retrieval pipelines.

Hybrid Overview
Architectures for multi-signal retrieval.
Reranking
Cross-encoder and LLM reranking playbooks.
RAG Workflows
Orchestrate retrieval augmented generation end to end.

Background Workers

Operational guidance for queue execution, auto-tuning, and index maintenance workers.

Worker Overview
Understand worker architecture and lifecycles.
Queue Worker (neuranq)
Batch ingestion and asynchronous scoring.
Auto-Tuner (neuranmon)
Automated index health and GPU utilization tuning.
Index Maintenance (neurandefrag)
Defragment and rebalance vector indexes online.

API Reference

Browse SQL functions, operators, and data types exported by NeurondB.

SQL Functions
Query, indexing, and analytics procedures.
Data Types
Custom vector, tensor, and metadata types.
Operators
Similarity, distance, and hybrid scoring operators.

Quick Links

Getting Started Guide

Bootstrap NeurondB on PostgreSQL 16–18 with CPU and GPU execution paths.

Learn more

Installation Reference

Build from source, package installations, upgrades, and validation scripts.

Learn more

GitHub Repository

Source code, issues, and contribution guide for NeurondB.

Learn more