ProductEARLY ACCESS

Onyx Forge | AI Product Factory

Build, train, evaluate, and deploy vertical AI products. From domain expertise to production-ready deployment packages — with cryptographic receipts at every stage. Runs locally on your hardware.

127.0.0.1:8765 — Onyx Forge
Onyx Forge Dashboard — Data Ingest view showing document processing pipeline, extractors, and vector store configuration

Data Ingest view — Forge running locally on Mac Studio, parsing documents into RAG-ready embeddings

Dashboard

Full lifecycle, one dashboard

Eleven modules covering every stage of AI product development — from data ingestion to production monitoring. Each module produces verifiable artifacts.

OverviewProjectsData IngestData StudioModel HubTrainingEvaluationPackagingLineageMonitoringSettings
Pipeline

From expertise to deployment

Six stages. Each produces verifiable artifacts. Every artifact is cryptographically signed. No black boxes — just auditable, reproducible AI product engineering.

Data Ingest

01

Parse, chunk, embed, and store documents into the vector database. Supports PDF, DOCX, HTML, JSON, CSV, TXT, and MD. Multiple extraction backends — Fast (text-only), Docling (multimodal), and NV-Ingest (NVIDIA NIM). Outputs RAG-ready vector embeddings.

Data Studio

02

Clean, deduplicate, augment, and format your training data. Built-in quality checks, bias detection, and dataset versioning ensure your models are trained on reliable, representative data.

Model Hub

03

Choose from a curated catalog of base models — Llama, Mistral, Qwen, DeepSeek, Nemotron — or bring your own. Automated benchmarking helps you select the right architecture for your domain.

Training

04

Fine-tune using NVIDIA NeMo, Unsloth, or MLX — whichever framework is optimal for your use case. PEFT, LoRA, QLoRA, full fine-tuning — Forge selects the right approach for your data and compute budget.

Evaluation

05

Seven benchmark dimensions: accuracy, latency, safety, bias, regulatory alignment, robustness, and domain coverage. Automated evaluation suites produce quantitative scores and detailed failure analysis.

Packaging

06

Production-ready deployment packages: NIM containers, GGUF quantized models, or Python wheels. Each package includes model cards, deployment guides, and cryptographic integrity receipts.

Document Extractors

Multi-backend extraction

Choose the extraction backend that fits your format, quality, and licensing requirements. From fast text-only to NVIDIA NIM-powered multimodal extraction.

BackendLicenseFormatsStatus
Fast (text-only)Apache 2.0PDF, DOCX, TXT, MD, HTMLready
Docling (multimodal)Apache 2.0PDF, DOCX, PPTX, Imagesavailable
NV-Ingest (NVIDIA NIM)NVIDIA AI EnterprisePDF, DOCX, Images, Tableslicense required
NVIDIA Stack

Built on the full NVIDIA AI platform

Every component of Forge runs on NVIDIA infrastructure — from training frameworks to inference engines to deployment containers.

NeMo

Model customization framework

NIM

Optimized inference microservices

TensorRT-LLM

LLM inference acceleration

Megatron-Core

Large-scale training

Milvus

Vector database

NV-EmbedQA

Embedding models

RTX 5090 + 5080 · CUDA 12.8 · DGX-class training infrastructure
Deployment

Three deployment targets

Ship your vertical AI product the way your customers need it — as optimized containers, local models, or embedded libraries.

NIM Container

NVIDIA-optimized inference container with TensorRT-LLM acceleration. Enterprise-ready with built-in security, monitoring, and API compatibility. Deploy on DGX, cloud GPU instances, or your own hardware.

GGUF (Quantized)

Local-first deployment with quantized model formats. Run on consumer hardware, edge devices, or air-gapped environments. Multiple quantization levels for precision-vs-performance tradeoffs.

Python Wheel

Lightweight distribution for embedding into existing Python applications. Import your model as a library — no container orchestration required. Ideal for CI/CD and automated workflows.

Cryptographic Intelligence

Every artifact, verifiably yours

Forge doesn't just build models — it builds a chain of cryptographic evidence that proves what was built, how it was trained, and how it performed. Regulators and auditors get the receipts they need.

01

Ed25519 Receipts

Every model version, every training run, every evaluation result is signed with Ed25519. Cryptographic proof that your model is exactly what was trained and tested.

02

Merkle-Batched Audit Trails

Immutable, tamper-evident logs of the entire build pipeline. Merkle tree batching provides efficient verification while maintaining cryptographic integrity across millions of pipeline events.

Bring your expertise. We'll build the product.

Forge is in early access. If you have deep domain expertise and want to build a vertical AI product, let's talk.

Request early access