Building Production-Ready AI Features - AI Factory 201 v1.3.2

Suggest edits

The November 2025 Innovation Release of EDB Postgres AI is available. For more information, see the release notes.

Who this is for

Developers integrating Gen AI features into production applications
Data teams building pipelines for AI-driven Knowledge Bases
AI and platform teams configuring model serving and observability
Hybrid Manager users enabling Sovereign AI workloads at scale

General Goals

By completing this path, you will:

Build multi-step AI Assistants and advanced Structures
Design Hybrid Knowledge Bases and multi-source RAG pipelines
Deploy and manage GPU-powered Model Serving
Implement observability and monitoring for AI-driven features
Learn patterns for production-grade governance and performance tuning

Modules by Focus Area

1. Advanced Assistant & Structure Design

Goals:

Implement Assistants with complex personas and memory
Create advanced multi-step Structures
Integrate Tools and external data flows

Estimated Time: ~30–45 min

Modules:

Assistants — Design assistants with personas, memory, and tool usage patterns.
Structures — Learn how to chain retrieval and actions into reliable flows.
Create an Assistant — Build a production‑ready assistant with tools and KBs.
Create a Structure — Implement multi‑step workflows with validations and branching.
Create a Tool — Wrap internal services and APIs as safe, governed tools.

2. Data Engineering & Hybrid Knowledge Bases

Goals:

Design and manage Hybrid Knowledge Bases
Tune multi-source RAG pipelines
Implement metadata filtering and hybrid search

Estimated Time: ~30–45 min

Modules:

Hybrid Knowledge Base Best Practices — Patterns to balance freshness, coverage, and cost.
Manage Knowledge Bases — Operate KBs: re‑embedding, source lifecycle, and access control.
Create a Retriever — Tune search behavior, filters, and ranking at scale.

3. Model Serving with KServe

Goals:

Deploy GPU-powered models
Tune runtime and resource settings
Understand the Model Serving lifecycle

Estimated Time: ~45–60 min

Modules:

Model Serving Concepts
Configure ServingRuntime — Define runtimes to host families of models with shared settings.
Deploy a NIM Container — Use NVIDIA NIM images for high‑performance inference.
Update GPU Resources — Right‑size GPU/CPU for performance and cost.
Verify Model Deployments — Validate endpoints and GPU usage in Hybrid Manager.

4. Observability & Monitoring

Goals:

Implement observability for AI pipelines and Model Serving
Monitor performance and resource usage
Enable production readiness checks

Estimated Time: ~20–30 min

Modules:

Observability for Model Serving — Metrics and logs for models and pipelines.
Monitor InferenceService — Track health and performance of serving endpoints.
Hybrid Manager Observability — Platform‑level dashboards and alerts.

Next steps

After completing this 201 Path:

Continue to AI Factory 301 Path — advanced patterns for scaling AI apps, multi-agent orchestration, embedding pipelines, and advanced governance.

By mastering this 201 Path, you’ll be ready to deploy and scale Sovereign AI applications — with full control over your models, pipelines, and production observability — using EDB PG AI.

← Prev

Getting Started - AI Factory 101

↑ Up

AI Factory Learning Center

Advanced AI Factory Usage and Extensibility - AI Factory 301

Building Production-Ready AI Features - AI Factory 201 v1.3.2

Who this is for

General Goals

Modules by Focus Area

1. Advanced Assistant & Structure Design

2. Data Engineering & Hybrid Knowledge Bases

3. Model Serving with KServe

4. Observability & Monitoring

Next steps

Related learning resources

← Prev

↑ Up

Next →