GPU Recommendations for Default NIM Models v1.3.2

The November 2025 Innovation Release of EDB Postgres AI is available. For more information, see the release notes.

Overview

From Hybrid Manager, there are two primary consumers of AI models:

PG.AI Knowledge Base (AIDB Postgres extension) for creating and maintaining AI Knowledge Bases.
PG.AI GenAI Builder (containerized Griptape) for building agentic AI assistants.

Model type	NIM model	NVIDIA NIM documented resource requirements
Text completion	llama-3.3-70b-instruct	4 × L40S
Text embeddings	arctic-embed-l	1 × L40S
Image embeddings	nvclip	1 × L40S
OCR	paddleocr	1 × L40S
Text reranking	llama-3.2-nv-rerankqa-1b-v2	1 × L40S

Based on the default models above, the minimum to run them concurrently is 8 × L40S GPUs.

Note: GCP does not offer L40S GPUs. The recommended A2 nodes with A100 GPUs are supported and documented for the NIM models listed above.

Frequently Asked Questions - AI Factory on Hybrid Manager

AI Factory in Hybrid Manager

AI Factory Quickstart