On-Premises RAG LLM Hosting with NVIDIA AI Blueprint

18

Sep

On-Premises RAG LLM Hosting with NVIDIA AI Blueprint

We invite you for a hands-on lunch session showcasing a fully operational enterprise-level RAG LLM deployment running on your own hardware. This live demonstration will feature real document processing, interactive query handling, and comprehensive visibility into system architecture and performance monitoring. Please also feel free to share this opportunity with colleagues interested in private LLM hosting and implementation strategies (We can set this up for your team within half a day! As long as you have the GPUs).

Interactive demonstration with lunch provided! Researchers are also welcome to join! We look forward to your feedback.

Registration quickly at https://forms.office.com/e/ZdUMiUJHGZ?origin=lprLink as it helps us prepare adequate catering. To avoid waste, please let us know if you want to cancel your registration.

Questions? Contact supercomputing@tue.nl

🎯 Key Propositions

Example using RAG AI Enterprise, asking question about spike with/without selecting the relevant pdf. (Spike-1 is a NVIDIA DGX B200 system TU/e bought)

Complete Solution Out-of-the-Box: Pre-integrated 15+ enterprise components including document processing, vector databases, embedding models, and multimodal AI capabilities
Superior Performance: 2x improved throughput (1,201 vs 613 tokens/sec), 15x faster multimodal data extraction, and 50% fewer incorrect answers compared to traditional implementations
Multimodal Intelligence: Advanced capabilities to understand and process text, tables, charts, images, and audio files from enterprise documents
On-Premises Control: Full data sovereignty with enterprise-grade security, compliance, and customization while leveraging NVIDIA's optimized AI stack
Low Cost But Up-to-date: Only requires hardware spending and maintenance efforts are low.

Speakers

Hengjian Zhang
AI Application Support Engineer

Speakers on previous Editions

Schedule

Thu

18

Sep

On-Premises RAG LLM Hosting with NVIDIA AI Blueprint

12:00 - 13:30 TBD

May 27, 2025

Training
event

LUMI AI Workshop

27

May

-

28

May

The TU/e Supercomputing Center is pleased to promote the upcoming “Moving Your AI Training Jobs to LUMI: A Hands-On Workshop”, taking place May 27–28, 2025, in Amsterdam, The Netherlands.

Organized by the LUMI User Support Team (LUST), in collaboration with EuroCC National Competence Centers in Finland and the Netherlands, and GreenNLP, this two-day event is aimed at users who want to transition their AI workloads from laptops, workstations, or cloud VMs to the powerful GPU-enabled LUMI supercomputer.

Continue reading
19

Feb

Feb 19, 2025
Training
event

Advanced User of SURF Research Cloud (API)

31

Jan

Jan 31, 2025
Training
event

UPDATED: Experimental Technologies Platform

12

Nov

Nov 12, 2024
Training
event

Introduction to MATLAB on HPC Systems

17

Sep

Sep 17, 2024
Training
event

COMSOL Training

On-Premises RAG LLM Hosting with NVIDIA AI Blueprint

🎯 Key Propositions

Speakers

Schedule

RELATED ITEMS