InferX — Serverless GPU Inference Platform for Production Workloads

Tenant Namespace Model Name Type GPU Count vRam (GB) CPU Memory (GB) Standby State Snapshot Nodes Revision Actions
GPU Pageable Pinned
public Qwen IntelliAsk-Qwen3-32B-450-Merged text2text 2 58.0 12.0 80.0 Mem File File Normal ['computeinstance-e00r2jrqynf83a8b4f'] 76 Open