InferX — Serverless GPU Inference Platform for Production Workloads

Tenant Namespace Model Name Type GPU Count vRam (GB) CPU Memory (GB) Standby State Snapshot Nodes Revision Actions
GPU Pageable Pinned
public ActionAnalytics CR-70B text2text 4 71.0 20.0 80.0 Mem File File Normal ['computeinstance-e00r2jrqynf83a8b4f'] 54 Open