Eight cards, one chassis
Eight NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs in a single 4U rackmount — 768 GB of GDDR7 across PCIe Gen5, enough headroom to serve 70B-class models to 100–500 concurrent users without leaving the rack.
8× NVIDIA RTX PRO 6000 Blackwell Server Edition (96 GB GDDR7 each) in a 4U PCIe rackmount reference platform. Available through EMARQUE.

Manufacturer-defined features from the published datasheet.
Eight NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs in a single 4U rackmount — 768 GB of GDDR7 across PCIe Gen5, enough headroom to serve 70B-class models to 100–500 concurrent users without leaving the rack.
Giga Computing G493-class reference platform — the same hardware that ships into hyperscaler labs — supplied through EMARQUE with local commissioning, warranty handling, and Tier-1 support in Malaysia.
MIG-style partitioning per card lets you carve the server into isolated inference workloads — separate teams, separate models, separate quotas on shared hardware. No noisy-neighbour fights, full observability via the BMC.
Up to 60 TB U.2 NVMe on PCIe Gen5 keeps the eight GPUs fed; dual 25 GbE on-board with optional 100 GbE or InfiniBand for scale-out clusters. No PCIe contention, no storage starvation under sustained load.
Validated for 200–240 V AC on dual hot-swap 2 kW Titanium PSUs, redundant cooling, ASPEED BMC for remote management — runs reliably in Malaysian DC environments without bespoke power conditioning.
GPU count (2 / 4 / 8), memory (256 GB – 2 TB), storage (8 – 60 TB), networking (25 / 100 GbE / InfiniBand), and CPU choice (AMD EPYC or Intel Xeon) — all selectable at quote without changing the chassis or rebuilding the BOM.
The four sub-systems that determine real-workload behaviour. We tune each before delivery.
Tell us your workload. EMARQUE sizes the AI Server and sends a quote.
Workload categories documented in the manufacturer's reference materials. Sizing is confirmed with your technical team during scoping.
Run a private RAG stack with a 70B-class model against your document store, code repos, and ticketing system. Eight GPUs split four ways gives four logical inference endpoints sized for 100+ concurrent users each — the right shape for finance, legal, engineering, and ops to share one server.
TGI / vLLM / Triton serving multiple fine-tuned variants of a base model. PCIe Gen5 isolation per card means no NVLink coherence overhead — the right architecture when each request fits on one GPU and you want predictable per-tenant throughput rather than tightly coupled training.
Eight independent cards parallelise across camera or audio streams cleanly — object detection on 50 4K streams, speech-to-text on 200 concurrent calls, or pose estimation across a factory floor. Each stream gets a dedicated GPU slice with consistent latency.
Run production inference on six cards during business hours, reallocate to LoRA / QLoRA fine-tuning runs on all eight overnight. The BMC + Redfish API makes the rebalance scriptable; no separate dev cluster required.
Configurable. Final BOM, GPU mix, RAM and storage, and networking topology are confirmed in writing at quotation.
Server-optimised variant of the RTX PRO 6000 Blackwell — same Blackwell silicon and 96 GB GDDR7 memory, with passive cooling (300 W TDP) designed to be cooled by the server chassis airflow. Targets enterprise rackmount deployments where the active 600 W variant would be impractical.
EMARQUE supplies the AI Server on the Giga Computing G493-class 4U PCIe reference platform validated for the RTX PRO 6000 SE thermal envelope. Equivalent reference designs from Supermicro or ASUS are available on request. Final BOM is documented at quotation.
Yes. GPU count (2 / 4 / 8 cards), memory (256 GB – 2 TB), storage (8 – 60 TB NVMe), networking (25 GbE / 100 GbE / InfiniBand), and OS are all configurable. CPU choice (AMD EPYC vs Intel Xeon) is customer-selectable within the manufacturer's published configuration matrix.
Manufacturer warranty applies — Giga Computing / OEM warranty on the chassis and components; NVIDIA's warranty entitlement on the RTX PRO 6000 SE GPUs. EMARQUE handles local warranty claim processing through the manufacturer's authorised channel.
EMARQUE handles in-country delivery, customs, commissioning, acceptance testing, and Tier-1 support response in Malaysia. Tier-2/3 escalation routes to the manufacturer per the warranty entitlement. Optional service contracts for extended response SLAs and on-site engineering visits can be added.
Step into NVIDIA DGX B200 or NVIDIA DGX B300 (or HGX B200 / B300 OEM platforms from Dell, Giga Computing, or Supermicro). Those configurations are presented on the respective NVIDIA DGX product pages — each shows both NVIDIA-branded DGX systems and HGX OEM alternatives.
Manufacturer specifications and warranty terms apply. EMARQUE issues a formal quotation through your Key Account Manager.
Model size, concurrency, latency budget, deployment site. EMARQUE returns a quote in MYR within one Malaysian business day, sized to the workload — not the salesperson’s quota.