Home News

company news about Nvidia Boasts 7 Chips in Production for Vera Rubin Platform, Including Groq 3 LPU

All Products

Rack Storage Server
(179)

Huawei Fusion Server
(31)

Dell Poweredge Server
(59)

H3C Server
(31)

Datacom Switches
(96)

WLAN Device
(21)

Smart Wireless Router
(17)

Hard Drive HDD
(78)

Internal Hard Drive SSD
(16)

Geforce Graphic Card
(27)

INTEL CPU Processor
(20)

Server Memory RAM
(6)

Refurbished Storage Server
(6)

SFP Transceiver Module
(4)

Fibre Channel Switch
(125)

Certification

Customer Reviews

The sales staff of Beijing Qianxing Jietong Technology Co.,Ltd are very professional and patient. They can provide quotations quickly. The quality and packaging of the products are also very good. Our cooperation is very smooth.

—— 《Festfing DV》LLC

When I was looking for intel CPU and Toshiba SSD urgently, Sandy from Beijing Qianxing Jietong Technology Co., Ltd gave me a lot of help and got me the products I needed quickly. I really appreciate her.

—— Kitty Yen

Sandy of Beijing Qianxing Jietong Technology Co.,Ltd is a very careful salesman, who can remind me of configuration errors in time when I buy a server. The engineers are also very professional and can quickly complete the testing process.

—— Strelkin Mikhail Vladimirovich

We are very happy with our experience working with Beijing Qianxing Jietong. The product quality is excellent, and delivery is always on time. Their sales team is professional, patient, and very helpful with all our questions. We truly appreciate their support and look forward to a long-term partnership. Highly recommended!

—— Ahmad Navid

Quality： “Great experience with my supplier. The MikroTik RB3011 was already used, but it was in very good condition and everything works perfectly. Communication was fast and smooth, and all my concerns were addressed quickly. Very reliable supplier—highly recommended.”

—— Geran Colesio

I'm Online Chat Now

Company News

Nvidia Boasts 7 Chips in Production for Vera Rubin Platform, Including Groq 3 LPU

Nvidia announced a key hardware update at its GPU Technology Conference (GTC) in San Jose today, barely two months after it acquired chip startup Groq and all its intellectual property for $20 billion. Even with the deal only recently finalized, Groq’s Language Processing Unit (LPU) is already in mass production, and is being integrated into Nvidia’s full Vera Rubin chip stack — which now includes a total of seven new chips that have entered production.

Groq was founded in 2016 by former Google engineers who were part of the original Tensor Processing Unit (TPU) team. The company designs custom ASIC chips built specifically for fast, low-latency AI inference processing. Ian Buck, Nvidia’s vice president and general manager of accelerated computing, stated that combining the “extreme flops” of Rubin GPUs with the strong bandwidth of Groq LPUs will create a uniquely powerful solution for AI workloads.

“GPUs have large memory and strong floating-point performance, delivering high throughput and fast token rates for the mainstream market, and they excel at general AI tasks,” Buck said in a press briefing the previous day. “But the LPU is optimized solely for extreme low-latency token generation, capable of pushing thousands of tokens per second.”

“The tradeoff is that it takes multiple chips to reach that level of performance,” he added. Each Groq 3 LPU has just 500 MB of SRAM, just 1/500 the memory capacity of Rubin GPUs, according to Buck. “But the bandwidth is exceptional — Rubin GPUs offer up to 22TBps, while Groq LPUs reach 150TB per second.”

Nvidia is working to combine the two processors, Buck confirmed, to unify the GPU’s decoding operations with the LPU’s low-latency work, allowing the two to run as one unified system rather than separate components.

The Groq 3 LPX rack that Nvidia unveiled at GTC will be deployed alongside NVL72 racks, delivering dedicated capacity for AI inference and agentic AI workloads. Per Nvidia’s presentation, the Groq 3 LPX rack can hold up to 256 LPU accelerators, equipped with 128GB of SRAM and a staggering 40 petabytes per second of SRAM memory bandwidth. The rack delivers up to 640TB per second of scale-up bandwidth in total, and Nvidia notes it could eventually scale to house more than 1,000 LPUs.

Pairing a Groq 3 LPX rack with a Rubin NVL72 system enables customers to generate one million tokens for just $45 on a 1 trillion-parameter GPT model with a 400k token context window, according to Nvidia. That figure represents 35 times more tokens than the Rubin NVL72 system can generate on its own.

Groq 3 LPUs are not the only new chips Nvidia is leveraging to boost AI inference capacity. The company also announced a dedicated rack for its Vera CPUs — the ARM-based processors paired with two Rubin GPUs to build the superchips at the core of Nvidia’s NVL72 and NVL8 systems.

As CPUs have emerged as a key bottleneck for AI inference and agentic AI workloads, enterprises are increasingly demanding greater CPU resources. In response, Nvidia has launched a standalone CPU-only rack, named the Vera CPU Rack, which features 256 Vera CPUs connected to 400TB of LPDDR5x memory operating at 300TBps.

The Vera CPU Rack also comes equipped with a Spectrum-X Ethernet spine and 64 BlueField-4 data processing units (DPUs). These DPUs coordinate with GPUs in NVL72 systems via Nvidia’s NVLink-C2C interconnect, delivering 1.8TBps of coherent bandwidth — seven times the bandwidth of PCIe Gen 6, per the company.

Nvidia states the Vera rack can support 22,500 concurrent CPU environments, meeting the massive CPU demand required to run AI inference and agentic workloads smoothly. The rack uses liquid cooling and is built on Nvidia’s MGX reference architecture, which the company highlights is backed by 80 ecosystem partners, and it will be distributed through Nvidia’s global partner network.

Nvidia also announced a new rack full of BlueField-4 DPUs, one of the seven new chips that Nvidia touted as making up the new AI supercomputer. The BlueField-4 STX is the first rack-scale implementation of Nvidia’s new CMX (context memory storage) platform, which expands GPU memory from HBM into primary NVMe storage. It unveiled CMX in January, and Nvidia’s storage partners, such as VAST Data, which presented on its CMX storage offering at its conference a few weeks ago, are beginning to adopt it via the Nvidia STX reference architecture.

“The STX is a high-bandwidth, shared layer optimized for storing and retrieving the massive key value cache data generated by agentic workflows,” Buck said. “This is a reference architecture. While Nvidia is not going to be providing it directly, we’re providing [the reference architecture] to all of our storage partners and the entire storage ecosystem so that they can build the next generation of storage for AI factories that has 4x the performance per watt, double the pages per second for enterprise data, and delivering 5x the tokens per second of context memory necessary for AI factories running agentic workflows.”

Cloudian, DDN, Dell Technologies, Everpure (forerly Pure Storage), Hitachi Vantara, HPE, IBM, MinIO, NetApp, Nutanix, and WEKA are all building new storage on the BlueField-4 STX reference architecture, Nvidia said, while companies like CoreWeave, Crusoe, IREN, Lambda, Mistral AI, Nebius, Oracle Cloud Infrastructure (OCI), and Vultr are adopting it.

All told, Nvidia is showcasing seven new chips at GTC that each have a role for powering AI in the Vera Rubin platform. This includes Vera CPU, Rubin GPU, NVLink 6 Switch, ConnectX-9 SuperNIC, BlueField-4 DPU, Groq 3 LPU, and SpectrumX CPO, the new co-packaged optics Ethernet switch that delivers 200 Gbps connectivity over silicon photonics. Nvidia announced the SpectrumX chip at GTC 2025, and it’s now in production, CEO Jensen Huang said in his keynote.

Beijing Qianxing Jietong Technology Co., Ltd.
Sandy Yang/Global Strategy Director
WhatsApp / WeChat: +86 13426366826
Email: yangyd@qianxingdata.com
Website: www.qianxingdata.com/www.storagesserver.com

Business Focus:
ICT Product Distribution/System Integration & Services/Infrastructure Solutions
With 20+ years of IT distribution experience, we partner with leading global brands to deliver reliable products and professional services.
“Using Technology to Build an Intelligent World”Your Trusted ICT Product Service Provider!

Pub Time : 2026-03-18 14:05:18 >> News list

Contact Details

Beijing Qianxing Jietong Technology Co., Ltd.

Contact Person: Ms. Sandy Yang

Tel: 13426366826

company news about Nvidia Boasts 7 Chips in Production for Vera Rubin Platform, Including Groq 3 LPU

Rack Storage Server

Huawei Fusion Server

Dell Poweredge Server

H3C Server

Datacom Switches

WLAN Device

Smart Wireless Router

Hard Drive HDD

Internal Hard Drive SSD

Geforce Graphic Card

INTEL CPU Processor

Server Memory RAM

Refurbished Storage Server

SFP Transceiver Module

Fibre Channel Switch

Rack Storage Server

12 Bays 1U Rackmount Server Lenovo ThinkSystem SR630 Rack Server

ThinkSystem SR250 V2 4SFF Rack Storage Server Intel Xeon E-2378G Processor

Intel C621A Rack Storage Server Inspur NF5180M6 1U Rack Mount Server

Huawei Fusion Server

FusionServer 5288 V6 4U Rack Server 32 DDR4 DIMMs 44 3.5 Inch Hard Disks

Ultra High Density Huawei Fusion Server 1U Network Storage Server 1288H V5

New Gen OceanStor 5310 Huawei Rack Server Hybrid Flash Storage