NVIDIA Deploys Alibaba Qwen3.5 VLM on Blackwell GPUs for AI Agent Improvement

NVIDIA has rolled out free GPU-accelerated endpoints for Alibaba’s Qwen3.5 vision-language mannequin, giving builders rapid entry to the 397 billion parameter system by way of Blackwell structure {hardware}. The transfer positions each tech giants to seize the rising marketplace for multimodal AI brokers able to understanding and navigating person interfaces.

The Qwen3.5 mannequin, which Alibaba launched on February 16, 2026, represents a big architectural shift in massive language fashions. Regardless of its huge 397B complete parameters, solely 17 billion activate per ahead cross—a 4.28% activation price achieved by way of a hybrid mixture-of-experts (MoE) design mixed with Gated Delta Networks. This effectivity interprets to actual value financial savings: Alibaba claims the system runs 60% cheaper and handles massive workloads eight occasions extra effectively than its predecessor.

Technical Specs Value Noting

The mannequin helps an enter context size of 256K tokens, extensible to 1 million—sufficient to course of roughly two hours of video content material natively. It handles 200+ languages and runs 512 consultants per layer, with 11 consultants (10 routed plus 1 shared) activated per token throughout 60 layers.

Builders can entry Qwen3.5 by way of NVIDIA’s construct.nvidia.com platform with free registration within the NVIDIA Developer Program. The API follows OpenAI-compatible conventions, making integration simple for groups already working with related tool-calling patterns.

Manufacturing Deployment Choices

For enterprises shifting past experimentation, NVIDIA NIM packages the mannequin as containerized inference microservices. These can run on-premises, in cloud environments, or throughout hybrid deployments. The NeMo framework supplies fine-tuning capabilities for domain-specific functions—NVIDIA particularly highlights a medical visible QA tutorial demonstrating radiological dataset coaching.

Alibaba has continued increasing the Qwen3.5 household for the reason that preliminary launch. On February 24, the corporate pushed out three extra variants: Qwen3.5-122B-A10B, Qwen3.5-35B-A3B, and Qwen3.5-27B, providing smaller footprint choices for various deployment eventualities.

Alibaba, buying and selling with a market cap round $372 billion as of February 27, has positioned Qwen3.5 in opposition to GPT-5.2, Claude Opus 4.5, and Gemini 3 Professional on benchmark efficiency. The open-weight fashions stay accessible on Hugging Face Hub and ModelScope for builders preferring self-hosting over NVIDIA’s managed endpoints.

Picture supply: Shutterstock

Supply hyperlink

What's Hot

Abra CEO Invoice Barhydt sees tokenization overtaking bitcoin value as crypto’s essential story

Cardano (ADA) on Verge of First 2026 Weekly Dying Cross, What's Forward? – U.Immediately

The Good Information for Ethereum (ETH) After Collapse to $1.5K: Particulars

NVIDIA Deploys Alibaba Qwen3.5 VLM on Blackwell GPUs for AI Agent Improvement

Monero Subsequent? Researcher Who Discovered The Zcash Flaw Targets XMR For Future Audit | Bitcoinist.com

Hyperliquid (HYPE) Pulls Again After Rally as Merchants Look ahead to a Potential Reversal – BlockNews

Trump’s AI Possession Plan Might Profit Anthropic at OpenAI’s Expense

SEI Faces Essential Check Amid Rising Bearish Alerts – Right here Is the Bull Case – BlockNews

Abra CEO Invoice Barhydt sees tokenization overtaking bitcoin value as crypto’s essential story

'Time to Add Dots': Saylor Teases Technique's Subsequent Bitcoin Wave Amid $12 Billion Paper Loss – U.As we speak

Bitcoin's Worst Week Since FTX Raises The Query: Is The Backside Already In?

Bitcoin ETFs Rout Extends To June With $1.72 Billion Internet Outflows In First Week | Bitcoinist.com

Bitcoin Reaches Deep Undervaluation Zone – Time To Get In?

Bitcoin ETFs Recorded Their Worst Week Since Inception Amid BTC’s Huge Value Slide

Bitcoin's June Massacre Defined: Causes, Market Impression, And Outlook

10X Analysis Offers Bitcoin Two Weeks as Bitwise CEO Flags the Actual Threat

Top Insights

Citi Prepares Institutional Crypto Custody Launch

Hong Kong Strikes to License Crypto Sellers and Custodians

IPO Genie ($IPO) Emerges because the Hottest Crypto Presale Heading Into 2025

What's Hot

NVIDIA Deploys Alibaba Qwen3.5 VLM on Blackwell GPUs for AI Agent Improvement

Technical Specs Value Noting

Manufacturing Deployment Choices

Related Posts

Subscribe to Updates