Close Menu
Cryprovideos
    What's Hot

    Solana Crypto Consolidation Heats Up as Forward Industries Targets Rival – Here Is What Happened – BlockNews

    June 16, 2026

    Crypto Miner MARA Buys 1,000 Bitcoin – U.At present

    June 16, 2026

    XRP Levels ‘Spectacular Comeback’ Following Main Sentiment Stoop: Santiment 

    June 16, 2026
    Facebook X (Twitter) Instagram
    Cryprovideos
    • Home
    • Crypto News
    • Bitcoin
    • Altcoins
    • Markets
    Cryprovideos
    Home»Markets»NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API
    NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API
    Markets

    NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API

    By Crypto EditorJanuary 22, 2026No Comments3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Felix Pinkston
    Jan 21, 2026 21:57

    NVIDIA simplifies GPU growth with CUB single-call API in CUDA 13.1, eliminating repetitive two-phase reminiscence allocation code with out efficiency loss.

    NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API

    NVIDIA has shipped a big quality-of-life improve for GPU builders with CUDA 13.1, introducing a single-call API for the CUB template library that eliminates the clunky two-phase reminiscence allocation sample builders have labored round for years.

    The change addresses a long-standing ache level. CUB—the C++ template library powering high-performance GPU primitives like scans, kinds, and histograms—beforehand required builders to name every perform twice: as soon as to calculate required reminiscence, then once more to truly run the algorithm. This meant each CUB operation seemed one thing like this verbose dance of reminiscence estimation, allocation, and execution.

    PyTorch’s codebase tells the story. The framework wraps CUB calls in macros particularly to cover this two-step invocation, a workaround widespread throughout manufacturing codebases. Macros obscure management stream and complicate debugging—a trade-off groups accepted as a result of the choice was worse.

    Zero Overhead, Much less Code

    The brand new API cuts straight to the purpose. What beforehand required express reminiscence allocation now matches in a single line, with CUB dealing with momentary storage internally. NVIDIA’s benchmarks present the streamlined interface introduces zero efficiency overhead in comparison with the guide strategy—reminiscence allocation nonetheless occurs, slightly below the hood by way of asynchronous allocation embedded inside gadget primitives.

    Critically, the outdated two-phase API stays obtainable. Builders who want fine-grained management over reminiscence—reusing allocations throughout a number of operations or sharing between algorithms—can proceed utilizing the present sample. However for almost all of use circumstances, the single-call strategy ought to turn into the default.

    The Setting Argument

    Past simplifying fundamental calls, CUDA 13.1 introduces an extensible “env” argument that consolidates execution configuration. Builders can now mix customized CUDA streams, reminiscence sources, deterministic necessities, and tuning insurance policies by a single type-safe object fairly than juggling a number of perform parameters.

    Reminiscence sources—a brand new utility for allocation and deallocation—may be handed by this setting argument. NVIDIA gives default sources, however builders can substitute their very own customized implementations or use CCCL-provided options like gadget reminiscence swimming pools.

    Presently, the setting interface helps core algorithms together with DeviceReduce operations (Cut back, Sum, Min, Max, ArgMin, ArgMax) and DeviceScan operations (ExclusiveSum, ExclusiveScan). NVIDIA is monitoring further algorithm assist by way of their CCCL GitHub repository.

    Sensible Implications

    For groups sustaining GPU-accelerated functions, this replace means much less wrapper code and cleaner integration. The CUB library already serves as a foundational part of NVIDIA’s CUDA Core Compute Libraries, and simplifying its API reduces friction for builders constructing customized CUDA kernels.

    The timing aligns with broader business motion towards extra accessible GPU programming. As AI workloads drive demand for optimized GPU code, decreasing boundaries to utilizing high-performance primitives issues.

    CUDA 13.1 is offered now by NVIDIA’s developer portal. Groups at the moment utilizing macro wrappers round CUB calls ought to consider migrating to the native single-call API—it delivers the identical abstraction with out the debugging complications.

    Picture supply: Shutterstock




    Supply hyperlink

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    NVIDIA BioNeMo Permits LoRA Superb-Tuning for Biotech Fashions

    June 16, 2026

    US Buyers’ Fairness Publicity Tops Ranges Seen Earlier than Previous Bear Markets

    June 16, 2026

    Nvidia's New MoE Kernels Promise 93% Speedup for AI Coaching

    June 16, 2026

    xAI Launches Grok Construct Agent Dashboard for Builders

    June 16, 2026
    Latest Posts

    Crypto Miner MARA Buys 1,000 Bitcoin – U.At present

    June 16, 2026

    Bitcoin Tops $65K on US-Iran Deal, However Merchants Stay Skeptical – Decrypt

    June 16, 2026

    BTC, ETH, SOL worth information: Bitcoin again below $67,000 as merchants warn of Trump reversal

    June 16, 2026

    High Bitcoin (BTC) Worth Predictions After the US-Iran Peace Rally

    June 16, 2026

    Bitcoin Big Technique Pads Money Cushion for Second Straight Week, Buys BTC – Decrypt

    June 16, 2026

    Bitcoin Has Gained at Each FIFA World Cup: Will the 2030 Cycle Maintain?

    June 16, 2026

    Bitcoin Whales Full Promote-Off as Value Bounces Again From $65,000 – U.Immediately

    June 16, 2026

    Technique Buys 1,587 BTC for $100M, Lowers Common Price Foundation

    June 16, 2026

    CryptoVideos.net is your premier destination for all things cryptocurrency. Our platform provides the latest updates in crypto news, expert price analysis, and valuable insights from top crypto influencers to keep you informed and ahead in the fast-paced world of digital assets. Whether you’re an experienced trader, investor, or just starting in the crypto space, our comprehensive collection of videos and articles covers trending topics, market forecasts, blockchain technology, and more. We aim to simplify complex market movements and provide a trustworthy, user-friendly resource for anyone looking to deepen their understanding of the crypto industry. Stay tuned to CryptoVideos.net to make informed decisions and keep up with emerging trends in the world of cryptocurrency.

    Top Insights

    Binance Analysis Highlights Bitcoin's Rising Position in DeFi

    March 15, 2025

    SEC Delays Choices On BlackRock And Franklin Crypto ETFs

    September 12, 2025

    Dogwifhat Value Prediction: As WIF Pumps 5%, This Solana Layer-2 Crypto Presale Closes On $16 Million – Greatest Crypto To Purchase Now?

    January 29, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    • Home
    • Privacy Policy
    • Contact us
    © 2026 CryptoVideos. Designed by MAXBIT.

    Type above and press Enter to search. Press Esc to cancel.