Close Menu
Cryprovideos
    What's Hot

    Google: Quantum Computers Need 20x Fewer Qubits to Crack Crypto – Bitbo

    March 31, 2026

    Will agentic commerce unlock a two-layer funds stack for AI-native transactions?

    March 31, 2026

    Jordi Visser Says Bitcoin Was Constructed For This New Fed Disaster

    March 31, 2026
    Facebook X (Twitter) Instagram
    Cryprovideos
    • Home
    • Crypto News
    • Bitcoin
    • Altcoins
    • Markets
    Cryprovideos
    Home»Markets»NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API
    NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API
    Markets

    NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API

    By Crypto EditorJanuary 22, 2026No Comments3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Felix Pinkston
    Jan 21, 2026 21:57

    NVIDIA simplifies GPU growth with CUB single-call API in CUDA 13.1, eliminating repetitive two-phase reminiscence allocation code with out efficiency loss.

    NVIDIA CUDA 13.1 Drops CUB Boilerplate with New Single-Name API

    NVIDIA has shipped a big quality-of-life improve for GPU builders with CUDA 13.1, introducing a single-call API for the CUB template library that eliminates the clunky two-phase reminiscence allocation sample builders have labored round for years.

    The change addresses a long-standing ache level. CUB—the C++ template library powering high-performance GPU primitives like scans, kinds, and histograms—beforehand required builders to name every perform twice: as soon as to calculate required reminiscence, then once more to truly run the algorithm. This meant each CUB operation seemed one thing like this verbose dance of reminiscence estimation, allocation, and execution.

    PyTorch’s codebase tells the story. The framework wraps CUB calls in macros particularly to cover this two-step invocation, a workaround widespread throughout manufacturing codebases. Macros obscure management stream and complicate debugging—a trade-off groups accepted as a result of the choice was worse.

    Zero Overhead, Much less Code

    The brand new API cuts straight to the purpose. What beforehand required express reminiscence allocation now matches in a single line, with CUB dealing with momentary storage internally. NVIDIA’s benchmarks present the streamlined interface introduces zero efficiency overhead in comparison with the guide strategy—reminiscence allocation nonetheless occurs, slightly below the hood by way of asynchronous allocation embedded inside gadget primitives.

    Critically, the outdated two-phase API stays obtainable. Builders who want fine-grained management over reminiscence—reusing allocations throughout a number of operations or sharing between algorithms—can proceed utilizing the present sample. However for almost all of use circumstances, the single-call strategy ought to turn into the default.

    The Setting Argument

    Past simplifying fundamental calls, CUDA 13.1 introduces an extensible “env” argument that consolidates execution configuration. Builders can now mix customized CUDA streams, reminiscence sources, deterministic necessities, and tuning insurance policies by a single type-safe object fairly than juggling a number of perform parameters.

    Reminiscence sources—a brand new utility for allocation and deallocation—may be handed by this setting argument. NVIDIA gives default sources, however builders can substitute their very own customized implementations or use CCCL-provided options like gadget reminiscence swimming pools.

    Presently, the setting interface helps core algorithms together with DeviceReduce operations (Cut back, Sum, Min, Max, ArgMin, ArgMax) and DeviceScan operations (ExclusiveSum, ExclusiveScan). NVIDIA is monitoring further algorithm assist by way of their CCCL GitHub repository.

    Sensible Implications

    For groups sustaining GPU-accelerated functions, this replace means much less wrapper code and cleaner integration. The CUB library already serves as a foundational part of NVIDIA’s CUDA Core Compute Libraries, and simplifying its API reduces friction for builders constructing customized CUDA kernels.

    The timing aligns with broader business motion towards extra accessible GPU programming. As AI workloads drive demand for optimized GPU code, decreasing boundaries to utilizing high-performance primitives issues.

    CUDA 13.1 is offered now by NVIDIA’s developer portal. Groups at the moment utilizing macro wrappers round CUB calls ought to consider migrating to the native single-call API—it delivers the identical abstraction with out the debugging complications.

    Picture supply: Shutterstock




    Supply hyperlink

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Will agentic commerce unlock a two-layer funds stack for AI-native transactions?

    March 31, 2026

    ‘I’d Slightly Be a Bond’: Goldman Sachs Government Says Treasuries Are a Higher Commerce Than Equities in Present Market Atmosphere – The Every day Hodl

    March 31, 2026

    LangChain MongoDB Partnership Delivers Full AI Agent Stack for Enterprise Groups

    March 31, 2026

    Qubic Reveals How Its Dogecoin Mining Launch Will Work

    March 31, 2026
    Latest Posts

    Jordi Visser Says Bitcoin Was Constructed For This New Fed Disaster

    March 31, 2026

    Bitcoin Worth Faces Rising Promote Stress As Downtrend Nears Six-Month Streak

    March 31, 2026

    Practically 7 Million Bitcoin is Sitting in a Quantum Minefield, Together with Satoshi’s

    March 31, 2026

    CZ Downplays Quantum Menace to Crypto – Right here Is Why Bitcoin Isn’t Doomed – BlockNews

    March 31, 2026

    F2Pool Founder Offered Thai Condominium Purchased for two,900 BTC for Simply 7 – Bitbo

    March 31, 2026

    Bitcoin Promote-Offs Are Ramping Up As Value Struggles, However The place Is All That BTC Going To?

    March 31, 2026

    Google's New Quantum Analysis Renews Push To Safe Bitcoin

    March 31, 2026

    Bitcoin Bombshell: Google’s 2029 Quantum Warning Sparks New Concern

    March 31, 2026

    CryptoVideos.net is your premier destination for all things cryptocurrency. Our platform provides the latest updates in crypto news, expert price analysis, and valuable insights from top crypto influencers to keep you informed and ahead in the fast-paced world of digital assets. Whether you’re an experienced trader, investor, or just starting in the crypto space, our comprehensive collection of videos and articles covers trending topics, market forecasts, blockchain technology, and more. We aim to simplify complex market movements and provide a trustworthy, user-friendly resource for anyone looking to deepen their understanding of the crypto industry. Stay tuned to CryptoVideos.net to make informed decisions and keep up with emerging trends in the world of cryptocurrency.

    Top Insights

    Monitoring The Bitmine Crypto Technique: How A lot Bitcoin And Ethereum Does The Firm Maintain? | Bitcoinist.com

    March 26, 2026

    Coinbase Unveils ‘Tremendous App’ To Develop Crypto Entry–Particulars

    July 17, 2025

    DeFi lending on Liquidium hits 4-month excessive as Bitcoin soars previous $100K

    December 7, 2024

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    • Home
    • Privacy Policy
    • Contact us
    © 2026 CryptoVideos. Designed by MAXBIT.

    Type above and press Enter to search. Press Esc to cancel.