Close Menu
Cryprovideos
    What's Hot

    Crypto Money Floods US Politics As PACs Spend $7.2M Across 5 States

    May 9, 2026

    Will Technique Promote Bitcoin This 12 months? Merchants Are Satisfied After Saylor Feedback – Decrypt

    May 9, 2026

    CLARITY Act sees ‘massive step ahead’ as markup set for Might 14

    May 9, 2026
    Facebook X (Twitter) Instagram
    Cryprovideos
    • Home
    • Crypto News
    • Bitcoin
    • Altcoins
    • Markets
    Cryprovideos
    Home»Markets»Notion Slashes AI Embedding Prices 80% After Ditching Spark for Ray
    Notion Slashes AI Embedding Prices 80% After Ditching Spark for Ray
    Markets

    Notion Slashes AI Embedding Prices 80% After Ditching Spark for Ray

    By Crypto EditorApril 10, 2026No Comments3 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    James Ding
    Apr 09, 2026 16:48

    Notion migrated from Spark on EMR to Ray, chopping embedding prices 80% and bettering question latency 10x. Uber and Salesforce shared comparable AI infrastructure wins.

    Notion Slashes AI Embedding Prices 80% After Ditching Spark for Ray

    Notion has slashed its AI embedding pipeline prices by greater than 80% after migrating from Apache Spark to Ray, the distributed computing framework backed by Anyscale. The productiveness software program firm additionally achieved 10x enhancements in question latency whereas consolidating three separate jobs per area into one.

    The migration particulars emerged at Ray Day Seattle on April 9, 2026, the place ML engineers from Notion, Uber, Salesforce, and Apple shared hard-won classes about scaling AI infrastructure.

    What Notion Really Modified

    Mickey Liu, a software program engineer on Notion’s search platform group, walked by the overhaul. Their unique setup used a three-step Spark pipeline operating on Amazon EMR: knowledge chunking, third-party API requires embedding technology, and writes to a vector retailer.

    The ache factors have been predictable however extreme. Double compute prices. Third-party API price limits throttling throughput. Debugging nightmares when failures occurred throughout instruments—driver and executor logs weren’t even endured in YARN.

    The brand new structure streams Kafka knowledge immediately right into a Ray cluster dealing with CPU chunking, GPU embedding technology, and vector retailer writes in a single pipeline. No intermediate S3 handoffs. What began because the backend for a Q&A function in 2023 now powers all of Notion AI and customized brokers.

    Uber and Salesforce Report Comparable Positive factors

    Uber’s Peng Zhang detailed how their Michelangelo ML platform advanced from TensorFlow/Horovod to Ray with PyTorch. The standout transfer: separating CPU data-loading nodes from GPU coaching nodes in a heterogeneous cluster design. End result? GPU utilization jumped 20%, and coaching time dropped roughly 50% in choose pipelines.

    Salesforce tackled a special beast—summarizing paperwork as much as 200,000 tokens lengthy (roughly a brief novel) with P95 latency below 15 seconds. Their group used Ray to chunk paperwork and run parallel inference throughout a distributed actor pool with vLLM, then merge outcomes. They landed on 1-2 GPU knowledge parallelism because the candy spot after operating scaling experiments immediately on Ray.

    Why This Issues Past These Corporations

    Robert Nishihara, Ray’s co-creator and Anyscale co-founder, opened the occasion by framing the core downside: AI infrastructure retains getting more durable. Multimodal knowledge processing, reinforcement studying workloads, and multi-node LLM inference are pushing current instruments previous their limits.

    Each speaker landed on the identical conclusion from totally different angles—their earlier tooling ran out of highway.

    Apple engineers Charlie Chen and Haocheng Bian highlighted basis mannequin coaching challenges: huge unstructured knowledge, billion-plus parameters, and sparse architectures like Combination of Specialists. Conventional engines fail as a result of knowledge pipelines and coaching frameworks run in separate environments with no shared context.

    What’s Subsequent

    Ray Day Seattle kicked off Anyscale’s 2026 “Ray on the Street” tour—eight cities throughout three nations. The corporate can also be operating invite-only buyer roundtables at every cease to preview their product roadmap.

    For groups hitting comparable partitions with Spark or different distributed frameworks, Notion’s full technical writeup is out there on their engineering weblog below “Two Years of Vector Search at Notion.” The 80% value discount and 10x latency enchancment supply a concrete benchmark for anybody evaluating comparable migrations.

    Picture supply: Shutterstock




    Supply hyperlink

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    CLARITY Act sees ‘massive step ahead’ as markup set for Might 14

    May 9, 2026

    Stablecoins have their 'permission slip.' Now comes the exhausting half.

    May 9, 2026

    Contained in the $CWU Thriller as 90% of Tokens Stay Bundled

    May 9, 2026

    Prime 8 Cryptocurrency Trade Tokens In 2026

    May 9, 2026
    Latest Posts

    Will Technique Promote Bitcoin This 12 months? Merchants Are Satisfied After Saylor Feedback – Decrypt

    May 9, 2026

    GoMining Launches GoBTC Pay to Convey Native Prompt Funds to Bitcoin

    May 9, 2026

    Bitcoin Holdings Climb 7% Throughout Main Establishments as Ethereum Sees Sharp Withdrawals – U.At this time

    May 9, 2026

    Latest Bitcoin Worth Predictions, Shiba Inu’s Newest Achievement, and Extra: Bits Recap Might 8

    May 9, 2026

    Bitcoin Rally Stalls At $80K However Bulls Anticipate A Professional-Crypto Fed Chair

    May 9, 2026

    $0.019 Bitcoin? Revolut Customers Spot Huge BTC Pricing Glitch

    May 9, 2026

    Bitcoin Crypto Value Holds Above $80K – Right here Is Why Merchants Are Watching Funding Charges – BlockNews

    May 9, 2026

    Revolut BTC Value Glitch Exhibits $0.02 Amid Consumer Panic

    May 9, 2026

    CryptoVideos.net is your premier destination for all things cryptocurrency. Our platform provides the latest updates in crypto news, expert price analysis, and valuable insights from top crypto influencers to keep you informed and ahead in the fast-paced world of digital assets. Whether you’re an experienced trader, investor, or just starting in the crypto space, our comprehensive collection of videos and articles covers trending topics, market forecasts, blockchain technology, and more. We aim to simplify complex market movements and provide a trustworthy, user-friendly resource for anyone looking to deepen their understanding of the crypto industry. Stay tuned to CryptoVideos.net to make informed decisions and keep up with emerging trends in the world of cryptocurrency.

    Top Insights

    Tether TRM Labs and TRON Freeze $100 Million in Illicit Crypto Funds | Dwell Bitcoin Information

    January 4, 2025

    Greatest Crypto to Purchase Now Amid $3.8B ETH Exit Queue

    August 16, 2025

    Greatest Crypto to Purchase Now: Shiba Inu Worth Prediction

    September 6, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    • Home
    • Privacy Policy
    • Contact us
    © 2026 CryptoVideos. Designed by MAXBIT.

    Type above and press Enter to search. Press Esc to cancel.