Close Menu
Cryprovideos
    What's Hot

    Who Truly Owns a Tokenized Asset? The IMF Desires an Reply

    July 3, 2026

    Right here Is Why Ethereum May Be Getting ready for a Larger Restoration After Defending $1,500 Twice – BlockNews

    July 3, 2026

    Bitcoin ETFs Snap Shedding Streak With $221M Influx – Bitbo

    July 3, 2026
    Facebook X (Twitter) Instagram
    Cryprovideos
    • Home
    • Crypto News
    • Bitcoin
    • Altcoins
    • Markets
    Cryprovideos
    Home»Markets»This Frankenstein AI Merges Claude Opus, GLM and Qwen – Decrypt
    This Frankenstein AI Merges Claude Opus, GLM and Qwen – Decrypt
    Markets

    This Frankenstein AI Merges Claude Opus, GLM and Qwen – Decrypt

    By Crypto EditorApril 22, 2026No Comments4 Mins Read
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Briefly

    • AI engineer Kyle Hessling merged two of Jackrong’s Claude Opus 4.6 and GLM-5.1 distilled finetunes right into a single “frankenmerge.”
    • A post-merge “heal fine-tune” was required to repair garbled code output attributable to the layer boundary between the 2 independently-trained fashions.
    • The mannequin over-reasons on some duties, nevertheless it’s a solvable drawback.

    You thought Qwopus was cool as a result of it merged Qwen and Opus? Effectively, Kyle Hessling, an AI engineer with a number of data and free time simply took that recipe and threw GLM—probably the greatest reasoning fashions on the market—into the combo. The result’s an 18 billion parameter frankenmerge that matches on an affordable GPU and outperforms Alibaba’s latest 35B mannequin.

    For individuals who do not know, parameters are the numerical values baked right into a neural community throughout coaching, like dials {that a} neural community can alter — the extra of them, the extra data and complexity the mannequin can deal with, and the extra reminiscence it must run.

    Hessling, an AI infrastructure engineer, stacked two of Jackrong’s Qwen3.5 finetunes on high of one another: layers 0 by means of 31 from Qwopus 3.5-9B-v3.5, which distills Claude 4.6 Opus’s reasoning model into Qwen as a base mannequin, and layers 32 by means of 63 from Qwen 3.5-9B-GLM5.1-Distill-v1, skilled on reasoning knowledge from z.AI’s GLM-5.1 instructor mannequin on high of the identical Qwen base.

    The speculation: Give the mannequin Opus-style structured planning within the first half of the reasoning and GLM’s drawback decomposition scaffold within the second—64 layers complete, in a single mannequin.

    The method is known as a passthrough frankenmerge—no mixing, no averaging of weights, simply uncooked layer stacking. Hessling needed to write his personal merge script from scratch as a result of present instruments do not help Qwen 3.5’s hybrid linear/full consideration structure. The ensuing mannequin handed 40 out of 44 functionality exams, beating Alibaba’s Qwen 3.6-35B-A3B MoE—which requires 22 GB of VRAM—whereas operating on simply 9.2 GB in Q4_K_M quantization.

    An NVIDIA RTX 3060 handles it high quality… theoretically.

    Hessling explains that making this mannequin wasn’t straightforward. The uncooked merge used to throw garbled code. Besides, the check fashions he printed went form of viral amongst fanatics.

    Hessling’s remaining repair was a “heal fine-tune”—principally a QLoRA (a little bit of code that’s embedded into the mannequin like an appendix and closely situations the ultimate output) focusing on all consideration and projections.

    We tried it, and though the concept of getting Qwen, Claude Opus, and GLM 5.1 operating regionally in our potato is past tempting, in actuality we discovered that the mannequin is so good at reasoning by means of issues that it finally ends up overthinking.

    When examined it on an M1 MacBook operating an MLX quantized model (a mannequin optimized to run on Macs). When prompted to generate our standard check sport, the reasoning chain ran so lengthy it hit the token restrict and gave us a pleasant lengthy piece of reasoning with no working lead to a zero shot interplay. That is a daily-use blocker for anybody desirous to run this regionally on shopper {hardware} for any severe software.

    We went a bit softer and issues nonetheless had been difficult. A easy “write a Snake sport” immediate took over 40 minutes in reasoning… plenty of it.

    You may see the leads to our Github repository.

    This can be a recognized rigidity within the Qwopus lineage: Jackrong’s v2 finetunes had been constructed to deal with Qwen 3.5’s tendency towards repetitive inside loops and “suppose extra economically.” Stacking 64 layers of two reasoning distills seems to amplify that habits on sure prompts.

    That is a solvable drawback, and the open-source neighborhood will probably resolve it. What issues right here is the broader sample: a pseudonymous developer publishes specialised finetunes with full coaching guides, one other fanatic stacks them with a customized script, runs 1,000 therapeutic steps, and lands a mannequin that outperforms a 35 billion parameter launch from one of many world’s largest AI labs. The entire thing matches in a small file.

    That is what makes open-source price watching—not simply the large labs releasing weights, however the layer-by-layer options, the specialization occurring beneath the radar. The hole between a weekend venture and a frontier deployment is narrower the extra builders be part of the neighborhood.

    Jackrong has since mirrored Hessling’s repository, and the mannequin had gathered over three thousand downloads inside its first two weeks of availability.

    Day by day Debrief E-newsletter

    Begin daily with the highest information tales proper now, plus authentic options, a podcast, movies and extra.



    Supply hyperlink

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Who Truly Owns a Tokenized Asset? The IMF Desires an Reply

    July 3, 2026

    South Korea 24-hour Buying and selling Revolutionizes KRW/USD Market

    July 3, 2026

    Man Drains $85,100 From East Coast Financial institution Accounts by Impersonating Official Prospects – Right here's How He Obtained Caught – The Day by day Hodl

    July 3, 2026

    FLOKI Value Prediction: The three.85% Bounce Is a Lure Till Quantity Says In any other case

    July 3, 2026
    Latest Posts

    Bitcoin ETFs Snap Shedding Streak With $221M Influx – Bitbo

    July 3, 2026

    Irish Authorities Seize One other 500 Bitcoin in Legal Proceeds

    July 3, 2026

    Dwell updates: Extra bitcoin is now held at a loss than at a revenue

    July 3, 2026

    Crypto ETF Demand Weakens as Bitcoin and Ether Funds Publish H1 Outflows

    July 3, 2026

    Constancy Warns Bitcoin Faces Key Check – U.In the present day

    July 3, 2026

    Will Markets React When $2 Billion Bitcoin Choices Expire In the present day? 

    July 3, 2026

    Metaplanet Provides 2,823 Bitcoin in Q2 as Shopping for Tempo Cools – Decrypt

    July 3, 2026

    Bitwise Says Technique Now Much less Essential Determine in Bitcoin

    July 3, 2026

    CryptoVideos.net is your premier destination for all things cryptocurrency. Our platform provides the latest updates in crypto news, expert price analysis, and valuable insights from top crypto influencers to keep you informed and ahead in the fast-paced world of digital assets. Whether you’re an experienced trader, investor, or just starting in the crypto space, our comprehensive collection of videos and articles covers trending topics, market forecasts, blockchain technology, and more. We aim to simplify complex market movements and provide a trustworthy, user-friendly resource for anyone looking to deepen their understanding of the crypto industry. Stay tuned to CryptoVideos.net to make informed decisions and keep up with emerging trends in the world of cryptocurrency.

    Top Insights

    Solana Altcoin Jumps As Crypto Big Coinbase Pronounces Buying and selling Assist – The Every day Hodl

    January 14, 2026

    Hilbert Group buys Enigma Nordic in $32 million deal to spice up crypto buying and selling edge

    December 20, 2025

    Litecoin Crypto Builds Shortage Narrative Towards $1000 Goal – Right here Is What To Watch – BlockNews

    April 6, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    • Home
    • Privacy Policy
    • Contact us
    © 2026 CryptoVideos. Designed by MAXBIT.

    Type above and press Enter to search. Press Esc to cancel.