Briefly
- Alibaba upgraded Qwen Deep Analysis with one-click webpage and podcast era.
- In testing, Qwen and Gemini tied for accuracy, each outperforming ChatGPT and Grok
- General, Qwen gained for analysis depth and shareable net output, whereas Gemini led multimedia high quality
Qwen, the devoted AI analysis group inside the Chinese language tech big Alibaba, launched a big improve to its AI chatbot final week, enabling customers to generate complete analysis paperwork on any matter.
You possibly can then simply convert these paperwork into clear webpages or multi-speaker podcasts with just some clicks.
Qwen Chat is much like ChatGPT, DeepSeek, or Claude when it comes to UI and is on the market worldwide free of charge.
Qwen Deep Analysis simply bought a significant improve. ⚡️
It now creates not solely the report, but in addition a stay webpage 🌐 and a podcast 🎙️ – Powered by Qwen3-Coder, Qwen-Picture, and Qwen3-TTS.
Your insights, now visible and audible. ✨
👉 https://t.co/wESb7vfAnD pic.twitter.com/eRvjKU222O— Qwen (@Alibaba_Qwen) October 21, 2025
The brand new performance runs on three open-source fashions working in live performance: Qwen3-Coder handles net construction, Qwen-Picture generates inline graphics, and Qwen3-TTS powers dynamic audio narration.
Regardless of counting on open-source fashions, the end-to-end expertise—together with analysis execution, net deployment, and audio era—is hosted and operated by Qwen as a managed service.
The workflow begins inside Qwen Chat, the place customers pose analysis questions. The AI conducts net searches after some clarifications, analyzes information from public sources, and generates a complete report with citations.

From there, two new choices seem: “Net Dev” produces a stay, professional-grade webpage mechanically deployed and hosted by Qwen, full with inline graphics.
“Podcast,” in the meantime, affords an audio dialogue that includes dynamic multi-speaker narration, with 17 host voices and 7 co-host choices.

Testing the fashions
To evaluate how Qwen stacked up as a analysis software, we ran the identical complicated analysis question throughout it, Gemini, ChatGPT, and Grok. The duty, which could be reviewed on our GitHub repo, was to investigate philosophical and scientific arguments for and towards God’s existence. Every mannequin generated a full analysis report. The analysis concerned 5 standards: accuracy of claims and citations, data supplied, readability of rationalization, mental richness, and total high quality.
TL;DR: Qwen Deep Analysis wins for analytical depth, quotation, and its distinctive auto-generated webpages, making it very best for teachers and creators. It is also one of the best all-in-one free different for researchers. However Gemini nonetheless leads in audio and video high quality, whereas ChatGPT and Grok stay positive for informal use however lack Qwen’s attain and Google’s polish.
Here is a extra in-depth assessment:
Accuracy: Have been philosophical positions and scientific claims represented accurately, with correct supply attribution?
Qwen nailed the main points. When discussing the cosmological argument, it correctly cited tutorial sources like Bertrand Russell’s “Why I’m not a Christian” and the talk between William Lane Craig and Peter Atkins, with particular references. In contrast to different AI researchers like Perplexity’s or Grok, the vast majority of sources are respected and tutorial, typically even the Authentic Supply. It included hyperlinks from Stanford, Princeton, Oxford, Drew, however added pertinent evaluation from Quora and Fb when related.
Gemini matched this precision with 94 numbered citations, a few of which had been duplicated when referenced in several elements of the report.
It accurately distinguished between ideas. Each averted sloppy errors, comparable to conflating biblical literalism with normal theism.
ChatGPT relied closely on the Stanford Encyclopedia of Philosophy, however typically oversimplified. Grok gave correct summaries however with vaguer attribution—saying issues like “traced to Plato, Aristotle” with out particular works.
Outcome: Qwen and Gemini had been one of the best.
Data Supplied: How thorough was the analysis?
Qwen was the one mannequin to incorporate a bit referred to as “Critiques of Atheism: The Burden of Proof and the Nature of Proof.” This part examined a sort of debate not one of the others touched. It distinguished between “weak atheism” (skepticism towards God claims) and “gnostic atheism” (constructive assertion God would not exist), and cited particular atheist thinkers like Gary Whittenberger’s “past an affordable doubt” commonplace.

Here is an instance passage from Qwen: “Some of the contentious points is the burden of proof. Bertrand Russell famously illustrated this along with his teapot analogy: simply as he couldn’t show {that a} tiny teapot doesn’t orbit the solar between Earth and Mars, he argued that theists couldn’t show that God does exist.”
No different mannequin went this deep into burden-of-proof debates as a result of it most likely was not central to the subject. Gemini got here shut with sturdy protection of consciousness arguments and the “God-of-the-gaps” critique. ChatGPT included pragmatic arguments like Pascal’s Wager and explored real-world implications for ethics and coverage. Grok stored it concise—about one-third the size of Qwen’s report—however added a useful abstract desk.
Outcome: Qwen was essentially the most exhaustive.
Readability: How was the analysis expressed?
Grok used a clear desk to prepare arguments by sort (Philosophical vs. Scientific, For vs. Towards). Its part breaks had been express: “Philosophical Arguments,” “Scientific Arguments,” “Surprising Element.” Anybody might scan it rapidly.
ChatGPT used tons of parenthetical clarifications, making complicated concepts extra digestible. Instance: “if God’s existence is even potential (i.e., logically coherent), then God exists essentially.” The “(i.e., logically coherent)” helps readers who aren’t philosophy majors.
Qwen and Gemini, alternatively, had been extra tutorial of their fashion. Qwen organized the content material below formal headings like “Theistic Arguments for God’s Existence: Cosmological and Teleological Foundations,” which made the entire studying really feel very dense, regardless of its accuracy. Gemini used Roman numerals (I. Introduction, II. Philosophical Arguments), which seemed structured however required nearer studying.
Each Qwen and Gemini goal researchers doing severe work. ChatGPT and Grok goal broader audiences.
Outcome: ChatGPT offered data essentially the most clearly, adopted by Grok.
Variety of sources: Does the analysis draw from assorted traditions, disciplines, and views?
Qwen built-in technical philosophy (kalām, PSR, modal S5 logic) with stay scientific debates (Massive Bang singularities, quantum fluctuations, DNA performance). It defined issues, ensuring to be particular and provides background examples on positions and arguments.
For example, when explaining theistic arguments for God’s existence, Qwen constructed a desk to make it simpler to know the premises, critiques, and proponents of essentially the most related arguments.

Gemini matched this by protecting consciousness arguments that the majority fashions ignored. It additionally warned towards “God-of-the-gaps” reasoning extra explicitly than rivals.
ChatGPT introduced distinctive worth with its huge “Implications” part, exploring how the talk shapes science schooling coverage, bioethics legal guidelines, and private attitudes towards loss of life. This was much less tutorial and extra pragmatic, however nonetheless related to understand the character of the investigation.
Grok lined the foremost arguments however with much less element. It talked about fine-tuning and the anthropic precept, however did not cite particular values or focus on issues too deeply.
Outcome: Qwen and Gemini had been one of the best.
High quality: Taking all collectively—rigor, coherence, scholarly worth—which analysis would you wish to cite?
Each Qwen and Gemini produced reviews you could possibly undergo your professor. Qwen’s distinctive energy was balancing depth on each theistic traces and atheistic critiques, together with that burden-of-proof part. Gemini’s energy was integrating scientific frontiers (consciousness, evolution, cosmology) with philosophical arguments.
ChatGPT delivered substantial pedagogical worth—nice for educating or understanding implications. Grok labored as a dependable primer or fast reference.
In different phrases, ChatGPT and Grok are most likely those you’ll use for those who simply wish to know one thing rapidly for a dialog, to impress your nerd date, or refresh your information earlier than a presentation on one thing you already know
Last Scores:
- Qwen: 9/10
- Gemini: 9/10
- ChatGPT: 8/10
- Grok: 6/10
The podcast battle: Qwen vs Gemini
Qwen’s podcast function places it head-to-head with Google’s NotebookLM and Gemini, which pioneered AI-generated Audio Overviews.
In contrast to Gemini, Qwen affords a big number of host voices to select from. The construction is stable: two AI hosts have an precise dialog about your analysis, not only a text-to-speech read-through.

That mentioned, the voice high quality is inconsistent. Some voices are pure, however most of them sound robotic with bizarre accents. Throughout testing, one of many male hosts stored saying “oh oh oh” repeatedly, as a result of he was impressed. My spouse handed by and requested if I used to be watching porn.
With some trial and error, yow will discover an honest voice that works easily, and the standard will increase significantly.
However Gemini and NotebookLM crush Qwen right here. Google’s Audio Overviews function—launched in NotebookLM in September 2024, expanded to Gemini in March 2025—sounds remarkably human. The speech patterns are pure, with back-and-forth banter and even humor.
Gemini’s podcasts really feel human and extra partaking.
Gemini additionally affords video era, which is a big benefit for individuals who desire an audiovisual method to understanding a subject relatively than studying lengthy chunks of textual content.
Qwen can not do that—the truth is, no different mannequin can.
If you would like full multimedia, together with audio, video, and net, Gemini is essentially the most full package deal.
The webpage benefit
Past analysis high quality, Qwen’s killer function is the auto-generated webpage. No different mannequin does this.
After your analysis finishes, you may flip it right into a stay, hosted web site. Not a PDF or a Google Doc—an actual webpage with headers, formatted tables, embedded citations as hyperlinks.
The UI seems like Kimi; it options clear typography, responsive design and is immediately shareable.

ChatGPT customers have to repeat and paste into web site builders.
Gemini retains every thing in Docs. Grok spits out textual content. Solely Qwen mechanically generates web-ready output.
That workflow benefit is sweet to have.
Typically Clever E-newsletter
A weekly AI journey narrated by Gen, a generative AI mannequin.
