刘洋的博客分享 http://blog.sciencenet.cn/u/outcrop 管中窥豹,盲人摸象

博文

第一个商业级扩散大语言模型(diffusion large language models)发布

已有 1106 次阅读 2025-2-27 11:19 |个人分类:计算机应用技术|系统分类:海外观察

Firefox_Screenshot_2025-02-27T03-03-22.452Z.png

Inception Labs(官网地址:https://www.inceptionlabs.ai/)宣布发布了第一个商业级扩散大语言模型(Introducing Mercury, the first commercial-scale diffusion large language model),一个字:

快!

Firefox_Screenshot_2025-02-27T03-09-26.675Z.png

非常快!

官网介绍是:

Our Vision — Next Generation LLMs Powered By Diffusion

Current large language models are autoregressive, meaning that they generate text left to right, one token at a time. Generation is inherently sequential—a token cannot be generated until all the text that comes before it has been generated—and generating each token requires evaluating a neural network with billions of parameters. Frontier LLM companies are betting on test-time computation to increase reasoning and error-correction capabilities, but generating long reasoning traces comes at the price of ballooning inference costs and unusable latency. A paradigm shift is needed to make high-quality AI solutions truly accessible.

Diffusion models provide such a paradigm shift. These models operate with a “coarse-to-fine” generation process, where the output is refined from pure noise over a few “denoising” steps, as illustrated in the video above.

Because diffusion models are not restricted to only considering previous output, they are better at reasoning and at structuring their responses. And because diffusion models can continually refine their outputs, they can correct mistakes and hallucinations. For these reasons, diffusion powers all of the most prominent AI solutions for video, image, and audio generation, including Sora, Midjourney, and Riffusion. However, applications of diffusion to discrete data such as text and code have never been successful. Until now.

AI的迭代演进很快。



https://blog.sciencenet.cn/blog-1750-1475048.html

上一篇:开源MiniCPM-Llama3-V-2_5性能不错
下一篇:人民大学发布大语言扩散模型Large Language Diffusion Models
收藏 IP: 43.128.115.*| 热度|

3 池德龙 刘进平 郑永军

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

1/0 | 总计:0 | 首页 | 上一页 | 跳转

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2025-3-15 23:07

Powered by ScienceNet.cn

Copyright © 2007-2025 中国科学报社

返回顶部