Training Large Language Models To Reason In Continuous Latent Space Deep Papers podcast

Artwork

Science Tech Math Business Arize AI

内容由Arize AI提供。所有播客内容（包括剧集、图形和播客描述）均由 Arize AI 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品，您可以按照此处概述的流程进行操作https://zh.player.fm/legal。

Deep Papers « »
Training Large Language Models to Reason in Continuous Latent Space

11M ago 24:58

分享

MP3•单集首页

内容由Arize AI提供。所有播客内容（包括剧集、图形和播客描述）均由 Arize AI 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品，您可以按照此处概述的流程进行操作https://zh.player.fm/legal。

LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not always be the best for reasoning. In this paper read, we cover an exciting new technique from a team at Meta called Chain of Continuous Thought—also known as "Coconut." In the paper, "Training Large Language Models to Reason in a Continuous Latent Space" explores the potential of allowing LLMs to reason in an unrestricted latent space instead of being constrained by natural language tokens.
Read a full breakdown of Coconut on our blog, or join us live for the next paper reading.

Learn more about AI observability and evaluation, join the Arize AI Slack community or get the latest on LinkedIn and X.

… continue reading

59集单集

#Science #Tech #Math #Business #Arize AI

Artwork

Training Large Language Models to Reason in Continuous Latent Space

33 subscribers

published 11M ago

分享

MP3•单集首页

内容由Arize AI提供。所有播客内容（包括剧集、图形和播客描述）均由 Arize AI 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品，您可以按照此处概述的流程进行操作https://zh.player.fm/legal。

LLMs have typically been restricted to reason in the "language space," where chain-of-thought (CoT) is used to solve complex reasoning problems. But a new paper argues that language space may not always be the best for reasoning. In this paper read, we cover an exciting new technique from a team at Meta called Chain of Continuous Thought—also known as "Coconut." In the paper, "Training Large Language Models to Reason in a Continuous Latent Space" explores the potential of allowing LLMs to reason in an unrestricted latent space instead of being constrained by natural language tokens.
Read a full breakdown of Coconut on our blog, or join us live for the next paper reading.

Learn more about AI observability and evaluation, join the Arize AI Slack community or get the latest on LinkedIn and X.

… continue reading

59集单集

#Science #Tech #Math #Business #Arize AI

所有剧集

×

欢迎使用Player FM

Player FM正在网上搜索高质量的播客，以便您现在享受。它是最好的播客应用程序，适用于安卓、iPhone和网络。注册以跨设备同步订阅。

收听超过500个主题

边探索边听这个节目