Artwork

内容由Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky提供。所有播客内容(包括剧集、图形和播客描述)均由 Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal
Player FM -播客应用
使用Player FM应用程序离线!

Claude Opus 4.5, Olmo 3, and a Paper on Diffusion + Auto Regression

47:45
 
分享
 

Manage episode 521719471 series 3703995
内容由Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky提供。所有播客内容(包括剧集、图形和播客描述)均由 Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4集单集

Artwork
icon分享
 
Manage episode 521719471 series 3703995
内容由Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky提供。所有播客内容(包括剧集、图形和播客描述)均由 Shimin Zhang & Dan Lasky, Shimin Zhang, and Dan Lasky 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal

In this episode of Artificial Developer Intelligence, hosts Shimin and Dan explore the latest advancements in AI models, including the release of Claude Opus 4.5 and Gemini 3. They discuss the implications of these models on software engineering, the rise of open-source models like Olmo 3, and the enhancements in the Claude Developer Platform. The conversation also delves into the challenges of relying on AI for coding tasks, the potential pitfalls of the AI bubble, and the future of written exams in the age of AI.

Takeaways

  • Claude Opus 4.5 setting benchmarks, enhance usability and reduce token consumption.
  • The introduction of open-source models like Olmo 3 is a significant development in AI.
  • The future of written exams may be challenged by AI's ability to generate human-like responses.
  • Relying too heavily on AI can lead to a lack of critical thinking and problem-solving skills.
  • The AI bubble is at 25s to midnight
  • Recent research suggests that AI models can improve their performance through emulating query based search.
  • The importance of prompt engineering in AI interactions is highlighted.

Resources Mentioned
Introducing Claude Opus 4.5
Build with Nano Banana Pro, our Gemini 3 Pro Image model
Andrej Karpathy's Post about Nano Banana Pro
Olmo 3: Charting a path through the model flow to lead open-source AI
Introducing advanced tool use on the Claude Developer Platform
TiDAR: Think in Diffusion, Talk in Autoregression
SSRL: SELF-SEARCH REINFORCEMENT LEARNING
Mira Murati's Thinking Machines seeks $50 billion valuation in funding talks, Bloomberg News reports
Boom, bubble, bust, boom. Why should AI be different?
Nvidia didn’t save the market. What’s next for the AI trade?

Chapters

  • (00:00) - Introduction to Artificial Developer Intelligence
  • (01:25) - Claude Opus 4.5
  • (07:02) - Exploring Gemini 3 and Image Models
  • (11:24) - Olmo 3 and The Rise of Open Flow Models
  • (15:46) - Innovations in AI Tools and Platforms
  • (19:33) - Research Insights: Diffusion and Auto-Regression Models
  • (23:39) - Advancements in AI Output Efficiency
  • (25:45) - Exploring Self Search Reinforcement Learning
  • (27:48) - The Dilemma of Language Models
  • (30:11) - Prompt Engineering and Search Integration
  • (32:55) - Dan's Rants on AI Limitations
  • (38:17) - 2 Minutes to Midnight
  • (46:41) - Outro

Connect with ADIPod
  continue reading

4集单集

所有剧集

×
 
Loading …

欢迎使用Player FM

Player FM正在网上搜索高质量的播客,以便您现在享受。它是最好的播客应用程序,适用于安卓、iPhone和网络。注册以跨设备同步订阅。

 

快速参考指南

版权2025 | 隐私政策 | 服务条款 | | 版权
边探索边听这个节目
播放