Hanselminutes is Fresh Air for Developers. A weekly commute-time podcast that promotes fresh technology and fresh voices. Talk and Tech for Developers, Life-long Learners, and Technologists.
…
continue reading
内容由The Mad Botter提供。所有播客内容(包括剧集、图形和播客描述)均由 The Mad Botter 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal。
Player FM -播客应用
使用Player FM应用程序离线!
使用Player FM应用程序离线!
605: The Democrats Behind DeepSeek
Manage episode 463722536 series 2440919
内容由The Mad Botter提供。所有播客内容(包括剧集、图形和播客描述)均由 The Mad Botter 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal。
DeepSeek has everyone freaking out; we'll look at what's legitimately fascinating, what bits have been an overreaction, and the big mistake that made this all possible.
Plus, there's some bad news for Java fans.
Sponsored By:
- Bitcoin Well: Bitcoin sent directly to your wallet is the safest way to buy Bitcoin. Immediate settlement, direct to self-custody. Supports the Lightning Network.
- Coder QA: Take $2 a month off for the lifetime of your membership and contribute to our show directly Promo Code: jarjar
Links:
- 💥 Gets Sats with Strike — Strike is a lightning-powered app that lets you quickly and cheaply grab sats in over 100 countries. Easily integrates with Fountain.fm. Setup your Strike account, and you have one of the world's best ways to buy sats.
- 🇨🇦 Bitcoin Well — The fastest and safest way to buy Bitcoin in Canada and the USA. With self-custody built in. 🥇
- 📻 Boost with Fountain.FM — Boost with Fountain.FM and kick the tires on the Podcasting 2.0 revolution! 🚀
- Report: 88% of companies are contemplating leaving Oracle Java — 72% of respondents were already thinking about it when surveyed in 2023.
- State of Java 2025 - Azul Report — Insights from over 2,000 Java users across six continents to reveal 2025 Java trends that are shaping key areas of enterprise technology.
- Nvidia sheds almost $600 billion in market cap, biggest drop ever — The sell-off, which hit much of the U.S. tech sector, was sparked by concerns about increased competition from Chinese AI lab DeepSeek.
- DeepSeek’s AI Model Tests Limits of US Curbs on Nvidia Chips
- DeepSeek-V3 Technical Report — We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attention (MLA) and DeepSeekMoE architectures, which were thoroughly validated in DeepSeek-V2. Furthermore, DeepSeek-V3 pioneers an auxiliary-loss-free strategy for load balancing and sets a multi-token prediction training objective for stronger performance. We pre-train DeepSeek-V3 on 14.8 trillion diverse and high-quality tokens, followed by Supervised Fine-Tuning and Reinforcement Learning stages to fully harness its capabilities. Comprehensive evaluations reveal that DeepSeek-V3 outperforms other open-source models and achieves performance comparable to leading closed-source models. Despite its excellent performance, DeepSeek-V3 requires only 2.788M H800 GPU hours for its full training. In addition, its training process is remarkably stable.
- Deepseek: The Quiet Giant Leading China’s AI Race
- DeepSeek’s Popular AI App Is Explicitly Sending US Data to China — Amid ongoing fears over TikTok, Chinese generative AI platform DeepSeek says it’s sending heaps of US user data straight to its home country, potentially setting the stage for greater scrutiny.
- DeepSeek hit with large-scale cyberattack, says it's limiting registrations — DeepSeek on Monday said it would temporarily limit user registrations “due to large-scale malicious attacks” on its services.
- Satya Nadella on X — Jevons paradox strikes again! As AI gets more efficient and accessible, we will see its use skyrocket, turning it into a commodity we just can't get enough of.
- Jevons paradox - Wikipedia
- Sam Altman on X — deepseek's r1 is an impressive model, particularly around what they're able to deliver for the price. we will obviously deliver much better models and also it's legit invigorating to have a new competitor! we will pull up some releases.
- Biden Got Freaked Out About AI and National Security After Watching the Newest 'Mission: Impossible' Movie — Speaking to The Associated Press, deputy White House chief of staff Bruce Reed recalled that while Biden has grown concerned over the use of AI to generate fake images of himself or clone a user's voice, it was a screening of "Mission: Impossible -- Dead Reckoning Part One" at Camp David that particularly alarmed the president.
563集单集
Manage episode 463722536 series 2440919
内容由The Mad Botter提供。所有播客内容(包括剧集、图形和播客描述)均由 The Mad Botter 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal。
DeepSeek has everyone freaking out; we'll look at what's legitimately fascinating, what bits have been an overreaction, and the big mistake that made this all possible.
Plus, there's some bad news for Java fans.
Sponsored By:
- Bitcoin Well: Bitcoin sent directly to your wallet is the safest way to buy Bitcoin. Immediate settlement, direct to self-custody. Supports the Lightning Network.
- Coder QA: Take $2 a month off for the lifetime of your membership and contribute to our show directly Promo Code: jarjar
Links:
- 💥 Gets Sats with Strike — Strike is a lightning-powered app that lets you quickly and cheaply grab sats in over 100 countries. Easily integrates with Fountain.fm. Setup your Strike account, and you have one of the world's best ways to buy sats.
- 🇨🇦 Bitcoin Well — The fastest and safest way to buy Bitcoin in Canada and the USA. With self-custody built in. 🥇
- 📻 Boost with Fountain.FM — Boost with Fountain.FM and kick the tires on the Podcasting 2.0 revolution! 🚀
- Report: 88% of companies are contemplating leaving Oracle Java — 72% of respondents were already thinking about it when surveyed in 2023.
- State of Java 2025 - Azul Report — Insights from over 2,000 Java users across six continents to reveal 2025 Java trends that are shaping key areas of enterprise technology.
- Nvidia sheds almost $600 billion in market cap, biggest drop ever — The sell-off, which hit much of the U.S. tech sector, was sparked by concerns about increased competition from Chinese AI lab DeepSeek.
- DeepSeek’s AI Model Tests Limits of US Curbs on Nvidia Chips
- DeepSeek-V3 Technical Report — We present DeepSeek-V3, a strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token. To achieve efficient inference and cost-effective training, DeepSeek-V3 adopts Multi-head Latent Attention (MLA) and DeepSeekMoE architectures, which were thoroughly validated in DeepSeek-V2. Furthermore, DeepSeek-V3 pioneers an auxiliary-loss-free strategy for load balancing and sets a multi-token prediction training objective for stronger performance. We pre-train DeepSeek-V3 on 14.8 trillion diverse and high-quality tokens, followed by Supervised Fine-Tuning and Reinforcement Learning stages to fully harness its capabilities. Comprehensive evaluations reveal that DeepSeek-V3 outperforms other open-source models and achieves performance comparable to leading closed-source models. Despite its excellent performance, DeepSeek-V3 requires only 2.788M H800 GPU hours for its full training. In addition, its training process is remarkably stable.
- Deepseek: The Quiet Giant Leading China’s AI Race
- DeepSeek’s Popular AI App Is Explicitly Sending US Data to China — Amid ongoing fears over TikTok, Chinese generative AI platform DeepSeek says it’s sending heaps of US user data straight to its home country, potentially setting the stage for greater scrutiny.
- DeepSeek hit with large-scale cyberattack, says it's limiting registrations — DeepSeek on Monday said it would temporarily limit user registrations “due to large-scale malicious attacks” on its services.
- Satya Nadella on X — Jevons paradox strikes again! As AI gets more efficient and accessible, we will see its use skyrocket, turning it into a commodity we just can't get enough of.
- Jevons paradox - Wikipedia
- Sam Altman on X — deepseek's r1 is an impressive model, particularly around what they're able to deliver for the price. we will obviously deliver much better models and also it's legit invigorating to have a new competitor! we will pull up some releases.
- Biden Got Freaked Out About AI and National Security After Watching the Newest 'Mission: Impossible' Movie — Speaking to The Associated Press, deputy White House chief of staff Bruce Reed recalled that while Biden has grown concerned over the use of AI to generate fake images of himself or clone a user's voice, it was a screening of "Mission: Impossible -- Dead Reckoning Part One" at Camp David that particularly alarmed the president.
563集单集
All episodes
×欢迎使用Player FM
Player FM正在网上搜索高质量的播客,以便您现在享受。它是最好的播客应用程序,适用于安卓、iPhone和网络。注册以跨设备同步订阅。