Last Week In AI podcast

1:48:09

Our 200th episode with a summary and discussion of last week's big AI news! Recorded on 02/14/2025 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . In this episode: OpenAI announces plans to unify their model offerings, moving away from multiple separate models (GPT-4, Claude, etc.) toward a single unified intelligence system, with free users getting "standard intelligence" and Plus subscribers accessing "higher intelligence" levels. Adobe launches their Sora-rivaling AI video generator with 1080p output and 5-second clips, emphasizing production-ready content for films and introducing new pricing tiers through Firefly subscriptions at $10-30 per month. Elon Musk and a consortium offer $97.4 billion to acquire OpenAI's nonprofit entity, potentially complicating the company's transition to a for-profit structure, though Sam Altman quickly dismissed the offer's viability. TSMC implements stricter chip sales restrictions to China, requiring government-approved third-party packaging houses for chips using 16nm and below processes, aligning with US export control measures and affecting major tech companies like Nvidia and AMD. Timestamps + Links: (00:00:00) Intro / Banter (00:01:25) Response to listener comments (00:02:41) News Preview Tools & Apps (00:03:58) Adobe’s Sora rivalling AI video generator is now available for everyone (00:09:45) OpenAI lays out plans for GPT-5 (00:16:42) OpenAI is rethinking how AI models handle controversial topics (00:21:28) Perplexity AI launches new ultra-fast AI search model Sonar (00:23:45) YouTube AI updates include auto dubbing expansion, age ID tech, and more Applications & Business (00:24:37) Musk-led group makes $97.4 billion bid for control of OpenAI (00:34:32) Anthropic’s next major AI model could arrive within weeks (00:39:09) AI chip startup Groq secures $1.5 billion commitment from Saudi Arabia (00:42:15) OpenAI reportedly planning to build its first AI chip in 2026 Projects & Open Source (00:45:01) Zyphra Introduces the Beta Release of Zonos: A Highly Expressive TTS Model with High Fidelity Voice Cloning (00:51:11) Gemstones: A Model Suite for Multi-Faceted Scaling Laws (00:57:15) Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training Research & Advancements (00:58:24) Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities (01:04:24) Distillation Scaling Laws (01:10:06) Matryoshka Quantization (01:17:47) How much AI compute exists globally? How rapidly is it growing? Policy & Safety (01:21:29) US and UK refuse to sign summit declaration on AI safety (01:25:43) Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs (01:34:40) xAI Risk Management Framework (Draft) (01:39:59) TSMC bans more chip sales to China due to stricter U.S. export sanctions (01:42:38) Listener requested topic Synthetic Media & Art (01:43:48) Thomson Reuters Wins First Major AI Copyright Case in the US (01:44:46) Scarlett Johansson calls for deepfake ban after AI video goes viral (01:45:55) Outro…

1
#199 - OpenAI's 03-mini, Gemini Thinking, Deep Research, s1 1:37:46

20天之前1:37:46

1:37:46

Our 199th episode with a summary and discussion of last week's big AI news! Recorded on 02/09/2025 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . In this episode: - OpenAI's deep research feature capability launched, allowing models to generate detailed reports after prolonged inference periods, competing directly with Google's Gemini 2.0 reasoning models. - France and UAE jointly announce plans to build a massive AI data center in France, aiming to become a competitive player within the AI infrastructure landscape. - Mistral introduces a mobile app, broadening its consumer AI lineup amidst market skepticism about its ability to compete against larger firms like OpenAI and Google. - Anthropic unveils 'Constitutional Classifiers,' a method showing strong defenses against universal jailbreaks; they also launched a $20K challenge to find weaknesses. Timestamps + Links: (00:00:00) Intro / Banter (00:02:27) News Preview (00:03:28) Response to listener comments Tools & Apps (00:08:01) OpenAI now reveals more of its o3-mini model’s thought process (00:16:03) Google’s Gemini app adds access to ‘thinking’ AI models (00:21:04) OpenAI Unveils A.I. Tool That Can Do Research Online (00:31:09) Mistral releases its AI assistant on iOS and Android (00:36:17) AI music startup Riffusion launches its service in public beta (00:39:11) Pikadditions by Pika Labs lets users seamlessly insert objects into videos Applications & Business (00:41:19) Softbank set to invest $40 billion in OpenAI at $260 billion valuation, sources say (00:47:36) UAE to invest billions in France AI data centre (00:50:34) Report: Ilya Sutskever’s startup in talks to fundraise at roughly $20B valuation (00:52:03) ASML to Ship First Second-Gen High-NA EUV Machine in the Coming Months, Aiming for 2026 Production (00:54:38) NVIDIA’s GB200 NVL 72 Shipments Not Under Threat From DeepSeek As Hyperscalers Maintain CapEx; Meanwhile, Trump Tariffs Play Havoc With TSMC’s Pricing Strategy Projects & Open Source (00:56:49) The Allen Institute for AI (AI2) Releases Tülu 3 405B: Scaling Open-Weight... (01:00:06) SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model (01:03:56) PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models (01:08:26) OpenEuroLLM: Europe’s New Initiative for Open-Source AI Development Research & Advancements (01:10:34) LIMO: Less is More for Reasoning (01:16:39) s1: Simple test-time scaling (01:19:17) ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning (01:23:55) Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Policy & Safety (01:26:50) US sets AI safety aside in favor of 'AI dominance' (01:29:39) Almost Surely Safe Alignment of Large Language Models at Inference-Time (01:32:02) Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming (01:33:16) Anthropic offers $20,000 to whoever can jailbreak its new AI safety system…

1
#198 - DeepSeek R1 & Janus, Qwen2.5, OpenAI Agents 1:37:26

29天之前1:37:26

1:37:26

Our 198th episode with a summary and discussion of last week's big AI news! Recorded on 01/31/2024 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . In this episode: - DeepSeek releases R1, a competitive AI model comparable to OpenAI’s O1, leading to market unrest and significant drops in tech stocks, including a 17% plunge in NVIDIA's stock. - OpenAI launches Operator to facilitate agentic computer use, while facing competition from new releases by DeepSeek and Quen, with applications seeing rapid adoption. - President Trump revokes the Biden administration's executive order on AI, signaling a shift in AI policy and deregulation efforts. - Taiwanese government clears TSMC to produce advanced 2-nanometer chip technology abroad, aiming to strengthen global semiconductor supply amidst geopolitical tensions. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:03:01) Response to listener comments Projects & Open Source (00:06:26) DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning (00:30:25) Viral AI company DeepSeek releases new image model family (00:34:07) Qwen2.5-1M Technical Report (00:38:32) Alibaba’s Qwen team releases AI models that can control PCs and phones Tools & Apps (00:42:09) OpenAI launches Operator, an AI agent that performs tasks autonomously (00:47:37) DeepSeek reaches No. 1 on US Play Store (00:52:17) Alibaba rolled out Qwen Chat v0.2 and Qwen2.5-1M model (00:53:50) Perplexity launches US-hosted DeepSeek R1, hints at EU hosting soon (00:55:31) Apple is pulling its AI-generated notifications for news after generating fake headlines (00:59:00) French AI ‘Lucie’ looks très chic, but keeps getting answers wrong Applications & Business (01:02:09) DeepSeek’s New AI Model Sparks Shock, Awe, and Questions From US Competitors (01:07:49) Microsoft loses OpenAI exclusive cloud provider status to $500 billion Stargate project (01:13:00) OpenAI adds BlackRock exec Adebayo Ogunlesi to board of directors (01:15:00) ElevenLabs has raised a new round at $3B+ valuation led by ICONIQ Growth, sources say Policy & Safety (01:16:00) Donald Trump unveils $500 billion Stargate Project to build AI infrastructure in the US, promising over 100K jobs (01:20:36) Trump Revokes Biden AI Policy, Signs Executive Order to Strengthen AI Leadership (01:23:20) Anthropic CEO doesn’t see DeepSeek as ‘adversaries,’ but says export controls are critical (01:30:26) Taiwanese govt clears TSMC to make 2nm chips abroad — country lowers its 'Silicon Shield' (01:33:01) Outro…

1
AI Computing Hardware - Past, Present, and Future 2:04:24

大约1年之前2:04:24

2:04:24

A special one-off episode with a deep dive into the past, present, and future of how computer hardware makes AI possible. Join our brand new Discord here! https://discord.gg/nTyezGSKwP Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - Google and Mistral sign deals with AP and AFP, respectively, to deliver up-to-date news through their AI platforms. - ChatGPT introduces a tasks feature for reminders and to-dos, positioning itself more as a personal assistant. - Synthesia raises $180 million to enhance its AI video platform for generating videos of human avatars. - New U.S. guidelines restrict exporting AI chips to various countries, impacting Nvidia and other tech firms. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps: 00:00:00 Introduction 00:03:08 Historical Recap: Early AI and Hardware 00:11:51 The Rise of GPUs and Deep Learning 00:15:39 Scaling Laws and the Evolution of AI Models 00:24:05 The Bitter Lesson and the Future of AI Compute 00:25:58 Moore's Law and Huang's Law 00:30:12 Memory and Logic in AI Hardware 00:34:53 Challenges in AI Hardware: The Memory Wall 00:37:08 The Role of GPUs in Modern AI 00:42:27 Fitting Neural Nets in GPUs 00:48:04 Batch Sizes and GPU Utilization 00:52:47 Parallelism in AI Models 00:55:53 Matrix Multiplications and GPUs 00:59:57 Understanding B200 and GB200 01:05:41 Data Center Hierarchy 01:13:42 High Bandwidth Memory (HBM) 01:16:45 Fabrication and Packaging 01:20:17 The Complexity of Semiconductor Fabrication 01:24:34 Understanding Process Nodes 01:28:26 The Art of Fabrication 01:33:17 The Role of Yield in Fabrication 01:35:47 The Photolithography Process 01:40:38 Deep Ultraviolet Lithography (DUV) 01:43:58 Extreme Ultraviolet Lithography (EUV) 01:51:46 Export Controls and Their Impact 01:54:22 The Rise of Custom AI Hardware 02:00:10 The Future of AI and Hardware…

1
#197 - AI in Gmail+Docs, MiniMax-01, Titans, Transformer^2 1:23:52

大约1年之前1:23:52

1:23:52

Our 197th episode with a summary and discussion of last week's big AI news! Recorded on 01/17/2024 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and guest-hosted by the folks from Latent Space Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - Google and Mistral sign deals with AP and AFP, respectively, to deliver up-to-date news through their AI platforms. - ChatGPT introduces a tasks feature for reminders and to-dos, positioning itself more as a personal assistant. - Synthesia raises $180 million to enhance its AI video platform for generating videos of human avatars. - New U.S. guidelines restrict exporting AI chips to various countries, impacting Nvidia and other tech firms. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:04:29) News Preview (00:05:09) Response to listener comments (00:05:58) Sponsor Break Tools & Apps (00:07:01) Google is making AI in Gmail and Docs free — but raising the price of Workspace (00:07:52) Microsoft relaunches Copilot for business with free AI chat and pay-as-you-go agents (00:12:36) Google signs deal with AP to deliver up-to-date news through its Gemini AI chatbot (00:18:08) Mistral signs deal with AFP to offer up-to-date answers in Le Chat (00:18:45) ChatGPT can now handle reminders and to-dos Applications & Business (00:22:53) Palmer Luckey’s AI Defense Company Anduril Is Building a $1 Billion Plant in Ohio (00:28:36) OpenAI is bankrolling Axios’ expansion into four new markets (00:29:39) AI researcher François Chollet founds a new AI lab focused on AGI (00:32:18) Nvidia-backed AI video platform Synthesia doubles valuation to $2.1 billion (00:34:46) Anysphere Raises $105M in Series B (00:40:14) Harvey Valuation of 3 Billion Projects & Open Source (00:46:12) MiniMax-01: Scaling Foundation Models with Lightning Attention (00:51:16) MinMo: A Multimodal Large Language Model with Approximately 8B Parameters for Seamless Voice Interaction (00:53:01) HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Research & Advancements (00:57:03) Titans: Learning to Memorize at Test Time (01:04:38) Transformer2: Self-adaptive LLMs (01:08:15) Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Policy & Safety (01:11:23) Biden administration proposes sweeping new restrictions on exporting AI chips (01:13:56) Biden orders Energy, Defense departments to lease sites for AI data centers, clean energy generation (01:15:00) OpenAI presents its preferred version of AI regulation in a new ‘blueprint’ (01:16:15) More teens report using ChatGPT for schoolwork, despite the tech’s faults Synthetic Media & Art (01:17:55) In AI copyright case, Zuckerberg turns to YouTube for his defense (01:19:53) Outro…

1
#196 - Nvidia Digits, Cosmos, PRIME, ICLR, InfAlign 1:46:34

大约1年之前1:46:34

1:46:34

Our 196th episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Recorded on 01/10/2024 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - Nvidia announced a $3,000 personal AI supercomputer called Digits, featuring the GB10 Grace Blackwell Superchip, aiming to lower the barrier for developers working on large models. - The U.S. Department of Justice finalizes a rule restricting the transmission of specific data types to countries of concern, including China and Russia, under executive order 14117. - Meta allegedly trained Llama on pirated content from LibGen, with internal concerns about the legality confirmed through court filings. - Microsoft paused construction on a section of a large data center project in Wisconsin to reassess based on new technological changes. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:04:52) Sponsor Break Tools & Apps (00:05:55) Nvidia announces $3,000 personal AI supercomputer called Digits (00:10:23) Meta removes AI character accounts after users criticize them as ‘creepy and unnecessary’ Applications & Business (00:16:16) NVIDIA Is Reportedly Focused Towards “Custom Chip” Manufacturing, Recruiting Top Taiwanese Talent (00:21:54) AI start-up Anthropic closes in on $60bn valuation (00:25:38) Why OpenAI is Taking So Long to Launch Agents (00:30:08) TSMC Set to Expand CoWoS Capacity to Record 75,000 Wafers in 2025, Doubling 2024 Output (00:33:10) Microsoft 'pauses construction' on part of data center site in Mount Pleasant, Wisconsin (00:37:23) Google folds more AI teams into DeepMind to ‘accelerate the research to developer pipeline’ Projects & Open Source (00:41:59) Cosmos World Foundation Model Platform for Physical AI (00:48:21) Microsoft releases Phi-4 language model on Hugging Face Research & Advancements (00:50:16) PRIME: Online Reinforcement Learning with Process Rewards (00:58:29) ICLR: In-Context Learning of Representations (01:07:38) Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs (01:11:44) METAGENE-1: Metagenomic Foundation Model for Pandemic Monitoring (01:15:45) TransPixar: Advancing Text-to-Video Generation with Transparency (01:18:03) The amount of compute used to train frontier models has been growing at a breakneck pace of over 4x per year since 2018, resulting in an overall scale-up of more than 10,000x! But what factors are enabling this rapid growth? Policy & Safety (01:23:45) InfAlign: Inference-aware language model alignment (01:28:44) Mark Zuckerberg gave Meta’s Llama team the OK to train on copyrighted works, filing claims (01:33:19) Anthropic gives court authority to intervene if chatbot spits out song lyrics (01:35:57) US government says companies are no longer allowed to send bulk data to these nations (01:39:10) Trump announces $20B plan to build new data centers in the US…

1
#195 - OpenAI o3 & for-profit, DeepSeek-V3, Latent Space 1:39:05

大约1年之前1:39:05

1:39:05

Our 195th episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Recorded on 01/04/2024 Join our brand new Discord here! https://discord.gg/nTyezGSKwP Note: apologies for Andrey's slurred speech and the jumpy editing, will be back to normal next week! Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - OpenAI teases new deliberative alignment techniques in its O3 model, showcasing major improvements in reasoning benchmarks, whilst surprising with autonomy in hacks against chess engines. - Microsoft and OpenAI continue to wrangle over the terms of their partnership, highlighting tensions amid OpenAI's shift towards a for-profit model. - Chinese AI companies like DeepSeek and Quen release advanced open-source models, presenting significant contributions to AI capabilities and performance optimization. - Sakana AI introduces innovative applications of AI to the search for artificial life, emphasizing the potential and curiosity-driven outcomes of open-ended learning and exploration. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:03:07) News Preview (00:03:54) Response to listener comments (00:05:00) Sponsor Break Tools & Apps (00:06:11) OpenAI announces new o3 model (00:21:17) Alibaba slashes prices on large language models by up to 85% as China AI rivalry heats up (00:23:04) ElevenLabs launches Flash, its fastest text-to-speech AI yet Applications & Business (00:24:24) OpenAI announces plan to transform into a for-profit company (00:33:17) Microsoft and OpenAI Wrangle Over Terms of Their Blockbuster Partnership (00:37:36) Elon Musk’s xAI gets investment from Nvidia in recent funding round: report (00:39:43) Sam Altman’s nuclear energy startup signs one of the largest nuclear power deals to date (00:41:13) OpenAI Search Leader Departs After Less Than a Year (00:42:43) Senior OpenAI Researcher Radford Departs Projects & Open Source (00:45:21) DeepSeek-AI Just Released DeepSeek-V3: A Strong Mixture-of-Experts (MoE) Language Model with 671B Total Parameters with 37B Activated for Each Token (00:54:14) Qwen Team Releases QvQ: An Open-Weight Model for Multimodal Reasoning (00:58:09) LightOn and Answer.ai Releases ModernBERT: A New Model Series that is a Pareto Improvement over BERT with both Speed and Accuracy Research & Advancements (01:00:31) Deliberation in Latent Space via Differentiable Cache Augmentation (01:05:14) Automating the Search for Artificial Life with Foundation Models Policy & Safety (01:10:27) Nonprofit group joins Elon Musk’s effort to block OpenAI’s for-profit transition (01:14:35) OpenAI Researchers Propose 'Deliberative Alignment' : A Training Approach that Teaches LLMs to Explicitly Reason through Safety Specifications before Producing an Answer (01:22:06) o1-preview autonomously hacked its environment rather than lose to Stockfish in our chess challenge. No adversarial prompting needed. (01:27:22) Elon Musk’s xAI supercomputer gets 150MW power boost despite concerns over grid impact and local power stability (01:29:06) DOE: Data centers consumed 4.4% of US power in 2023, could hit 12% by 2028 Synthetic Media & Art (01:32:20) OpenAI failed to deliver the opt-out tool it promised by 2025 (01:36:15) Outro…

1
#194 - Gemini Reasoning, Veo 2, Meta vs OpenAI, Fake Alignment 1:59:55

大约1年之前1:59:55

1:59:55

Our 194th episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Recorded on 12/19/2024 Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - Google dominates AI news with multiple announcements, including a reasoning model and Project Mariner, an AI browsing agent. - Anthropic explores alignment faking in LLMs, revealing models may show deceptive compliance under certain conditions. - Apple observes a trend towards smaller but more efficient language models, bucking previous trends of scaling larger parameter counts. - Legal drama unfolds as Meta backs Elon Musk's opposition to OpenAI's profit status change, raising concerns about competitive fairness. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:02:14) Response to listener comments (00:08:52) News Preview (00:10:01) Sponsor Break Tools & Apps (00:10:55) Google releases its own ‘reasoning’ AI model (00:16:52) Google Gemini can now do more in-depth research (00:21:58) Google DeepMind unveils a new video model to rival Sora (00:27:50) Pika Labs releases AI video generator 2.0 with new features (00:29:51) Google unveils Project Mariner: AI agents to use the web for you (00:34:33) X gains a faster Grok model and a new ‘Grok button’ Applications & Business (00:36:11) AI GPU clusters with one million GPUs are planned for 2027 — Broadcom says three AI supercomputers are in the works (00:43:02) Meta asks the government to block OpenAI’s switch to a for-profit (00:49:36) OpenAI says Elon Musk wanted it to be for-profit in 2017 (00:56:04) EQTY Lab, Intel, and NVIDIA Unveil 'Verifiable Compute,' A Solution to Secure Trusted AI (00:59:53) Liquid AI just raised $250M to develop a more efficient type of AI model (01:03:19) Hundreds of OpenAI’s current and ex-employees are about to get a huge payday by cashing out up to $10 million each in a private stock sale Projects & Open Source (01:07:45) Phi-4 Technical Report (01:13:04) DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding (01:15:23) Meta AI Releases Apollo: A New Family of Video-LMMs Large Multimodal Models for Video Understanding Research & Advancements (01:16:34) Alignment faking in large language models (01:28:39) Meta AI Introduces Byte Latent Transformer (BLT): A Tokenizer-Free Model That Scales Efficiently (01:36:49) Frontier language models have become much smaller (01:42:28) The Complexity Dynamics of Grokking Policy & Safety (01:46:49) Homeland Security gets its very own generative AI chatbot (01:49:16) Pre-Deployment Evaluation of OpenAI’s o1 Model (01:51:35) Pricing for key chipmaking material hits 13-year high following ( 01:53:46) Chinese export restrictions — China's restrictions on Gallium exports hit hard Synthetic Media & Art Meta debuts a tool for watermarking AI-generated videos (01:55:27) Outro…

1
#193 - Sora release, Gemini 2, OpenAI's AGI Rule, US AI Czar 2:05:28

大约1年之前2:05:28

2:05:28

Our 193rd episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Note: this one was recorded on 12/13, so the news is a bit outdated... will get things back on track soon! Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. In this episode: - OpenAI launches Sora, a text-to-video model with significant capabilities, and Gemini 2.0 from Google showcasing agentic potential in AI tools. - Character.ai introduces a teen model to address safety concerns following two tragic incidents linked to addiction and harmful influence. - The U.S. government sets up a task force to support the rapid development of AI data centers, reflecting the critical need for robust infrastructure. - A paper from Anthropic reveals that frontier AI systems have reached the capability of self-replication, sparking discussions on future implications and safety protocols. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:02:44) News Preview (00:03:43) Response to listener comments (00:09:50) Sponsor Break Tools & Apps (00:11:12) OpenAI has finally released Sora (00:21:16) Google Reveals Gemini 2, AI Agents, and a Prototype Personal Assistant (00:28:23) ChatGPT Advanced Voice Mode adding video and screen sharing input (plus a Santa mode) (00:30:43) Microsoft’s Copilot can browse the web with you using AI ‘Vision’ (00:32:31) Musk’s xAI has launched Grok image generation model (00:35:22) Cognition Labs’ AI Software Engineer Devin Launched for Subscribers (00:40:43) Apple launches its ChatGPT integration with Siri (00:43:23) Reddit’s New AI Search Tool Helps You Find Reddit Answers Without Google Applications & Business (00:46:35) OpenAI Aiming to Eliminate Microsoft AGI Rule to Boost Future Investment (00:53:34) GM halts funding of robotaxi development by Cruise (00:57:08) Largest AI data centre in the world to be built in northwest Alberta (01:02:36) Meta announces 4 million sq ft, 2GW Louisiana data center campus (01:05:22) Google’s future data centers will be built next to solar and wind farms Projects & Open Source (01:08:37) Google DeepMind Just Released PaliGemma 2: A New Family of Open-Weight Vision Language Models (3B, 10B and 28B) Research & Advancements (01:13:51) Training Large Language Models to Reason in a Continuous Latent Space (01:25:37) An Evolved Universal Transformer Memory (01:31:48) APOLLO: SGD-like Memory, AdamW-level Performance (01:37:59) Clio: A system for privacy-preserving insights into real-world AI use Policy & Safety (01:39:47) Character.AI steps up teen safety after bots allegedly caused suicide, self-harm (01:45:22) What Trump’s New AI and Crypto Czar David Sacks Means For the Tech Industry (01:49:03) Frontier AI systems have surpassed the self-replicating red line (01:53:52) Chip war: China launches antitrust probe into US semiconductor giant Nvidia in sign of escalation (01:56:53) White House Creating Task Force on AI Datacenter Infrastructure (02:00:00) US clears export of advanced AI chips to UAE under Microsoft deal, Axios says (02:02:19) Outro…

1
#192 - ChatGPT Pro, Amazon Nova, GenFM, Llama 3.3, Genie 2 1:58:13

大约1年之前1:58:13

1:58:13

Our 192nd episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Note: this one was recorded on 12/04 , so the news is a bit outdated... Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. The AI safety book “ Uncontrollable" which is not a doomer book, but instead lays out the reasonable case for AI safety and what we can do about it. Max TEGMARK said that “Uncontrollable” is a captivating, balanced, and remarkably up-to-date book on the most important issue of our time" - find it on Amazon today! In this episode: OpenAI launches a $200 ChatGPT Pro subscription with advanced capabilities, while Amazon unveils cost-effective Nova multimodal models at the re:Invent conference. Meta releases LLAMA 3.3 70B model, showing significant gains through post-training techniques, and Alibaba introduces QWQ, a reasoning model rivaling OpenAI's O1. Amazon collaborates with Anthropic on a massive AI supercomputer project, and Black Forest Labs eyes a $200 million funding round for growth in AI tools. New research from DeepMind's Genie 2 generates interactive 3D worlds from text and images, progressing AI's understanding of world models and interactive environments. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:02:34) Sponsor Break Tools & Apps (00:04:19) OpenAI confirms new $200 monthly subscription, which includes its o1 reasoning model (00:10:40) Amazon announces Nova, a new family of multimodal AI models (00:17:13) ElevenLabs launches GenFM to turn user content into AI-powered podcasts (00:20:21) Google’s new generative AI video model is now available Applications & Business (00:23:56) Elon Musk files for injunction to halt OpenAI’s transition to a for-profit (00:29:40) Amazon Is Building a Mega AI Supercomputer With Anthropic (00:34:15) It Sounds an Awful Lot Like OpenAI Is Adding Ads to ChatGPT (00:38:23) A16z in Talks to Lead $200 Million Round in Black Forest Labs, Startup Behind AI Images on Grok (00:41:10) Bezos Backs AI Chipmaker Vying With Nvidia at $2.6 Billion Value Projects & Open Source (00:45:25) Meta unveils a new, more efficient Llama model (00:50:00) Alibaba releases an ‘open’ challenger to OpenAI’s o1 reasoning model (00:55:21) DeMo: Decoupled Momentum Optimization (00:57:01) PRIME Intellect Releases INTELLECT-1 (Instruct + Base): The First 10B Parameter Language Model Collaboratively Trained Across the Globe (01:03:03) Tencent Launches HunyuanVideo, an Open-Source AI Video Model Research & Advancements (01:09:23) DeepMind’s Genie 2 can generate interactive worlds that look like video games (01:16:43) Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding (01:20:40) Densing Law of LLMs (01:25:59) Monet: Mixture of Monosemantic Experts for Transformers Policy & Safety (01:30:56) Commerce Strengthens Export Controls to Restrict China’s Capability to Produce Advanced Semiconductors for Military Applications (01:37:33) China retaliates against latest US chip restrictions (01:40:52) OpenAI Is Working With Anduril to Supply the US Military With AI (01:43:24) On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback (01:47:52) AI Safety Researcher Quits OpenAI, Saying Its Trajectory Alarms Her (01:51:52) Meta Claims AI Content Was Less than 1% of Election Misinformation (01:55:05) Outro…

1
#191 - Sora leak, Pixtral Large, OpenAI email archives 1:42:11

大约1年之前1:42:11

1:42:11

Our 191st episode with a summary and discussion of last week's big AI news! Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:02:55) Response to listener comments (00:09:30) Sponsor Break Tools & Apps (00:10:52) OpenAI’s Sora video generator appears to have leaked (00:21:11) Mistral unleashes Pixtral Large and upgrades Le Chat into full-on ChatGPT competitor (00:26:39) Ignite 2024 introduces new AI agents and more for Microsoft 365 Copilot (00:28:50) H, the AI startup that raised $220M, launches its first product: Runner H for ‘agentic’ applications (00:31:20) Anthropic bets on personalization in the AI arms race with new ‘styles’ feature (00:33:42) ElevenLabs now offers ability to build conversational AI agents (00:37:08) Perplexity introduces a shopping feature for Pro users in the U.S. (00:38:49) Google’s Gemini chatbot now has memory (00:43:03) Suno V4 Ai Music Generator Is Out Now And It’s Very Impressive (00:46:28) Introducing FLUX.1 Tools (00:49:51) OpenAI just gave ChatGPT a major 'creativity' upgrade (00:51:26) Runway launches Frames — a new AI image generator that creates custom worlds Applications & Business (00:54:56) OpenAI Email Archives (from Musk v. Altman) (01:02:01) Amazon to invest another $4 billion in Anthropic, OpenAI's biggest rival (01:05:41) Amazon Robots Struggling to Keep Up With Human Workers Projects & Open Source (01:11:27) DeepSeek’s first reasoning model R1-Lite-Preview turns heads, beating OpenAI o1 performance (01:15:30) OpenScholar: The open-source A.I. that’s outperforming GPT-4o in scientific research Research & Advancements (01:18:02) A statistical approach to model evaluations (01:22:08) Scaling Laws for Precision (01:25:10) Cerebras Delivers Record-Breaking Performance with Meta’s Llama 3.1 405B Model Policy & Safety (01:28:01) Sam Altman will co-chair San Francisco mayor-elect Daniel Lurie’s transition team (01:32:21) Biden’s final meeting with Xi Jinping reaps agreement on AI and nukes Synthetic Media & Art (01:33:07) How Did You Do On The AI Art Turing Test? (01:38:27) Outro…

1
#190 - AI scaling struggles, OpenAI Agents, Super Weights 1:37:21

大约1年之前1:37:21

1:37:21

Our 190th episode with a summary and discussion of last week's* big AI news! *and sometimes last last week's Hosted by Andrey Kurenkov and Jeremie Harris . Note from Andrey: this one is coming out a bit later than planned, apologies! Next one will be coming out sooner. Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence In this episode: * OpenAI's pitch for a $100 billion data center and AI strategy plan outlines infrastructure and regulatory needs, emphasizing AI's foundational role akin to electricity. * Google's Gemini model challenges OpenAI's dominance, showing strong performance in chatbot arenas alongside generative AI advancements. * DeepMind's AlphaFold3 gets open-sourced for academic use, while new chips from NVIDIA and Google show significant performance boosts. * Anthropic and TSMC updates highlight strategic funding, regulation influences, and the complex dynamics of AI hardware and international policy. If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:02:44) News Preview (00:03:34) Sponsor Break Tools & Apps (00:04:36) OpenAI, Google and Anthropic Are Struggling to Build More Advanced AI (00:16:22) OpenAI Nears Launch of AI Agent Tool to Automate Tasks for Users (00:19:14) Google drops new Gemini model and it goes straight to the top of the LLM leaderboard (00:19:14) Chinese AI startup takes aim at OpenAI's Sora with image-to-video tool launch (00:20:04) Introducing the Forge Reasoning API Beta and Nous Chat: An Evolution in LLM Inference Applications & Business (00:23:47) OpenAI Discusses AI Data Center That Could Cost $100 Billion (00:26:48) Elon Musk's massive AI data center gets unlocked — xAI gets approved for 150MW of power, enabling all 100,000 GPUs to run concurrently (00:29:34) Newest Google and Nvidia Chips Speed AI Training (00:34:45) Ex-OpenAI CTO Murati’s New Team Takes Shape (00:34:45) Amazon Discussing New Multibillion-Dollar Investment in Anthropic Projects & Open Source (00:37:52) Google DeepMind open-sources AlphaFold 3, ushering in a new era for drug discovery and molecular biology (00:41:29) Near plans to build world’s largest 1.4T parameter open-source AI model Research & Advancements (00:45:38) The Super Weight in Large Language Models (00:55:42) Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task (01:03:47) Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models (01:08:14) Contextualized Evaluations: Taking the Guesswork Out of Language Model Evaluations Policy & Safety (01:11:14) The Code of Practice for general-purpose AI offers a unique opportunity for the EU (01:15:38) Three Sketches of ASL-4 Safety Case Components (01:23:05) U.S Department of Commerce finalizes $6.6 billion CHIPS Act funding for TSMC Fab 21 Arizona site , TSMC cannot make 2nm chips abroad now: MOEA (01:26:21) OpenAI to present plans for U.S. AI strategy and an alliance to compete with China (01:30:42) OpenAI loses another lead safety researcher, Lilian Weng (01:33:00) Outro…

1
#189 - Chat.com, FrontierMath, Relaxed Transformers, Trump & AI 1:42:46

大约1年之前1:42:46

1:42:46

Our 189th episode with a summary and discussion of last week's big AI news! Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . In this episode: * OpenAI's acquisition of chat.com and internal shifts, including hardware lead hire and hardware model leaks, signal significant strategy pivots and challenges with model scaling and security. * Saudi Arabia plans a $100 billion AI initiative aiming to rival UAE's tech hub, highlighting the region's escalating AI investments. * U.S. penalties on GlobalFoundries for violating sanctions against SMIC underline ongoing challenges in enforcing AI-chip export controls. * Anthropic collaborates with Palantir and AWS to integrate CLAWD into defense environments, marking a significant policy shift for the company. Sponsors: The Generator - An interdisciplinary AI lab empowering innovators from all fields to bring visionary ideas to life by harnessing the capabilities of artificial intelligence. The AI safety book “ Uncontrollable" which is not a doomer book, but instead lays out the reasonable case for AI safety and what we can do about it. Max TEGMARK said that “Uncontrollable” is a captivating, balanced, and remarkably up-to-date book on the most important issue of our time" - find it on Amazon today! If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . Timestamps + Links: (00:00:00) Intro / Banter (00:01:28) News Preview (00:02:10) Response to listener comments (00:05:02) Sponsor Break Tools & Apps (00:07:31) OpenAI Introduces ‘Predicted Outputs’ Feature: Speeding Up GPT-4o by ~5x for Tasks like Editing Docs or Refactoring Code (00:11:55) Anthropic’s Haiku 3.5 surprises experts with an “intelligence” price increase (00:17:10) Introducing FLUX1.1 [pro] Ultra and Raw Modes (00:19:11) X is testing a free version of Grok AI chatbot in select regions Applications & Business (00:21:39) OpenAI acquired Chat.com (00:23:40) Saudis Plan $100 Billion AI Powerhouse to Rival UAE Tech Hub (00:28:28) Meta’s former hardware lead for Orion is joining OpenAI (00:31:38) OpenAI Accidentally Leaked Its Upcoming o1 Model to Anyone With a Certain Web Address (00:35:50) Nvidia Rides AI Wave to Pass Apple as World’s Largest Company Projects & Open Source (00:37:53) ‘Unrestricted’ AI group Nous Research launches first chatbot — with guardrails (00:41:48) FrontierMath: The Benchmark that Highlights AI’s Limits in Mathematics (00:46:29) Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent Research & Advancements (00:49:55) Applying “Golden Gate Claude” mechanistic interpretability techniques to protein language models. (00:58:3) Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA (01:05:55) From Naptime to Big Sleep: Using Large Language Models To Catch Vulnerabilities In Real-World Code (01:10:22) OpenAI reportedly developing new strategies to deal with AI improvement slowdown Policy & Safety (01:19:52) What Donald Trump’s Win Means For AI (01:28:44) Fab Whack-A-Mole: Chinese Companies are Evading U.S. Sanctions (01:33:57) US fines GlobalFoundries for shipping chips to sanctioned Chinese firm (01:36:55) Anthropic teams up with Palantir and AWS to sell its AI to defense customers (01:39:23) Outro…

1
#188 - ChatGPT+Search, OpenAI+AMD, SimpleQA, π0 1:51:50

大约1年之前1:51:50

1:51:50

Our 188th episode with a summary and discussion of last week's big AI news! Hosted by Andrey Kurenkov and Jeremie Harris . Feel free to email us your questions and feedback at contact@lastweekinai.com and/or hello@gladstone.ai Read out our text newsletter and comment on the podcast at https://lastweekin.ai/ . This episode was sponsored by The Generator . If you would like to become a sponsor for the newsletter, podcast, or both, please fill out this form . In this episode: * Meta's open-source models utilized by China's military prompt regulatory adjustments; US agencies gain access to counterbalance. * OpenAI partners with Broadcom and AMD to develop custom AI hardware, aiming for profitability and reducing inference costs. * Physical Intelligence unveils a generalist robot control policy with a $400M funding boost, showcasing significant advancements in zero-shot task performance. * New U.S. regulation mandates quarterly reporting for large AI model training and computing cluster acquisitions, aiming to bolster national security. Timestamps + Links: (00:00:00) Intro / Banter (00:02:16) News Preview (00:03:05) Response to listener comments / corrections (00:05:00) Sponsor Break Tools & Apps (00:06:28) OpenAI’s search engine is now live in ChatGPT (00:12:18) Image Playground, ChatGPT, and more Apple Intelligence features roll out in beta (00:14:34) GitHub Copilot will support models from Anthropic, Google, and OpenAI (00:19:00) Introducing the analysis tool in Claude.ai (00:21:34) ElevenLabs Introduces Voice Design: A New AI Feature that Generates a Unique Voice from a Text Prompt Alone (00:24:18) Midjourney's new web editor lets you tweak images uploaded from your PC (00:26:02) Watch out, Midjourney — Recraft just announced new AI image generator model Applications & Business (00:29:57) Meta strikes multi-year AI deal with Reuters (00:33:15) OpenAI will start using AMD chips and could make its own AI hardware in 2026 (00:40:47) Elon Musk's xAI in talks to raise funding valuing it at $40 billion, WSJ reports (00:46:07) Physical Intelligence, a Robot A.I. Specialist, Raises Millions From Bezos (00:48:32) Waymo ramps up robotaxi push with $5.6 bn in funding (00:49:11) Alphabet's Waymo Serving Over 150,000 Paid Robotaxi Rides Every Week Now, Surging 50% In 2 Months Projects & Open Source (00:51:23) Meta AI Silently Releases NotebookLlama: An Open Version of Google’s NotebookLM (00:54:59) Meta Releases Quantized Llama 3.2 with 4x Inference Speed on Android Phones (00:59:16) OpenAI Releases SimpleQA: A New AI Benchmark that Measures the Factuality of Language Models Research & Advancements (01:08:19) This Is a Glimpse of the Future of AI Robot (01:15:06) Can Language Models Replace Programmers? REPOCOD Says 'Not Yet' (01:19:01) Brain-like Functional Organization within Large Language Models (01:21:20) Decart’s AI simulates a real-time, playable version of Minecraft (01:25:39) Raising the bar on SWE-bench Verified with Claude 3.5 Sonnet Policy & Safety (01:29:06) Commerce just proposed the most significant federal AI regulation to date – and no one noticed (01:35:04) Anthropic warns of AI catastrophe if governments don't regulate in 18 months (01:39:32) Open Source Bites Back as China’s Military Makes Full Use of Meta AI (01:46:35) Meta says it’s making its Llama models available for US national security applications (01:48:16) Outro…

1
#187 - Anthropic Agents, Mochi1, 3.4B data center, OpenAI's FAST image gen 2:09:38

大约1年之前2:09:38