show episodes
 
Artwork
 
Save As Jpeg is a podcast about artist life and what it means to be an artist in today's world. We discuss a wide range of art subjects, but also branch out into movies, anime, video games, and whatever else that may sound fun. Come hang out! Host: Kito, He's an awesome tri-color shiba inu. IG: @kitos_Shiba_pride Host: Mikayla aka FatTackyCats - Webcomic & Animator Fattackycats@gmail.com Twitter & IG: @Fattackycats Fattackycats.tumblr.com Host: Joshua aka First1stclass - An Illustrator & dig ...
  continue reading
 
Artwork
 
We are living in a time where a JPEG can sell for a million dollars, celebrities openly endorse Ponzi schemes and when what you've invented doesn't matter nearly as much as what you say you've invented. As snake oil increasingly becomes our new currency, regulators and lawmakers are asleep at the wheel while pay-to-play journalists pump out puff pieces from their slurp juice-induced hangovers. Join us as we explore the dizzying, unending roster of these 2020s-era rackets. Welcome to the age ...
  continue reading
 
A daily update on the latest AI Research Papers. We provide a high level overview of a handful of papers each day and will link all papers in the description for further reading. This podcast is created entirely with AI by PocketPod. Head over to https://pocketpod.app to learn more.
  continue reading
 
Artwork
 
Music Matters Media covers all genres of music from Pop, R&B, and Hip-Hop to EDM, Pop-Punk, Alternative Rock, and more! Listen as we discuss music news, album reviews, concert reviews, discover up-and-coming artists, and talk about all things music. You can now stream the #MusicMattersMedia podcast on Apple Podcasts, Spotify, Google Podcasts, Stitcher, Soundcloud, iHeartRadio, Amazon Music, Audible, Pandora, and any other platform podcasts are hosted. Make sure to subscribe to catch up on ol ...
  continue reading
 
Artwork
 
Capture breathtaking portraits and unforgettable weddings with J-P Visual Voices! This podcast is your one-stop shop for actionable tips, inspiring stories, and essential business insights to elevate your photography game. Whether you're a seasoned pro or just starting out, you'll find valuable advice on posing techniques, lighting setups, creative editing workflows, and essential gear recommendations. J-P Visual Voices goes beyond the technical, delving into the artistic aspects of portrait ...
  continue reading
 
The CNBC-TV18 Special Podcast covers trends that have an impact on the daily lives of the masses. Experts from sectors like economy, healthcare, banking, finance, autos and technology will discuss the latest developments in various fields. Tune in to the CNBC-TV18 Special Podcast for more.
  continue reading
 
Artwork

1
Mega Digitizing

Mega Digitizing

Unsubscribe
Unsubscribe
每月
 
We provide you the best digitized experience that meets up your standards. We bring the timely, and affordable digitizing service experience in the USA. We are focused on providing quality embroidery items, gifts, text digitizing, 3d Puff digitizing, Applique digitizing, and custom digitizing services with the highest levels of customer satisfaction.
  continue reading
 
Loading …
show series
 
xGen-MM (BLIP-3): A Family of Open Large Multimodal ModelsJPEG-LM: LLMs as Image Generators with Canonical Codec RepresentationsAutomated Design of Agentic SystemsTurboEdit: Instant text-based image editingSurgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame PruningFine-tuning Large Language Models with Human-inspired Lea…
  continue reading
 
Have you ever hesitated to lend out something valuable to a friend, even when they once helped you in a similar situation? Join us on this episode of JP Visual Voices as we navigate Jackie’s predicament of whether to let a friend borrow his newly upgraded, high-end photography gear. Jackie’s dilemma raises important questions about balancing gratit…
  continue reading
 
Albums discussed are Oasis ‘What’s The Story Morning Glory’ (12:11) and Hootie And The Blowfish’s ‘Cracked Rear View’ (43:20). We also lead the podcast off talking about the upcoming Linkin Park news. For all of Mutlu’s tour dates and tickets visit https://www.mutlusounds.com/ This episode was recorded on September 1, 2024 To suggest an album for C…
  continue reading
 
The Beatles ‘White Album,’ Lana Del Rey ‘Ultraviolence,’ Nas & DJ Premier ‘Define My Name’ and Aerosmith Retires Albums discussed are The Beatles ‘White Album’ (8:30) and Lana Del Rey’s ‘Ultraviolence’ (50:01). We also talk about the new song from Nas and DJ Premier called ‘Define My Name’ (1:14:20). As well, we discuss Aerosmith’s retirement from …
  continue reading
 
The AI Scientist: Towards Fully Automated Open-Ended Scientific DiscoveryMed42-v2: A Suite of Clinical LLMsMutual Reasoning Makes Smaller LLMs Stronger Problem-SolversControlNeXt: Powerful and Efficient Control for Image and Video GenerationCogVideoX: Text-to-Video Diffusion Models with An Expert TransformerFruitNeRF: A Unified Neural Radiance Fiel…
  continue reading
 
Dive into the latest episode of the Music Matters Media podcast as we unpack Billie Eilish's highly anticipated third album, 'Hit Me Hard and Soft.' From her evolving sound with brother Finneas to the echoes of her past work, this is one album breakdown you don't want to miss. Tune in for our track-by-track analysis and discover our top picks from …
  continue reading
 
Albums discussed are Nick Drake’s ‘Pink Moon’ (2:45) and Jurassic 5’s ‘Quality Control’ (41:29). We also talk about the new song from Honest AV w/ Mod Son called ‘I’d Rather Overdose’ (27:20). For all of Mutlu’s tour dates and tickets visit https://www.mutlusounds.com/ This episode was recorded on July 28th, 2024. To suggest an album for CLRC do an…
  continue reading
 
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language ModelsLLaVA-OneVision: Easy Visual Task TransferAn Object is Worth 64x64 Pixels: Generating 3D Object via Image DiffusionMedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for MedicineIPAdapter-Instruct: Resolving Ambiguity in Image-based Co…
  continue reading
 
Can a photographer maintain their passion and personal life without one overshadowing the other? Join us on JP Visual Voices as we unpack the Photographer's Dilemma, featuring the story of Dave, a 27-year-old photographer from Detroit who's caught between his love for capturing moments and the demands of his new relationship. You'll learn how to na…
  continue reading
 
Unlock the secrets to perfectly exposed photos with a deep dive into the exposure triangle. This episode is your roadmap to understanding the intricate relationship between aperture, shutter speed, and ISO. Learn how these three fundamental elements work together to control the amount of light reaching your camera's sensor. From capturing sharp ima…
  continue reading
 
SAM 2: Segment Anything in Images and VideosGemma 2: Improving Open Language Models at a Practical SizeCoarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language ModelImproving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuningOmniParser for Pure Vision Based GUI AgentSF3D: Stable Fast 3D Mesh Reconstructi…
  continue reading
 
On this episode of the Music Matters Media Podcast, we dive into the fourth studio album by blues rock musician Gary Clark Jr., 'JPEG Raw'. Join us for an in-depth discussion on the album's stylistic evolution compared to Clark's previous works, featuring epic guest appearances by Stevie Wonder and George Clinton. We highlight standout musical mome…
  continue reading
 
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion ModelLAMBDA: A Large Model Based Data AgentAMEX: Android Multi-annotation Expo Dataset for Mobile GUI AgentsBetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth EstimationVery Large-Scale Multi-Agent Simulation in AgentScopeData Mixture Inference: What do BPE Tok…
  continue reading
 
Ready to kickstart your photography journey without breaking the bank? Join me, Widji, as I guide you through the process of finding affordable used photography gear. On this episode of JP Visual Voices, you'll learn the ins and outs of researching and selecting the best cameras and lenses while exploring platforms like eBay and Facebook Marketplac…
  continue reading
 
Albums discussed are Robyn’s ‘Body Talk’ (24:20) and JPEG MAFIA & Danny Brown’s ‘Scaring The Hoes’ (48:30). We also talk about the new song from Olive Jones called ‘Summer Rain’ (15:13). For all of Mutlu’s tour dates and tickets visit https://www.mutlusounds.com/ This episode was recorded on July 7th, 2024. To suggest an album for CLRC do any of th…
  continue reading
 
Ever wondered why wedding photographers keep raving about gear despite YouTube gurus telling you otherwise? In our inaugural episode of JP Visual Voices, I, Weedji, pull back the curtain on the real-world challenges of stepping into the wedding photography scene. You'll gain firsthand insights into the financial investments needed, the hurdles of b…
  continue reading
 
Ever wondered how a simple Walmart trip could kickstart a photography career? Join me, Ouija, on JP Visual Voices as I share my unconventional journey from social event snapper to professional photographer. Learn how a spontaneous maternity shoot for my best friend taught me the ropes and discover the invaluable role YouTube played in my education.…
  continue reading
 
OpenDevin: An Open Platform for AI Software Developers as Generalist AgentsVILA^2: VILA Augmented VILAHumanVid: Demystifying Training Data for Camera-controllable Human Image AnimationPERSONA: A Reproducible Testbed for Pluralistic AlignmentSV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View ConsistencyScalify: scale propagation for…
  continue reading
 
Scaling Laws with Vocabulary: Larger Models Deserve Larger VocabulariesScaling Retrieval-Based Language Models with a Trillion-Token DatastoreShape of Motion: 4D Reconstruction from a Single VideoStreetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video DiffusionUnderstanding Reference Policies in Direct Preference Opti…
  continue reading
 
Qwen2 Technical ReportLearning to Refuse: Towards Mitigating Privacy Risks in LLMsThe Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-DeterminismQ-Sparse: All Large Language Models can be Fully Sparsely-ActivatedGRUtopia: Dream General Robots in a City at Scale
  continue reading
 
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes OnVideo Diffusion Alignment via Reward GradientsMultimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language ModelQ-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank GradientsMAVIS: Math…
  continue reading
 
Unveiling Encoder-Free Vision-Language ModelsFunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMsAriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsRULE: Reliable Multimodal RAG for Factuality in Medical Vision Language ModelsChartGemma: Visual Instruction-…
  continue reading
 
On this week's episode, we unpack Dua Lipa's much-anticipated third studio album, 'Radical Optimism'. It's her first full-length release in four years since 'Future Nostalgia', and we're here to explore every detail. Join us as we delve into the album's unique sonic landscape and the evolved lyrical maturity that sets this record apart.We'll share …
  continue reading
 
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionLet the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language ModelsPlanetarium: A Rigorous Benchmark for Translating Text to Structured Planning LanguagesInternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Co…
  continue reading
 
We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoningMMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient EvaluationLiteSearch: Efficacious Tree Search for LLMWavelets Are All You Need for Autoregressive Image…
  continue reading
 
Albums discussed are Pinegrove’s ‘Skylight’ (04:22) and System of a Down’s ‘Toxicity’ (33:00). We also talk about the new song from Spacey Jane called ‘One Bad Day’ (52:22). This episode was recorded on January 28th, 2024. To suggest an album for CLRC do any of the following: * Leave a review on Apple Podcasts with the artist and title (five stars …
  continue reading
 
Scaling Synthetic Data Creation with 1,000,000,000 PersonasHuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleLLaRA: Supercharging Robot Learning Data for Vision-Language PolicyDirect Preference Knowledge Distillation for Large Language ModelsGaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly Enh…
  continue reading
 
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and UnderstandingStep-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMsMUMU: Bootstrapping Multimodal Image Generation from Text-to-Image DataSimulating Classroom Education with LLM-Empowered AgentsSeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval …
  continue reading
 
The FineWeb Datasets: Decanting the Web for the Finest Text Data at ScaleYouDream: Generating Anatomically Controllable Consistent Text-to-3D AnimalsDiffusionPDE: Generative PDE-Solving Under Partial ObservationAligning Diffusion Models with Noise-Conditioned PerceptionUnlocking Continual Learning Abilities in Language Models…
  continue reading
 
DreamBench++: A Human-Aligned Benchmark for Personalized Image GenerationBigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex InstructionsCambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMsEvaluating D-MERIT of Partial-annotation on Information RetrievalLong Context Transfer from Language to Vision…
  continue reading
 
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMsJudging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-JudgesComplexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a TaskTowards Retrieval Augmented Generation over Large Video LibrariesStylebreeder: Exploring …
  continue reading
 
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement LearningMake It Count: Text-to-Image Generation with an Accurate Number of ObjectsChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code GenerationNeedle In A Multimodal HaystackBABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Hay…
  continue reading
 
Depth Anything V2An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual PixelsTransformers meet Neural Algorithmic ReasonersSamba: Simple Hybrid State Space Models for Efficient Unlimited Context Language ModelingOpenVLA: An Open-Source Vision-Language-Action ModelAlleviating Distortion in Image Generation via Multi-Resolut…
  continue reading
 
NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video EditingMotionClone: Training-Free Motion Cloning for Controllable Video GenerationWhat If We Recaption Billions of Web Images with LLaMA-3?Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with NothingPowerInfer-2: Fast Large Language Model I…
  continue reading
 
Albums discussed are Jalen Ngonda ‘Come Around And Love Me’ (04:10) and Wine Lips ‘Super Mega Ultra’ (21:55). We also talk about the new song from Falling In Reverse called ‘Ronald’ (34:22). Spike’s mic sounds a little off because it was accidentally recorded with his laptop mic not his real mic. It’s still fine but that’s why it’s a little weird. …
  continue reading
 
An Image is Worth 32 Tokens for Reconstruction and GenerationMcEval: Massively Multilingual Code EvaluationZero-shot Image Editing with Reference ImitationThe Prompt Report: A Systematic Survey of Prompting TechniquesTextGrad: Automatic "Differentiation" via Text
  continue reading
 
Autoregressive Model Beats Diffusion: Llama for Scalable Image GenerationHusky: A Unified, Open-Source Language Agent for Multi-Step ReasoningVript: A Video Is Worth Thousands of WordsLighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View SynthesisVALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text …
  continue reading
 
Mixture-of-Agents Enhances Large Language Model CapabilitiesWildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the WildCRAG -- Comprehensive RAG BenchmarkGenAI Arena: An Open Evaluation Platform for Generative ModelsLarge Language Model Confidence Estimation via Black-Box Access
  continue reading
 
ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsBitsFusion: 1.99 bits Weight Quantization of Diffusion ModelStep-aware Preference Optimization: Aligning Preference with Denoising Performance at Each StepBuffer of Thoughts: Thought-Augmented Reasoning with Large Language ModelsSF-V: Single Forward Video Generation Mo…
  continue reading
 
Apple announced new Siri features and Apple Intelligence today, Interestingly, Apple already released a paper, titled "Ferret-UI," on how it all works - a multimodal vision-language model capable of understanding widgets, icons, and text on an iOS mobile screen, and reasoning about their spatial relationships and functional meanings. https://arxiv.…
  continue reading
 
Block Transformer: Global-to-Local Language Modeling for Fast InferenceParrot: Multilingual Visual Instruction TuningMobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent CollaborationOuroboros3D: Image-to-3D Generation via 3D-aware Recursive DiffusionLiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autore…
  continue reading
 
Albums discussed are Bon Jovi’s ‘New Jersey (14:45) and Freddie Gibbs & Madlib’s ‘Bandana’ (43:55). We also talk about the new song from Madlib, Your Old Droog and Black Thought called ‘REEKYOD.’ (1:02:40 ) To suggest an album for CLRC do any of the following: * Leave a review on Apple Podcasts with the artist and title (five stars always helps). *…
  continue reading
 
Seed-TTS: A Family of High-Quality Versatile Speech Generation ModelsTo Believe or Not to Believe Your LLMI4VGen: Image as Stepping Stone for Text-to-Video GenerationSelf-Improving Robust Preference OptimizationGuiding a Diffusion Model with a Bad Version of Itself
  continue reading
 
Welcome to the continuation of our Sum 41 series! Listen as we rank all eight studio albums from least favorite to our favorite Sum 41 record of all time, providing in-depth thoughts on each one. We’ll also reflect on how Sum 41 has influenced our music taste over the years. Don't miss this final chapter of our Sum 41 content, and be sure to check …
  continue reading
 
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding BenchmarkLearning Temporally Consistent Video Depth from Video Diffusion PriorsShow, Don't Tell: Aligning Language Models with Demonstrated FeedbackArtificial Generational Intelligence: Cultural Accumulation in Reinforcement LearningZeroSmooth: Training-free Diffuser Adaptati…
  continue reading
 
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space DualityVideo-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video AnalysisPerplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference ModelsKaleido Diffusion: Improving Conditional Diffusion Models with Au…
  continue reading
 
AI Papers Podcast for 06/04/2024 DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music GenerationGECO: Generative Image-to-3D within a SECOndPLA4D: Pixel-Level Alignments for Text-to-4D Gaussian SplattingDevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code RepositoriesParrot: Efficient Serving of LLM-b…
  continue reading
 
AI Papers Podcast for 06/03/2024 Jina CLIP: Your CLIP Model Is Also Your Text RetrieverSimilarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered ThoughtsMotionLLM: Understanding Human Behaviors from Human Motions and VideosXwin-LM: Strong and Scalable Alignment Practice for LLMsMOFA-Video: Controllable Image Animati…
  continue reading
 
Sum 41 fans, get ready for an unforgettable episode! Sum 41 has announced their final headlining world tour, ‘Tour Of The Setting Sum’, celebrating the release of their last album ‘Heaven :x: Hell’, and their farewell as a band. They’re hitting major stops across the globe, ending with their ultimate farewell in Toronto at Scotiabank Arena on Janua…
  continue reading
 
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model SeriesT2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward FeedbackLLMs achieve adult human performance on higher-order theory of mind tasksNearest Neighbor Speculative Decoding for LLM Generation and AttributionZipper: A Multi-Tower Decoder Ar…
  continue reading
 
Loading …

快速参考指南