Skip to main content
← All Podcasts
Latent Space

Latent Space

Deep technical AI engineering content. The go-to podcast for AI builders.

187 episodes curated

ShareShare

Episodes

Apr 27, 2024· 53 min

WebSim, WorldSim, and The Summer of Simulative AI — with Joscha Bach of Liquid AI, Karan Malhotra of Nous Research, Rob Haisfield of WebSim.ai

We are 200 people over our 300-person venue capacity for AI UX 2024 , but you can subscribe to our YouTube for the video recaps. Our next event, and largest EVER, is the AI Engineer World’s Fair . See you there! Parental advisory: Adult language used in the first 10 mins of this podcast . Any accounting of Generative AI that ends with RAG as its “final form” is seriously lacking in imagination and missing out on its full potential. While AI generation is very good for “spicy autocomplete” and “reasoning and retrieval with in context learning”, there’s a lot of untapped potential for simulative

Apr 19, 2024· 52 min

High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor

We are reuniting for the 2nd AI UX demo day in SF on Apr 28. Sign up to demo here ! And don’t forget tickets for the AI Engineer World’s Fair — for early birds who join before keynote announcements ! About a year ago there was a lot of buzz around prompt engineering techniques to force structured output. Our friend Simon Willison tweeted a bunch of tips and tricks, but the most iconic one is Riley Goodside making it a matter of life or death : Guardrails ( friend of the pod and AI Engineer speaker ), Marvin ( AI Engineer speaker ), and jsonformer had also come out at the time. In June 2023, Ja

Apr 11, 2024· 56 min

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit

Maggie, Linus, Geoffrey, and the LS crew are reuniting for our second annual AI UX demo day in SF on Apr 28. Sign up to demo here ! And don’t forget tickets for the AI Engineer World’s Fair — for early birds who join before keynote announcements! It’s become fashionable for many AI startups to project themselves as “the next Google” - while the search engine is so 2000s, both Perplexity and Exa referred to themselves as a “ research engine ” or “ answer engine ” in our NeurIPS pod . However these searches tend to be relatively shallow, and it is challenging to zoom up and down the ladders of a

Apr 6, 2024· 2h 45min

Latent Space Chats: NLW (Four Wars, GPT5), Josh Albrecht/Ali Rohde (TNAI), Dylan Patel/Semianalysis (Groq), Milind Naphade (Nvidia GTC), Personal AI (ft. Harrison Chase — LangFriend/LangMem)

Our next 2 big events are AI UX and the World’s Fair . Join and apply to speak/sponsor! Due to timing issues we didn’t have an interview episode to share with you this week, but not to worry, we have more than enough “weekend special” content in the backlog for you to get your Latent Space fix, whether you like thinking about the big picture, or learning more about the pod behind the scenes, or talking Groq and GPUs, or AI Leadership, or Personal AI. Enjoy! AI Breakdown The indefatigable NLW had us back on his show for an update on the Four Wars , covering Sora, Suno, and the reshaped GPT-4 Cl

Mar 29, 2024· 42 min

Presenting the AI Engineer World's Fair — with Sam Schillace, Deputy CTO of Microsoft

TL;DR: You can now buy tickets , apply to speak , or join the expo for the biggest AI Engineer event of 2024. We’re gathering *everyone* you want to meet - see you this June. In last year’s the Rise of the AI Engineer we put our money where our mouth was and announced the AI Engineer Summit , which fortunately went well: With ~500 live attendees and over ~500k views online , the first iteration of the AI Engineer industry affair seemed to be well received. Competing in an expensive city with 3 other more established AI conferences in the fall calendar, we broke through in terms of in-person ex

Mar 22, 2024· 41 min

Why Google failed to make GPT-3 + why Multimodal Agents are the path to AGI — with David Luan of Adept

Our next SF event is AI UX 2024 - let’s see the new frontier for UX since last year ! Last call: we are recording a preview of the AI Engineer World’s Fair with swyx and Ben Dunphy, send any questions about Speaker CFPs and Sponsor Guides you have! Alessio is now hiring engineers for a new startup he is incubating at Decibel: Ideal candidate is an “ex-technical co-founder type”. Reach out to him for more! David Luan has been at the center of the modern AI revolution: he was the ~30th hire at OpenAI, he led Google's LLM efforts and co-led Google Brain, and then started Adept in 2022, one of the

Mar 14, 2024· 52 min

Making Transformers Sing - with Mikey Shulman of Suno

Giving computers a voice has always been at the center of sci-fi movies; “I’m sorry Dave, I’m afraid I can’t do that” wouldn’t hit as hard if it just appeared on screen as a terminal output, after all. The first electronic speech synthesizer, the Voder, was built at Bell Labs 85 years ago (1939!), and it’s…. something: We will not cover the history of Text To Speech (TTS), but the evolution of the underlying architecture has generally been Formant Synthesis → Concatenative Synthesis → Neural Networks. Nowadays, state of the art TTS is just one API call away with models like Eleven Labs and Ope

Mar 9, 2024· 1h 48min

Top 5 Research Trends + OpenAI Sora, Google Gemini, Groq Math (Jan-Feb 2024 Audio Recap) + Latent Space Anniversary with Lindy.ai, RWKV, Pixee, Julius.ai, Listener Q&A!

We will be recording a preview of the AI Engineer World’s Fair soon with swyx and Ben Dunphy, send any questions about Speaker CFPs and Sponsor Guides you have! Alessio is now hiring engineers for a new startup he is incubating at Decibel: Ideal candidate is an ex-technical co-founder type (can MVP products end to end, comfortable with ambiguous prod requirements, etc). Reach out to him for more! Thanks for all the love on the Four Wars episode ! We’re excited to develop this new “swyx & Alessio rapid-fire thru a bunch of things” format with you, and feedback is welcome . Jan 2024 Recap The fi

Mar 6, 2024· 1h 20min

Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI

Speaker CFPs and Sponsor Guides are now available for AIE World’s Fair — join us on June 25-27 for the biggest AI Engineer conference of 2024 ! Soumith Chintala needs no introduction in the ML world — his insights are incredibly accessible across Twitter , LinkedIn , podcasts , and conference talks (in this pod we’ll assume you’ll have caught up on the History of PyTorch pod from last year and cover different topics). He’s well known as the creator of PyTorch, but he's more broadly the Engineering Lead on AI Infra, PyTorch, and Generative AI at Meta. Soumith was one of the earliest supporters

Feb 28, 2024· 1h 10min

A Brief History of the Open Source AI Hacker - with Ben Firshman of Replicate

This Friday we’re doing a special crossover event in SF with Dylan Patel of SemiAnalysis ( previous guest !), and we will do a live podcast on site. RSVP here . Also join us on June 25-27 for the biggest AI Engineer conference of the year ! Replicate is one of the most popular AI inference providers, reporting over 2 million users as of their $40m Series B with a16z . But how did they get there? The Definitive Replicate Story (warts and all) Their overnight success took 5 years of building, and it all started with arXiv Vanity , which was a 2017 vacation project that scrapes arXiv PDFs and re-

Feb 16, 2024· 1h 2min

Truly Serverless Infra for AI Engineers - with Erik Bernhardsson of Modal

We’re writing this one day after the monster release of OpenAI’s Sora and Gemini 1.5 . We covered this on Alex Volkov ‘s ThursdAI space , so head over there for our takes. IRL: We’re ONE WEEK away from Latent Space: Final Frontiers , the second edition and anniversary of our first ever Latent Space event ! Also: join us on June 25-27 for the biggest AI Engineer conference of the year ! Online: All three Discord clubs are thriving. Join us every Wednesday/Friday ! Almost 12 years ago, while working at Spotify, Erik Bernhardsson built one of the first open source vector databases, Annoy , based

Feb 8, 2024· 1h 3min

Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI

Our first ever demo day aimed for 15-20 people and ended up ballooning to >200 and covered in the news . We are now running the 2024 edition in SF on Feb 23 : Latent Space Final Frontiers , a startup and research competition in “The Autonomous Workforce”, ​”Beyond Transformers & GPUs”, and “​Embodied AI”. RSVP here ! You can find all LS online/IRL events on our new calendar . Super Early Bird tickets have just gone on sale for AI Engineer World’s Fair, June 25-27 ! Today we have the honor of hosting two of Together AI ’s co-founders: Ce Zhang (CTO) and Vipul Ved Prakash (CEO). This is a rare o

Feb 1, 2024· 58 min

Why StackOverflow usage is down 50% — with David Hsu of Retool

We are announcing the second edition of our Latent Space demo day event in SF on 2/23: Final Frontiers , a startup and research competition in “The Autonomous Workforce”, ​”Beyond Transformers & GPUs”, and “​Embodied AI”. RSVP here ! The first one was aimed for 15-20 people and ended up blowing up to >200 and covered in the Information - let’s see what a year of growth (and competition) does to the local events space in 2024. You can find all Latent Space events here , and of course get in touch with us to host your own AI Engineer meetups like AI Engineering Singapore . In our December 2023 r

Jan 25, 2024· 1h 8min

The Four Wars of the AI Stack (Dec 2023 Audio Recap)

Note for Latent Space Community members: we have now soft-launched meetups in Singapore , as well as two new virtual paper club/meetups for AI in Action and LLM Paper Club . We’re also running Latent Space: Final Frontiers , our second annual demo day hackathon from last year . Edit from March 2024: We did a followup on the Four Wars on the AI Breakdown . For the first time, we are doing an audio version of monthly AI Engineering recap that we publish on Latent Space! This month it’s “The Four Wars of the AI Stack”; you can find the full recap with all the show notes here: https://latent.space

Jan 19, 2024· 1h 11min

How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4

Latent Space is heating up! Our paper club ran into >99 person Discord limits, oops. We are also introducing 2 new online meetups: LLM Paper Club Asia for Asia timezone (led by Ivan), and AI in Action: hands-on application of AI (led by KBall). To be notified of all upcoming Latent Space events, subscribe to our new Luma calendar ( sign up for individual events, or hit the RSS icon to sync all events to calendar ). In the halcyon open research days of 2022 BC ( Before-ChatGPT ), DeepMind was the first to create a SOTA multimodal model by taking a pre-existing LLM ( Chinchilla 80B - now dead ?)