Kapa AI with Emil Sorensen and Finn Bauer - Weaviate Podcast #50!
Weaviate Podcast - En podkast av Weaviate
Kategorier:
Hey everyone, thank you so much for watching the 50th (!!!) Weaviate Podcast with Emil Sorensen and Finn Bauer from Kapa AI! Are you curious about taking either your, or your company's, specific information and putting into a Vector DB + LLM system? Emil and Finn are doing this at the highest level, taking the documentation of software companies like Weaviate and building these LLM-augmetnted assistant systems for them. This podcast takes a complete tour from Data Ingestion to Cleaning, Chunking, LLM latency, and emerging trends in LLMs such as cheap fine-tuning with LoRA or Long Context Windows such as GPT-4 32K, MPT-7B 65K, or Anthropic Claude's 100k. I learned so much from speaking with Emil and Finn! Please let us know any questions you have or ideas you would like to discuss! Check out Kapa here! https://www.kapa.ai/ Chapters 0:00 Welcome Emil and Finn! 0:42 Origin Story of Kapa 2:08 Data Ingestion 5:10 Data Cleaning 6:20 Slack / Discord / Forum Ingestion 9:05 Testing Models on Support QA 11:14 Selling Kapa to Weaviate and friends 12:37 Hallucinations in LLMs 14:06 Trends in Open-Source LLMs 15:20 Long Input LLMs (32K, 65K, 100K, …) 16:54 Retrieval-Augmentation for Long Input LLMs 18:08 Fine-Tuning LLMs 23:00 As much or as refined content as possible? 24:40 Adding Docs from Integrations 26:15 Generative Feedback Loops 29:00 What in AI excites you the most?