Support the show on Patreon! http://patreon.com/aiinsideshow
INTERVIEW
- Introduction and background on AI Inside podcast
- Discussion of the recent AI oversight Senate hearing Jeff testified at
- Introduction of guest Rich Skrenta from Common Crawl Foundation
- Overview of Common Crawl and its goals to archive the open web
- Discussion of how Common Crawl data is used to train AI models
- News publishers wanting content removed from Common Crawl
- Debate around copyright, fair use, and AI's "right to read"
- Mechanics of how Common Crawl works and what it archives
- Concerns about restricting AI access to data for training
- Risk of regulatory capture and only big companies being able to use AI
- Discussion of recent court ruling related to web scraping
- Hopes for Common Crawl's growth and evolution
NEWS BITES
- Interesting device announcement from CES - Rabbit R1 with Perplexity AI integration
- Study on actual risk of AI automating jobs away in the near future
=====路SUPPORT路=====
馃敂 PATREON: http://www.patreon.com/aiinsideshow
=====路ALL OUR SHOWS路=====
AI INSIDE PODCAST: http://www.aiinside.show
TECHSPLODER PODCAST: http://www.techsploder.com
ANDROID FAITHFUL PODCAST: http://www.androidfaithful.com
=====路CONTACT路=====
BUSINESS AND SPONSORSHIP INQUIRIES: jason(at)yellowgoldstudios(dot)com

