The Data Exchange with Ben Lorica-logo

The Data Exchange with Ben Lorica

Technology Podcasts

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Location:

United States

Description:

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Twitter:

@bigdata

Language:

English

Contact:

4156476844


Episodes
Ask host to enable sharing for playback control

From Preparation to Recovery: Mastering AI Incident Response

7/18/2024
Andrew Burt is co-founder of both Luminos.Law and Luminos.ai, entities building tools to help companies mitigate and manage AI risks. We dive into the critical topic of AI incident response, highlighting its unique challenges compared to traditional software incidents. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:34:38

Ask host to enable sharing for playback control

Unlocking the Power of Unstructured Data

7/11/2024
Chang She is CEO and co-founder of LanceDB, an open-source database designed for multimodal AI applications, offering scalable vector search, streaming training data, and interactive exploration of large AI datasets. In this episode we discuss Lance, an open-source columnar data format that tackles the unique challenges posed by modern AI and machine learning workloads. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:49:32

Ask host to enable sharing for playback control

Postgres: The Swiss Army Knife of Databases

7/3/2024
Ajay Kulkarni and Mike Freedman are the co-founders of Timescale, a startup that provides an enhanced version of PostgreSQL optimized for time-series analytics, AI applications, and scalable relational workloads. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:50:50

Ask host to enable sharing for playback control

Supercharging AI with Graphs

6/27/2024
Philip Rathle, CTO of Neo4j, joins the podcast to discuss the rising popularity of graph-enhanced retrieval augmented generation (GraphRAG). He also discusses the potential impact of the new GQL graph query language standard. [Link to the demo that Philip showed.] Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:43:58

Ask host to enable sharing for playback control

Monthly Roundup: SB 1047, GraphRAG, and AI Avatars in the Workplace

6/20/2024
Paco Nathan is the founder of Derwen, a boutique consultancy focused on Data and AI. This episode is part of our series of monthly roundups and covers: the proposed California Senate Bill 1047 for regulating AI models, including its feasibility and potential unintended consequences. We also discuss the rising popularity of graph retrieval augmented generation (GraphRAG) techniques to mitigate hallucinations in large language models, while acknowledging the current limitations and future potential of integrating symbolic and statistical AI approaches. Additionally, we explore the concept of AI avatars in the workplace, highlighting the challenges and ethical considerations surrounding digital twins and agent-based systems. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:36:59

Ask host to enable sharing for playback control

Fine-tuning and Preference Alignment in a Single Streamlined Process

6/13/2024
Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:35:32

Ask host to enable sharing for playback control

TinyML, Sensor-Driven AI, and Advances in Large Language Models

6/6/2024
In this episode, Pete Warden introduces his company, Useful Sensors, which focuses on developing AI solutions for consumer electronics and appliances. [This episode originally aired on Generative AI in the Real World, a podcast series I’m hosting for O’Reilly.] Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:25:23

Ask host to enable sharing for playback control

Machine Unlearning: Techniques, Challenges, and Future Directions

5/30/2024
Ken Liu, Ph.D. student in Computer Science at Stanford, is the author of Machine Unlearning in 2024. We explore the concept of machine unlearning, a process of removing specific data points from trained AI models. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:49:36

Ask host to enable sharing for playback control

Unleashing the Power of AI Agents

5/23/2024
Joao (Joe) Moura is the founder of crewAI, an open-source platform that simplifies the development and deployment of AI agents, allowing users to build autonomous systems for various tasks using multiple large language models. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:38:47

Ask host to enable sharing for playback control

Monthly Roundup: Llama 3, Agents, Evaluation Metrics, Cyc, TikTok, and more

5/16/2024
Paco Nathan is the founder of Derwen, a boutique consultancy focused on Data and AI. This episode is part of our series of monthly roundups and covers: Llama 3 and other recent LLMs, the rise of open foundation models, the evolution of AI agents, and the importance of data engineering. We also explore the limitations of leaderboards in evaluating AI models and touch upon the ethical and societal implications of AI development. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:41:58

Ask host to enable sharing for playback control

LLMs for Data Access: Unlocking Insights with Text-to-SQL

5/9/2024
Gunther Hagleither is co-founder of Waii, a startup that provides an API enabling businesses to seamlessly integrate text-to-SQL functionality into their products. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:43:22

Ask host to enable sharing for playback control

2024 Artificial Intelligence Index

5/2/2024
In this episode we explore the latest developments in artificial intelligence with a focus on the 2024 Artificial Intelligence Index Report, edited by Nestor Maslej from Stanford’s Institute for Human-Centered Artificial Intelligence. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:53:50

Ask host to enable sharing for playback control

DBRX and the Future of Open LLMs

4/25/2024
In this episode, Hagay Lupesko, Senior Director of Engineering at Databricks MosaicAI, delves into the creation and aspirations behind DBRX, an innovative open Large Language Model (LLM) designed to bridge the gap between quality and cost-effectiveness for AI applications. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:45:45

Ask host to enable sharing for playback control

Monthly Roundup: New LLMs, GTC 2024, Constraint-Driven Innovation, Model Safety, and GraphRAG

4/18/2024
Paco Nathan is the founder of Derwen, a boutique consultancy focused on Data and AI. This episode is part of our series of monthly roundups and covers: recently released large language models, Constraint-Driven Innovation, highlights from GTC 2024, and Lessons from the First AI Workload Security Exploit. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:37:01

Ask host to enable sharing for playback control

Automating Software Upgrades: How to Combine AI and Expert Developers

4/11/2024
Steve Pike is a co-founder of Infield.ai, a startup building tools to help companies upgrade and maintain open source software dependencies, ensuring they stay up-to-date with the latest releases, features, and security fixes. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:36:27

Ask host to enable sharing for playback control

Generative AI in the Industrial Sphere

4/4/2024
Chetan Gupta is the Head of AI Research at Hitachi. This episode explores the applications and challenges of generative AI in industrial settings. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:44:04

Ask host to enable sharing for playback control

The Intersection of LLMs, Knowledge Graphs, and Query Generation

3/28/2024
Semih Salihoglu is an Associate Professor at University of Waterloo, and co-creator of Kuzu an open source embeddable property graph database management system. This episode explores the use of large language models (LLMs) for generating queries across different query languages like SQL and Cypher for graphs. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:57:45

Ask host to enable sharing for playback control

Unlocking the Potential of Private Data Collaboration

3/21/2024
Sadegh Riazi, CEO and co-founder of Pyte, a startup offering secure, encrypted data collaboration solutions, enabling partners to maximize insights without compromising privacy or data integrity. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:36:14

Ask host to enable sharing for playback control

Frontiers of AI: From Text-to-Video Models to Knowledge Graphs

3/14/2024
Paco Nathan is the founder of Derwen, a boutique consultancy focused on Data and AI. This episode explores recent developments in AI, including text-to-video models like Sora, frameworks for productionizing AI models, analyses of systems like Google’s Gemini, techniques to improve foundation models, AMD’s software innovations for AI acceleration, and knowledge graph augmentations of language models. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:33:35

Ask host to enable sharing for playback control

Adaptive, Specialized, and Accessible: Where AI Systems Are Heading Next

3/7/2024
Jerry Kaplan is the author of the new book “Generative Artificial Intelligence: What Everyone Needs to Know”. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:43:01