The Data Exchange with Ben Lorica-logo

The Data Exchange with Ben Lorica

Technology Podcasts

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Location:

United States

Description:

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Twitter:

@bigdata

Language:

English

Contact:

4156476844


Episodes

Building and Deploying Foundation Models for Enterprises

6/1/2023
Jonas Andrulis is the Founder & CEO Aleph Alpha, a startup that provides enterprise software solutions backed with their own large language models and multimodal models Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:34:09

Building Robust AI Infrastructure for Critical Solutions

5/25/2023
Alex Remedios, founder of Treebeardtech, leads a London-based consulting firm dedicated to assisting machine learning teams in constructing dependable, secure, and adaptable cloud infrastructures crucial for delivering business-critical artificial intelligence solutions. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:31:40

Machine Learning for High-Risk Applications

5/18/2023
Patrick Hall, is co-founder of BNH and a visiting faculty member of decision sciences at the George Washington University School of Business. Agus Sudjianto, EVP, Head of Corporate Model Risk at Wells Fargo. We explore several topics covered in the new book Machine Learning for High-Risk Applications, co-authored by Patrick and with a foreword by Agus. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:46:25

Boosting Perception With Synthetic Data

5/11/2023
Omar Maher is Director of Product Marketing at Parallel Domain, a startup that is advancing machine perception capabilities by harnessing the power of synthetic data. We delve into the growing adoption of synthetic data and the factors driving its use. We discuss major developments in synthetic data generation and its overlap with Generative AI. The conversation also covers data privacy, intellectual property, the generation of structured data like LiDAR, the current state of adoption, and key research directions to overcome existing challenges. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:35:34

Revolutionizing B2B: Unleashing the Power of AI and Data

5/4/2023
Simon Chan is the General Partner at Firsthand Alliance, a venture capital fund focused on the future of B2B and enterprise software. We explore the evolution of AI, cloud computing, and business collaboration tools, revealing how a new generation of generative AI technologies is enabling applications to generate content and drive transformative innovation across various industries. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:43:08

AI Metadata

4/27/2023
Gev Sogomonian is co-author of AimStack, an open-source, self-hosted AI metadata tracker that logs all your AI metadata, such as experiments and prompts, and provides a user-friendly UI for comparing and observing them. It also offers an SDK for programmatically querying tracked metadata. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:31:30

The 2023 AI Index

4/20/2023
Raymond Perrault is a Distinguished Computer Scientist at SRI International, and Co-Director of the Steering Committee for the AI Index, an annual report that tracks, collates, distills, and visualizes data relating to AI, to help inform decision-makers and teams to take meaningful action for responsible and ethical AI. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:43:38

Custom Foundation Models

4/13/2023
Hagay Lupesko, is VP Engineering at MosaicML, a startup that enables teams to easily train large AI models on their data and in their own secure environment. We discuss the the evolution of cloud based machine learning (from “traditional” ML through LLMs), his experience building machine learning applications at leading technology companies, and the need for companies to build their own custom foundation models. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:38:06

Uncovering and Highlighting AI Trends

4/6/2023
Jakub Zavrel is the Founder and CEO at Zeta Alpha, a premier Neural Discovery Platform that utilizes cutting-edge Neural Search technology to enhance the way you and your team uncover, arrange, and disseminate knowledge. Our conversation focuses on the latest developments in artificial intelligence, taking inspiration from their recent viral article featuring the top the 100 most cited AI papers of 2022. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:49:02

How Data and AI Happened

3/30/2023
Chris Wiggins is a Professor at Columbia University and the Chief Data Scientist at the NYTimes. He is also co-author of a fascinating new historical exploration of how data has been used as a tool in shaping society, from the census to eugenics to Google search. How Data Happened traces the trajectory of data and explores new mathematical and computational techniques that serve to shape people, ideas, society, and economies. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:48:48

Blazing fast bulk data transfers between any cloud

3/23/2023
Paras Jain and Sarah Wooders are graduate students at UC Berkeley’s Sky Computing Lab. They are part of the team behind Skyplane, and open source project that accelerates wide-area transfers in the cloud via overlay routing and parallelism. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:31:29

Exhaustion of High-Quality Data Could Slow Down AI Progress in Coming Decades

3/16/2023
Pablo Villalobos is a Staff Researcher at Epoch, and lead author of the recent paper “Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning”. We discuss the key findings in this paper, as well as a related study Pablo conducted on scaling laws. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:33:09

Generating high-fidelity and privacy-preserving synthetic data

3/9/2023
Jinsung Yoon (Senior Research Scientist) and Sercan Arik (Staff Research Scientist and Manager) are part of the Google team behind EHR-Safe, a set of tools for generating highly realistic and privacy-preserving synthetic Electronic Health Records. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:35:47

How technology is disrupting the venture capital industry

3/2/2023
Brandon Jenkins, Co-founder and COO of Fundrise, the largest direct-to-individuals alternative investment platform in the country. Our conversation centered on their recent foray into technology investing, specifically startup companies in the data infrastructure space. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:36:19

Running Machine Learning Workloads On Any Cloud

2/23/2023
Zongheng Yang, is a researcher in the Sky Computing Lab at UC Berkeley, a multi-year research initiative that utilizes distributed systems, programming languages, security and machine learning to separate the services that a company requires from the choice of a specific cloud. He provides a detailed overview and update on SkyPilot, a groundbreaking intercloud broker that views the cloud ecosystem as a unified and integrated entity rather than a collection of disparate, largely incompatible clouds. SkyPilot enables users to run Machine Learning and Data Science batch jobs on any cloud, realize substantial cost savings, access the best hardware across clouds, and enjoy higher resource availability. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:37:11

2023 Trends in Data Engineering and Infrastructure

2/16/2023
Jesse Anderson, Evan Chan, and I delve into the current developments and possibilities within the realm of data engineering and platforms. As the foundation for artificial intelligence and machine learning, data plays a crucial role in the advancement of these technologies. Download a copy of the FREE Report: https://gradientflow.com/2023trendsreport/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:45:47

Preparing for the Implementation of the EU AI Act and Other AI Regulations

2/9/2023
This week we discuss AI regulations with Gabriela Zanfir-Fortuna is VP for Global Privacy at the Future of Privacy Forum, and Andrew Burt, Managing Partner at BNH, the first law firm focused on AI and Analytics. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:36:38

The Open Source Stack Unleashing a Game-Changing AI Hardware Shift

2/2/2023
Dylan Patel is the Chief Analyst at SemiAnalysis, a boutique semiconductor research and consulting firm focused on the semiconductor supply chain from chemical inputs to fabs to design IP and strategy. In this episode, we discuss the emerging open source software stack for PyTorch that makes it easier and more accessible to implement non-Nvidia backends (see his recent post). Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:41:55

Data Science and AI in Context

1/26/2023
Peter Norvig (of Google and Stanford) and Alfred Spector (of MIT) are part of the team of authors behind the must-read book Data Science in Context: Foundations, Challenges, Opportunities. We discussed their recent book and tool a deep dive into their Data Science Analysis Rubric, and we also talked about a trending topics in AI including looming regulations, synthetic data, and Large Language and Foundation Models. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:47:40

Evaluating Language Models

1/19/2023
Percy Liang is Associate Professor of Computer Science and Statistics, and Director of the new Center for Research on Foundation Models at Stanford University. We discussed a new suit of tools (HELM) designed to help users and researchers understand language models in their totality. We also discuss recent trends in AI including the rise of Generative AI and Foundation Models. Download a copy of our FREE 2023 Trends in Data and AI Report: https://gradientflow.com/2023trendsreport/ Subscribe: Apple • Spotify • Stitcher • Google • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Duration:00:45:36