The Data Exchange with Ben Lorica-logo

The Data Exchange with Ben Lorica

Technology Podcasts

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Location:

United States

Description:

A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].

Twitter:

@bigdata

Language:

English

Contact:

4156476844


Episodes

Practical Machine Learning and Deep learning

5/12/2022
Sebastian Raschka is lead author of a new book from Packt entitled “Machine Learning with PyTorch and Scikit-Learn”. He is also an Assistant Professor of Statistics at the University of Wisconsin (Madison), and serves as the Lead AI Educator at Grid.ai. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/ Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS. Detailed show notes can be found on The Data Exchange web...

Duration:00:48:27

Machine Learning for Optimization

5/5/2022
This week’s guests are Ade Fajemisin (Postdoctoral Researcher) and Donato Maragno (PhD Student) of the University of Amsterdam. They were co-authors of a recent paper (“Optimization with Constraint Learning: A Framework and Survey”) that explores how machine learning can be used to learn constraints in optimization problems. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe: Apple • Android •...

Duration:00:26:25

Efficient Scaling of Language Models

4/28/2022
This week’s guests are Barret Zoph and Liam Fedus, research scientists at Google Brain. Our conversation centered around Large Language Models (LLM), specifically recent work by Barret, Liam, and their collaborators on efficient scaling of large language models. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/ Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS. Detailed show notes can be found on The Data...

Duration:00:27:06

Data Science at Stitch Fix

4/21/2022
Olivia Liao is Senior Director of Data Science at Stitch Fix, a company that uses data science and expert stylists to deliver personalization at scale. We discuss how they blend data science and domain expertise, how they tune recommendations in light of logistics and supply chain constraints, and how they incorporate new developments in large language models, multimodal models and Responsible AI. Download a FREE copy of our recent NLP Industry Survey Results:...

Duration:00:30:58

The 2022 AI Index

4/14/2022
Jack Clark is co-director of the AI Index Steering Committee. In this episode we discuss key findings of the fifth edition of the AI Index. The report uses multiple metrics (benchmarks, publications, patents, legislation, etc.) to track progress in AI (mainly deep learning) in key areas that include computer vision, speech recognition, and language models. Download the FREE Report: Trends in Data, Machine Learning, and AI →...

Duration:00:45:13

Why You Need A Time-Series Database

4/7/2022
This week’s guests are Ajay Kulkarni (CEO) and Mike Freedman (CTO), co-founders of Timescale, the startup behind the popular relational database for time-series and analytics. Mike is also a Professor of Computer Science at Princeton University. Our conversation took place a few weeks after Timescale raised a massive funding round and achieved unicorn status. Download the FREE Report: 2022 Data Engineering Survey Report →...

Duration:00:45:46

Data Science at Shopify

3/31/2022
This week’s guest is Wendy Foster, Director of Engineering & Data Science at Shopify. We discussed applications of data science within Shopify, how they organize their data teams, the lifecycle of a data science project within the company, and how they approach emerging challenges like Responsible AI, large language models, and multimodal models. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe:...

Duration:00:35:28

An AI Risk Management Framework

3/24/2022
This week’s guests are Elham Tabassi of the National Institute of Standards and Technology (NIST) and Andrew Burt, Managing Partner of BNH.ai, the first law firm focused on AI compliance, risk mitigation, and related topics. We discuss the new NIST framework – “AI Risk Management Framework” – intended for voluntary use to manage risks in the design, development and use of AI products and systems. Download the FREE Report: Trends in Data, Machine Learning, and AI →...

Duration:00:30:55

An open source and end-to-end library for causal inference

3/17/2022
This week’s guests are Amit Sharma (Principal Researcher) and Emre Kiciman (Senior Principal Researcher) of Microsoft Research. We talk about practical applications of causal inference, a set of tools and techniques that enable data teams to draw causal conclusions based on data. Amit and Emre are part of the team behind DoWhy, a new open source library for estimating causal effects based on historical data alone, particularly useful when we cannot run an experiment because of time, expense,...

Duration:00:39:57

The Graph Intelligence Stack

3/10/2022
Leo Meyerovich is founder and CEO of Graphistry, a startup building tools to democratize visual graph intelligence and graph machine learning. Leo and I recently wrote a well-received post (“What Is Graph Intelligence?”) making the case for why companies need to revisit graph analytics and graph intelligence. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe: Apple • Android • Spotify • Stitcher •...

Duration:00:37:21

NLP and Language Models in Healthcare and the Life Sciences

3/3/2022
This week’s guests are Dia Trambitas-Miron (Head of Product) and David Talby (CTO) of John Snow Labs, the startup behind the popular open source project, Spark NLP. The company also has a suite of products including an NLP platform targeted specifically for the healthcare, pharmaceutical, and biotech sectors. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/ Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod •...

Duration:00:37:44

Delivering Continuous Intelligence at Scale

2/24/2022
Simon Crosby is CTO of Swim.ai, a startup building tools (based on the Swim open source project) for next-generation data and AI applications. Swim is one of several projects (along with Ray and Akka) contributing to interest in the Actor Model for building large-scale machine learning and data applications and infrastructure. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe: Apple • Android •...

Duration:00:31:22

Imperceptible NLP Attacks

2/17/2022
Nicholas Boucher is a PhD at Cambridge University where his focus is on security including on topics like homomorphic encryption, voting systems, and adversarial machine learning. He is the lead author of a fascinating new paper – “Bad Characters: Imperceptible NLP Attacks” – which provides a taxonomy of attacks against text-based NLP models, that are based on Unicode and other encoding systems. Download a FREE copy of our recent NLP Industry Survey Results:...

Duration:00:44:53

Evolving Data Science Training Programs

2/10/2022
This week’s guest is Anjali Samani, Director of Data Science and Data Intelligence at SalesForce. We first met during the early days of Faculty, one of the leading data science and AI startups in Europe. Anjali helped design and lead the early Fellowship programs at Faculty (these are intensive bootcamps that turn STEM PhDs and turn them into industrial data scientists). Download the FREE Report: Trends in Data, Machine Learning, and AI →...

Duration:00:33:52

Building Machine Learning Infrastructure at Netflix and beyond

2/3/2022
Savin Goyal is CTO and co-founder of Outerbounds, a startup building infrastructure to help teams streamline how they build machine learning applications. Prior to starting Outerbounds, Savin and team worked at Netflix, where they were instrumental in the creation and release of Metaflow, an open source Python framework that addresses some of the challenges data scientists face around scalability and version control. Download the FREE Report: Trends in Data, Machine Learning, and AI →...

Duration:00:35:06

Democratizing NLP

1/27/2022
Moshe Wasserblat is a Senior Principal Engineer at Intel, where he serves as a Research Manager focused on NLP and Deep Learning. Download a FREE copy of our recent NLP Industry Survey Results: https://gradientflow.com/2021nlpsurvey/ Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS. Detailed show notes can be found on The Data Exchange web site. Subscribe to The Gradient Flow Newsletter.

Duration:00:43:32

Machine Learning at Discord

1/20/2022
Gaurav Chakravorty, is a Senior Manager at Discord, where he leads the team responsible for machine learning models in the area of search and notification. Prior to discord Gaurav was a manager at Google where he led the team responsible for personalized podcast recommendations. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod •...

Duration:00:40:14

Applications of Knowledge Graphs

1/13/2022
This week's guest is Mike Tung, founder and CEO of Diffbot, a startup that crawls the web and offers one of the most comprehensive knowledge graphs accessible through a variety of simple interfaces. Detailed show notes can be found on The Data Exchange web site. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.

Duration:00:39:48

Key AI and Data Trends for 2022

1/6/2022
In this episode of the Data Exchange, our special correspondent and managing editor Jenn Webb organized a mini-panel composed of myself and my podcast co-organizer Mikio Braun. This conversation took place as we were assembling our list of trends for 2022. Download the FREE Report: Trends in Data, Machine Learning, and AI → https://gradientflow.com/2022trendsreport?utm_source=DEpodcast Subscribe: Apple • Android • Spotify • Stitcher • Google • AntennaPod • RSS.

Duration:00:36:49

Large Language Models

12/30/2021
This episode features conversations with two experts who have helped train and release models that can recognize, predict, and generate human language on the basis of very large text-based data sets. First is an excerpt of my conversation with Connor Leahy, AI Researcher at Aleph Alpha GmbH, and founding member of EleutherAI, (pronounced “ee-luther”) a collective of researchers and engineers building resources and models for researchers who work on natural language models. Next up is an...

Duration:00:41:13