SELECT * FROM data.lake;-logo

SELECT * FROM data.lake;

Technology Podcasts

Welcome to "SELECT * FROM data.lake;," the podcast where data enthusiasts and tech aficionados gather each week for deep dives into the ever-expansive world of data lakes and data lakehouses. Join your host, Alex Merced, as he embarks on a journey to unlock the untapped potential of these powerful data repositories. Follow on twitter: - Alex @amdatalakehouse blogs: - https://www.dremio.com/blogs Other Podcasts to Check Out: - Gnarly Data Waves - DataNation - In the Grand Schema of Things

Location:

United States

Description:

Welcome to "SELECT * FROM data.lake;," the podcast where data enthusiasts and tech aficionados gather each week for deep dives into the ever-expansive world of data lakes and data lakehouses. Join your host, Alex Merced, as he embarks on a journey to unlock the untapped potential of these powerful data repositories. Follow on twitter: - Alex @amdatalakehouse blogs: - https://www.dremio.com/blogs Other Podcasts to Check Out: - Gnarly Data Waves - DataNation - In the Grand Schema of Things

Language:

English


Episodes
Ask host to enable sharing for playback control

What's new in Dremio 25.0?

4/18/2024
Alex Merced talks about what's new in Dremio 25.0. Find several blogs on the new features and release at dremio.com/blog! https://bio.alexmerced.com/data

Duration:00:08:17

Ask host to enable sharing for playback control

18 - What is a Semantic or Metrics Layer?

4/10/2024
Alex Merced discusses what is a Semantic Layer? https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:07:23

Ask host to enable sharing for playback control

17 - What is Data Ops?

3/28/2024
Alex Merced discusses what DataOps is. Build a Data Lakehouse on Your Laptop Deploy Deploy into Production

Duration:00:08:20

Ask host to enable sharing for playback control

16 - Apache Iceberg Writes in Python are Now a Thing!

2/21/2024
Alex Merced discusses the newest release of pyIceberg and its implications. Create a Data Lakehouse on Your LaptopDeploy a Dremio Lakehouse into ProductionFollow Me on Social, Links Here

Duration:00:03:31

Ask host to enable sharing for playback control

15 - Data Lakehouse FUD (Conversations around Apache Iceberg, Apache Hudi and Delta Lake)

2/2/2024
Read my reflections in this blog: https://amdatalakehouse.substack.com/p/table-format-fud-thinking-through Follow me on social: https://bio.alexmerced.com/data

Duration:00:13:23

Ask host to enable sharing for playback control

Bonus: New Youtube Channel, State of the Data Lakehouse

1/20/2024
Find all my data resources below: https://bio.alexmerced.com/data Listen to the State of the Data Lakehouse Podcast Here: https://em360tech.com/podcast/dremio-state-data-lakehouse?utm_source=podcasts&utm_medium=podcast&utm_content=content&utm_campaign=alexmercedcontent&utm_term=iceberg+lakehouse+nessie

Duration:00:03:29

Ask host to enable sharing for playback control

14 - Data Trends - Decentralization (Lakehouse, Virtualization, Data Mesh)

1/9/2024
Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:06:03

Ask host to enable sharing for playback control

13 - Game Changer: Dremio's Incremental Reflections and Reflection Recommendations

12/20/2023
Alex Merced discusses some of the new innovations at Dremio in particular Reflection Recommendations and Incremental Refreshes that aim to make managing an easy to use data lakehouse at scale so much easier that it already is with Dremio. Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:10:15

Ask host to enable sharing for playback control

12 - Dremio's Columnar Cloud Cache (C3)

10/31/2023
Alex Merced discusses caching and Dremio's Columnar Cloud Cache feature. Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:06:33

Ask host to enable sharing for playback control

Call for Speaker - Subsurface 2024 (Data Lakhouse Conference)

10/30/2023
Submit your talks here: https://www.dremio.com/subsurface/ Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:02:10

Ask host to enable sharing for playback control

11 - Data Reflections Makes Data Virtualization Scale

10/20/2023
Alex Merced discusses how Dremio's Data Reflections features make life easier for Data Engineers and Data Analysts. - Makes it easier for data engineers to optimize performance - Makes it easier for data analysts to take advantage of those optimizations In this talk I also discuss how Dremio Reflections succeeds when federating sources (Data Virtualization) over other approaches like Materialized Views and Indexes. Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:21:41

Ask host to enable sharing for playback control

10 - The Architecture and Potenial of Nessie Catalogs and Transactional Data Lakehouses

8/23/2023
Alex Merced discusses the internals of how Project Nessie catalogs work and the potential they hold to cataloging and managing data lakehouses. Try out Nessie first hand with the following resources: In just a few commands, you can have everything you need to practice ingestion and querying with popular data software. Just install Docker and then run the commands in the image. You can also follow the directions in this blog: https://lnkd.in/eDiC8fc6 Also try out this video series: https://lnkd.in/gp843ErM Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:08:24

Ask host to enable sharing for playback control

9 - On-Prem Data Lake Modernization and Migration

7/7/2023
Alex Merced discusses what is Hadoop and On-Prem data lake, and how Dremio can help data lake modernization and migration efforts. Follow Alex on twitter @amdatalakehouse Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:10:45

Ask host to enable sharing for playback control

8 - Big Picture on Apache Iceberg Table Optimization

6/16/2023
Alex Merced explains the big picture of Apache Iceberg table optimization. Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:11:17

Ask host to enable sharing for playback control

7 - Open-Source Nessie Catalogs

5/31/2023
Alex Merced discusses what is Project Nessie and its role in the world of Apache Iceberg catalogs. Intro to Data as Code Article/TutorialIntro to Nessie with SparkIntro to Nessie with Jupyter NotebookMulti-Table Transactions TutorialData as Code: ML Reproducability Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:12:33

Ask host to enable sharing for playback control

6 - Apache Iceberg Table Migration

5/8/2023
Alex Merced discusses the considerations and planning when migrating your existing data to Apache Iceberg tables. Follow Alex @amdatalakehouse on twitter Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:17:19

Ask host to enable sharing for playback control

5 - Apache Iceberg Tables Optimization Strategies

3/23/2023
Dipankar Mazumdar discusses how to optimize Apache Iceberg tables Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:10:41

Ask host to enable sharing for playback control

4 - Apache Iceberg Metadata (Metadata.json, Manifest Lists, Manifests)

3/15/2023
Alex Merced explains the Apache Iceberg table metadata structure. Check out the Data as Code demo at the 19 minute mark of this video: https://youtu.be/FzOkbCvyE0I

Duration:00:10:36

Ask host to enable sharing for playback control

3 - What is an Apache Iceberg Catalog?

3/10/2023
Alex Merced discusses what is an Apache Iceberg catalog. Links: - Guide to try out Dremio Arctic in a Jupyter Notebook with Spark locally: https://github.com/developer-advocacy-dremio/quick-guides-from-dremio/blob/main/arcticexercise.md - Guide to PySpark settings for different catalogs: https://github.com/developer-advocacy-dremio/quick-guides-from-dremio/blob/main/icebergpyspark.md Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:20:58

Ask host to enable sharing for playback control

2 - What is a Data Lake Table Format?

2/22/2023
Alex Merced explains what is a Data Lake table format. Register for the Subsurface Live! Conference dremio.com/subsurface Get Started with Dremio for Free Today: https://bit.ly/am-dremio-get-started-podcasts Try Out Dremio/Apache Iceberg on your Laptop: https://bit.ly/am-dremio-lakehouse-laptop

Duration:00:16:44