
Data Lake
Brian Murray
“Data Lake: Strategies and Best Practices for Storing, Managing, and Analyzing Big Data” is a comprehensive guide to understanding and implementing a data lake architecture. With the increasing volume, velocity, and variety of data being generated, organizations need to be able to store and analyze large amounts of data to gain insights and make informed decisions.
This book covers the key concepts and principles of data lakes, including data ingestion, data transformation, and data governance. It also provides practical guidance on designing and implementing a data lake solution, including choosing the right technologies and tools, setting up security and access controls, and implementing data quality and data lineage.
Readers will learn about the different types of data lake architectures, including centralized and decentralized architectures, and the pros and cons of each. They will also discover best practices for managing and optimizing data lake performance, including data partitioning and compression, and techniques for data processing, such as batch processing and stream processing.
This book is a must-read for data architects, data engineers, data scientists, and anyone who wants to learn about data lake strategies and best practices for storing, managing, and analyzing big data. With its comprehensive coverage and practical guidance, this book is an essential resource for anyone working with big data.
Duration - 4h 21m.
Author - Brian Murray.
Narrator - Ray Collins.
Published Date - Tuesday, 23 January 2024.
Copyright - © 2024 Brian Murray ©.
Location:
United States
Description:
“Data Lake: Strategies and Best Practices for Storing, Managing, and Analyzing Big Data” is a comprehensive guide to understanding and implementing a data lake architecture. With the increasing volume, velocity, and variety of data being generated, organizations need to be able to store and analyze large amounts of data to gain insights and make informed decisions. This book covers the key concepts and principles of data lakes, including data ingestion, data transformation, and data governance. It also provides practical guidance on designing and implementing a data lake solution, including choosing the right technologies and tools, setting up security and access controls, and implementing data quality and data lineage. Readers will learn about the different types of data lake architectures, including centralized and decentralized architectures, and the pros and cons of each. They will also discover best practices for managing and optimizing data lake performance, including data partitioning and compression, and techniques for data processing, such as batch processing and stream processing. This book is a must-read for data architects, data engineers, data scientists, and anyone who wants to learn about data lake strategies and best practices for storing, managing, and analyzing big data. With its comprehensive coverage and practical guidance, this book is an essential resource for anyone working with big data. Duration - 4h 21m. Author - Brian Murray. Narrator - Ray Collins. Published Date - Tuesday, 23 January 2024. Copyright - © 2024 Brian Murray ©.
Language:
English
Opening Credits
Duración:00:00:14
I Introduction
Duración:00:16:06
II Designing a data lake
Duración:00:25:44
III Data ingestion and management
Duración:00:34:16
IV Data processing and analysis
Duración:00:29:21
V Data lake and cloud
Duración:00:44:34
VI Data lake and big data technologies
Duración:00:29:14
VII Data lake management and operations
Duración:00:46:29
VIII Case studies
Duración:00:24:11
IX Future of data lake
Duración:00:05:43
X Conclusion
Duración:00:05:07
Ending Credits
Duración:00:00:13