Technology Podcasts

Datacast follows the narrative journey of data practitioners and researchers to unpack the career lessons they learned along the way. James Le hosts the show.

Datacast follows the narrative journey of data practitioners and researchers to unpack the career lessons they learned along the way. James Le hosts the show.


United States


Datacast follows the narrative journey of data practitioners and researchers to unpack the career lessons they learned along the way. James Le hosts the show.






Episode 81: Research, Engineering, and Product in Machine Learning with Aarti Bagul

Timestamps New York UniversityACM chapterWomen in ComputingStanford UniversityCS 230CheXNetMURALanding AIThreshold Venture FellowshipAI FundSnorkel AISnorkelAarti’s Contact Info LinkedInTwitterGoogle ScholarPeople Andrew NgJohn LangfordDavid SontagBooks and Papers “The Art of Doing Science & Engineering“Deep Medicine: How AI Can Make Healthcare Human Again“CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning“MURA: Large Dataset for Abnormality Detection in...


Episode 80: Creating The Sense of Sight with Alberto Rizzoli

Timestamps Bayes Business SchoolSingularity UniversityAipolyV7 LabsAnnotationblog postDataset ManagementModel Automationnot losing sight of the ‘ideal customer’Alberto’s Contact Info WebsiteLinkedInTwitterMediumV7’s Resources WebsiteSoftware 2.0 BlogAcademy TutorialsDocumentationLinkedInTwitterMentioned Content Articles “7 Things We Looked for in a Video Labeling Tool“The Biggest Mistake I’ve Ever Made: Losing Sight of the Ideal CustomerTalks “An AI Narrator for the Blind“If The Blind...


Episode 79: Analytics Culture, Digital Contracting, and Data Angels with Jessica Cherny

Timestamps an immigrant familyRussiaData ScienceMobile Developers of BerkeleyData Science Society at BerkeleyCITRIS and the Banatao InstituteTAMID GroupAccel ScholarIroncladdigital contractingContract Lifecycle Managementbenchmark report analyzing economic trends caused by COVID-19building a data analytics culture from the ground upData Angels Communityusing databeautyfashionJessica’s Contact Info LinkedInTwitterData AngelsMentioned Content Resources How to use contract data during...


Episode 78: Open-Source Investing and Data Product Management with Julia Schottenstein

Timestamps StanfordQatalyst PartnersNew Enterprise AssociatesMetabaseSentrythe Series B round for Anyscalethe seed round for DatafoldMetabase application to help investors pick winning open-source startupsdbt CloudJulia’s Contact Info LinkedInTwitterdbt’s Resources Slack CommunityCoalesce 2021 Replaysdbt LearnGitHubEvents and MeetupsMentioned Content People Tristan HandyAli GhodsiDan LevineBook “Working Backwards: Insights, Stories, and Secrets from Inside AmazonNotes My conversation...


Episode 77: Delivering Modern Data Engineering with Einat Orr

Timestamps Tel Aviv UniversityCompugenFlash NetworksCorrelixSimilarWebTreeverselakeFSwhy data versioning-as-an-Infrastructure mattersdata meshensure data quality in a data lake environmentthe data engineering ecosystemEinat’s Contact Info LinkedInTwitterEmailMentioned Content lakeFS WebsiteGitHub@lakeFSTreeverseSlackBlog Posts Why We Built lakeFS: Atomic and Versioned Data Lake OperationsData Versioning — Does It Mean What You Think It Means?How To Manage Your Data The Way You Manage...


Episode 76: Modern Data Collaboration and Social Entrepreneurship with Prukalpa Sankar

Timestamps Nanyang Tech UniversityGoldman SachsSocialCopsQuora answerHow Big Data Can Influence Decisions That Actually MatterAtlanDataOps Culture CodeData Catalog 3.0a key value propdata qualitydata governancemodern data platformsthe trendsPeople-as-a-MoatPrukalpa’s Contact Info LinkedInTwitterMentioned Content Atlan (Twitter | LinkedIn | Facebook | Instagram | YouTube | Documentation) “Empowering Organizations to Become Masters of Their DataAtlan LabsHumans of Data InterviewsThe...


Episode 75: Commoditizing Data Integration Pipelines with Michel Tricot

Timestamps EPITA — School of Engineering and Computer ScienceSiemens Corporate ResearchFactSet Research SystemsMurexRapleafLiveRamprideOS ride-hail platformAirbytethe pivotthe visionAirbyte’s approach to building a connector manufacturing plantlist of challengesproduct roadmapbusiness modelsMichel’s Contact Info TwitterLinkedInGitHubMentioned Content Airbyte (Docs | Community | GitHub | Twitter | LinkedIn) HandbookRecipesCommunity CallOffice HoursConnector ContestBlog Posts “The Hard...


Episode 74: The Next Generation of Business Intelligence with Cindi Howson

Timestamps University of MarylandDow Chemicalthe Jones Business Schoolher MBA ThesisDeloitteBI ScorecardThe Data Warehousing InstituteGartnerMagic Quadrant for Analytics and BI PlatformsCritical CapabilitiesThoughtSpotdata-driven cultureSearchIQSpotIQThoughtSpot OneThoughtSpot EmbraceA New Era in Analytics and BI6 Top Trends and Predictions for Data, Analytics, and AI in 2021the movement of Data For GoodThe Data Chief Podcastthe challenges that keep women out of techCindi’s...


Episode 73: Datasets for Software 2.0 with Taivo Pungas

Timestamps blog post written in EstonianUniversity of TartuSkypeTransferWiseETH ZurichStarship TechnologiesData Specification ManifestoThe Two Loops Of Building Algorithmic ProductsVeriffdeveloped automation-heavy productsData Loops Are The Bottleneck In Applied AIYour AI Team Needs DataOpsDatasets Carve The Terrain of AITaivo's Contact WebsiteTwitterLinkedInMediumGoogle ScholarMentioned Content Blog Posts Data Specification ManifestoBuilding Automation-Heavy ProductsData Loops Are The...


Episode 72: Folding Data with Gleb Mezhanskiy

Timestamps Carnegie Mellon UniversityAutodeskLyftPhantom AutoDatafoldData Diffdata monitoringthe modern analytics stackData Quality meetupsGleb’s Contact Info LinkedInDatafoldTwitterLinkedInData Quality MeetupsMentioned Content Course Harvard’s CS50: Introduction to Computer ScienceBlog Posts Modern Analytics StackChoosing Data Warehouse for Analytics3 Ways To Be Wrong About Open-Source Data Warehousing SoftwareBuy Not BuildDatafold Raises a $2.1M Seed Round Led by NEADatafold + dbt:...


Episode 71: Trusted AI with Saishruthi Swaminathan

Timestamps Sri Sairam Engineering CollegeTata Consultancy ServicesSan Jose State Universitydisaster managementCenter for Open Source Data and AI TechnologiesDigital Discrimination: Cognitive Bias in Machine LearningElyradata visualization with Pythonwith RSaishruthi’s Contact Info TwitterLinkedInMediumGitHubCourseraMentioned Content Talks “Digital Discrimination: Cognitive Bias in Machine LearningProjects AI Fairness 360AI Explainability 360Adversarial Robustness ToolkitModel Asset...


Episode 70: Machine Learning Testing with Mohamed Elgendy

Timestamps Cairo University3D Business Analyst: The Ultimate Hands-On Guide to Mastering Business AnalysisBusiness Analysis for Beginners: Jump-Start Your BA Career in 4 WeeksTwilioAmazonMachine Learning universitySynapse Tech CorporationRakutenKolenaKolenaMowglyDeep Learning For Vision SystemsMohamed’s Contact Info LinkedInTwitterWebsiteYouTubeGitHubKolenaMentioned Content People Andrew TraskFrancois CholletLex FridmanBooks “Mindset“OutliersNotes My conversation with Mohamed was...


Episode 69: DataPrepOps, Active Learning, and Team Management with Jennifer Prendki

Show Notes Louis Pasteur UniversityParis-Sud UniversitySorbonne UniversityDuke UniversityQuantlab FinancialYuMeAyasdiWalmart LabsReview Analysis: An Approach to Leveraging User-Generated Content in the Context of RetailAtlassianAgile for Data Science TeamsFigure EightAlectioresponsible AIfight biasincrease accessibilitycreate more opportunities in AIJennifer’s Contact Info LinkedInTwitterMediumAlectio’s Resources WebsiteTwitterLinkedInWhat Is Alectio?Is Big Data Dragging Us Towards...


Episode 68: Threat Intelligence, Venture Stamina, and Data Investing with Sarah Catanzaro

Show Notes StanfordCenter for International Security and CooperationCenter for Advanced Defense StudiesCyveillancePalantirMattermarkCanvas VenturesstaminaAmplify PartnersSeries A round for OctoMLApache TVMthe seed round for Einblickseed round for Metaphor DataSeries A round for RunwayProjects To KnowSarah’s Contact Info Amplify PageTwitterLinkedInMediumAmplify Partners’ Resources WebsiteTeamPortfolioBlogMentioned Content Blog Posts Our Investment in OctoMLAnnouncing Our Investment in...


Episode 67: Model Observability, AI Bias, and ML Infrastructure Ecosystem with Aparna Dhinakaran

Show Notes UC BerkeleyEnergy and Sustainable Technologies labTubeMogulUberCornell UniversityMonitorMLEswarY-Combinatorthe acquisition of MonitorML by Arize AIongoingML Observabilityblogseriesdata preparationmodel buildingmodel validationmodel servingin The Amazing RaceAparna’s Contact Info TwitterLinkedInMediumForbes ColumnWebsiteGithubGoogle ScholarArize’s Resources WebsiteMediumLinkedInTwitterMentioned Content Blog Posts ML Infrastructure Tools for Data PreparationML Infrastructure...


Episode 66: Monitoring Models in Production with Emeli Dral

Show Notes Peoples’ Friendship University of RussiaYandex School of Data AnalysisRamblerYandexYandex Data FactoryMechanica AIMachine Learning and Data AnalysisBig Data EssentialsMoscow Institute of Physics and TechnologyYandex School of Data AnalysisHarbour.SpaceGraduate School of Management — St. Petersburg State UniversityEvidently AIPart 1Part 2Part 3Part 4Part 5Emeli’s Contact Info LinkedInTwitterCourseraGitHubMediumEvidently AI’s...


Episode 65: Chaos Theory, High-Frequency Trading, and Experimentations at Scale with David Sweet

Show Notes Duke UniversityUniversity of Maryland, College ParkTopology in Chaotic Scatteringfractal dimensionshigher-dimensional chaotic scatteringK Desktop Environmenta print bookAndamookaThales Fund ManagementLehman BrothersKCG/GETCOTeza TechnologiesGalaxy Digital Tradingoptimization of high-frequency trading systemsInstagram3Red PartnersTuning UpDavid’s Contact Info WebsiteLinkedInTwitterMentioned Content Publications Topology In Chaotic ScatteringFractal Dimension of...


Episode 64: Improving Access to High-Quality Data with Fabiana Clemente

Show Notes the University of LisbonNovabaseVodafoneODYSAIHabit AnalyticsLisbon School of Economics and ManagementNOVA IMS Information Management SchoolYDatadifferent techniques to generate synthetic dataher blog series on generating synthetic tabular datanovel design and optimization techniquesDifferential PrivacyThe Cost of Poor Data Qualitymodel explainabilityWhen Machine Learning Meets PrivacyFabiana’s Contact Info LinkedInMediumTwitterYData’s...


Episode 63: Real-World Transfer Learning with Azin Asgarian

Show Notes a girls-only high school in TehranUniversity of TehranUniversity of TorontoBabak TaatiDavid FleetBarriers to Adoption of Information Technology in HealthcareSubspace Selection to Suppress Confounding Source Domain Information in AAM Transfer LearningToronto Rehabilitation Institutealgorithmic biasesolder adults with dementiaGeorgianinjury prediction modela hybrid instance-based transfer learning postTractable AIAzin’s Contact Info WebsiteTwitterLinkedInGoogle...


Episode 62: Leading Organizations Through Analytics Transformations with Gordon Wong

Show Notes Rutgers UniversityAB Initio SoftwareSmarter Travel MediaClickSquaredCervelloFitbitSnowflakeezCaterZipcaredXHubSpotData Hierarchy of NeedsSnowflakeGordon’s Contact Info LinkedInMentioned Content People Tristan HandyMichael KaminskyAnalytics EngineeringBarr MosesData ObservabilityBook “Start With Why