Datacast-logo

Datacast

Technology Podcasts

Datacast follows the narrative journey of data practitioners and researchers to unpack the career lessons they learned along the way. James Le hosts the show.

Datacast follows the narrative journey of data practitioners and researchers to unpack the career lessons they learned along the way. James Le hosts the show.

Location:

United States

Description:

Datacast follows the narrative journey of data practitioners and researchers to unpack the career lessons they learned along the way. James Le hosts the show.

Language:

English

Contact:

5852868783


Episodes

Episode 73: Datasets for Software 2.0 with Taivo Pungas

9/26/2021
Timestamps blog post written in EstonianUniversity of TartuSkypeTransferWiseETH ZurichStarship TechnologiesData Specification ManifestoThe Two Loops Of Building Algorithmic ProductsVeriffdeveloped automation-heavy productsData Loops Are The Bottleneck In Applied AIYour AI Team Needs DataOpsDatasets Carve The Terrain of AITaivo's Contact WebsiteTwitterLinkedInMediumGoogle ScholarMentioned Content Blog Posts Data Specification ManifestoBuilding Automation-Heavy ProductsData Loops Are The...

Duration:01:14:55

Episode 72: Folding Data with Gleb Mezhanskiy

9/17/2021
Timestamps Carnegie Mellon UniversityAutodeskLyftPhantom AutoDatafoldData Diffdata monitoringthe modern analytics stackData Quality meetupsGleb’s Contact Info LinkedInDatafoldTwitterLinkedInData Quality MeetupsMentioned Content Course Harvard’s CS50: Introduction to Computer ScienceBlog Posts Modern Analytics StackChoosing Data Warehouse for Analytics3 Ways To Be Wrong About Open-Source Data Warehousing SoftwareBuy Not BuildDatafold Raises a $2.1M Seed Round Led by NEADatafold + dbt:...

Duration:01:07:52

Episode 71: Trusted AI with Saishruthi Swaminathan

9/8/2021
Timestamps Sri Sairam Engineering CollegeTata Consultancy ServicesSan Jose State Universitydisaster managementCenter for Open Source Data and AI TechnologiesDigital Discrimination: Cognitive Bias in Machine LearningElyradata visualization with Pythonwith RSaishruthi’s Contact Info TwitterLinkedInMediumGitHubCourseraMentioned Content Talks “Digital Discrimination: Cognitive Bias in Machine LearningProjects AI Fairness 360AI Explainability 360Adversarial Robustness ToolkitModel Asset...

Duration:01:07:54

Episode 70: Machine Learning Testing with Mohamed Elgendy

8/29/2021
Timestamps Cairo University3D Business Analyst: The Ultimate Hands-On Guide to Mastering Business AnalysisBusiness Analysis for Beginners: Jump-Start Your BA Career in 4 WeeksTwilioAmazonMachine Learning universitySynapse Tech CorporationRakutenKolenaKolenaMowglyDeep Learning For Vision SystemsMohamed’s Contact Info LinkedInTwitterWebsiteYouTubeGitHubKolenaMentioned Content People Andrew TraskFrancois CholletLex FridmanBooks “Mindset“OutliersNotes My conversation with Mohamed was...

Duration:00:59:56

Episode 69: DataPrepOps, Active Learning, and Team Management with Jennifer Prendki

8/16/2021
Show Notes Louis Pasteur UniversityParis-Sud UniversitySorbonne UniversityDuke UniversityQuantlab FinancialYuMeAyasdiWalmart LabsReview Analysis: An Approach to Leveraging User-Generated Content in the Context of RetailAtlassianAgile for Data Science TeamsFigure EightAlectioresponsible AIfight biasincrease accessibilitycreate more opportunities in AIJennifer’s Contact Info LinkedInTwitterMediumAlectio’s Resources WebsiteTwitterLinkedInWhat Is Alectio?Is Big Data Dragging Us Towards...

Duration:01:21:56

Episode 68: Threat Intelligence, Venture Stamina, and Data Investing with Sarah Catanzaro

7/14/2021
Show Notes StanfordCenter for International Security and CooperationCenter for Advanced Defense StudiesCyveillancePalantirMattermarkCanvas VenturesstaminaAmplify PartnersSeries A round for OctoMLApache TVMthe seed round for Einblickseed round for Metaphor DataSeries A round for RunwayProjects To KnowSarah’s Contact Info Amplify PageTwitterLinkedInMediumAmplify Partners’ Resources WebsiteTeamPortfolioBlogMentioned Content Blog Posts Our Investment in OctoMLAnnouncing Our Investment in...

Duration:01:13:44

Episode 67: Model Observability, AI Bias, and ML Infrastructure Ecosystem with Aparna Dhinakaran

6/28/2021
Show Notes UC BerkeleyEnergy and Sustainable Technologies labTubeMogulUberCornell UniversityMonitorMLEswarY-Combinatorthe acquisition of MonitorML by Arize AIongoingML Observabilityblogseriesdata preparationmodel buildingmodel validationmodel servingin The Amazing RaceAparna’s Contact Info TwitterLinkedInMediumForbes ColumnWebsiteGithubGoogle ScholarArize’s Resources WebsiteMediumLinkedInTwitterMentioned Content Blog Posts ML Infrastructure Tools for Data PreparationML Infrastructure...

Duration:00:45:34

Episode 66: Monitoring Models in Production with Emeli Dral

6/9/2021
Show Notes Peoples’ Friendship University of RussiaYandex School of Data AnalysisRamblerYandexYandex Data FactoryMechanica AIMachine Learning and Data AnalysisBig Data EssentialsMoscow Institute of Physics and TechnologyYandex School of Data AnalysisHarbour.SpaceGraduate School of Management — St. Petersburg State UniversityEvidently AIPart 1Part 2Part 3Part 4Part 5Emeli’s Contact Info LinkedInTwitterCourseraGitHubMediumEvidently AI’s...

Duration:00:43:27

Episode 65: Chaos Theory, High-Frequency Trading, and Experimentations at Scale with David Sweet

5/30/2021
Show Notes Duke UniversityUniversity of Maryland, College ParkTopology in Chaotic Scatteringfractal dimensionshigher-dimensional chaotic scatteringK Desktop Environmenta print bookAndamookaThales Fund ManagementLehman BrothersKCG/GETCOTeza TechnologiesGalaxy Digital Tradingoptimization of high-frequency trading systemsInstagram3Red PartnersTuning UpDavid’s Contact Info WebsiteLinkedInTwitterMentioned Content Publications Topology In Chaotic ScatteringFractal Dimension of...

Duration:00:54:46

Episode 64: Improving Access to High-Quality Data with Fabiana Clemente

5/18/2021
Show Notes the University of LisbonNovabaseVodafoneODYSAIHabit AnalyticsLisbon School of Economics and ManagementNOVA IMS Information Management SchoolYDatadifferent techniques to generate synthetic dataher blog series on generating synthetic tabular datanovel design and optimization techniquesDifferential PrivacyThe Cost of Poor Data Qualitymodel explainabilityWhen Machine Learning Meets PrivacyFabiana’s Contact Info LinkedInMediumTwitterYData’s...

Duration:00:53:25

Episode 63: Real-World Transfer Learning with Azin Asgarian

5/5/2021
Show Notes a girls-only high school in TehranUniversity of TehranUniversity of TorontoBabak TaatiDavid FleetBarriers to Adoption of Information Technology in HealthcareSubspace Selection to Suppress Confounding Source Domain Information in AAM Transfer LearningToronto Rehabilitation Institutealgorithmic biasesolder adults with dementiaGeorgianinjury prediction modela hybrid instance-based transfer learning method.blog postTractable AIAzin’s Contact Info WebsiteTwitterLinkedInGoogle...

Duration:01:02:39

Episode 62: Leading Organizations Through Analytics Transformations with Gordon Wong

4/27/2021
Show Notes Rutgers UniversityAB Initio SoftwareSmarter Travel MediaClickSquaredCervelloFitbitSnowflakeezCaterZipcaredXHubSpotData Hierarchy of NeedsSnowflakeGordon’s Contact Info LinkedInMentioned Content People Tristan HandyMichael KaminskyAnalytics EngineeringBarr MosesData ObservabilityBook “Start With Why

Duration:01:13:03

Episode 61: Meta Reinforcement Learning with Louis Kirsch

4/18/2021
Show Notes Hasso Plattner InstituteDifferentiable Convolutional Neural Network Architectures for Time Series ClassificationTransfer Learning for Speech Recognition on a BudgetUniversity College LondonDavid BarberModular Networks: Learning to Decompose Neural ComputationScaling Neural Networks Through SparsityCharacteristics of Machine Learning Research with ImpactContemporary Challenges in Artificial Intelligencepart 1 on universal AIpart 2 on active inferenceIDSIAJürgen Schmidhubera very...

Duration:01:00:02

Episode 60: Algorithms and Data Structures for Massive Datasets with Dzejla Medjedovic

4/5/2021
Show Notes Sarajevo School of Science and TechnologyStony Brook UniversityUpper and Lower Bounds on Sorting and Searching in External MemoryDon’t Thrash: How to Cache Your Hash on FlashThe batched predecessor problem in external memoryundergraduategraduateInternational University of SarajevoAlgorithms and Data Structures for Massive DatasetsDzejla’s Contact Info LinkedInTwitterGoogle ScholarMentioned Content Papers “Upper and Lower Bounds on Sorting and Searching in External Memory“Don’t...

Duration:00:53:55

Episode 59: Bridging The Gap Between Data and Models with Willem Pienaar

3/24/2021
Show Notes Stellenbosch UniversitySystems AnywhereINDEFFGojekClockworkMerlinFeastTuringCloud Next 2018KubeCon 2019Feastproduct roadmapTecton’s recent backing of FeastWillem’s Contact Info TwitterLinkedInGitHubMentioned Content Feast feast.dev#Feastdocs.feast.devfeast-dev/feaststackoverflow.com/questions/tagged/feastwiki.lfaidata.foundation/display/FEAST/Feast+Home@feast_devArticle An Introduction to Gojek’s Machine Learning PlatformIntroducing Feast: An Open-Source Feature Store For...

Duration:00:46:01

Episode 58: Deep Learning Meets Distributed Systems with Jim Dowling

3/19/2021
Show Notes Trinity College Dublindynamic software architecturethe K-Component modelcollaborativereinforcement learningonline optimization problems in dynamic systemsMySQLRISE Research Institute of Swedensearch algorithmwalk topologyGradientTVGliveDivision of Software and Computer SystemsSchool of Electrical Engineering and Computer ScienceKTH Royal Institute of TechnologyDistributed SystemsDeep Learning on Big DataKTH Royal Institute of TechnologyDistributed TensorFlowHopsFSHopsworksFeature...

Duration:00:59:22

Episode 57: Building Data Science Projects with Pier Paolo-Ippolito

3/6/2021
Show Notes University of Southamptonhis final undergraduate projectthe AI SocietyCausal Reasoning in Machine LearningFidessaDigital-DandelionSAS InstituteTowards Data Sciencehis blog postshis Augmented Reality Personal Business CardTensorFlow.jsml5.jsPlotlyR ShinyStreamlitrPier’s Contact Info WebsiteLinkedInTwitterGitHubMediumPatreonKaggleMentioned Content “Alleviate Children’s Health Issues Through Games and Machine Learning“Causal Reasoning in Machine LearningAndrej KarpathyCassie...

Duration:00:53:31

Episode 56: Apprehending Quantum Computation with Alba Cervera-Lierta

2/21/2021
Timestamps The University of BarcelonaOperational Approach to Bell Inequalities: Application to QutritsUniversity of OxfordUniversity of Madridmaximal entanglement and the fundamental symmetries of high-energy physicsMultipartite Entanglement in Spin Chains and The HyperdeterminantQuanticQuantum Computation: Playing The Quantum SymphonyExact Ising Model Simulation On A Quantum ComputerTeach Me QISKit challenge from IBMQuantum Circuits For the Maximally Entangled StatesData Re-Uploading For...

Duration:01:16:38

Episode 55: Making Apache Spark Developer-Friendly and Cost-Effective with Jean-Yves Stephan

2/11/2021
Timestamps Ecole PolytechniqueStanfordMachine LearningMining Massive DatasetsLiveRampDatabricksData Mechanicsthe launch blog postPros and Cons of Running Apache Spark on KubernetesSpark on Kubernetes Made EasyData Mechanics Delightcustomized Spark UIopen-sourced how to be successful with Apache Spark in 2021the Y Combinator program in summer 2019His Contact Info TwitterLinkedInData MechanicsHis Recommended Resources Jure LeskovecJeff BezosMatei Zaharia“Designing For Data-Intensive...

Duration:00:52:01

Episode 54: Information Retrieval Research, Data Science For Space Missions, and Open-Source Software with Chris Mattmann

2/4/2021
Timestamps University of Southern CaliforniaNASA Jet Propulsion LabDr. Nenad MedvidovićSoftware Connectors For Highly-Distributed And Voluminous Data-Intensive SystemsApache Software FoundationApache TikaJérôme CharronTika In ActionJukka ZittingUSC Viterbi School of EngineeringSoftware ArchitecturesInformation Retrieval and Web Search EnginesContent Detection and Analysis for Big Datathis USC articleInformation Retrieval and Data Science groupMEMEXXDATAObject-Oriented Data...

Duration:01:22:42