Data-Based Projections

Data-Based Projections

Data is often the basis for how we see the world, and how the world sees us. Understanding these data-based projections is the focus of this podcast, which discusses topics related to data analytics, machine learning, and data science. Produced and hosted by Jim Harris.

Episodes

July 21, 2022 26 mins

Machine learning (ML) can provide unique analytical insights, as well as help automate some operational and decision-making processes more efficiently and effectively than non-ML alternatives. However, ML is also among the buzziest of buzzwords, and many are overselling and oversimplifying its usage. 

Do not let anyone frame a data analysis, business problem, or process improvement as an ML use case. Ins...

Mark as Played

Label Making. That is my simple two-word definition of Machine Learning. Machine Learning is Label Making. ML is LM. 

Especially supervised machine learning, which creates either numerical labels (using regression algorithms) to make predictions about a continuous data value (such as sale or stock prices), or categorical labels (using classification algorithms) to assign data to pre-defined groups also c...

Mark as Played

Based on one of my presentations, this episode provides a five-part vendor-neutral framework for evaluating the critical capabilities of a cloud data analytics solution: Deploy, Store, Optimize, Analyze, Govern. 

 

This episode is sponsored by: Vertica.com

 

Extended Show Notes: ocdqblog.com/dbp

 

Follow Jim Harris on Twitter: @ocdqblog

 

Email Jim Harris: ocdqblog.com/contact

 

Other ways to listen: bit.ly/listen-dbp 

 

Mark as Played
April 23, 2022 29 mins

A decade ago, just before the beginning of the data science hype cycle was the big data hype cycle. At that time I had the privilege of sitting down with Ph.D. Statistician Dr. Thomas C. Redman (aka the “Data Doc”). 

We discussed whether data quality matters less in larger data sets, if statistical outliers represent business insights or data quality issues, statistical sampling errors versus measurement calibration errors, mistaki...

Mark as Played
April 10, 2022 12 mins

Before you get started on any data analytics effort, you need to have at least preliminary answers to three questions: (1) What problem are we trying to solve?, (2) What data can we apply to that problem?, and (3) What analytical techniques can we apply to that data?  

 

This episode is sponsored by: Vertica.com

 

Extended Show Notes: ocdqblog.com/dbp

 

Follow Jim Harris on Twitter: @ocdqblog

 

Email Jim Harris: ocdqb...

Mark as Played
April 6, 2022 9 mins

In time for opening day of the 2022 Major League Baseball (MLB) season, I discuss the initial results of my Baseball Data Analysis Challenge.   

See the extended show notes for links to my input data, my results as a Microsoft Excel file, and my SQL scripts on GitHub.   

I used logistic regression machine learning classification models to calculate win probabilities for the Boston Red Sox across nine (9) game metrics, and a Naïve B...

Mark as Played

Why don’t more machine learning models graduate to production? Paige Roberts stops by to help explore this topic and drop some knowledge about how to get more machine learning models deployed in production.  

 

This episode is sponsored by: Vertica.com

 

Extended Show Notes: ocdqblog.com/dbp

 

Follow Jim Harris on Twitter: @ocdqblog

 

Email Jim Harris: ocdqblog.com/contact

 

Other ways to listen: bit.ly/listen-dbp 

 

Mark as Played
March 29, 2022 32 mins

Back in 2012, Harvard Business Review declared Data Scientist was The Sexiest Job of the 21st Century. Less than a year later, I recorded a podcast discussion with an actual data scientist and Ph.D. Statistician, Dr. Melinda Thielbar, during which she discussed what a data scientist actually does and provided a straightforward explanation of key concepts, such as signal-to-noise ratio, how statistical results should be presented an...

Mark as Played

Data Analytics, Machine Learning, and Data Science — those are the three things that this podcast focuses its discussions on. This episode provides my definitions in descending order of their complexity in terms of the depth of required knowledge, competencies, and practical, demonstrable skills related to computer science and programming, mathematics and statistics, critical thinking and overall approach to solving p...

Mark as Played
March 25, 2022 5 mins

Hello, World! Welcome to Episode Zero! Okay, technically it’s the first episode, but I’m a geek who thinks all indexes should start at 0 not 1. Anyway, this is more of a meta-episode introducing the host, explaining what the podcast is about, and letting you know what to expect from future episodes.

The focus of this podcast is to discuss topics related to data analytics, machine learning, and data science. The goal is to provide a...

Mark as Played

Popular Podcasts

    Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations.

    The Nikki Glaser Podcast

    Every week comedian and infamous roaster Nikki Glaser provides a fun, fast-paced, and brutally honest look into current pop-culture and her own personal life.

    Stuff You Should Know

    If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

    Crime Junkie

    If you can never get enough true crime... Congratulations, you’ve found your people.

    Start Here

    A straightforward look at the day's top news in 20 minutes. Powered by ABC News. Hosted by Brad Mielke.

Advertise With Us
Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2024 iHeartMedia, Inc.