From the course: Problem Identification and Solution Design for Data Scientists

Unlock the full course today

Join today to access over 24,800 courses taught by industry experts.

The provenance of the data

The provenance of the data

- [Instructor] The details we just discussed matter, but I'm never going to ask enough clarifying questions in a single meeting to get a complete understanding of everything. So you have to have an overall goal to keep you organized. And here it is. You want to understand the provenance of the data. This word gets more use in auction houses and museums to describe the change in ownership of an object over time. But I think it's the perfect word to describe what we need at this stage. I asked the team to walk me through the process step-by-step, starting from when the data first gets generated. For example, it could be the very first purchase of a new retail customer who joins a loyalty program, or someone that just applied for a credit card, or visited a medical office for the first time. When possible, I've been known to pretend that I'm a new customer myself to check out what the process is like. Then I want to know every step of the process from the initial one to when I get to see…

Contents