From the course: Problem Identification and Solution Design for Data Scientists
Unlock the full course today
Join today to access over 24,800 courses taught by industry experts.
Metadata vs. data
From the course: Problem Identification and Solution Design for Data Scientists
Metadata vs. data
- [Instructor] Here's a common dilemma. You've got to prepare an estimate of the level of effort for data preparation, either for an internal budget or for an external statement of work. But you don't have access to all of the data yet. You're waiting on a badge, or you're waiting on a laptop, or you're waiting on a login, or access to the data. Even if you're internal, there are countless potential roadblocks. Count on it. It's always something. You can't let this hold up the whole project, especially when it's so common. So what can you do? The business understanding phase shouldn't require that you have every bit of data, and an internal plan, in an external SOW shouldn't require this either. I know it's desirable, I get it, but you have to have a plan to work around a problem that's so common. So first, start with metadata and data dictionaries. Ask your colleagues to give you any summary reports that have been run. These things are usually easier to get, and metadata is never…