最近因为工作需要接触了Data Vault这个概念,所以又开始了新一轮的学习,作为一个java开发者,平时主要和代码打交道,实现的是业务逻辑,初探数据的故事,感觉整个整个世界都是新的。作为一个懵逼小白,第一件事当然是了解什么是Data Vault,还是从5W1H出发来看。
What
Data Vault模型是Dan Linstedt在20世纪90年代提出的,主要在对自然界中发现的复杂网络建模。
根据维基百科,Data Vault定义如下:
Data Vault modeling is a database modeling method that is designed to provide long-term historical storage of data coming in from multiple operational systems. It is also a method of looking at historical data that deals with issues such as auditing, tracing of data, loading speed and resilience to change as well as emphasizing the need to trace where all the data in the database came from. This means that every row in a data vault must be accompanied by record source and load date attributes, enabling an auditor to trace values back to the source
根据《数据架构 大数据、数据仓库以及DATA VAULT》这本书里所说:
Data Vault