DataWorks is an end-to-end big data development and governance platform that provides data warehousing, data lake, and data lakehouse solutions based on big data compute engines, such as MaxCompute, Hologres, E-MapReduce (EMR), AnalyticDB, and CDH. Since 2009, DataWorks has been refining and enhancing Alibaba's big data development methodology to support data mid-end construction. DataWorks partners with public service sectors, state-owned enterprises, and customers across various industries, including finance, retail, Internet, energy, and manufacturing, to improve data application efficiency and facilitate industry digital transformation.
Service architecture
DataWorks has developed and accumulated hundreds of core capabilities over more than ten years. DataWorks provides data modeling, data integration, data development, data governance, data security, and data analysis services. These services deliver end-to-end data governance capabilities to help enterprises reduce data processing costs, increase data value, and unleash data productivity.
Service activation
DataWorks supports only Google Chrome 69 and later and the new Microsoft Edge (Chromium) on PCs.
The first time you use DataWorks, we recommend that you follow the instructions in this section to purchase features and resources. For more information, see Purchase guide.
Recommended configurations
Recommendation reasons
Software: We recommend that you activate DataWorks Professional Edition. This edition provides services such as DataStudio, Operation Center, Data Map, and Data Quality and can meet your requirements for standard data warehouse building.
Resources: We recommend that you purchase a pay-as-you-go serverless resource group. The first time you purchase an edition of DataWorks, the system automatically purchases a serverless resource group that is charged based on the pay-as-you-go billing method. You cannot cancel the purchase operation. In addition, the first time you purchase an edition of DataWorks, the system automatically creates a default virtual private cloud (VPC) and a default vSwitch and associates the resource group with the VPC and vSwitch. For more information, see Activate DataWorks.
Customer use cases
Big Data Center of State Grid Corporation of China (SGCC): DataWorks helped achieve centralized management of petabytes of data for SGCC and 27 subordinate provincial and municipal corporations. DataWorks also helped SGCC accelerate the digital transformation and upgrade of business using the end-to-end governance and monitoring systems for data mid-ends.
Mondelēz International (Fortune Global 500): Mondelēz China used DataWorks Data Modeling to perform end-to-end data model governance. This helped Mondelēz China significantly improve the self-service capability of data mid-ends, delegate data-related decision making, and unleash the digital power of the new retail industry.
iDreamSky (a listed company): iDreamSky replaced the self-developed scheduling system with DataWorks based on open source EMR, which enabled technical personnel in the company to focus more on business and facilitated digital operations of the gaming industry.
For more information about customer use cases, see Customer cases.
Development history
Development history within Alibaba Group
Since 2009, DataWorks has been used to build data mid-ends and data governance capabilities within Alibaba Group over multiple technology phases based on big data compute engines such as MaxCompute and Hologres. DataWorks has more than 50,000 daily active users within Alibaba Group. This indicates that one out of three employees in Alibaba Group uses DataWorks on average. DataWorks supports over 300 data applications and serves more than 100 business units within Alibaba Group.
Development history of DataWorks on the cloud
DataWorks was migrated to the cloud in 2015. Since then, DataWorks has launched services for Alibaba Cloud users based on the big data building methodology accumulated over the years. DataWorks continuously enhances its end-to-end data governance capabilities and is committed to improving data management and enhancing data value by collaborating with customers and partners from various industries and fields.
Learning path
You can quickly learn about the concepts, basic operations, and advanced operations of DataWorks from the Learning Path displayed on the documentation homepage of DataWorks.
Support for DataWorks
You can submit a ticket to contact technical support to obtain pre-sales and after-sales services.