As organizations of all sizes start to amass more data than ever in the hopes of making better business decisions faster, the quality of that data becomes increasingly critical. To help ensure that data quality, Talend this week added data preparation tools to its integration platform that promise to make it simpler to collaborate on the cleansing of data using a self-service Data Stewardship application.
Ashley Stirrup, chief marketing officer for Talend, says the Data Stewardship application is designed to make it simpler for the average business user to maintain the quality of the data being added to a larger Big Data lake. Without that capability, organizations will find themselves having to hire dedicated data managers to perform those same functions, says Stirrup.
“Organizations run the risk of finding themselves in specialization hell,” says Stirrup.
To enable the level of self-service required to maintain data quality, the Talend Data Stewardship application includes a pre-configured data dictionary that auto-recognizes the meaning of the raw data stored in the data lake. Organizations can also augment the dictionary with their own vocabulary, such as product codes or names, or crowdsource new data definitions from external sources or the Talend Community.
The combined effect of the new release is that Talend is moving to combine data integration and governance into a single integrated workflow. Rather than requiring organizations to acquire a separate set of data quality tools, Talend is making it simpler to essentially federate management of those functions between end users that better understand what specific data sets represent and the IT organization tasked with making it easier to combine data residing in multiple applications.