SHARE
Facebook X Pinterest WhatsApp

Google Dataflow Programming Tools for Big Data Become Apache Project

2016 Data Analytics Forecast: Top 5 Trends to Watch Google, in conjunction with Cloudera, Data Artisan, Cask and Talend, announced this week that the Dataflow programming model that Google created to develop streaming Big Data applications is now an open source Apache project. Talend CTO Laurent Bride says this move is significant because it should […]

Written By
MV
Mike Vizard
Jan 21, 2016
Slide Show

2016 Data Analytics Forecast: Top 5 Trends to Watch

Google, in conjunction with Cloudera, Data Artisan, Cask and Talend, announced this week that the Dataflow programming model that Google created to develop streaming Big Data applications is now an open source Apache project.

Talend CTO Laurent Bride says this move is significant because it should give IT organizations more freedom to run their Big Data applications wherever they see fit.

Bride says Dataflow is gaining traction because it provides a programming model that enables developers to build Big Data applications that can run on multiple run-time engines. As a result, code developed using Dataflow can run on MapReduce, Apache Spark and Flink engines.

Longer term, Bride says, Talend is looking at applying its code generation and integration tools to Dataflow in a way that would make it simpler for organizations to marry traditional batch processing applications with modern real-time streaming applications accessing Big Data.

Organizations are starting to appreciate the need for a framework that makes it simpler to both adopt new Big Data processing engines and move between them as needed.

Apachedataflow

As is often the case with any emerging set of technologies, the pace at which those engines are being developed is all too often faster than the average IT organization can absorb. The Dataflow programming model in effect provides a level of abstraction that enables that level of platform innovation to continue without necessarily having to disrupt all the code development that has gone before.

MV

Michael Vizard is a seasoned IT journalist, with nearly 30 years of experience writing and editing about enterprise IT issues. He is a contributor to publications including Programmableweb, IT Business Edge, CIOinsight and UBM Tech. He formerly was editorial director for Ziff-Davis Enterprise, where he launched the company’s custom content division, and has also served as editor in chief for CRN and InfoWorld. He also has held editorial positions at PC Week, Computerworld and Digital Review.

Recommended for you...

Data Lake Strategy Options: From Self-Service to Full-Service
Chad Kime
Aug 8, 2022
What’s New With Google Vertex AI?
Kashyap Vyas
Jul 26, 2022
Data Lake vs. Data Warehouse: What’s the Difference?
Aminu Abdullahi
Jul 25, 2022
IT Business Edge Logo

The go-to resource for IT professionals from all corners of the tech world looking for cutting edge technology solutions that solve their unique business challenges. We aim to help these professionals grow their knowledge base and authority in their field with the top news and trends in the technology space.

Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.