SHARE
Facebook X Pinterest WhatsApp

Gradle: Workflow Tools for Hadoop

Six Mistakes that Lead to Poor Enterprise Software Adoption When deploying Hadoop in a production environment, IT organizations often struggle with building applications that run across a Hadoop cluster. To solve that problem, the developers at LinkedIn created Gradle, a set of workflow tools that make it easier to connect multiple Hadoop jobs within the […]

Written By
MV
Mike Vizard
Aug 14, 2015
Slide Show

Six Mistakes that Lead to Poor Enterprise Software Adoption

When deploying Hadoop in a production environment, IT organizations often struggle with building applications that run across a Hadoop cluster. To solve that problem, the developers at LinkedIn created Gradle, a set of workflow tools that make it easier to connect multiple Hadoop jobs within the context of an application.

This week, LinkedIn announced that it is turning Gradle into an open source project. Alex Bain, senior software engineer for LinkedIn, says that LinkedIn has a vested interest in making Gradle, a plug-in to Hadoop, a bigger part of a rapidly growing Hadoop ecosystem. For example, as the Apache Spark in-memory computing project continues to evolve, Bain says that LinkedIn would like to see open source contributions that extended the reach of Gradle to both Hadoop and Spark.

At the core of Gradle is a domain-specific language called Hadoop DSL, which LinkedIn created to make Hadoop more accessible to its developers who need to work with Hadoop workflow managers such as Azkaban and Apache Oozie. Hadoop DSL is written in Groovy, a language derivative of Java, which provides developers with a consistent method of invoking multiple application development frameworks running on top of Hadoop.

Bain says LinkedIn relies heavily on Hadoop to surface data that’s relevant to its community in real time. LinkedIn can only do that, says Bain, because it found a way to unify all the Hadoop frameworks that developers need to invoke in order to create a production application.

LinkedIn, of course, is no stranger to the open source community. It has previously launched projects such as the Galene search engine, Pinot real-time analytics software, and Burrow monitoring tools for monitoring the Kafka messaging system, which is often deployed on top of Hadoop. In all four cases, IT organizations that choose make use of that software are relying on open source software that is core to how LinkedIn operates.

When it comes to Hadoop, many IT organizations are understandably intimidated by the dizzying array of frameworks that can be used to build Big Data applications. Over time, IT organizations working with Hadoop are going to be working with multiple instances of those frameworks. In the case of Gradle, LinkedIn is providing a convenient place from which to get started building Hadoop applications, and also to ultimately master those frameworks.

MV

Michael Vizard is a seasoned IT journalist, with nearly 30 years of experience writing and editing about enterprise IT issues. He is a contributor to publications including Programmableweb, IT Business Edge, CIOinsight and UBM Tech. He formerly was editorial director for Ziff-Davis Enterprise, where he launched the company’s custom content division, and has also served as editor in chief for CRN and InfoWorld. He also has held editorial positions at PC Week, Computerworld and Digital Review.

Recommended for you...

Python for Machine Learning: A Tutorial
Kihara Kimachia
Jun 20, 2022
Best Performance Testing Tools for 2022
Jenn Fulmer
Jun 14, 2022
Tips for Writing the Perfect Business Requirements Document
Jillian Koskie
May 24, 2022
IT Business Edge Logo

The go-to resource for IT professionals from all corners of the tech world looking for cutting edge technology solutions that solve their unique business challenges. We aim to help these professionals grow their knowledge base and authority in their field with the top news and trends in the technology space.

Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.