SHARE
Facebook X Pinterest WhatsApp

Syncsort Adds Support for Apache Spark and Kafka

Flawed Integration Can Destroy Data Quality and Reliability When it comes to Big Data applications, a triumvirate of open source technologies has emerged as the most dominant platforms being used. The first is obviously Hadoop, followed closely by Apache Spark in-memory computing clusters and Apache Kafka, a real-time messaging platform. After already adding support for […]

Written By
MV
Mike Vizard
Oct 12, 2015
Slide Show

Flawed Integration Can Destroy Data Quality and Reliability

When it comes to Big Data applications, a triumvirate of open source technologies has emerged as the most dominant platforms being used. The first is obviously Hadoop, followed closely by Apache Spark in-memory computing clusters and Apache Kafka, a real-time messaging platform.

After already adding support for Hadoop, Syncsort is now extending the reach of its extract, transform and load (ETL) software out to Apache Spark and Kafka.

Tendü Yoğurtçu, general manager of Syncsort’s Big Data business, says that while Hadoop is still the most widely deployed of the three open source platforms, interest in Apache Spark is rising sharply. The reason for this, says Yoğurtçu, is that rather than running Big Data analytics applications in batch mode, many organizations want to be able to run those applications in real time using an in-memory platform that makes it possible for them to blend data from multiple sources.

Taken together, Yoğurtçu says the combination of Hadoop, Spark and Kafka is creating the foundation for building more agile data warehouses. Naturally, the degree to which that combination of technologies will replace traditional data warehouses remains to be seen. The one thing that is for certain is that collectively these technologies will serve as a platform for running many of the analytics applications that formerly ran on data warehouse platforms that cost several orders of magnitude more to acquire, manage and deploy.

In fact, to help facilitate that transition, Syncsort has contributed a connector it created for integrating Spark with mainframe systems, where many of those data warehouses run, to the open source community.

Obviously, IT organizations are not going to replace data warehouse platforms that often contain terabytes, sometimes even petabytes, of data overnight. But in the months and years ahead, a lot of data will be bi-directionally moving between those legacy data warehouse systems and clusters running instances of Hadoop and Spark.

MV

Michael Vizard is a seasoned IT journalist, with nearly 30 years of experience writing and editing about enterprise IT issues. He is a contributor to publications including Programmableweb, IT Business Edge, CIOinsight and UBM Tech. He formerly was editorial director for Ziff-Davis Enterprise, where he launched the company’s custom content division, and has also served as editor in chief for CRN and InfoWorld. He also has held editorial positions at PC Week, Computerworld and Digital Review.

Recommended for you...

Top RPA Tools 2022: Robotic Process Automation Software
Jenn Fulmer
Aug 24, 2022
Metaverse’s Biggest Potential Is In Enterprises
Tom Taulli
Aug 18, 2022
The Value of the Metaverse for Small Businesses
IT Business Edge Logo

The go-to resource for IT professionals from all corners of the tech world looking for cutting edge technology solutions that solve their unique business challenges. We aim to help these professionals grow their knowledge base and authority in their field with the top news and trends in the technology space.

Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.