SHARE
Facebook X Pinterest WhatsApp

Cloudera Previews Tool for Analyzing and Optimizing Queries

Beware the Data Collectors for They Are Us: Privacy and Big Data One of the most often underestimated aspects of embracing Big Data is the impact that shifting to a platform such as Hadoop can have on existing queries. The simple fact is that most of the queries organizations run today have been extended multiple […]

Written By
MV
Mike Vizard
Nov 19, 2015
Slide Show

Beware the Data Collectors for They Are Us: Privacy and Big Data

One of the most often underestimated aspects of embracing Big Data is the impact that shifting to a platform such as Hadoop can have on existing queries. The simple fact is that most of the queries organizations run today have been extended multiple times over the years in ways that are usually poorly documented.

Cloudera wants to help IT organizations get a better handle on how their queries are actually structured. Toward this end, it’s including – as part of the version 5.5 update to its distribution of Hadoop – a beta of Cloudera Navigator Optimizer, a tool for analyzing and optimizing queries.

The product is based on technology that Cloudera gained via the acquisition of Xplain.io earlier this year.

Ewa Ding, product manager of Cloudera Navigator Optimizer, says the goal is to enable IT organizations to optimize legacy queries as they move to make Hadoop a core element of their modern data warehouse environments.

Cloudera-Navigator-Optimizer

Ding notes that many organizations are somewhat intimidated by the complexity of existing queries. While many organizations rely heavily on the results of those queries to run their business, few people in those organizations actually understand how the queries work. In fact, the person who wrote the original query might have long since moved on from that organization.

What organizations discover is that once they move data into Hadoop, the performance of queries can be adversely affected. That issue has almost nothing to do with Hadoop itself, but rather with the way those queries are structured, says Ding.

Most existing queries need to be optimized to one degree or another even if an organization isn’t embracing Hadoop. The challenge is that, without a tool to actually facilitate that process, manually going through each query to see how it’s actually written winds up being one of those many IT tasks that never gets done.

MV

Michael Vizard is a seasoned IT journalist, with nearly 30 years of experience writing and editing about enterprise IT issues. He is a contributor to publications including Programmableweb, IT Business Edge, CIOinsight and UBM Tech. He formerly was editorial director for Ziff-Davis Enterprise, where he launched the company’s custom content division, and has also served as editor in chief for CRN and InfoWorld. He also has held editorial positions at PC Week, Computerworld and Digital Review.

Recommended for you...

Top RPA Tools 2022: Robotic Process Automation Software
Jenn Fulmer
Aug 24, 2022
Metaverse’s Biggest Potential Is In Enterprises
Tom Taulli
Aug 18, 2022
The Value of the Metaverse for Small Businesses
IT Business Edge Logo

The go-to resource for IT professionals from all corners of the tech world looking for cutting edge technology solutions that solve their unique business challenges. We aim to help these professionals grow their knowledge base and authority in their field with the top news and trends in the technology space.

Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.