Apache Impala vs Cloudera
Last updated: October 04, 2018
Apache Impala is a modern, open source, distributed SQL query engine for Apache Hadoop.
Cloudera helps you become information-driven by leveraging the best of the open source community with the enterprise capabilities you need to succeed with Apache Hadoop in your organization. Designed specifically for mission-critical environments, Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts. Cloudera is your partner on the path to big data.
Apache Impala vs Cloudera in our news:
2018 - Big Data platforms Cloudera and Hortonworks merge
Over the years, Hadoop, the once high-flying open-source platform, gave rise to many companies and an ecosystem of vendors emerged. The problem with Hadoop was the sheer complexity of it. That’s where companies like Hortonworks and Cloudera came in. They packaged it for IT departments that wanted the advantage of a big data processing platform, but didn’t necessarily want to build Hadoop from scratch. These companies offered different ways of helping to attack that complexity, but over time, with all the cloud-based big data solutions, rolling a Hadoop system seemed futile, even with the help of companies like Cloudera and Hortonworks. Today the two companies announced are merging in a deal worth $5.2 billion. The combined companies will boast 2,500 customers, $720 million in revenue and $500 million in cash with no debt, according to the companies.
2015 - Hortonworks acquired dataflow solutions developer Onyara
Hortonworks, a publicly traded company selling a commercial distribution of the Hadoop open-source big data software, announced today that it has acquired Onyara, an early-stage startup whose employees developed Apache NiFi, a piece of open-source software that was first used inside the National Security Agency (NSA). Apache NiFi allows to to deliver sensor data to the right systems and keep track of what was happening to the data. Hortonworks, which itself spun out of Yahoo, has previously acquired XA Secure and SequenceIQ. Now Hortonworks will be selling a new subscription based on the Apache NiFi software, under the name Hortonworks DataFlow.
2015 - Google partners with Cloudera to bring Cloud Dataflow to Apache Spark
Google announced that it has teamed up with the Hadoop specialists at Cloudera to bring its Cloud Dataflow programming model to Apache’s Spark data processing engine. With Google Cloud Dataflow, developers can create and monitor data processing pipelines without having to worry about the underlying data processing cluster. As Google likes to stress, the service evolved out of the company’s internal tools for processing large datasets at Internet scale. Not all data processing tasks are the same, though, and sometimes you may want to run a task in the cloud or on premise or on different processing engines. With Cloud Dataflow — in its ideal state — data analysts will be able use the same system for creating their pipelines, no matter the underlying architecture they want to run them on.
2014 - Enterprise Hadoop provider Hortonworks filed for an IPO
Hortonworks, the company building commercial Hadoop technology, has filed for its initial public offering. The company claims more than $33 million in revenue for the year thus far and nearly $88 million in operating loss. Hortonworks spun off from Yahoo in 2011. It offers a big data processing platform that includes the ability to process various types of data including SQL and NoSQL sources then search across data, or use various analytics tools to visualize the data. Hortonworks has a reputation for being a pure Hadoop offering without any proprietary extensions.
2014 - Cloudera helps to manage Hadoop on Amazon cloud
Hadoop vendor Cloudera announced a new product called Director that will make it easier for customers to manage their Hadoop clusters on the Amazon Web Services cloud. Senior Director of Product Marketing Clarke Patterson acknowledged that has not been easy to date while still maintaining the breadth of capabilities. Although there’s no difference between the cloud version and the on-premises version of the software, he added, the Director interface is designed to be self-service and includes cloud-specific capabilities such as instance-tracking so administrators can keep an eye on whose cloud instances are costing what.
2014 - Cloudera bought data-visualization startup DataPad to take on Pivotal
Cloud-based big data platform Cloudera has acquired a data-visualization startup DataPad which specializes in data analysis using the Python programming language. As Hadoop competition heats up, Cloudera might be ramping up its Python tooling in order to attract more data scientists and developers (DataPad co-founders are known in the data science community for having developed a Python-based data analysis library Pandas). It's not surprising considering the billions of dollars up for play in the commercial Hadoop market. Cloudera, Hortonworks, MapR, Pivotal and more are all trying to win over as many users as they can for their respective flavors of Hadoop and general big data infrastructure. Spreading the cheerleading base beyond IT staff and systems architects, to include the people actually developing applications and doing data analysis within the company, is a good way to help ensure your stuff is the stuff that gets used.
2014 - HP invests $50 million in Hortonworks
Cloud-based Big Data platforms Hortonworks and Cloudera offer commercial versions and enhancement to Apache Hadoop. Their duel have traded news of big-name tech investors for the past year or so. Intel, Google Ventures and In-Q-tel are in the Cloudera camp. Yahoo and HP are on team Hortonworks. Yesterday HP has invested $50 million in Hortonworks and HP’s CTO Martin Fink has joined the Hortonworks board. Under a year-old pact, HP could already resell Hortonworks Data Platform to its customers. In a statement, Hortonworks CEO Rob Bearden said the deal will accelerate our joint customers’ transition to a modern data architecture.