Cloudera alternatives

Cloudera
Cloudera helps you become information-driven by leveraging the best of the open source community with the enterprise capabilities you need to succeed with Apache Hadoop in your organization. Designed specifically for mission-critical environments, Cloudera Enterprise includes CDH, the world’s most popular open source Hadoop-based platform, as well as advanced system management and data management tools plus dedicated support and community advocacy from our world-class team of Hadoop developers and experts. Cloudera is your partner on the path to big data.
Here are the latest news about Cloudera:

2015 Google partners with Cloudera to bring Cloud Dataflow to Apache Spark


Google announced that it has teamed up with the Hadoop specialists at Cloudera to bring its Cloud Dataflow programming model to Apache’s Spark data processing engine. With Google Cloud Dataflow, developers can create and monitor data processing pipelines without having to worry about the underlying data processing cluster. As Google likes to stress, the service evolved out of the company’s internal tools for processing large datasets at Internet scale. Not all data processing tasks are the same, though, and sometimes you may want to run a task in the cloud or on premise or on different processing engines. With Cloud Dataflow — in its ideal state — data analysts will be able use the same system for creating their pipelines, no matter the underlying architecture they want to run them on.



2014 Cloudera helps to manage Hadoop on Amazon cloud


Hadoop vendor Cloudera announced a new product called Director that will make it easier for customers to manage their Hadoop clusters on the Amazon Web Services cloud. Senior Director of Product Marketing Clarke Patterson acknowledged that has not been easy to date while still maintaining the breadth of capabilities. Although there’s no difference between the cloud version and the on-premises version of the software, he added, the Director interface is designed to be self-service and includes cloud-specific capabilities such as instance-tracking so administrators can keep an eye on whose cloud instances are costing what.



2014 Cloudera bought data-visualization startup DataPad


Cloud-based big data platform Cloudera has acquired a data-visualization startup DataPad which specializes in data analysis using the Python programming language. As Hadoop competition heats up, Cloudera might be ramping up its Python tooling in order to attract more data scientists and developers (DataPad co-founders are known in the data science community for having developed a Python-based data analysis library Pandas). It's not surprising considering the billions of dollars up for play in the commercial Hadoop market. Cloudera, Hortonworks, MapR, Pivotal and more are all trying to win over as many users as they can for their respective flavors of Hadoop and general big data infrastructure. Spreading the cheerleading base beyond IT staff and systems architects, to include the people actually developing applications and doing data analysis within the company, is a good way to help ensure your stuff is the stuff that gets used.