Cloud Big Data platforms

Updated: January 03, 2020
Cloud Big Data platforms provide Big Data as a Service

2018. Big Data platforms Cloudera and Hortonworks merge



Over the years, Hadoop, the once high-flying open-source platform, gave rise to many companies and an ecosystem of vendors emerged. The problem with Hadoop was the sheer complexity of it. That’s where companies like Hortonworks and Cloudera came in. They packaged it for IT departments that wanted the advantage of a big data processing platform, but didn’t necessarily want to build Hadoop from scratch. These companies offered different ways of helping to attack that complexity, but over time, with all the cloud-based big data solutions, rolling a Hadoop system seemed futile, even with the help of companies like Cloudera and Hortonworks. Today the two companies announced are merging in a deal worth $5.2 billion. The combined companies will boast 2,500 customers, $720 million in revenue and $500 million in cash with no debt, according to the companies.


2015. Hortonworks acquired dataflow solutions developer Onyara



Hortonworks, a publicly traded company selling a commercial distribution of the Hadoop open-source big data software, announced today that it has acquired Onyara, an early-stage startup whose employees developed Apache NiFi, a piece of open-source software that was first used inside the National Security Agency (NSA). Apache NiFi allows to to deliver sensor data to the right systems and keep track of what was happening to the data. Hortonworks, which itself spun out of Yahoo, has previously acquired XA Secure and SequenceIQ. Now Hortonworks will be selling a new subscription based on the Apache NiFi software, under the name Hortonworks DataFlow.


2014. Big Data as a Service company Qubole raises $13 million



Hadoop-as-a-service startup Qubole has raised a $13 million series B round of venture capital. Qubole is hosted on the Amazon Web Services cloud, but can also run on Google Compute Engine, and acts like one might expect a cloud-native Hadoop service to act. It has a graphical user interface, connectors to several common data sources (including cloud object stores), and it takes advantage of cloud capabilities such as autoscaling and spot pricing for compute. What’s interesting about Qubole is that although it originally boasted optimized versions of Hive and other MapReduce-based tools, the company also lets users analyze data using the Facebook-created Presto SQL-on-Hadoop engine, and is working on a service around the increasingly popular and very fast Apache Spark framework.


2014. Enterprise Hadoop provider Hortonworks filed for an IPO



Hortonworks, the company building commercial Hadoop technology, has filed for its initial public offering. The company claims more than $33 million in revenue for the year thus far and nearly $88 million in operating loss. Hortonworks spun off from Yahoo in 2011. It offers a big data processing platform that includes the ability to process various types of data including SQL and NoSQL sources then search across data, or use various analytics tools to visualize the data. Hortonworks has a reputation for being a pure Hadoop offering without any proprietary extensions.


2013. SAP makes big companies effective with Big Data. Competitors are crying


In recent years, SAP was probably the least innovative IT giant (compared to competitors Oracle, Microsoft, IBM). All SAP's own innovative projects mostly failed (remember Business ByDesign), and the only thing that SAP could do - is buying other companies (SuccessFactors, SyBase, Ariba). But at this time, SAP is going to outdo all the competitors on the wave of new trendy technology - Big Data. What is Big Data? Big Data - is a set of technologies for fast processing of very large amounts of data (structured and unstructured). How Big Data is different from what was before? Roughly speaking, instead of a single server working with multiple data warehouses, there is one database running on multiple servers. Perhaps you, as a businessman or manager, don't care, but in result this change enables IT system to run much faster. For large companies data management turns into nightmare. The video above shows how one beautiful database turns into monstrous IT system that can't even tell you how many units of product you can now sell to customer. Because the product is stored in multiple warehouses, can be reserved by other branches and your system does not know when the supplier will deliver new batch, because it's controlled by another system. The Big Data promises large companies an opportunity to manage business in real time. So, SAP, was one of the first to create the own Big Data platform, called SAP HANA, and recently also announced that its ERP-system SAP Business Suite can work on top of this platform. Experts say that neither Oracle, nor Microsoft, nor IBM are able to offer such an integrated solution for large companies and they are not even close to SAP in this segment. But that's not all bad news for the competition. The fact is that SAP's ERP system always used the database management systems developed by Oracle, Microsoft and IBM. Now, SAP will transfer the customers to its own platform. And that is about 60% of the large companies in the world.