Google Cloud Dataflow is #20 in Top 10 Big Data platforms

Google Cloud Dataflow
Build, deploy, and run data processing pipelines that scale to solve your key business challenges. Google Cloud Dataflow enables reliable execution for large scale data processing scenarios such as ETL, analytics, real-time computation, and process orchestration.

Positions in ratings


#20 in Top 10 Big Data platforms

Alternatives


The best alternatives to Google Cloud Dataflow are: Apache Spark, Google Cloud Dataproc, Amazon EMR



Latest news about Google Cloud Dataflow


2015. Google partners with Cloudera to bring Cloud Dataflow to Apache Spark



Google has announced a collaboration with Cloudera, the Hadoop specialists, to integrate its Cloud Dataflow programming model into Apache's Spark data processing engine. By bringing Cloud Dataflow to Spark, developers gain the ability to create and monitor data processing pipelines without the need to manage the underlying data processing cluster. This service originated from Google's internal tools for processing large datasets at a massive scale on the internet. However, not all data processing tasks are identical, and sometimes it becomes necessary to run tasks in different environments such as the cloud, on-premises, or on various processing engines. With Cloud Dataflow, data analysts can utilize the same system to create pipelines, regardless of the underlying architecture they choose to deploy them on.