Google Cloud Dataproc is #21 in Top 10 Big Data platforms

Google Cloud Dataproc
Google Cloud Dataproc is a managed Hadoop MapReduce, Spark, Pig, and Hive service designed to easily and cost effectively process big datasets. You can quickly create managed clusters of any size and turn them off when you are finished, so you only pay for what you need. Cloud Dataproc is integrated across several Google Cloud Platform products, so you have access to a simple, powerful, and complete data processing platform.

Positions in ratings


#21 in Top 10 Big Data platforms

Alternatives


The best alternative to Google Cloud Dataproc is Google Cloud Dataflow



Latest news about Google Cloud Dataproc


2015. Google launched new managed Big Data service Cloud Dataproc



Google is expanding its portfolio of big data services on the Google Cloud Platform with the introduction of Cloud Dataproc. This new service fills the gap between directly managing the Spark data processing engine or Hadoop framework on virtual machines and utilizing a fully managed service like Cloud Dataflow for orchestrating data pipelines on Google's platform. With Cloud Dataproc, users can quickly deploy a Hadoop cluster in less than 90 seconds, which is considerably faster than other available services. Google charges only 1 cent per virtual CPU/hour within the cluster, in addition to the standard costs associated with running virtual machines and storing data. Users can also incorporate Google's more affordable preemptible instances into their clusters to reduce compute costs. Billing is calculated per minute, with a minimum charge of 10 minutes. Thanks to the rapid cluster deployment capabilities of Dataproc, users can easily create ad-hoc clusters when necessary, while Google takes care of the administrative tasks.