Data Lake platforms
Updated: September 28, 2022
2022. Salesforce built a data lake to transform how customer data moves on the platform
The ultimate goal of pulling customer data together into a customer data platform (CDP) is building more meaningful customer experiences in real time. To make this true Salesforce is launching Genie, a real-time data integration platform. It is a data lake that underlies the entire Salesforce platform with the aim of moving data wherever it’s needed most — and doing it fast. Customer 360 applications — sales, service, commerce, marketing — now have access to an entirely new way of bringing data into Salesforce in real time at scale But Genie is more than just a data integration layer. By allowing data to flow faster and more freely, it opens up all kinds of automation possibilities, especially when you combine it with Einstein for AI and machine learning and Salesforce Flow, the company’s workflow tool.
2022. Cloudera launches its all-in-one SaaS data lakehouse
Cloudera, the Hadoop-centric big data company is now putting its emphasis on becoming the unified data fabric for hybrid data platforms. The company took a next step in this direction with the launch of its Cloudera Data Platform (CDP) One data lakehouse as a service (LaaS?). This managed offering is meant to give enterprises a platform to enable self-service analytics and data access for more of their employees. The company calls it the “first all-in-one data lakehouse SaaS offering”, though Databricks, which popularized the lakehouse concept, also offers SaaS-based solutions. It makes for good marketing copy, though, and Cloudera argues that its service is the first to combine compute, storage, ML, streaming analytics and enterprise security.
2022. Dremio raises $160M for its data lake platform
Data lake platform Dremio has raised a $160 million Series E. Dremio — as well as competitors like Databricks — is playing in a newly invigorated market. While data lakes and data warehouses, for the longest time, looked like they would remain useful for a set of relatively limited use cases, the idea of the lake house — which was first popularized by Databricks — is meant to signal that this new class of technologies now allows enterprises to do far more with this data.