News

What’s maybe more exciting, though, is something Databricks calls Project Lightspeed, which the company describes as the next generation of the Spark streaming engine.
First created as part of a research project at UC Berkeley AMPLab, Spark is an open source project in the big data space, built for sophisticated analytics, speed, and ease of use. It unifies critical ...
The Apache Spark community has improved support for Python to such a great degree over the past few years that Python is now a “first-class” language, and no longer a “clunky” add-on as it once was, ...
Databricks Cloud will provide Spark-based streaming analysis as a service Taking on Google, Databricks plans to offer its own cloud service for analyzing live data streams, one based on the Apache ...
Still, Databricks’ announcements today failed to address its in-memory data processing capabilities, which Mueller said was Spark’s biggest strength but also its biggest weakness.
Databricks Cloud handles the metadata, launching and provisioning a Spark Cluster, and makes it easy for that cluster to process an organization's data stored in Amazon's S3 service.