Azure Databricks

Azure Databricks is a cloud-based big data analytics platform. It provides a collaborative environment for data scientists, data engineers and business analysts to work together on big data projects. Azure Databricks on the Federal Science DataHub (FSDH) combines the power of Apache Spark with a collaborative notebook interface, making it easy to build and deploy data pipelines, machine learning models and analytics applications.

Azure Databricks is ideal for:

The FSDH allows users to provision Azure Databricks for research, enabling them to conduct analysis of data at a large scale.

Visual Studio Code extension

Using the Databricks Visual Studio (VS) Code extension, users can connect to a Databricks workspace from within VS Code, allowing them to:

Google APIs and services

Users can use Google APIs on Databricks to run tools only available on Google Cloud, such as Big Query or Earth Engine, extending what can be accomplished on the FSDH.

Sample use cases

Page details

2025-09-18