Skip to main content

What is Azure DataBricks

Azure DataBricks in Short Details

Content to be covered

  1. Apache Spark Based Analytics Platform
  2. Apache Spark in Azure DataBricks
  3. Enterprise Security
  4. Integration with Azure Services
  5. Integration with PowerBI

Introduction

Microsoft Azure provides a cloud-based platform service known as Azure DataBricks which is an Apache Based Analytics Platform. It provides an interactive workspace that enables collaboration between Data Scientist, Data Engineers, and Business Analyst.

Apache Spark Based Analytics Platform

Spark in Azure DataBricks includes the following components:

  • Spark SQL and Data Frame
  • Streaming
  • MLlib
  • GraphX
  • Spark Core API

Apache Spark in Azure Databricks

The DataBrick build on the spark provides a zero management cloud platform that includes:

  1. Fully managed Spark Clusters
  2. An interactive workspace for exploration and visualization
  3. A platform for powering Spark Based Applications.

Enterprise Security

Azure DataBricks provides enterprise-grade Azure Security.

  • Integration with Azure Active Directory enables to run complete Azure Based solutions using Azure DataBricks
  • Enterprise-Grade SLA's.
  • Azure DataBricks' role-based access enables fine-grained user permission for notebooks, clusters, jobs, and data.

Integration with Azure Services

It deeply provides seamless integration with Azure DataBase and stores be at, SQL Data Warehouse, Cosmos DB, Data Lake Storage, and Blob Storage.

Integration with Power BI

You can use the BI tools as well, such as Tableau Software via JDBC/ODBC cluster endpoints.

Comments

Popular posts from this blog

Introduction to Numpy | Pandas | Matplotlib | in Python

  Introduction to Numpy | Pandas | Matplotlib | in Python Introduction If you are starting to learn Data Visualization or Data Science in particular you must have heard or trying to learn about Pandas, or Numpy, or Matplotlib. This article will serve you the very basic and informative introduction to these keeping in mind the practical essence of code. It is very important to first understand the theory first and then to understand code, and if possible try to implement it in parallel, making sure that the resources that are required are fulfilled at an earlier stage. What is Numpy Numpy is a Python package for scientific computing, adding support to linear algebra, matrices, and Fourier transform. In fact, Numpy is an abbreviation of Numerical Python. One of the premier use of Numpy is in the field of the multidimensional container of generic data. In the case of Numpy, the array here is called as ndarray. Numpy serves as a function as Reshaping arrays, or aggregation or filtering...

PowerShell Scripts - on Date and Time

  PowerShell Scripts - on Date and Time PowerShell - To Get System Date Command -   Get-Date  is used to get the current date. PowerShell - To Get System Time Command -   Get - Date   - DisplayHint   Time  is used to get the current system time PowerShell - To Set System Time Command -  set - date   adjust $timeToAdd is used to set time as required. PowerShell - To Set System Date Command -  Set-Date  is used to set system Date