YoVDO

Identifying Sensitive Data and Mitigating Risk in Apache Spark and Databricks

Offered By: Databricks via YouTube

Tags

Data Governance Courses Apache Spark Courses Databricks Courses Compliance Courses Risk Mitigation Courses Data Labeling Courses Delta Lake Courses

Course Description

Overview

Discover how to identify sensitive data and mitigate risk in Apache Spark and Databricks environments in this 27-minute video presentation. Learn to leverage BigID's Data Discovery-in-Depth technology to uncover sensitive data elements before building algorithms. Scale discovery and labeling processes to maintain context for all data in your Delta Lake, keeping pace with rapid data growth. Explore how to implement necessary guardrails around your data by understanding its contents. Gain insights on applying BigID's discovery platform to know all data inside Spark, select optimal datasets for analysis, identify sensitive information with relevant compliance policies, and add context to data for better understanding of data scientists' activities. Master techniques for maintaining a unified analytics platform while adhering to policies and regulations governing sensitive information.

Syllabus

Introduction
What is BigID
Data Discovery Intelligence Foundation
Joint Value
BigID
Catalog
Cluster Analysis
Correlation


Taught by

Databricks

Related Courses

Data Processing with Azure
LearnQuest via Coursera
Mejores prácticas para el procesamiento de datos en Big Data
Coursera Project Network via Coursera
Data Science with Databricks for Data Analysts
Databricks via Coursera
Azure Data Engineer con Databricks y Azure Data Factory
Coursera Project Network via Coursera
Curso Completo de Spark con Databricks (Big Data)
Coursera Project Network via Coursera