YoVDO

Apache Spark vs Databricks - On-Premise Performance Comparison

Offered By: Databricks via YouTube

Tags

Apache Spark Courses Data Science Courses Databricks Courses High Performance Computing Courses Service-Oriented Architecture Courses Air-Gapped Environments Courses

Course Description

Overview

Explore an on-premise comparison of Databricks and Open-Source Apache Spark in this 21-minute conference talk by Booz Allen's Cyber AI team. Discover how they achieved 10x performance gains on real-world cyber workloads using Databricks Runtime Environment in an air-gapped environment. Learn about the challenges and solutions for implementing data science in sensitive operations, including the service-oriented architecture for capability deployment and high-performance computing project architecture. Gain insights into the results of Spark Open Source vs Spark DBR and valuable lessons for future on-premise installations in data-sensitive environments.

Syllabus

Intro
The Challenge: Go Fast.... On-Premise?
Solution: A Service-Oriented Architecture for Capability Deployment
Project Architecture: Focused on High Performance Computing
Results: Spark Open Source vs Spark DBR
Lessons Learned for Future On-Premise Installs


Taught by

Databricks

Related Courses

Delivering Secure and Compliant Software Components with OCM and GitOps
Linux Foundation via YouTube
Curating and Securing Open Source for GovCloud
Linux Foundation via YouTube
Managing Kubernetes in Air Gap and Offline Environments
Linux Foundation via YouTube
High-Security, Zero-Connectivity and Air-Gapped Clouds - Delivering Complex Software with OCM and Flux
CNCF [Cloud Native Computing Foundation] via YouTube
How to Make Your K8s Cluster Survive Without Internet Access - Airgap Solutions
CNCF [Cloud Native Computing Foundation] via YouTube