Apache Spark vs Databricks - On-Premise Performance Comparison
Offered By: Databricks via YouTube
Course Description
Overview
Explore an on-premise comparison of Databricks and Open-Source Apache Spark in this 21-minute conference talk by Booz Allen's Cyber AI team. Discover how they achieved 10x performance gains on real-world cyber workloads using Databricks Runtime Environment in an air-gapped environment. Learn about the challenges and solutions for implementing data science in sensitive operations, including the service-oriented architecture for capability deployment and high-performance computing project architecture. Gain insights into the results of Spark Open Source vs Spark DBR and valuable lessons for future on-premise installations in data-sensitive environments.
Syllabus
Intro
The Challenge: Go Fast.... On-Premise?
Solution: A Service-Oriented Architecture for Capability Deployment
Project Architecture: Focused on High Performance Computing
Results: Spark Open Source vs Spark DBR
Lessons Learned for Future On-Premise Installs
Taught by
Databricks
Related Courses
Delivering Secure and Compliant Software Components with OCM and GitOpsLinux Foundation via YouTube Curating and Securing Open Source for GovCloud
Linux Foundation via YouTube Managing Kubernetes in Air Gap and Offline Environments
Linux Foundation via YouTube High-Security, Zero-Connectivity and Air-Gapped Clouds - Delivering Complex Software with OCM and Flux
CNCF [Cloud Native Computing Foundation] via YouTube How to Make Your K8s Cluster Survive Without Internet Access - Airgap Solutions
CNCF [Cloud Native Computing Foundation] via YouTube