Analyzing Pwned Passwords with Apache Spark
Offered By: YouTube
Course Description
Overview
Explore the analysis of compromised passwords using Apache Spark in this conference talk from GrrCon 2018. Dive into big data concepts, Resilient Distributed Datasets (RDDs), and DataFrames while examining the current state of password security. Learn about data visualization techniques, password policies, and the benefits and challenges of using Spark for large-scale data processing. Discover insights on common password patterns, lengths, and suffixes, and understand the importance of addressing password reuse and credential stuffing attacks. Gain valuable knowledge on balancing security measures with user experience in the ever-evolving landscape of cybersecurity.
Syllabus
Introduction
Kelly Robinson Introduction
Agenda
Apache Spark
Big Data
RDDs
DataFrames
Performance
State of passwords
TryHunt
Zeppelin
Top Passwords
Lengths
Data visualizations
Suffixes
Transform Data
Password Policies
Spark Benefits
Spark Challenges
Java Stack Traces
Big Data Security
Security is on everyones mind
Nobody security is perfect
Users have bad passwords
Password reuse
Password security
Seamless user experience
Credential stuffing
Wrap up
Related Courses
CS115x: Advanced Apache Spark for Data Science and Data EngineeringUniversity of California, Berkeley via edX Big Data Analytics
University of Adelaide via edX Big Data Essentials: HDFS, MapReduce and Spark RDD
Yandex via Coursera Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames
Yandex via Coursera Introduction to Apache Spark and AWS
University of London International Programmes via Coursera