BinaryPig - Scalable Malware Analytics in Hadoop
Offered By: Black Hat via YouTube
Course Description
Overview
Explore scalable malware analytics using Hadoop in this Black Hat USA 2013 conference talk. Learn how Endgame developed BinaryPig, an open framework built on Apache Hadoop, Apache Pig, and Python, to process and analyze massive amounts of malware data. Discover techniques for handling terabytes of binary data, extracting feature sets for machine learning, and performing large-scale malware studies. Gain insights into the challenges of processing millions of malware samples and how BinaryPig addresses issues of scalability, workflow development, and parallel processing. Examine the architecture, optimizations, and implementations of BinaryPig, including loaders, scripting, and web interface. Delve into general findings, feature extraction methods, clustering results, and icon analysis. Understand the lessons learned and future directions for scalable malware analytics in the face of ever-increasing data volumes.
Syllabus
Intro
Background
Malware data mining is useful
Pre-BinaryPig: Architecture
BinaryPig - Results Exploration
BinaryPig Loaders
Optimizations in BinaryPig
BinaryPig: Loader Implementations
BinaryPig: Scripting
Web Interface
General Findings
Feature Extraction
Feature Depth
Clustering Results **.
ICO Extraction
Icon Features
Lessons Learned
Future work
black hat USA 2013
Taught by
Black Hat
Related Courses
Computer Vision: The FundamentalsUniversity of California, Berkeley via Coursera Einführung in Computer Vision
Technische Universität München (Technical University of Munich) via Coursera 機器學習技法 (Machine Learning Techniques)
National Taiwan University via Coursera Machine Learning for Musicians and Artists
Goldsmiths University of London via Kadenze Прикладные задачи анализа данных
Moscow Institute of Physics and Technology via Coursera