Blurring the Line Between Developer and Data Scientist with PixieDust
Offered By: Devoxx via YouTube
Course Description
Overview
Explore the world of data science through Notebooks and PixieDust in this 52-minute conference talk from Devoxx. Learn how PixieDust, an open-source library, enhances efficiency for data scientists and developers working with Jupyter Notebooks and Apache Spark. Discover features like auto-visualization of Spark DataFrames, real-time Spark Job progress monitoring, seamless cloud service integration, and automated local installation of Python and Scala kernels. Gain insights into using PixieDust for effortless data visualization and exploration without coding, applicable to both Python and Scala environments. Witness a demonstration combining Twitter, Watson Tone Analyzer, Spark Streaming, and real-time visualizations within a Notebook, showcasing the practical applications of this powerful tool.
Syllabus
Blurring the line between Developer and Data scientist with PixieDust by David Taieb
Taught by
Devoxx
Related Courses
Intro to StatisticsStanford University via Udacity Introduction to Data Science
University of Washington via Coursera Passion Driven Statistics
Wesleyan University via Coursera Information Visualization
Indiana University via Independent DCO042 - Python For Informatics
University of Michigan via Independent