The Jupyter Notebook is a web application that allows you to create and share documents that contain live code, equations, visualizations and explanatory text. In PNDA, it supports exploration and presentation of data from HDFS and HBase.
This tutorial demonstrates how to use Jupyter to access data and make sense of data interactively using the Jupyter pyspark interpreter. The data samples used in this lab are networking telemetry data and netflow data and persistent in PNDA platform.
This section explains how to manage PySpark application dependencies.