Example Applications
This repository contains a number of example applications that can be built and run on PNDA. Each application directory contains more detailed information.
Spark Streaming
- Examples of consuming data from Kafka and populating both HBase and OpenTSDB with simple Scala based Spark Streaming applications.
Spark
- Example of consuming data ingested by Gobblin on a batch basis and producing Parquet datasets, optimized for consumption by Impala.
Jupyter
- Example of a notebook for manipulating network data.
H2O
- Application that runs the H2O data science platform as an application on PNDA.
Compound Packages
- An example of a package containing multiple application component types, in this case a Spark app and related Jupyter notebook.