Example Applications

This repository contains a number of example applications that can be built and run on PNDA. Each application directory contains more detailed information.

Spark Streaming

  • Examples of consuming data from Kafka and populating both HBase and OpenTSDB with simple Scala based Spark Streaming applications.

Spark

  • Example of consuming data ingested by Gobblin on a batch basis and producing Parquet datasets, optimized for consumption by Impala.

Jupyter

  • Example of a notebook for manipulating network data.

H2O

  • Application that runs the H2O data science platform as an application on PNDA.

Compound Packages

  • An example of a package containing multiple application component types, in this case a Spark app and related Jupyter notebook.

results matching ""

    No results matching ""