Data Collection

Pyshark data collector

Collection of executable scripts to start data collection components that individually monitor, pre-process (even label), and store data from pre-defined types of data sources, more or less pre-configured by the respective script for ease of use and demonstration purposes. Each module contains the setup of a single specific data collector for a specific type of data, i.e. script to be started, either in separate python instances or via threads. Alternatively one can also launch these scripts directly via the command line.

Currently, the following (sub-)packaged scripts are provided:

  • Pyshark data collector - Collects live network traffic using pyshark, processes

it and stores it in a CSV file.

daisy.scripts.data_collection.pyshark_data_collector()

Creates a CSV file relay with all needed structures to provide it with data. There is the option to either use a live data capture on the local machine or to use data from a remote machine.