Skip to main content

Azul Runner

Core framework for writing Python plugins for Azul.

It handles filtering and fetching events, setting up data streams, and parsing and validating plugin results before posting output events back to the dispatcher.

Additionally, it allows plugins to be run locally from the commandline, with local files or folders as input data.

Usage

To run a plugin (after installing azul-runner and the plugin as Python packages):

# against local samples
$ azul-plugin-(name) local_file.bin
$ azul-plugin-(name) local_folder/

# against remote dispatcher
$ azul-plugin-(name) --server http://server.address/path

# with custom config
$ azul-plugin-(name) -c KEY VALUE -c KEY2 VALUE2

Plugins may define multiple entrypoints, check the plugins setup.py for details.

Plugin configuration may be passed in as environment variables.

Check ./azul_runner/settings.py for specific options

Plugin Development

Use the project azure-generator to generate boilerplate code for a new plugin.

For most common use cases, inherit from azul_runner.BinaryPlugin and implement the execute method. It is recommended to look at existing plugins in order to implement your plugin. You can also check binary_plugin.py and plugin.py for more properties that can be configured.

Advanced use cases may inherit from azul_runner.Plugin instead.

For more documentation on development process, see here

For more documentation on specific api usage and plugin code, see here

See structure for more information about the structure of azul-runner.

See migration guide to update your plugin in line with api changes in azul-runner.

Example Plugin

from azul_runner import BinaryPlugin, Feature, Job, cmdline_run, State, FeatureType


class LookForThings(BinaryPlugin):
"""Look for things."""

VERSION = "1.0"
SETTINGS = add_settings(
# You can filter assemblyline file types as below.
# These are the files that your plugin will process.
# Note: you can filter by prefix (document/).
# Check azul-bedrock/identify.yaml for valid file types.
filter_data_types={
"content": [
"text/plain",
"document/",
]
}
)
FEATURES = [
Feature("tag", "Custom tag", FeatureType.String),
]

def execute(self, job: Job):
"""Find peanuts."""
data = job.get_data()
# 'data' is a file-like object that supports seeking and being read from
# (The content may be retrieved in parts if the file is large and non-local)
header: bytes = data.read(7)
if header == b"PEANUT:":
# create a tag
self.add_feature_values("tag", "may contain nuts")
# add the next 24 bytes as a child
c = self.add_child_with_data(
relationship={"label": "peanut"},
data=data.read(24),
)
c.add_feature_values("tag", "may be hard to crack")
else:
return State.Label.OPT_OUT

if __name__ == '__main__':
cmdline_run(plugin=LookForThings)

Execution

As the example plugin was not constructed in azure-generator, running them requires directly executing the script they are contained in. A copy can be found in tests/example_plugins.py.

azul-runner$ python tests/example_plugins.py tests/data/peanut.txt
----- LookForThings results -----
COMPLETED

events (2)

event for cmdline_entity:None
{}
output features:
tag: may contain nuts

event for 7c4cd5274277dde41aa3f5e06cfca8c6cc703951a642b6c268cb43a2b345780a:None
{'label': 'peanut'}
child of cmdline_entity
output data streams (1):
24 bytes - EventData(hash='7c4cd5274277dde41aa3f5e06cfca8c6cc703951a642b6c268cb43a2b345780a', label='content')
output features:
tag: may be hard to crack

Feature key:
tag: Custom tag