onefuzz

Telemetry

OneFuzz reports two types of telemetry, both via AppInsights.

OneFuzz records fully featured attributable data is to a user-owned AppInsights instance. This goal of this information is to enable users to perform detailed analysis of their fuzzing tasks.
OneFuzz reports non-attributable minimal set of runtime statistics to Microsoft via a Microsoft managed AppInsights instance. The goal is to provide insight to the efficacy of OneFuzz and fuzzing engines used in OneFuzz. Information regarding the the users of a OneFuzz instance, any applications under test, or any bug details found via fuzzing is not intended to be recorded in in this telemetry.

Who owns OneFuzz Resources

For the purposes of this document, a “OneFuzz instance” is a user-deployed install of OneFuzz in that user’s Azure Subscription.

The user owns and manages all resources used for OneFuzz, including the fuzzing nodes. OneFuzz supports both “managed” nodes, where OneFuzz orchestrates the lifecycle of the fuzzing nodes via Azure VM Scale Sets, and “unmanaged” nodes, where users provide compute however they wish (be that on-premise hardware, third-party clouds, etc).

How telemetry is collected

All telemetry is gathered from two places, the agents that run within fuzzing nodes and the service API running in the Azure Functions instance.

The rust library onefuzz::telemetry provides a detailed set of telemetry types, as well as the function can_share_with_microsoft, which gates if a given telemetry field should be sent to the Microsoft central telemetry instance.
The Python library onefuzzlib.telemetry provides a filtering mechanism to identify a per-object set of filtering records. Each ORM backed table provides a mechanism to identify the field should be sent to the Microsoft central telemetry instance. Example: The onefuzzlib.jobs.Job.telemetry_include implementation describes the set of fields that are to be recorded.

These mechanisms ensure that any only fields intended to be recorded are sent to the central telemetry service.

How to disable sending telemetry to Microsoft

Remove ONEFUZZ_TELEMETRY in the Application settings of the Azure Functions instance in the OneFuzz instance created during deployment.

Users are reminded of how to disable the telemetry during each OneFuzz deployment to Azure.

Data sent to Microsoft

The following describes the information sent to Microsoft if telemetry is enabled.

Definitions of common data types

The following are common data types used in multiple locations:

Instance ID - A randomly generated GUID used to uniquely identify an instance of OneFuzz
Task ID - A randomly generated GUID used to uniquely identify a fuzzing task.
Job ID - A randomly generated GUID used to uniquely identify a job.
Machine ID - A GUID used to identify the machine running the task. When run in Azure, this is the VM Unique ID. When fuzzing is run outside of Azure, this is a randomly generated GUID created once per node.
Scaleset ID - A randomly generated GUID used to uniquely identify a VM scaleset.
Task Type - The type of task being executed. Examples include generic_crash_report or coverage. For a full list, see the enum TaskType.
OS - An enum value describing the OS used (Currently, only Windows or Linux).
Version - A compile-time generated string that specifies the OneFuzz version number based on CURRENT_RELEASE and the sha-1 git revision (See example).
Role - An enum value describing the role of the OneFuzz software in use. Examples include Agent or Proxy. For a full list, see the enum Role.

Data recorded by Agents

Task ID
Job ID
Machine ID
Task Type
Features - A u64 representing the number of ‘features’ in the SanCovcoverage map for a libFuzzer executable.
Covered - A u64 representing the number of ‘features’ in the SanCovcoverage map for a libFuzzer executable that were exercised during fuzzing.
Rate - A float64 that is calculated as (Covered / Features).
Count - Number of executions done by the fuzzing task.
ExecsSecond - The rate of executions per second.
WorkerID - For fuzzers that run multiple copies concurrently on a single VM, this is differentiates telemetry between each instance on the VM.
RunID - A randomly generated GUID used to uniquely identify the execution of a fuzzing target. For fuzzers that restart, such as libfuzzer, this is used to uniquely identify telemetry for each time the fuzzer is started.
VirtualMemory - The amount virtual memory in use by the fuzzing task.
PhysicalMemory - The amount of physical memory in use by the fuzzing task.
CpuUsage - The amount of CPU in use by the fuzzing task.
Crash Found - A flag that indicates that a crash was found.
Crash Report Created - A flag that indicates a crash was found to be reproducible and a report was generated.
Unique Crash Report Created - A flag that indicates that a crash was found to be reproducible and unique in the set of existing reports.
Tool Name - A string that identifies the tool in use for generic tasks. For custom tools, this will record the custom tool name. Examples: In the radamsa template, this is {tools_dir}/radamsa for the generic_generator task and cdb.exe for the generic_analysis task.

The following are AFL specific:

Mode - A string representing the mode of the AFL task. This is unique to parsing AFL stats, and specifies the “target_mode” that AFL is running in. Examples include, but are not limited to: “default”, “qemu”, and “persistent”.
CoveragePaths - A u64 representing paths_total in AFL stats.
CoveragePathsFavored - A u64 representing paths_favored in AFL stats.
CoveragePathsFound - A u64 representing paths_found in AFL stats.
CoveragePathsImported - A u64 representing paths_imported in AFL stats.
Coverage - A float64 representing bitmap_cvg in AFL stats.

Data recorded by the Service

Each time the state of a job changes, the following information is recorded:

Job ID
State - The current state of the job. For a full list, see the enum JobState.

Each time the state of a task changes, the following information is recorded at the service level:

Task ID
Job ID
Task Type
state: The current state of the task. For a full list, see the enum TaskState.
VM count: The number of VMs used for the task.

Each time the state of a scaleset changes, the following information is recorded:

Scaleset ID
OS
VM SKU - The Azure VM Size
Size - The number of VMs in the scalset. For a full list, see the enum ScalesetState.
Spot Instances - A boolean representing if Spot Instances are used in a scaleset.

Each time the state of a pool changes, the following information is recorded:

Pool ID - A randomly generated GUID used to uniquely identify a VM scaleset.
OS
State - The current state of the pool. For a full list, see the enum PoolState.
Managed - A boolean representing if the pool is OneFuzz manages the VMs in use.

Each time the state of a fuzzing node changes, the following information is recorded:

Scaleset ID
Machine ID
State - the current state of the node. For a full list, see the enum NodeState.

Each time the state of a task on a node changes, the following information is recorded:

Task ID
Machine ID
State - the current state of the task on the node. For a full list, see the enum NodeTaskState.

This site is open source. Improve this page.