Report generation date: 2025-11-09

Project overview: dask

High level conclusions

The runtime code coverage of fuzz_fuse covers 6.598% of its statically reachable code. This means there is some place that blocks the fuzzer to continue exploring more code at run time.

Reachability and coverage overview

Functions statically reachable by fuzzers

13 / 4212

Cyclomatic complexity statically reachable by fuzzers

74 / 14648

Runtime code coverage of functions

153 / 4212

Warning: The number of runtime covered functions are larger than the number of reachable functions. This means that Fuzz Introspector found there are more functions covered at runtime than what is considered reachable based on the static analysis. This is a limitation in the analysis as anything covered at runtime is by definition reachable by the fuzzers.
This is likely due to a limitation in the static analysis. In this case, the count of functions covered at runtime is the true value, which means this is what should be considered "achieved" by the fuzzer.

Use the project functions table below to query all functions that were not covered at runtime.

Fuzzers overview

Fuzzer	Fuzzer filename	Functions Reached	Functions unreached	Fuzzer depth	Files reached	Basic blocks reached	Cyclomatic complexity	Details
fuzz_fuse	/src/fuzz_fuse.py	197	7991	4	6	89	623	fuzz_fuse.py

Project functions overview

The following table shows data about each function in the project. The functions included in this table correspond to all functions that exist in the executables of the fuzzers. As such, there may be functions that are from third-party libraries.

For further technical details on the meaning of columns in the below table, please see the Glossary .

Func name	Functions filename	Args	Function call depth	Reached by Fuzzers	Runtime reached by Fuzzers	Combined reached by Fuzzers	Fuzzers runtime hit	Func lines hit %	I Count	BB Count	Cyclomatic complexity	Functions reached	Reached by functions	Accumulated cyclomatic complexity	Undiscovered complexity

Fuzzer details

Fuzzer: fuzz_fuse

Call tree

The calltree shows the control flow of the fuzzer. This is overlaid with coverage information to display how much of the potential code a fuzzer can reach is in fact covered at runtime. In the following there is a link to a detailed calltree visualisation as well as a bitmap showing a high-level view of the calltree. For further information about these topics please see the glossary for full calltree and calltree overview

Full calltree

Call tree overview bitmap:

The distribution of callsites in terms of coloring is

Color Runtime hitcount Callsite count Percentage

red 0 131 43.2%

gold [1:9] 0 0.0%

yellow [10:29] 0 0.0%

greenyellow [30:49] 0 0.0%

lawngreen 50+ 172 56.7%

All colors 303 100

Color	Runtime hitcount	Callsite count	Percentage
red	0	131	43.2%
gold	[1:9]	0	0.0%
yellow	[10:29]	0	0.0%
greenyellow	[30:49]	0	0.0%
lawngreen	50+	172	56.7%
All colors	303	100

Fuzz blockers

The following nodes represent call sites where fuzz blockers occur.

Amount of callsites blocked	Calltree index	Parent function	Callsite	Largest blocked function
61	110	dask.utils.key_split	call site: 00110	dask.utils.key_split
18	85	dask.optimization.fuse	call site: 00085	dask.core.subs
10	246	dask.optimization.default_fused_keys_renamer._enforce_max_key_limit	call site: 00246	dask.utils.key_split
6	22	dask.optimization.fuse	call site: 00022	dask.core.flatten
6	56	dask.optimization.fuse	call site: 00056	.len
4	285	dask.optimization.default_fused_linear_keys_renamer	call site: 00285	dask.utils.key_split
3	40	dask.core.keys_in_tasks	call site: 00040	work.extend
3	49	dask.core.keys_in_tasks	call site: 00049	dask.core.get_dependencies
3	233	dask.utils.key_split	call site: 00233	word.isalpha
3	258	dask.optimization.fuse_linear	call site: 00258	dask.core.flatten
2	33	dask.optimization.fuse	call site: 00033	dask.config.get
2	75	dask.core.subs	call site: 00075	dask.core.subs

Runtime coverage analysis

Covered functions

202

Functions that are reachable but not covered

184

Reachable functions

197

Percentage of reachable functions covered

6.6%

NB: The sum of covered functions and functions that are reachable but not covered need not be equal to Reachable functions . This is because the reachability analysis is an approximation and thus at runtime some functions may be covered that are not included in the reachability analysis. This is a limitation of our static analysis capabilities.

Warning: The number of covered functions are larger than the number of reachable functions. This means that there are more functions covered at runtime than are extracted using static analysis. This is likely a result of the static analysis component failing to extract the right call graph or the coverage runtime being compiled with sanitizers in code that the static analysis has not analysed. This can happen if lto/gold is not used in all places that coverage instrumentation is used.

Function name	source code lines	source lines hit	percentage hit

Files reached

filename	functions hit
/	1
...fuzz_fuse	12
dask.optimization	52
dask.config	2
dask.core	13
dask.utils	128

Analyses and suggestions

Optimal target analysis

Remaining optimal interesting functions

The following table shows a list of functions that are optimal targets. Optimal targets are identified by finding the functions that in combination, yield a high code coverage.

Func name	Functions filename	Arg count	Args	Function depth	instr count	bb count	cyclomatic complexity	Reachable functions	Incoming references	total cyclomatic complexity	Unreached complexity
`dask.array.routines.cov`	dask.array.routines	8	['N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']	27	1	21	11	1158	1	3766	3688
`dask.array._array_expr._overlap.sliding_window_view`	dask.array._array_expr._overlap	4	['N/A', 'N/A', 'N/A', 'N/A']	27	1	5	5	754	0	2400	581
`dask.dataframe.dask_expr._collection.to_numeric`	dask.dataframe.dask_expr._collection	4	['N/A', 'N/A', 'N/A', 'N/A']	14	1	5	5	861	0	2801	495

Implementing fuzzers that target the above functions will improve reachability such that it becomes:

Functions statically reachable by fuzzers

388 / 4212

Cyclomatic complexity statically reachable by fuzzers

1582 / 14648

All functions overview

If you implement fuzzers for these functions, the status of all functions in the project will be:

Func name	Functions filename	Args	Function call depth	Reached by Fuzzers	Runtime reached by Fuzzers	Combined reached by Fuzzers	Fuzzers runtime hit	Func lines hit %	I Count	BB Count	Cyclomatic complexity	Functions reached	Reached by functions	Accumulated cyclomatic complexity	Undiscovered complexity

Fuzz engine guidance

This sections provides heuristics that can be used as input to a fuzz engine when running a given fuzz target. The current focus is on providing input that is usable by libFuzzer.

/src/fuzz_fuse.py

Dictionary

Use this with the libFuzzer -dict=DICT.file flag

Fuzzer function priority

Use one of these functions as input to libfuzzer with flag: -focus_function name

-focus_function=['dask.utils.key_split', 'dask.optimization.fuse', 'dask.optimization.default_fused_keys_renamer._enforce_max_key_limit', 'dask.optimization.default_fused_linear_keys_renamer', 'dask.core.keys_in_tasks', 'dask.optimization.fuse_linear']

Runtime coverage analysis

This section shows analysis of runtime coverage data.

For futher technical details on how this section is generated, please see the Glossary .

Complex functions with low coverage

Func name	Function total lines	Lines covered at runtime	percentage covered	Reached by fuzzers
dask.core._toposort	53	0	0.0%	[]
dask.utils._derived_from	36	0	0.0%	[]
yaml.dump	47	22	46.80%	['fuzz_fuse']
data.append	487	223	45.79%	['fuzz_fuse']
dask.delayed.unpack_collections	75	12	16.0%	['fuzz_fuse']
dask.delayed.to_task_dask	34	0	0.0%	[]
dask.base.visualize_dsk	38	0	0.0%	[]
dask.base.visualize_dsk.label	32	0	0.0%	[]
dask.base.get_scheduler	50	0	0.0%	[]
dask.highlevelgraph.to_graphviz	63	0	0.0%	[]
dask._expr.Expr.rewrite	42	0	0.0%	[]
dask._expr.Expr.simplify_once	35	0	0.0%	[]
dask._expr.Expr._substitute	32	0	0.0%	[]
dask._expr.Expr._to_graphviz	39	0	0.0%	[]
dask._task_spec.convert_legacy_task	31	0	0.0%	[]
dask._task_spec.fuse_linear_task_spec	48	0	0.0%	[]
dask.order.order	68	0	0.0%	[]
dask.order.order.add_to_result	37	0	0.0%	[]
dask.order.order.process_runnables	74	0	0.0%	[]
dask.order.order.path_pop	48	0	0.0%	[]
dask.order._connecting_to_roots	55	0	0.0%	[]
dask.order.diagnostics	31	0	0.0%	[]
res.index.set_levels	144	66	45.83%	['fuzz_fuse']
toolz.merge_sorted	105	28	26.66%	['fuzz_fuse']
itertools.cycle	64	19	29.68%	['fuzz_fuse']
packaging.version.parse	175	21	12.0%	['fuzz_fuse']
pickle.load	569	156	27.41%	['fuzz_fuse']
dask.local.start_state_from_dask	52	1	1.923%	['fuzz_fuse']
dask.local.get_async.fire_tasks	44	3	6.818%	['fuzz_fuse']
p.add_layout	188	102	54.25%	['fuzz_fuse']
b.compute	39	14	35.89%	['fuzz_fuse']
constructor.from_dict	480	203	42.29%	['fuzz_fuse']

Files and Directories in report

This section shows which files and directories are considered in this report. The main reason for showing this is fuzz introspector may include more code in the reasoning than is desired. This section helps identify if too many files/directories are included, e.g. third party code, which may be irrelevant for the threat model. In the event too much is included, fuzz introspector supports a configuration file that can exclude data from the report. See the following link for more information on how to create a config file: link

Files in report

Source file	Reached by	Covered by
	[]	[]
dask.dataframe.io.parquet.arrow	[]	[]
dask.array.utils	[]	[]
dask.callbacks	[]	[]
dask.dataframe.io.orc.core	[]	[]
dask.dataframe.dask_expr._rolling	[]	[]
dask.array.lib	[]	[]
dask.dataframe.api	[]	[]
pyarrow	[]	[]
pandas	[]	[]
dask.dataframe.dask_expr.io.parquet	[]	[]
dask.bag.core	[]	[]
dask.bytes	[]	[]
dask.dataframe.io.parquet.core	[]	[]
dask.dataframe.dask_expr._describe	[]	[]
dask.delayed	[]	[]
dask.array.numpy_compat	[]	[]
importlib_metadata	[]	[]
dask.array._array_expr._map_blocks	[]	[]
dask.tokenize	[]	[]
dask.order	[]	[]
dask.array.einsumfuncs	[]	[]
dask.bag	[]	[]
dask.array.reshape	[]	[]
dask.dataframe.dask_expr._indexing	[]	[]
dask.dataframe.dask_expr._interchange	[]	[]
dask.context	[]	[]
dask.bytes.core	[]	[]
itertools	[]	[]
dask.sizeof	[]	[]
dask.typing	[]	[]
datetime	[]	[]
dask.dataframe	[]	[]
dask.dataframe._pyarrow	[]	[]
dask.dataframe.dask_expr	[]	[]
dask.dataframe.io.io	[]	[]
dask.array._array_expr._io	[]	[]
mmh3	[]	[]
dask.array.chunk_types	[]	[]
dask.array._shuffle	[]	[]
decimal	[]	[]
dask.array.backends	[]	[]
io	[]	[]
dask.dataframe.dask_expr._quantile	[]	[]
typing	[]	[]
weakref	[]	[]
dask.bag.utils	[]	[]
dask.system	[]	[]
html	[]	[]
zipfile	[]	[]
dask.array.blockwise	[]	[]
dask.dataframe.dask_expr._cumulative	[]	[]
dask.utils_test	[]	[]
hashlib	[]	[]
dask.dataframe.dask_expr.io.records	[]	[]
types	[]	[]
dask.multiprocessing	[]	[]
h5py	[]	[]
cachey	[]	[]
dask.dataframe.dask_expr._shuffle	[]	[]
dask.array.stats	[]	[]
dask.dataframe.indexing	[]	[]
jinja2	[]	[]
dask.array._array_expr._expr	[]	[]
dask.array.cupy_entry_point	[]	[]
scipy	[]	[]
tlz	[]	[]
dask.dataframe.dask_expr._backends	[]	[]
dask.dataframe.io.parquet.utils	[]	[]
copy	[]	[]
dask.dataframe.io.orc.arrow	[]	[]
dask.array._array_expr._shuffle	[]	[]
dask._dispatch	[]	[]
dask.conftest	[]	[]
dask.bag.random	[]	[]
dask._pandas_compat	[]	[]
dask.array.reductions	[]	[]
tiledb	[]	[]
dask.array.core	[]	[]
yaml	[]	[]
dask.dataframe.io.parquet	[]	[]
shutil	[]	[]
dask.dataframe.core	[]	[]
webbrowser	[]	[]
timeit	[]	[]
dask.__main__	[]	[]
dask.distributed	[]	[]
dask.dataframe.io.csv	[]	[]
sqlalchemy	[]	[]
dask.dataframe._pyarrow_compat	[]	[]
crick	[]	[]
cupy	[]	[]
dask.dataframe.dask_expr._dummies	[]	[]
importlib	[]	[]
inspect	[]	[]
dask.array._array_expr._collection	[]	[]
dask.array._array_expr._backends	[]	[]
dask.dataframe.accessor	[]	[]
time	[]	[]
dask.dataframe.hyperloglog	[]	[]
dask.array	[]	[]
dask.dataframe.dask_expr._quantiles	[]	[]
dask.datasets	[]	[]
typing_extensions	[]	[]
dask.bag.chunk	[]	[]
dask.ml	[]	[]
dask.diagnostics.profile	[]	[]
functools	[]	[]
pathlib	[]	[]
dask.dataframe.dask_expr._collection	[]	[]
queue	[]	[]
re	[]	[]
dask.dataframe.extensions	[]	[]
dask.threaded	[]	[]
click	[]	[]
dask.backends	[]	[]
dask.base	[]	[]
partd	[]	[]
pytest	[]	[]
pickle	[]	[]
dask._task_spec	[]	[]
dask.local	[]	[]
copyreg	[]	[]
dask.array.api	[]	[]
dask.dataframe.dask_expr.diagnostics._analyze_plugin	[]	[]
dask.layers	[]	[]
dask.blockwise	[]	[]
sys	[]	[]
dask.array._reductions_generic	[]	[]
math	[]	[]
dask.dot	[]	[]
dask.array._array_expr._rechunk	[]	[]
zarr	[]	[]
multiprocessing	[]	[]
binascii	[]	[]
dask.dataframe.multi	[]	[]
atheris	[]	[]
tempfile	[]	[]
...fuzz_fuse	['fuzz_fuse']	[]
dask.array.routines	[]	[]
dask.array.tiledb_io	[]	[]
dask.array._array_expr._reductions	[]	[]
dask.widgets	[]	[]
dask.dataframe.partitionquantiles	[]	[]
dask.rewrite	[]	[]
dask.array.image	[]	[]
dask	[]	[]
dask.array.rechunk	[]	[]
dask.array.lib.stride_tricks	[]	[]
dask._expr	[]	[]
dask.array.random	[]	[]
dask.cli	[]	[]
dask_cudf	[]	[]
dask.task_spec	[]	[]
psutil	[]	[]
dask.diagnostics.profile_visualize	[]	[]
random	[]	[]
dask.array.overlap	[]	[]
dask.dataframe.dask_expr._merge_asof	[]	[]
dask.array.chunk	[]	[]
dask.highlevelgraph	[]	[]
bisect	[]	[]
cupyx	[]	[]
dask.dataframe.dask_expr.io	[]	[]
dask.dataframe.methods	[]	[]
dask.utils	['fuzz_fuse']	[]
threading	[]	[]
toolz	[]	[]
dask._collections	[]	[]
uuid	[]	[]
dask.dataframe.dask_expr._merge	[]	[]
dask.dataframe.dask_expr._repartition	[]	[]
dask.dataframe.io.hdf	[]	[]
dask.config	['fuzz_fuse']	[]
dask.diagnostics.progress	[]	[]
dask.dataframe.dask_expr.diagnostics._analyze	[]	[]
os	[]	[]
dask.dataframe.dask_expr._groupby	[]	[]
dask.array.fft	[]	[]
dask.dataframe.dask_expr._datetime	[]	[]
dask.graph_manipulation	[]	[]
dask.dataframe.dask_expr._expr	[]	[]
json	[]	[]
cloudpickle	[]	[]
collections	[]	[]
operator	[]	[]
dask.dataframe.io.sql	[]	[]
mimesis	[]	[]
dask.array._array_expr.random	[]	[]
fastavro	[]	[]
matplotlib	[]	[]
dask.bag.text	[]	[]
dask.dataframe.io	[]	[]
IPython	[]	[]
dask.dataframe.dask_expr.datasets	[]	[]
dask.dataframe.io.json	[]	[]
dask.dataframe.dispatch	[]	[]
dask.dataframe._compat	[]	[]
dask.dataframe.dask_expr.diagnostics	[]	[]
dask.array._array_expr._slicing	[]	[]
dask.diagnostics	[]	[]
dask.optimization	['fuzz_fuse']	[]
dask.dataframe.tseries.resample	[]	[]
dask.array.wrap	[]	[]
logging	[]	[]
dask.dataframe.rolling	[]	[]
dask.dataframe.tseries	[]	[]
dask.dataframe.io.utils	[]	[]
dask.dataframe._dtypes	[]	[]
dask.widgets.widgets	[]	[]
contextvars	[]	[]
dask.array.ma	[]	[]
dask.bag.avro	[]	[]
dask.dataframe.utils	[]	[]
dask.array._array_expr._blockwise	[]	[]
dask.dataframe.groupby	[]	[]
dask.array.ufunc	[]	[]
dask.array._array_expr._utils	[]	[]
bokeh	[]	[]
dask.array.svg	[]	[]
dask.array._array_expr._creation	[]	[]
dask.dataframe.categorical	[]	[]
dask.array.gufunc	[]	[]
dask.array._array_expr._ufunc	[]	[]
numpy	[]	[]
ipywidgets	[]	[]
dask.array.creation	[]	[]
heapq	[]	[]
dask.array._array_expr._gufunc	[]	[]
platform	[]	[]
cityhash	[]	[]
dask.dataframe.dask_expr._concat	[]	[]
base64	[]	[]
textwrap	[]	[]
dask.dataframe.dask_expr._accessor	[]	[]
glob	[]	[]
codecs	[]	[]
distributed	[]	[]
dask.dataframe.dask_expr.diagnostics._explain	[]	[]
bottleneck	[]	[]
dask.array.optimization	[]	[]
dask.dataframe.dask_expr._util	[]	[]
dask.array.linalg	[]	[]
dask.dataframe.dask_expr.io.bag	[]	[]
dask.cache	[]	[]
dask.hashing	[]	[]
dask.dataframe.dask_expr.io._delayed	[]	[]
dask.dataframe.dask_expr._reductions	[]	[]
dask.dataframe.dask_expr._categorical	[]	[]
dask.array.percentile	[]	[]
dataclasses	[]	[]
dask.dataframe.dask_expr._str_accessor	[]	[]
dask._compatibility	[]	[]
dask.array.slicing	[]	[]
builtins	[]	[]
dask.dataframe.io.demo	[]	[]
	[]	[]
fsspec	[]	[]
packaging	[]	[]
dask.dataframe.backends	[]	[]
dask.dataframe.dask_expr.io.io	[]	[]
dask.dataframe.shuffle	[]	[]
dask.array.dispatch	[]	[]
dask.dataframe.io.orc	[]	[]
dask.bytes.utils	[]	[]
dask.dataframe.io.orc.utils	[]	[]
dask._version	[]	[]
dask.core	['fuzz_fuse']	[]
warnings	[]	[]
atexit	[]	[]
contextlib	[]	[]
traceback	[]	[]
concurrent	[]	[]
dask.array._array_expr	[]	[]
xxhash	[]	[]
fnmatch	[]	[]
dask.array._array_expr._overlap	[]	[]
gc	[]	[]
ast	[]	[]

Directories in report

Directory

Metadata section

This sections shows the raw data that is used to produce this report. This is mainly used for further processing and developer debugging.

Fuzzer Calltree file Program data file Coverage file

fuzz_fuse fuzzerLogFile-fuzz_fuse.data fuzzerLogFile-fuzz_fuse.data.yaml all_cov.json

Table of contents

Project overview: dask

High level conclusions

Reachability and coverage overview

Fuzzers overview

Project functions overview

Fuzzer details

Fuzzer: fuzz_fuse

Call tree

Fuzz blockers

Runtime coverage analysis

Files reached

Analyses and suggestions

Optimal target analysis

Remaining optimal interesting functions

All functions overview

Fuzz engine guidance

/src/fuzz_fuse.py

Dictionary

Fuzzer function priority

Runtime coverage analysis

Complex functions with low coverage

Files and Directories in report

Files in report

Directories in report

Metadata section