Fuzz introspector
For issues and ideas: https://github.com/ossf/fuzz-introspector/issues

Project functions overview

The following table shows data about each function in the project. The functions included in this table correspond to all functions that exist in the executables of the fuzzers. As such, there may be functions that are from third-party libraries.

For further technical details on the meaning of columns in the below table, please see the Glossary .

Func name Functions filename Args Function call depth Reached by Fuzzers Runtime reached by Fuzzers Combined reached by Fuzzers Fuzzers runtime hit Func lines hit % I Count BB Count Cyclomatic complexity Functions reached Reached by functions Accumulated cyclomatic complexity Undiscovered complexity

Fuzzer details

Fuzzer: fuzz_fuse

Call tree

The calltree shows the control flow of the fuzzer. This is overlaid with coverage information to display how much of the potential code a fuzzer can reach is in fact covered at runtime. In the following there is a link to a detailed calltree visualisation as well as a bitmap showing a high-level view of the calltree. For further information about these topics please see the glossary for full calltree and calltree overview

Call tree overview bitmap:

The distribution of callsites in terms of coloring is
Color Runtime hitcount Callsite count Percentage
red 0 131 43.2%
gold [1:9] 0 0.0%
yellow [10:29] 0 0.0%
greenyellow [30:49] 0 0.0%
lawngreen 50+ 172 56.7%
All colors 303 100

Fuzz blockers

The following nodes represent call sites where fuzz blockers occur.

Amount of callsites blocked Calltree index Parent function Callsite Largest blocked function
61 110 dask.utils.key_split call site: 00110 dask.utils.key_split
18 85 dask.optimization.fuse call site: 00085 dask.core.subs
10 246 dask.optimization.default_fused_keys_renamer._enforce_max_key_limit call site: 00246 dask.utils.key_split
6 22 dask.optimization.fuse call site: 00022 dask.core.flatten
6 56 dask.optimization.fuse call site: 00056 .len
4 285 dask.optimization.default_fused_linear_keys_renamer call site: 00285 dask.utils.key_split
3 40 dask.core.keys_in_tasks call site: 00040 work.extend
3 49 dask.core.keys_in_tasks call site: 00049 dask.core.get_dependencies
3 233 dask.utils.key_split call site: 00233 word.isalpha
3 258 dask.optimization.fuse_linear call site: 00258 dask.core.flatten
2 33 dask.optimization.fuse call site: 00033 dask.config.get
2 75 dask.core.subs call site: 00075 dask.core.subs

Runtime coverage analysis

Covered functions
202
Functions that are reachable but not covered
184
Reachable functions
197
Percentage of reachable functions covered
6.6%
NB: The sum of covered functions and functions that are reachable but not covered need not be equal to Reachable functions . This is because the reachability analysis is an approximation and thus at runtime some functions may be covered that are not included in the reachability analysis. This is a limitation of our static analysis capabilities.
Warning: The number of covered functions are larger than the number of reachable functions. This means that there are more functions covered at runtime than are extracted using static analysis. This is likely a result of the static analysis component failing to extract the right call graph or the coverage runtime being compiled with sanitizers in code that the static analysis has not analysed. This can happen if lto/gold is not used in all places that coverage instrumentation is used.
Function name source code lines source lines hit percentage hit

Files reached

filename functions hit
/ 1
...fuzz_fuse 12
dask.optimization 52
dask.config 2
dask.core 13
dask.utils 128

Analyses and suggestions

Optimal target analysis

Remaining optimal interesting functions

The following table shows a list of functions that are optimal targets. Optimal targets are identified by finding the functions that in combination, yield a high code coverage.

Func name Functions filename Arg count Args Function depth hitcount instr count bb count cyclomatic complexity Reachable functions Incoming references total cyclomatic complexity Unreached complexity
dask.array.routines.cov dask.array.routines 8 ['N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A'] 27 0 1 21 11 1158 1 3766 3688
dask.array._array_expr._overlap.sliding_window_view dask.array._array_expr._overlap 4 ['N/A', 'N/A', 'N/A', 'N/A'] 27 0 1 5 5 754 0 2400 581
dask.dataframe.dask_expr._collection.to_numeric dask.dataframe.dask_expr._collection 4 ['N/A', 'N/A', 'N/A', 'N/A'] 14 0 1 5 5 861 0 2801 495

Implementing fuzzers that target the above functions will improve reachability such that it becomes:

Functions statically reachable by fuzzers
9.0%
388 / 4212
Cyclomatic complexity statically reachable by fuzzers
11.0%
1582 / 14648

All functions overview

If you implement fuzzers for these functions, the status of all functions in the project will be:

Func name Functions filename Args Function call depth Reached by Fuzzers Runtime reached by Fuzzers Combined reached by Fuzzers Fuzzers runtime hit Func lines hit % I Count BB Count Cyclomatic complexity Functions reached Reached by functions Accumulated cyclomatic complexity Undiscovered complexity

Runtime coverage analysis

This section shows analysis of runtime coverage data.

For futher technical details on how this section is generated, please see the Glossary .

Complex functions with low coverage

Func name Function total lines Lines covered at runtime percentage covered Reached by fuzzers
dask.core._toposort 53 0 0.0% []
dask.utils._derived_from 36 0 0.0% []
yaml.dump 47 22 46.80% ['fuzz_fuse']
data.append 487 223 45.79% ['fuzz_fuse']
dask.delayed.unpack_collections 75 12 16.0% ['fuzz_fuse']
dask.delayed.to_task_dask 34 0 0.0% []
dask.base.visualize_dsk 38 0 0.0% []
dask.base.visualize_dsk.label 32 0 0.0% []
dask.base.get_scheduler 50 0 0.0% []
dask.highlevelgraph.to_graphviz 63 0 0.0% []
dask._expr.Expr.rewrite 42 0 0.0% []
dask._expr.Expr.simplify_once 35 0 0.0% []
dask._expr.Expr._substitute 32 0 0.0% []
dask._expr.Expr._to_graphviz 39 0 0.0% []
dask._task_spec.convert_legacy_task 31 0 0.0% []
dask._task_spec.fuse_linear_task_spec 48 0 0.0% []
dask.order.order 68 0 0.0% []
dask.order.order.add_to_result 37 0 0.0% []
dask.order.order.process_runnables 74 0 0.0% []
dask.order.order.path_pop 48 0 0.0% []
dask.order._connecting_to_roots 55 0 0.0% []
dask.order.diagnostics 31 0 0.0% []
res.index.set_levels 144 66 45.83% ['fuzz_fuse']
toolz.merge_sorted 105 28 26.66% ['fuzz_fuse']
itertools.cycle 64 19 29.68% ['fuzz_fuse']
packaging.version.parse 175 21 12.0% ['fuzz_fuse']
pickle.load 569 156 27.41% ['fuzz_fuse']
dask.local.start_state_from_dask 52 1 1.923% ['fuzz_fuse']
dask.local.get_async.fire_tasks 44 3 6.818% ['fuzz_fuse']
p.add_layout 188 102 54.25% ['fuzz_fuse']
b.compute 39 14 35.89% ['fuzz_fuse']
constructor.from_dict 480 203 42.29% ['fuzz_fuse']

Files and Directories in report

This section shows which files and directories are considered in this report. The main reason for showing this is fuzz introspector may include more code in the reasoning than is desired. This section helps identify if too many files/directories are included, e.g. third party code, which may be irrelevant for the threat model. In the event too much is included, fuzz introspector supports a configuration file that can exclude data from the report. See the following link for more information on how to create a config file: link

Files in report

Source file Reached by Covered by
[] []
dask.dataframe.io.parquet.arrow [] []
dask.array.utils [] []
dask.callbacks [] []
dask.dataframe.io.orc.core [] []
dask.dataframe.dask_expr._rolling [] []
dask.array.lib [] []
dask.dataframe.api [] []
pyarrow [] []
pandas [] []
dask.dataframe.dask_expr.io.parquet [] []
dask.bag.core [] []
dask.bytes [] []
dask.dataframe.io.parquet.core [] []
dask.dataframe.dask_expr._describe [] []
dask.delayed [] []
dask.array.numpy_compat [] []
importlib_metadata [] []
dask.array._array_expr._map_blocks [] []
dask.tokenize [] []
dask.order [] []
dask.array.einsumfuncs [] []
dask.bag [] []
dask.array.reshape [] []
dask.dataframe.dask_expr._indexing [] []
dask.dataframe.dask_expr._interchange [] []
dask.context [] []
dask.bytes.core [] []
itertools [] []
dask.sizeof [] []
dask.typing [] []
datetime [] []
dask.dataframe [] []
dask.dataframe._pyarrow [] []
dask.dataframe.dask_expr [] []
dask.dataframe.io.io [] []
dask.array._array_expr._io [] []
mmh3 [] []
dask.array.chunk_types [] []
dask.array._shuffle [] []
decimal [] []
dask.array.backends [] []
io [] []
dask.dataframe.dask_expr._quantile [] []
typing [] []
weakref [] []
dask.bag.utils [] []
dask.system [] []
html [] []
zipfile [] []
dask.array.blockwise [] []
dask.dataframe.dask_expr._cumulative [] []
dask.utils_test [] []
hashlib [] []
dask.dataframe.dask_expr.io.records [] []
types [] []
dask.multiprocessing [] []
h5py [] []
cachey [] []
dask.dataframe.dask_expr._shuffle [] []
dask.array.stats [] []
dask.dataframe.indexing [] []
jinja2 [] []
dask.array._array_expr._expr [] []
dask.array.cupy_entry_point [] []
scipy [] []
tlz [] []
dask.dataframe.dask_expr._backends [] []
dask.dataframe.io.parquet.utils [] []
copy [] []
dask.dataframe.io.orc.arrow [] []
dask.array._array_expr._shuffle [] []
dask._dispatch [] []
dask.conftest [] []
dask.bag.random [] []
dask._pandas_compat [] []
dask.array.reductions [] []
tiledb [] []
dask.array.core [] []
yaml [] []
dask.dataframe.io.parquet [] []
shutil [] []
dask.dataframe.core [] []
webbrowser [] []
timeit [] []
dask.__main__ [] []
dask.distributed [] []
dask.dataframe.io.csv [] []
sqlalchemy [] []
dask.dataframe._pyarrow_compat [] []
crick [] []
cupy [] []
dask.dataframe.dask_expr._dummies [] []
importlib [] []
inspect [] []
dask.array._array_expr._collection [] []
dask.array._array_expr._backends [] []
dask.dataframe.accessor [] []
time [] []
dask.dataframe.hyperloglog [] []
dask.array [] []
dask.dataframe.dask_expr._quantiles [] []
dask.datasets [] []
typing_extensions [] []
dask.bag.chunk [] []
dask.ml [] []
dask.diagnostics.profile [] []
functools [] []
pathlib [] []
dask.dataframe.dask_expr._collection [] []
queue [] []
re [] []
dask.dataframe.extensions [] []
dask.threaded [] []
click [] []
dask.backends [] []
dask.base [] []
partd [] []
pytest [] []
pickle [] []
dask._task_spec [] []
dask.local [] []
copyreg [] []
dask.array.api [] []
dask.dataframe.dask_expr.diagnostics._analyze_plugin [] []
dask.layers [] []
dask.blockwise [] []
sys [] []
dask.array._reductions_generic [] []
math [] []
dask.dot [] []
dask.array._array_expr._rechunk [] []
zarr [] []
multiprocessing [] []
binascii [] []
dask.dataframe.multi [] []
atheris [] []
tempfile [] []
...fuzz_fuse ['fuzz_fuse'] []
dask.array.routines [] []
dask.array.tiledb_io [] []
dask.array._array_expr._reductions [] []
dask.widgets [] []
dask.dataframe.partitionquantiles [] []
dask.rewrite [] []
dask.array.image [] []
dask [] []
dask.array.rechunk [] []
dask.array.lib.stride_tricks [] []
dask._expr [] []
dask.array.random [] []
dask.cli [] []
dask_cudf [] []
dask.task_spec [] []
psutil [] []
dask.diagnostics.profile_visualize [] []
random [] []
dask.array.overlap [] []
dask.dataframe.dask_expr._merge_asof [] []
dask.array.chunk [] []
dask.highlevelgraph [] []
bisect [] []
cupyx [] []
dask.dataframe.dask_expr.io [] []
dask.dataframe.methods [] []
dask.utils ['fuzz_fuse'] []
threading [] []
toolz [] []
dask._collections [] []
uuid [] []
dask.dataframe.dask_expr._merge [] []
dask.dataframe.dask_expr._repartition [] []
dask.dataframe.io.hdf [] []
dask.config ['fuzz_fuse'] []
dask.diagnostics.progress [] []
dask.dataframe.dask_expr.diagnostics._analyze [] []
os [] []
dask.dataframe.dask_expr._groupby [] []
dask.array.fft [] []
dask.dataframe.dask_expr._datetime [] []
dask.graph_manipulation [] []
dask.dataframe.dask_expr._expr [] []
json [] []
cloudpickle [] []
collections [] []
operator [] []
dask.dataframe.io.sql [] []
mimesis [] []
dask.array._array_expr.random [] []
fastavro [] []
matplotlib [] []
dask.bag.text [] []
dask.dataframe.io [] []
IPython [] []
dask.dataframe.dask_expr.datasets [] []
dask.dataframe.io.json [] []
dask.dataframe.dispatch [] []
dask.dataframe._compat [] []
dask.dataframe.dask_expr.diagnostics [] []
dask.array._array_expr._slicing [] []
dask.diagnostics [] []
dask.optimization ['fuzz_fuse'] []
dask.dataframe.tseries.resample [] []
dask.array.wrap [] []
logging [] []
dask.dataframe.rolling [] []
dask.dataframe.tseries [] []
dask.dataframe.io.utils [] []
dask.dataframe._dtypes [] []
dask.widgets.widgets [] []
contextvars [] []
dask.array.ma [] []
dask.bag.avro [] []
dask.dataframe.utils [] []
dask.array._array_expr._blockwise [] []
dask.dataframe.groupby [] []
dask.array.ufunc [] []
dask.array._array_expr._utils [] []
bokeh [] []
dask.array.svg [] []
dask.array._array_expr._creation [] []
dask.dataframe.categorical [] []
dask.array.gufunc [] []
dask.array._array_expr._ufunc [] []
numpy [] []
ipywidgets [] []
dask.array.creation [] []
heapq [] []
dask.array._array_expr._gufunc [] []
platform [] []
cityhash [] []
dask.dataframe.dask_expr._concat [] []
base64 [] []
textwrap [] []
dask.dataframe.dask_expr._accessor [] []
glob [] []
codecs [] []
distributed [] []
dask.dataframe.dask_expr.diagnostics._explain [] []
bottleneck [] []
dask.array.optimization [] []
dask.dataframe.dask_expr._util [] []
dask.array.linalg [] []
dask.dataframe.dask_expr.io.bag [] []
dask.cache [] []
dask.hashing [] []
dask.dataframe.dask_expr.io._delayed [] []
dask.dataframe.dask_expr._reductions [] []
dask.dataframe.dask_expr._categorical [] []
dask.array.percentile [] []
dataclasses [] []
dask.dataframe.dask_expr._str_accessor [] []
dask._compatibility [] []
dask.array.slicing [] []
builtins [] []
dask.dataframe.io.demo [] []
[] []
fsspec [] []
packaging [] []
dask.dataframe.backends [] []
dask.dataframe.dask_expr.io.io [] []
dask.dataframe.shuffle [] []
dask.array.dispatch [] []
dask.dataframe.io.orc [] []
dask.bytes.utils [] []
dask.dataframe.io.orc.utils [] []
dask._version [] []
dask.core ['fuzz_fuse'] []
warnings [] []
atexit [] []
contextlib [] []
traceback [] []
concurrent [] []
dask.array._array_expr [] []
xxhash [] []
fnmatch [] []
dask.array._array_expr._overlap [] []
gc [] []
ast [] []

Directories in report

Directory