FuzzBench: 2025-01-23-path-afl report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
aflplusplus 96.38
libfuzzer 93.16
eclipser 82.34
aflsmart 82.22
aflfast 81.55
fairfuzz 78.96
libafl 75.55
centipede 70.86
By avg. rank
average rank
fuzzer
aflplusplus 1.86
libfuzzer 3.29
libafl 3.95
aflsmart 4.43
eclipser 4.62
aflfast 5.62
fairfuzz 5.76
centipede 5.81
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      libafl aflplusplus libfuzzer eclipser aflsmart centipede aflfast fairfuzz
    FuzzerMedian 97.50 97.00 95.00 95.00 93.00 89.00 93.00 85.00
    FuzzerMean 95.94 93.19 90.10 88.05 79.86 79.61 79.24 76.67
    bloaty_fuzz_target nan 98.00 90.00 95.00 95.00 nan 95.00 81.00
    curl_curl_fuzzer_http 97.00 98.00 88.00 93.00 93.00 nan 92.00 84.00
    freetype2_ftfuzzer 91.00 92.00 79.00 74.00 67.00 57.00 64.00 63.00
    harfbuzz_hb-shape-fuzzer nan 99.00 95.00 98.00 97.00 nan 97.00 85.00
    jsoncpp_jsoncpp_fuzzer 98.00 99.00 100.00 98.00 98.00 98.00 98.00 98.00
    lcms_cms_transform_fuzzer nan 92.00 95.00 78.00 42.00 51.00 30.00 57.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 82.00 82.00 nan 82.00 82.00 98.00 99.00
    libpcap_fuzz_both nan 89.00 74.00 72.00 1.00 89.00 1.00 1.00
    libpng_libpng_read_fuzzer 95.00 95.00 96.00 95.00 95.00 96.00 93.00 94.00
    libxml2_xml 99.00 99.00 97.00 97.00 97.00 93.00 96.00 89.00
    libxslt_xpath 96.00 99.00 93.00 95.00 95.00 94.00 94.00 95.00
    mbedtls_fuzz_dtlsclient 88.00 70.00 68.00 68.00 69.00 67.00 66.00 71.00
    openssl_x509 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00
    openthread_ot-ip6-send-fuzzer 88.00 76.00 76.00 74.00 71.00 71.00 71.00 68.00
    proj4_proj_crs_to_crs_fuzzer nan 86.00 98.00 63.00 10.00 10.00 9.00 10.00
    re2_fuzzer 98.00 99.00 99.00 99.00 99.00 95.00 99.00 99.00
    sqlite3_ossfuzz 99.00 95.00 78.00 90.00 90.00 62.00 90.00 57.00
    systemd_fuzz-link-parser 98.00 99.00 95.00 91.00 91.00 97.00 91.00 86.00
    vorbis_decode_fuzzer 98.00 99.00 99.00 98.00 98.00 89.00 98.00 97.00
    woff2_convert_woff2ttf_fuzzer 97.00 95.00 92.00 nan 92.00 88.00 89.00 81.00
    zlib_zlib_uncompress_fuzzer 95.00 97.00 99.00 96.00 96.00 95.00 94.00 96.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, eclipser.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 6271.050000 51.508277 6130.0 6249.00 6292.0 6296.75 6340.0
    aflsmart 82800 20.0 6146.000000 146.262561 5784.0 6070.75 6104.0 6271.00 6381.0
    eclipser 82800 11.0 6053.181818 68.930136 5914.0 6022.50 6078.0 6095.50 6135.0
    aflfast 82800 20.0 6077.150000 116.111233 5846.0 6016.00 6076.0 6140.25 6335.0
    libfuzzer 82800 14.0 5831.857143 105.791418 5627.0 5772.00 5795.5 5929.50 5981.0
    fairfuzz 82800 16.0 5189.937500 67.159977 5111.0 5135.25 5170.0 5226.00 5323.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 19.0 10941.894737 97.934726 10710.0 10879.00 10963.0 10995.00 11117.0
    libafl 82800 17.0 10886.470588 74.619131 10654.0 10855.00 10886.0 10938.00 10972.0
    aflsmart 82800 20.0 10385.000000 132.985951 10018.0 10363.25 10420.0 10455.25 10544.0
    eclipser 82800 17.0 10351.470588 163.547057 9942.0 10353.00 10381.0 10436.00 10545.0
    aflfast 82800 20.0 10300.900000 57.898187 10100.0 10277.00 10306.0 10340.75 10375.0
    libfuzzer 82800 12.0 9895.666667 398.731701 9195.0 9685.50 9859.0 10078.00 10557.0
    fairfuzz 82800 19.0 9222.578947 391.462545 8215.0 8926.50 9379.0 9512.00 9714.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 19.0 11384.526316 295.498894 10524.0 11313.50 11451.0 11568.00 11797.0
    libafl 82800 19.0 11507.947368 444.269872 10571.0 11271.50 11427.0 11668.00 12428.0
    libfuzzer 82800 16.0 9665.687500 542.188125 8532.0 9416.00 9821.0 9955.75 10520.0
    eclipser 82800 18.0 9312.222222 98.567717 9066.0 9283.00 9317.0 9355.50 9516.0
    aflsmart 82800 20.0 8342.750000 174.588501 7870.0 8314.75 8393.0 8435.50 8535.0
    aflfast 82800 20.0 8059.000000 238.403903 7700.0 7852.50 8061.5 8289.00 8362.0
    fairfuzz 82800 17.0 7936.117647 236.373349 7764.0 7809.00 7863.0 7895.00 8582.0
    centipede 82800 14.0 7190.000000 120.755315 7001.0 7121.50 7192.0 7230.50 7434.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: fairfuzz, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 10918.400000 48.857904 10813.0 10896.25 10923.0 10956.25 10989.0
    eclipser 82800 20.0 10769.400000 51.530676 10623.0 10744.25 10777.0 10789.50 10868.0
    aflsmart 82800 20.0 10763.700000 42.521945 10652.0 10753.00 10764.5 10791.00 10839.0
    aflfast 82800 20.0 10677.750000 59.487172 10566.0 10633.00 10675.5 10714.50 10791.0
    libfuzzer 82800 14.0 10516.214286 64.768438 10370.0 10500.25 10521.0 10560.75 10599.0
    fairfuzz 82800 14.0 9509.571429 347.646530 9071.0 9259.50 9399.0 9715.75 10155.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 15.0 525.000000 0.000000 525.0 525.00 525.0 525.00 525.0
    aflplusplus 82800 19.0 519.842105 0.374634 519.0 520.00 520.0 520.00 520.0
    centipede 82800 12.0 520.166667 2.167249 518.0 518.00 519.5 522.00 524.0
    eclipser 82800 20.0 516.200000 5.176872 505.0 516.50 518.0 520.00 520.0
    aflfast 82800 20.0 517.000000 3.670652 502.0 517.00 517.0 519.00 519.0
    aflsmart 82800 19.0 516.105263 4.689169 503.0 517.00 517.0 519.00 520.0
    fairfuzz 82800 16.0 516.437500 2.988171 509.0 516.75 517.0 518.25 520.0
    libafl 82800 19.0 517.421053 1.070607 516.0 517.00 517.0 517.50 520.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 10.0 1990.500000 104.638266 1773.0 1998.0 2020.0 2051.75 2096.0
    aflplusplus 82800 17.0 1888.588235 222.719290 1460.0 1641.0 1976.0 2033.00 2125.0
    eclipser 82800 17.0 1656.823529 137.802955 1398.0 1537.0 1671.0 1753.00 1893.0
    fairfuzz 82800 18.0 1274.277778 397.407600 800.0 901.0 1221.0 1636.25 1940.0
    centipede 82800 8.0 1077.750000 299.600567 777.0 789.0 1088.5 1361.00 1370.0
    aflsmart 82800 20.0 1074.750000 469.254714 650.0 652.0 900.0 1561.00 1791.0
    aflfast 82800 20.0 655.900000 152.566086 476.0 629.5 640.5 644.25 1276.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libafl, fairfuzz, libfuzzer, centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    fairfuzz 82800 13.0 3072.615385 17.399897 3017.0 3072.0 3079.0 3080.0 3084.0
    libafl 82800 15.0 3079.066667 1.279881 3078.0 3078.0 3079.0 3080.0 3082.0
    aflfast 82800 20.0 3047.050000 30.163545 3007.0 3015.0 3053.5 3078.0 3081.0
    libfuzzer 82800 13.0 2549.461538 1.983910 2546.0 2550.0 2550.0 2550.0 2553.0
    aflplusplus 82800 19.0 2547.631579 2.165654 2544.0 2546.5 2548.0 2548.5 2552.0
    centipede 82800 7.0 2545.000000 2.708013 2541.0 2543.5 2546.0 2546.0 2549.0
    aflsmart 82800 20.0 2544.750000 1.773341 2543.0 2543.0 2545.0 2546.0 2549.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, libfuzzer, aflfast.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    centipede 82800 11.0 2759.090909 890.255183 101.0 2904.0 3019.0 3076.5 3227.0
    aflplusplus 82800 19.0 2999.842105 144.728582 2719.0 2889.5 2998.0 3081.0 3357.0
    libfuzzer 82800 8.0 2518.750000 61.904881 2454.0 2468.0 2514.0 2560.5 2618.0
    eclipser 82800 19.0 2461.052632 196.678947 1977.0 2403.0 2446.0 2609.0 2706.0
    aflfast 82800 6.0 38.500000 4.929503 34.0 34.0 38.5 43.0 43.0
    aflsmart 82800 20.0 34.000000 0.000000 34.0 34.0 34.0 34.0 34.0
    fairfuzz 82800 16.0 37.375000 4.500000 34.0 34.0 34.0 43.0 43.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 10.0 2017.900000 1.100505 2015.0 2018.00 2018.0 2018.00 2019.0
    centipede 82800 17.0 2013.529412 3.710082 2006.0 2013.00 2015.0 2016.00 2018.0
    aflplusplus 82800 18.0 2007.777778 20.377483 1999.0 2002.00 2003.0 2005.00 2089.0
    libafl 82800 17.0 1997.705882 24.881632 1973.0 1980.00 1999.0 2001.00 2084.0
    eclipser 82800 18.0 1983.000000 27.563830 1900.0 1988.25 1993.5 1996.25 1999.0
    aflsmart 82800 20.0 1964.400000 41.639303 1888.0 1926.50 1992.5 1995.25 1998.0
    fairfuzz 82800 18.0 1972.166667 30.020091 1889.0 1955.75 1981.0 1996.00 1999.0
    aflfast 82800 20.0 1940.050000 40.045336 1856.0 1926.25 1946.5 1973.50 1987.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 18.0 15700.888889 46.247275 15631.0 15656.25 15703.0 15734.50 15796.0
    libafl 82800 16.0 15644.500000 40.073266 15569.0 15616.50 15649.5 15673.75 15701.0
    libfuzzer 82800 16.0 15397.812500 96.679001 15221.0 15306.75 15418.5 15457.00 15554.0
    eclipser 82800 17.0 15366.176471 70.190130 15247.0 15310.00 15373.0 15407.00 15477.0
    aflsmart 82800 20.0 15356.900000 64.464515 15142.0 15341.50 15371.0 15390.50 15431.0
    aflfast 82800 20.0 15282.950000 77.642551 15097.0 15249.50 15282.5 15340.75 15388.0
    centipede 82800 11.0 14674.909091 108.888433 14450.0 14612.00 14706.0 14761.00 14787.0
    fairfuzz 82800 19.0 14034.842105 437.228934 12545.0 13923.00 14065.0 14210.00 14843.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 19.0 11235.526316 84.404692 11112.0 11158.00 11252.0 11315.00 11358.0
    libafl 82800 19.0 11017.473684 54.520504 10925.0 10972.50 11011.0 11051.00 11106.0
    fairfuzz 82800 17.0 10804.470588 200.035346 10162.0 10772.00 10855.0 10906.00 11044.0
    aflsmart 82800 20.0 10844.850000 43.367797 10793.0 10816.50 10826.5 10859.25 10942.0
    eclipser 82800 18.0 10767.611111 123.770252 10424.0 10773.25 10801.5 10834.00 10888.0
    centipede 82800 12.0 10707.000000 103.919549 10550.0 10641.75 10686.0 10793.00 10863.0
    aflfast 82800 20.0 10631.400000 116.451662 10368.0 10570.25 10677.0 10705.50 10796.0
    libfuzzer 82800 13.0 10432.769231 392.426247 9327.0 10417.00 10565.0 10685.00 10828.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: fairfuzz, centipede, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 17.0 3318.235294 399.724675 2714.0 2769.00 3481.0 3610.00 3811.0
    fairfuzz 82800 15.0 2849.466667 159.218119 2754.0 2779.00 2801.0 2807.50 3296.0
    aflplusplus 82800 19.0 2862.736842 342.392212 2721.0 2731.00 2753.0 2774.00 3912.0
    aflsmart 82800 20.0 2709.000000 26.942434 2665.0 2701.75 2705.5 2713.00 2792.0
    eclipser 82800 16.0 2726.562500 280.074030 2496.0 2674.00 2692.5 2715.00 3730.0
    libfuzzer 82800 14.0 2683.142857 17.680311 2651.0 2672.50 2680.0 2689.75 2719.0
    centipede 82800 14.0 2692.142857 193.889285 2610.0 2629.00 2640.5 2652.75 3363.0
    aflfast 82800 20.0 2561.850000 115.700737 2312.0 2580.00 2611.0 2629.75 2658.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: eclipser, centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 18.0 5834.777778 1.114374 5832.0 5835.00 5835.0 5835.0 5836.0
    libafl 82800 17.0 5828.823529 3.695188 5820.0 5830.00 5830.0 5831.0 5832.0
    libfuzzer 82800 16.0 5826.500000 6.480741 5816.0 5820.00 5829.5 5832.0 5835.0
    aflsmart 82800 20.0 5826.300000 5.582869 5808.0 5827.00 5828.0 5829.0 5831.0
    eclipser 82800 15.0 5825.266667 4.096456 5817.0 5823.50 5827.0 5828.0 5831.0
    aflfast 82800 20.0 5822.100000 7.496666 5797.0 5820.00 5823.5 5826.5 5830.0
    centipede 82800 12.0 5821.833333 4.895886 5809.0 5821.25 5823.5 5824.0 5829.0
    fairfuzz 82800 16.0 5821.125000 2.753785 5816.0 5819.50 5822.0 5823.0 5825.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 16.0 3608.250000 237.543400 3044.0 3547.50 3564.5 3637.00 4049.0
    aflplusplus 82800 18.0 3224.166667 206.396606 3042.0 3064.50 3087.5 3421.00 3574.0
    libfuzzer 82800 16.0 3083.625000 23.570815 3038.0 3074.50 3078.0 3091.25 3154.0
    eclipser 82800 16.0 2984.937500 66.946216 2895.0 2918.25 2999.5 3036.75 3073.0
    aflsmart 82800 20.0 2896.650000 46.006035 2828.0 2886.25 2907.0 2912.25 3025.0
    aflfast 82800 20.0 2888.250000 43.931616 2810.0 2865.75 2906.0 2911.75 2974.0
    centipede 82800 13.0 2855.615385 74.386982 2691.0 2796.00 2890.0 2897.00 2955.0
    fairfuzz 82800 19.0 2779.105263 65.278799 2676.0 2745.50 2764.0 2801.50 2912.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 10.0 7578.500000 69.498601 7475.0 7534.25 7574.0 7625.50 7684.0
    aflplusplus 82800 20.0 6686.650000 157.813773 6375.0 6583.25 6680.5 6788.25 7034.0
    eclipser 82800 20.0 4850.350000 238.740175 4471.0 4688.75 4879.5 4962.25 5520.0
    centipede 82800 12.0 822.916667 5.550730 817.0 818.50 822.0 826.50 834.0
    fairfuzz 82800 20.0 779.350000 98.588606 494.0 810.00 816.0 819.00 821.0
    aflsmart 82800 20.0 805.450000 17.101785 740.0 804.75 807.0 815.25 821.0
    aflfast 82800 20.0 738.800000 92.916828 488.0 727.75 740.5 806.25 817.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, centipede.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 13.0 2883.538462 2.787334 2879.0 2882.00 2883.0 2886.00 2888.0
    aflplusplus 82800 20.0 2868.350000 7.442941 2842.0 2866.50 2870.0 2873.25 2875.0
    aflsmart 82800 20.0 2863.850000 17.502707 2792.0 2863.75 2867.5 2870.50 2876.0
    eclipser 82800 17.0 2849.176471 37.480387 2746.0 2854.00 2867.0 2872.00 2877.0
    fairfuzz 82800 16.0 2849.875000 34.960692 2757.0 2858.75 2864.0 2866.00 2872.0
    aflfast 82800 20.0 2851.500000 30.130418 2778.0 2860.50 2862.5 2866.25 2869.0
    libafl 82800 18.0 2857.555556 6.509922 2844.0 2852.25 2856.5 2864.00 2865.0
    centipede 82800 10.0 2767.000000 23.489951 2733.0 2753.25 2762.5 2775.50 2811.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, fairfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 17.0 20801.470588 62.701792 20718.0 20760.00 20791.0 20818.00 20965.0
    aflplusplus 82800 19.0 19983.315789 221.093508 19388.0 19886.00 20031.0 20122.00 20267.0
    aflsmart 82800 20.0 18982.900000 237.661613 18306.0 18933.75 19022.0 19117.00 19303.0
    aflfast 82800 20.0 18918.450000 267.101494 18518.0 18677.25 18938.0 19154.00 19247.0
    eclipser 82800 16.0 18884.375000 348.908751 17767.0 18835.25 18909.5 19083.75 19306.0
    libfuzzer 82800 16.0 16601.625000 402.278987 15760.0 16329.25 16561.0 16898.25 17279.0
    centipede 82800 12.0 13113.083333 482.740559 12453.0 12802.50 13043.5 13350.75 14071.0
    fairfuzz 82800 12.0 12516.333333 1555.113169 10873.0 11305.75 12095.0 13230.75 15546.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, eclipser, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 11.0 1267.909091 2.211540 1265.0 1266.50 1268.0 1269.50 1272.0
    aflplusplus 82800 19.0 1263.315789 3.127879 1256.0 1262.00 1264.0 1265.00 1268.0
    aflsmart 82800 20.0 1244.100000 19.558011 1199.0 1247.00 1251.5 1254.00 1259.0
    aflfast 82800 20.0 1245.400000 19.491969 1183.0 1246.75 1251.0 1253.75 1258.0
    libafl 82800 17.0 1251.470588 3.659195 1245.0 1249.00 1250.0 1255.00 1259.0
    eclipser 82800 15.0 1248.133333 5.514483 1236.0 1245.00 1248.0 1252.50 1255.0
    fairfuzz 82800 18.0 1228.111111 28.060625 1175.0 1208.25 1238.0 1250.75 1257.0
    centipede 82800 15.0 1143.266667 14.901901 1118.0 1134.00 1144.0 1150.50 1170.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, aflsmart, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 18.0 1184.555556 12.147245 1161.0 1174.00 1184.5 1194.75 1204.0
    aflplusplus 82800 16.0 1163.437500 13.038245 1143.0 1156.00 1162.0 1170.50 1189.0
    libfuzzer 82800 12.0 1145.583333 52.879723 1053.0 1111.50 1125.5 1194.00 1212.0
    aflsmart 82800 13.0 1105.461538 29.809438 1047.0 1090.00 1123.0 1127.00 1129.0
    aflfast 82800 20.0 1084.450000 26.198584 1032.0 1065.25 1090.5 1106.25 1124.0
    centipede 82800 15.0 1072.800000 15.279304 1036.0 1066.50 1069.0 1082.00 1098.0
    fairfuzz 82800 17.0 994.941176 30.150602 959.0 982.00 991.0 1001.00 1098.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, aflsmart, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 9.0 469.555556 3.844188 463.0 470.00 471.0 472.0 473.0
    aflplusplus 82800 19.0 462.631579 3.467476 458.0 460.00 462.0 464.0 470.0
    fairfuzz 82800 16.0 457.875000 3.685557 455.0 455.75 457.0 459.0 470.0
    aflsmart 82800 12.0 455.750000 13.948770 416.0 455.75 456.0 462.0 470.0
    eclipser 82800 16.0 451.687500 13.189484 423.0 450.25 455.0 458.0 471.0
    centipede 82800 13.0 454.153846 2.544476 451.0 452.00 454.0 456.0 459.0
    libafl 82800 17.0 452.529412 5.038820 446.0 449.00 451.0 458.0 462.0
    aflfast 82800 20.0 448.100000 15.109774 386.0 449.00 449.0 454.0 460.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)