FuzzBench: 2024-05-13-new-cov report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
libafl 98.92
honggfuzz 94.88
aflplusplus 93.35
libfuzzer 93.02
mopt 84.46
afl 81.05
aflsmart 75.81
centipede 68.85
By avg. rank
average rank
fuzzer
aflplusplus 2.22
libafl 3.04
libfuzzer 3.78
honggfuzz 4.26
aflsmart 4.78
mopt 5.22
afl 5.35
centipede 6.65
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      libafl aflplusplus honggfuzz libfuzzer afl mopt aflsmart centipede
    FuzzerMedian 98.00 98.00 97.00 93.00 94.00 93.00 94.00 91.00
    FuzzerMean 95.70 94.73 91.83 90.04 82.18 82.04 80.48 80.42
    bloaty_fuzz_target 98.00 98.00 92.00 91.00 94.00 96.00 94.00 nan
    curl_curl_fuzzer_http 98.00 98.00 98.00 90.00 94.00 93.00 94.00 nan
    freetype2_ftfuzzer 90.00 91.00 91.00 77.00 66.00 66.00 67.00 58.00
    harfbuzz_hb-shape-fuzzer 99.00 98.00 96.00 94.00 96.00 97.00 96.00 nan
    jsoncpp_jsoncpp_fuzzer 98.00 99.00 99.00 100.00 98.00 98.00 98.00 98.00
    lcms_cms_transform_fuzzer 94.00 nan 73.00 86.00 67.00 51.00 40.00 37.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00
    libpcap_fuzz_both 91.00 94.00 89.00 82.00 1.00 1.00 1.00 91.00
    libpng_libpng_read_fuzzer 95.00 95.00 95.00 96.00 94.00 94.00 94.00 96.00
    libxml2_xml 99.00 99.00 98.00 97.00 97.00 97.00 97.00 92.00
    libxslt_xpath 96.00 98.00 97.00 92.00 94.00 93.00 95.00 94.00
    mbedtls_fuzz_dtlsclient 92.00 76.00 73.00 73.00 73.00 73.00 73.00 71.00
    openh264_decoder_fuzzer 98.00 99.00 98.00 98.00 98.00 99.00 98.00 96.00
    openssl_x509 99.00 99.00 98.00 99.00 99.00 99.00 99.00 99.00
    openthread_ot-ip6-send-fuzzer 85.00 73.00 73.00 74.00 70.00 70.00 70.00 69.00
    proj4_proj_crs_to_crs_fuzzer 89.00 87.00 96.00 93.00 9.00 9.00 9.00 10.00
    re2_fuzzer 99.00 99.00 98.00 99.00 99.00 99.00 99.00 95.00
    sqlite3_ossfuzz 99.00 96.00 69.00 78.00 90.00 90.00 91.00 65.00
    stb_stbi_read_fuzzer 95.00 93.00 92.00 87.00 87.00 86.00 87.00 85.00
    systemd_fuzz-link-parser 98.00 99.00 97.00 72.00 92.00 91.00 91.00 nan
    vorbis_decode_fuzzer 98.00 99.00 97.00 99.00 98.00 98.00 98.00 89.00
    woff2_convert_woff2ttf_fuzzer 98.00 98.00 97.00 96.00 93.00 92.00 nan 89.00
    zlib_zlib_uncompress_fuzzer 94.00 97.00 97.00 99.00 nan 96.00 nan 95.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 6343.00 41.929766 6252.0 6324.25 6352.0 6367.00 6402.0
    aflplusplus 82800 20.0 6328.30 69.291224 6149.0 6327.50 6340.0 6368.00 6429.0
    mopt 82800 20.0 6189.05 93.155487 6019.0 6121.00 6188.0 6260.50 6323.0
    aflsmart 82800 20.0 6146.00 146.262561 5784.0 6070.75 6104.0 6271.00 6381.0
    afl 82800 20.0 6040.55 112.988809 5838.0 5956.50 6062.5 6098.75 6233.0
    honggfuzz 82800 20.0 6044.80 190.055560 5739.0 5920.75 5971.0 6222.50 6334.0
    libfuzzer 82800 20.0 5899.05 118.185614 5708.0 5797.75 5905.0 5980.25 6141.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 10892.90 66.383971 10791.0 10839.00 10883.5 10921.00 11018.0
    libafl 82800 20.0 10852.50 71.694160 10682.0 10804.00 10847.0 10917.25 10957.0
    honggfuzz 82800 20.0 10828.55 86.108545 10645.0 10766.00 10845.0 10886.25 10948.0
    aflsmart 82800 20.0 10385.00 132.985951 10018.0 10363.25 10420.0 10455.25 10544.0
    afl 82800 20.0 10364.95 124.397273 10082.0 10344.00 10400.0 10426.75 10513.0
    mopt 82800 20.0 10355.10 76.350232 10128.0 10334.25 10353.0 10406.75 10458.0
    libfuzzer 82800 20.0 9923.25 444.124434 9151.0 9477.25 10020.0 10221.75 10581.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 82800 20.0 11363.35 626.091239 10087.0 11149.50 11475.5 11834.25 12138.0
    aflplusplus 82800 20.0 11381.75 411.452098 10657.0 11001.25 11431.5 11707.75 12014.0
    libafl 82800 20.0 11388.90 686.837443 10085.0 10910.25 11281.5 11976.25 12475.0
    libfuzzer 82800 20.0 9691.85 478.040989 8979.0 9503.25 9650.5 9810.25 10769.0
    aflsmart 82800 20.0 8342.75 174.588501 7870.0 8314.75 8393.0 8435.50 8535.0
    mopt 82800 20.0 8321.65 153.060454 7821.0 8324.75 8357.5 8392.00 8480.0
    afl 82800 20.0 8262.20 189.522558 7862.0 8164.50 8342.5 8407.25 8475.0
    centipede 82800 20.0 7264.35 173.182069 6852.0 7190.50 7299.0 7378.00 7565.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 11039.65 37.033804 10965.0 11029.5 11045.0 11058.00 11106.0
    aflplusplus 82800 20.0 10860.70 236.201988 9917.0 10891.5 10922.0 10958.75 11010.0
    mopt 82800 20.0 10778.95 52.249880 10651.0 10757.5 10789.5 10796.25 10863.0
    aflsmart 82800 20.0 10763.70 42.521945 10652.0 10753.0 10764.5 10791.00 10839.0
    afl 82800 20.0 10745.00 44.143307 10636.0 10734.0 10757.0 10771.75 10812.0
    honggfuzz 82800 14.0 10686.00 25.495098 10629.0 10672.5 10689.5 10703.50 10719.0
    libfuzzer 82800 20.0 10537.35 68.395348 10421.0 10484.0 10522.0 10600.25 10656.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 525.000000 0.000000 525.0 525.00 525.0 525.0 525.0
    honggfuzz 82800 20.0 522.500000 1.051315 521.0 522.00 522.0 524.0 524.0
    aflplusplus 82800 20.0 519.950000 0.223607 519.0 520.00 520.0 520.0 520.0
    centipede 82800 20.0 519.300000 1.592747 517.0 518.75 519.0 519.0 523.0
    mopt 82800 20.0 517.050000 4.430457 502.0 517.00 518.0 520.0 520.0
    afl 82800 20.0 517.400000 2.479389 508.0 517.00 517.0 519.0 520.0
    aflsmart 82800 19.0 516.105263 4.689169 503.0 517.00 517.0 519.0 520.0
    libafl 82800 20.0 517.350000 1.089423 515.0 517.00 517.0 517.5 519.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 2087.40 78.542245 1869.0 2055.50 2100.0 2118.00 2212.0
    libfuzzer 82800 20.0 1895.95 59.130965 1788.0 1846.25 1903.5 1946.50 1992.0
    honggfuzz 82800 20.0 1438.10 500.925974 694.0 778.00 1631.5 1838.25 1918.0
    afl 82800 20.0 1311.25 440.908736 648.0 876.00 1498.0 1675.50 1776.0
    mopt 82800 20.0 1154.60 428.242602 650.0 697.50 1133.0 1559.00 1736.0
    aflsmart 82800 20.0 1074.75 469.254714 650.0 652.00 900.0 1561.00 1791.0
    centipede 82800 20.0 988.70 247.792167 753.0 811.25 834.0 1228.25 1496.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 2549.60 1.846761 2546.0 2549.75 2550.0 2550.00 2553.0
    aflplusplus 82800 20.0 2548.95 2.187885 2546.0 2547.00 2548.5 2551.00 2552.0
    centipede 82800 20.0 2546.00 1.555973 2542.0 2545.00 2546.0 2547.00 2550.0
    aflsmart 82800 20.0 2544.75 1.773341 2543.0 2543.00 2545.0 2546.00 2549.0
    libafl 82800 20.0 2544.70 1.031095 2543.0 2544.00 2545.0 2545.25 2546.0
    afl 82800 20.0 2544.60 1.759186 2541.0 2543.75 2544.0 2545.25 2548.0
    honggfuzz 82800 20.0 2543.80 1.765160 2538.0 2543.00 2544.0 2544.00 2548.0
    mopt 82800 20.0 2544.20 1.542384 2540.0 2543.75 2544.0 2545.00 2547.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: afl, mopt, aflplusplus, libafl.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 5.0 2895.600000 137.583793 2698.0 2849.0 2891.0 2978.0 3062.0
    libafl 82800 3.0 2812.666667 232.508781 2579.0 2697.0 2815.0 2929.5 3044.0
    centipede 82800 20.0 2423.700000 1007.383641 101.0 2647.5 2811.5 2901.0 3035.0
    honggfuzz 82800 20.0 2722.850000 143.003230 2288.0 2701.5 2751.5 2809.0 2950.0
    libfuzzer 82800 19.0 2549.631579 72.861521 2460.0 2503.0 2525.0 2577.0 2706.0
    afl 82800 15.0 38.200000 4.647580 34.0 34.0 34.0 43.0 43.0
    aflsmart 82800 20.0 34.000000 0.000000 34.0 34.0 34.0 34.0 34.0
    mopt 82800 14.0 36.214286 3.826599 34.0 34.0 34.0 37.0 43.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 2017.85 0.875094 2015.0 2018.00 2018.0 2018.00 2019.0
    centipede 82800 20.0 2015.35 2.207046 2009.0 2014.00 2016.0 2017.00 2018.0
    honggfuzz 82800 20.0 2030.00 33.450671 2003.0 2009.75 2011.5 2039.75 2098.0
    aflplusplus 82800 20.0 2007.45 19.701656 1984.0 2003.75 2004.0 2006.25 2088.0
    libafl 82800 20.0 1994.10 10.808866 1971.0 1989.75 1998.0 2001.25 2005.0
    aflsmart 82800 20.0 1964.40 41.639303 1888.0 1926.50 1992.5 1995.25 1998.0
    afl 82800 20.0 1975.30 32.509270 1873.0 1974.75 1985.5 1995.25 2001.0
    mopt 82800 20.0 1969.65 25.915399 1915.0 1945.50 1976.5 1990.75 1999.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 15704.00 38.554131 15644.0 15675.75 15707.5 15722.75 15768.0
    libafl 82800 20.0 15616.85 29.388997 15582.0 15596.75 15612.0 15632.00 15697.0
    honggfuzz 82800 20.0 15608.15 38.615888 15543.0 15576.00 15609.0 15640.25 15667.0
    aflsmart 82800 20.0 15356.90 64.464515 15142.0 15341.50 15371.0 15390.50 15431.0
    libfuzzer 82800 20.0 15365.30 76.755319 15224.0 15302.75 15364.5 15404.50 15513.0
    mopt 82800 20.0 15336.60 74.989403 15154.0 15311.25 15343.0 15360.00 15510.0
    afl 82800 20.0 15333.40 55.067805 15255.0 15277.00 15336.5 15355.25 15457.0
    centipede 82800 20.0 14641.10 107.861461 14388.0 14574.25 14654.5 14713.50 14809.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 11189.20 84.190636 11056.0 11138.75 11175.5 11242.00 11337.0
    honggfuzz 82800 20.0 11044.80 89.711585 10860.0 10998.00 11075.5 11106.75 11147.0
    libafl 82800 20.0 10978.30 72.276589 10799.0 10958.50 10993.0 11019.00 11081.0
    aflsmart 82800 20.0 10844.85 43.367797 10793.0 10816.50 10826.5 10859.25 10942.0
    afl 82800 20.0 10733.85 121.023388 10465.0 10730.00 10768.5 10816.25 10846.0
    centipede 82800 19.0 10722.00 125.835430 10468.0 10650.00 10714.0 10821.00 10951.0
    mopt 82800 20.0 10575.40 126.064896 10279.0 10519.00 10613.0 10662.00 10772.0
    libfuzzer 82800 20.0 10453.70 390.147219 9328.0 10435.00 10519.0 10656.75 10911.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 3206.50 386.055014 2694.0 2736.25 3386.0 3505.50 3664.0
    aflplusplus 82800 20.0 2786.90 25.308413 2750.0 2764.00 2785.5 2807.75 2829.0
    aflsmart 82800 20.0 2709.00 26.942434 2665.0 2701.75 2705.5 2713.00 2792.0
    libfuzzer 82800 20.0 2696.80 27.475539 2662.0 2673.25 2693.5 2718.50 2755.0
    mopt 82800 20.0 2685.00 34.546688 2562.0 2672.50 2692.5 2701.25 2726.0
    afl 82800 20.0 2684.95 40.685540 2532.0 2678.25 2691.0 2703.00 2741.0
    honggfuzz 82800 20.0 2685.30 28.077149 2647.0 2665.00 2679.5 2706.50 2750.0
    centipede 82800 20.0 2642.00 19.031829 2616.0 2627.00 2636.5 2659.50 2673.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openh264_decoder_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    mopt 82800 20.0 9531.75 57.376756 9409.0 9523.75 9543.5 9553.00 9631.0
    aflplusplus 82800 20.0 9520.80 66.542112 9320.0 9520.25 9543.0 9554.25 9581.0
    afl 82800 20.0 9523.10 47.584717 9407.0 9513.50 9533.5 9546.25 9595.0
    aflsmart 82800 20.0 9518.80 44.310626 9411.0 9519.75 9530.0 9547.25 9564.0
    libfuzzer 82800 20.0 9500.30 37.975200 9405.0 9485.00 9507.5 9523.75 9560.0
    libafl 82800 20.0 9443.75 62.804521 9285.0 9433.75 9463.5 9476.25 9521.0
    honggfuzz 82800 20.0 9379.55 322.928455 8036.0 9375.25 9460.5 9505.25 9540.0
    centipede 82800 20.0 9265.70 143.057625 8816.0 9213.25 9285.5 9323.50 9486.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 5832.90 3.024462 5821.0 5833.00 5833.5 5834.00 5835.0
    libfuzzer 82800 20.0 5830.15 11.699190 5818.0 5825.50 5830.0 5832.00 5875.0
    aflsmart 82800 20.0 5826.30 5.582869 5808.0 5827.00 5828.0 5829.00 5831.0
    afl 82800 20.0 5825.95 4.236123 5815.0 5824.00 5827.0 5829.00 5830.0
    mopt 82800 20.0 5823.35 6.507283 5813.0 5815.75 5826.0 5829.00 5830.0
    libafl 82800 20.0 5824.00 5.525063 5808.0 5820.00 5824.0 5829.25 5830.0
    centipede 82800 20.0 5820.50 6.893933 5808.0 5814.00 5820.5 5826.00 5831.0
    honggfuzz 82800 20.0 5813.45 7.126489 5801.0 5808.75 5811.0 5820.25 5822.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 3528.90 293.876791 3001.0 3516.00 3557.0 3615.75 4144.0
    libfuzzer 82800 20.0 3086.05 27.277280 3034.0 3075.75 3083.0 3094.25 3169.0
    aflplusplus 82800 20.0 3246.65 261.252907 3047.0 3053.75 3066.5 3577.50 3631.0
    honggfuzz 82800 20.0 3122.55 290.299750 2889.0 2998.25 3047.0 3064.25 3848.0
    mopt 82800 20.0 2889.45 41.043590 2826.0 2832.75 2912.5 2916.00 2936.0
    aflsmart 82800 20.0 2896.65 46.006035 2828.0 2886.25 2907.0 2912.25 3025.0
    afl 82800 20.0 2879.15 42.045433 2808.0 2830.25 2902.0 2911.00 2933.0
    centipede 82800 20.0 2873.50 73.097268 2779.0 2795.75 2877.5 2907.75 3046.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 82800 20.0 7831.95 203.670134 7321.0 7730.75 7849.5 7984.00 8112.0
    libfuzzer 82800 20.0 7573.50 106.292793 7374.0 7498.00 7603.5 7630.25 7788.0
    libafl 82800 20.0 7312.85 156.194978 7028.0 7249.50 7285.0 7430.50 7553.0
    aflplusplus 82800 20.0 7069.30 206.538412 6599.0 6934.25 7065.5 7209.50 7408.0
    centipede 82800 20.0 822.25 4.327087 817.0 820.00 820.5 823.00 830.0
    aflsmart 82800 20.0 805.45 17.101785 740.0 804.75 807.0 815.25 821.0
    mopt 82800 20.0 798.40 22.272003 740.0 798.50 806.5 808.00 820.0
    afl 82800 20.0 788.70 34.581863 720.0 777.50 805.5 807.75 819.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 2883.30 2.002630 2880.0 2882.00 2883.0 2885.00 2887.0
    aflplusplus 82800 20.0 2877.40 4.717270 2864.0 2875.00 2878.0 2880.25 2884.0
    aflsmart 82800 20.0 2863.85 17.502707 2792.0 2863.75 2867.5 2870.50 2876.0
    afl 82800 20.0 2857.50 27.952027 2767.0 2860.50 2867.0 2870.25 2874.0
    mopt 82800 20.0 2846.85 31.551587 2764.0 2847.00 2861.5 2864.25 2868.0
    libafl 82800 20.0 2858.05 4.978639 2849.0 2854.75 2859.5 2862.00 2865.0
    honggfuzz 82800 20.0 2853.65 8.910520 2840.0 2846.75 2852.0 2862.00 2870.0
    centipede 82800 20.0 2765.60 19.277475 2741.0 2752.75 2761.5 2779.25 2798.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 20759.85 64.795447 20614.0 20721.50 20768.0 20801.25 20873.0
    aflplusplus 82800 20.0 19940.30 566.860430 18215.0 19741.75 20061.5 20292.75 20561.0
    aflsmart 82800 20.0 18982.90 237.661613 18306.0 18933.75 19022.0 19117.00 19303.0
    afl 82800 20.0 18947.25 360.375809 18132.0 18822.25 18991.0 19191.75 19434.0
    mopt 82800 20.0 18882.30 305.529067 18331.0 18657.50 18968.0 19091.50 19473.0
    libfuzzer 82800 20.0 16491.80 366.922565 15742.0 16264.00 16481.0 16738.00 17117.0
    honggfuzz 82800 20.0 14312.90 399.955379 13518.0 14213.75 14403.0 14563.00 14860.0
    centipede 82800 20.0 13673.95 533.791550 12605.0 13360.75 13601.0 14037.75 14686.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

stb_stbi_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 2198.05 54.959770 2107.0 2163.75 2193.0 2240.00 2290.0
    aflplusplus 82800 20.0 2159.35 45.787007 2113.0 2116.00 2148.0 2202.75 2215.0
    honggfuzz 82800 20.0 2127.50 30.768234 2111.0 2113.00 2115.5 2118.25 2199.0
    aflsmart 82800 20.0 2025.70 45.962342 1941.0 2000.50 2007.5 2083.75 2093.0
    libfuzzer 82800 20.0 2023.90 45.396383 1982.0 1998.50 2007.0 2026.25 2116.0
    afl 82800 20.0 2008.75 29.163559 1975.0 2001.75 2003.0 2006.25 2091.0
    mopt 82800 20.0 1997.00 28.771331 1963.0 1980.75 1992.0 2004.00 2072.0
    centipede 82800 20.0 1963.50 10.694317 1952.0 1956.00 1960.0 1964.00 1986.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 1266.75 2.221308 1263.0 1265.75 1267.0 1268.00 1274.0
    aflplusplus 82800 20.0 1264.55 2.211810 1261.0 1263.00 1264.0 1265.25 1269.0
    afl 82800 20.0 1251.35 12.482725 1201.0 1250.75 1253.5 1257.00 1261.0
    mopt 82800 20.0 1250.40 9.626717 1218.0 1252.00 1253.0 1254.00 1256.0
    aflsmart 82800 20.0 1244.10 19.558011 1199.0 1247.00 1251.5 1254.00 1259.0
    libafl 82800 20.0 1251.45 3.103055 1247.0 1248.75 1250.5 1254.25 1257.0
    honggfuzz 82800 20.0 1243.70 8.742516 1230.0 1236.25 1246.0 1250.50 1255.0
    centipede 82800 20.0 1144.80 17.644441 1118.0 1132.25 1142.5 1162.00 1179.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 1179.25 12.636101 1157.0 1168.00 1181.0 1189.25 1198.0
    libafl 82800 20.0 1176.85 15.587698 1138.0 1168.25 1181.0 1186.25 1201.0
    honggfuzz 82800 20.0 1169.20 20.995238 1114.0 1168.75 1171.0 1181.00 1195.0
    libfuzzer 82800 20.0 1146.05 50.539693 1071.0 1101.00 1165.0 1188.00 1204.0
    afl 82800 20.0 1111.65 22.697728 1060.0 1103.00 1120.5 1127.00 1136.0
    mopt 82800 20.0 1115.45 18.259749 1068.0 1106.25 1119.5 1128.75 1140.0
    centipede 82800 20.0 1074.50 18.303005 1032.0 1061.00 1082.5 1087.00 1099.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 467.50 4.872047 461.0 462.75 470.5 472.00 473.0
    honggfuzz 82800 20.0 461.90 4.610058 456.0 458.75 461.0 467.00 470.0
    aflplusplus 82800 20.0 460.80 4.916139 456.0 457.00 460.0 462.25 472.0
    mopt 82800 20.0 456.10 4.024922 449.0 455.00 455.5 458.50 464.0
    centipede 82800 20.0 453.30 3.435113 445.0 451.00 454.0 455.25 462.0
    libafl 82800 20.0 449.35 5.264329 439.0 447.50 449.0 451.00 461.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)