FuzzBench: 2024-05-10-aflpp report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
aflplusplus 99.44
libafl 97.41
honggfuzz 96.79
libfuzzer 95.38
mopt 84.95
aflsmart 84.71
gfuzz 84.15
afl 81.65
aflfast 79.84
fairfuzz 79.14
eclipser 75.72
centipede 70.19
By avg. rank
average rank
fuzzer
aflplusplus 1.82
libfuzzer 3.82
libafl 4.64
honggfuzz 5.82
aflsmart 5.91
gfuzz 6.00
mopt 6.09
afl 7.27
eclipser 7.55
centipede 9.09
fairfuzz 9.27
aflfast 9.45
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      aflplusplus libafl honggfuzz libfuzzer eclipser gfuzz mopt aflsmart centipede afl aflfast fairfuzz
    FuzzerMedian 97.00 95.00 96.00 96.00 95.00 90.00 97.00 95.00 90.00 95.00 94.00 84.00
    FuzzerMean 95.82 93.91 93.45 92.00 89.11 88.90 83.00 82.64 82.33 79.82 78.09 77.27
    bloaty_fuzz_target 98.00 98.00 95.00 89.00 94.00 85.00 97.00 95.00 nan 94.00 93.00 80.00
    freetype2_ftfuzzer 93.00 92.00 91.00 79.00 76.00 83.00 68.00 68.00 58.00 68.00 67.00 64.00
    harfbuzz_hb-shape-fuzzer 98.00 99.00 96.00 94.00 96.00 nan 97.00 96.00 nan 96.00 95.00 84.00
    lcms_cms_transform_fuzzer 94.00 95.00 82.00 87.00 76.00 75.00 72.00 70.00 38.00 41.00 29.00 55.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 82.00 99.00 99.00 nan 82.00 99.00 99.00 96.00 99.00 99.00 99.00
    libpcap_fuzz_both 85.00 83.00 80.00 75.00 72.00 78.00 1.00 1.00 81.00 1.00 1.00 1.00
    libpng_libpng_read_fuzzer 95.00 95.00 96.00 96.00 95.00 95.00 94.00 95.00 96.00 95.00 94.00 94.00
    openssl_x509 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00
    vorbis_decode_fuzzer 99.00 98.00 98.00 99.00 98.00 99.00 98.00 98.00 90.00 98.00 98.00 97.00
    woff2_convert_woff2ttf_fuzzer 97.00 98.00 95.00 96.00 nan 96.00 91.00 92.00 88.00 91.00 90.00 81.00
    zlib_zlib_uncompress_fuzzer 97.00 94.00 97.00 99.00 96.00 97.00 97.00 96.00 95.00 96.00 94.00 96.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflplusplus, aflsmart, mopt, eclipser, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 15.0 6362.933333 53.783251 6262.0 6318.00 6363.0 6414.50 6437.0
    libafl 82800 11.0 6348.454545 40.497811 6289.0 6318.50 6360.0 6380.00 6400.0
    mopt 82800 14.0 6242.357143 84.158831 6046.0 6225.75 6267.5 6291.00 6350.0
    aflsmart 82800 15.0 6143.400000 156.758139 5816.0 6031.00 6149.0 6241.50 6401.0
    honggfuzz 82800 18.0 6094.111111 174.869662 5884.0 5918.25 6142.0 6248.00 6329.0
    afl 82800 18.0 6101.944444 119.949240 5889.0 6070.75 6098.5 6141.75 6338.0
    eclipser 82800 11.0 6053.181818 68.930136 5914.0 6022.50 6078.0 6095.50 6135.0
    aflfast 82800 17.0 6028.411765 77.961576 5893.0 5960.00 6035.0 6065.00 6185.0
    libfuzzer 82800 20.0 5795.800000 110.501536 5592.0 5718.75 5781.0 5869.50 5994.0
    gfuzz 82800 2.0 5480.500000 120.915260 5395.0 5437.75 5480.5 5523.25 5566.0
    fairfuzz 82800 16.0 5189.937500 67.159977 5111.0 5135.25 5170.0 5226.00 5323.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: honggfuzz, mopt, aflfast, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 16.0 11317.312500 469.680135 10099.0 11260.25 11429.5 11614.5 11875.0
    libafl 82800 8.0 11233.125000 573.448326 10421.0 10942.25 11301.5 11468.0 12222.0
    honggfuzz 82800 15.0 11146.933333 587.165767 9966.0 10638.00 11211.0 11643.5 11874.0
    gfuzz 82800 2.0 10230.000000 445.477272 9915.0 10072.50 10230.0 10387.5 10545.0
    libfuzzer 82800 20.0 9686.850000 575.195187 8786.0 9311.25 9719.5 10053.5 11076.0
    eclipser 82800 18.0 9312.222222 98.567717 9066.0 9283.00 9317.0 9355.5 9516.0
    aflsmart 82800 17.0 8323.647059 218.891794 7882.0 8183.00 8427.0 8489.0 8603.0
    mopt 82800 14.0 8386.428571 93.355989 8142.0 8352.25 8394.0 8459.0 8499.0
    afl 82800 17.0 8258.117647 210.946345 7860.0 8255.00 8313.0 8393.0 8532.0
    aflfast 82800 13.0 8162.615385 203.476673 7780.0 8005.00 8204.0 8329.0 8411.0
    fairfuzz 82800 17.0 7936.117647 236.373349 7764.0 7809.00 7863.0 7895.0 8582.0
    centipede 82800 19.0 7235.526316 211.190374 6852.0 7110.50 7190.0 7354.0 7653.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: mopt, fairfuzz, libafl, honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 11.0 11004.454545 134.697709 10716.0 11019.00 11049.0 11079.00 11133.0
    aflplusplus 82800 17.0 10913.411765 154.392786 10415.0 10928.00 10958.0 10990.00 11038.0
    mopt 82800 15.0 10787.266667 79.410747 10535.0 10768.50 10803.0 10831.50 10876.0
    eclipser 82800 20.0 10769.400000 51.530676 10623.0 10744.25 10777.0 10789.50 10868.0
    afl 82800 18.0 10766.222222 44.163540 10669.0 10737.25 10773.5 10791.25 10845.0
    aflsmart 82800 18.0 10772.000000 45.695926 10703.0 10737.50 10763.5 10805.75 10859.0
    honggfuzz 82800 8.0 10715.750000 14.992855 10696.0 10704.25 10716.0 10726.25 10736.0
    aflfast 82800 18.0 10653.000000 67.534741 10498.0 10637.75 10667.5 10692.00 10756.0
    libfuzzer 82800 20.0 10556.700000 73.370079 10375.0 10509.25 10564.5 10605.00 10679.0
    fairfuzz 82800 14.0 9509.571429 347.646530 9071.0 9259.50 9399.0 9715.75 10155.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: honggfuzz, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 9.0 2085.777778 80.161989 1934.0 2038.00 2087.0 2134.00 2186.0
    aflplusplus 82800 17.0 2008.411765 174.692250 1536.0 2024.00 2056.0 2101.00 2125.0
    libfuzzer 82800 18.0 1937.333333 107.209525 1788.0 1860.00 1905.0 2018.00 2135.0
    honggfuzz 82800 11.0 1578.818182 434.253110 741.0 1542.00 1807.0 1848.00 1923.0
    eclipser 82800 17.0 1656.823529 137.802955 1398.0 1537.00 1671.0 1753.00 1893.0
    gfuzz 82800 2.0 1657.500000 289.206674 1453.0 1555.25 1657.5 1759.75 1862.0
    mopt 82800 18.0 1334.222222 446.061399 586.0 873.25 1574.0 1668.50 1795.0
    aflsmart 82800 17.0 1318.294118 462.454696 642.0 859.00 1546.0 1667.00 1762.0
    fairfuzz 82800 18.0 1274.277778 397.407600 800.0 901.00 1221.0 1636.25 1940.0
    afl 82800 17.0 1135.764706 470.692247 583.0 651.00 906.0 1631.00 1728.0
    centipede 82800 18.0 949.388889 231.815378 756.0 782.50 846.5 1025.75 1362.0
    aflfast 82800 19.0 663.947368 178.273969 519.0 619.00 643.0 645.50 1383.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflplusplus, fairfuzz, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 19.0 3087.157895 1.772104 3082.0 3086.00 3087.0 3088.50 3089.0
    aflplusplus 82800 15.0 3082.800000 2.274078 3078.0 3081.50 3082.0 3084.50 3086.0
    aflsmart 82800 17.0 3080.352941 5.024206 3069.0 3080.00 3082.0 3084.00 3086.0
    fairfuzz 82800 13.0 3072.615385 17.399897 3017.0 3072.00 3079.0 3080.00 3084.0
    afl 82800 18.0 3078.611111 5.203380 3070.0 3075.25 3078.5 3083.00 3086.0
    mopt 82800 17.0 3070.882353 21.109484 3014.0 3072.00 3078.0 3081.00 3086.0
    honggfuzz 82800 17.0 3063.411765 9.454224 3040.0 3060.00 3066.0 3068.00 3075.0
    aflfast 82800 17.0 3050.470588 30.820281 3007.0 3014.00 3065.0 3077.00 3084.0
    centipede 82800 19.0 2971.631579 31.225365 2915.0 2945.50 2986.0 2999.50 3008.0
    gfuzz 82800 2.0 2549.000000 0.000000 2549.0 2549.00 2549.0 2549.00 2549.0
    libafl 82800 10.0 2545.500000 1.779513 2543.0 2545.00 2545.0 2545.75 2550.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflfast, aflsmart, centipede, honggfuzz, mopt, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 16.0 2901.187500 118.532257 2721.0 2794.25 2897.5 2994.5 3129.0
    libafl 82800 3.0 2807.000000 64.210591 2736.0 2780.00 2824.0 2842.5 2861.0
    centipede 82800 15.0 1788.666667 1437.918022 100.0 104.50 2773.0 2910.5 3390.0
    honggfuzz 82800 15.0 2772.733333 147.428175 2564.0 2679.00 2738.0 2854.0 3129.0
    gfuzz 82800 1.0 2664.000000 NaN 2664.0 2664.00 2664.0 2664.0 2664.0
    libfuzzer 82800 20.0 2574.000000 93.538621 2460.0 2525.75 2544.0 2590.0 2877.0
    eclipser 82800 19.0 2461.052632 196.678947 1977.0 2403.00 2446.0 2609.0 2706.0
    aflfast 82800 15.0 39.066667 4.463609 34.0 34.00 43.0 43.0 43.0
    afl 82800 18.0 38.777778 4.492550 34.0 34.00 41.0 43.0 43.0
    aflsmart 82800 15.0 34.600000 2.323790 34.0 34.00 34.0 34.0 43.0
    fairfuzz 82800 16.0 37.375000 4.500000 34.0 34.00 34.0 43.0 43.0
    mopt 82800 15.0 144.866667 416.958180 34.0 34.00 34.0 43.0 1652.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: afl, aflsmart, aflfast, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 2017.750000 0.966546 2015.0 2018.00 2018.0 2018.00 2019.0
    centipede 82800 18.0 2014.000000 3.613699 2006.0 2013.00 2015.0 2016.75 2018.0
    honggfuzz 82800 18.0 2011.777778 2.860595 2007.0 2010.00 2011.5 2013.50 2018.0
    aflplusplus 82800 16.0 2008.437500 21.734669 1993.0 2002.00 2004.0 2005.00 2089.0
    gfuzz 82800 2.0 2004.000000 1.414214 2003.0 2003.50 2004.0 2004.50 2005.0
    libafl 82800 6.0 1996.666667 8.640988 1980.0 1996.50 1999.0 2001.50 2004.0
    eclipser 82800 18.0 1983.000000 27.563830 1900.0 1988.25 1993.5 1996.25 1999.0
    aflsmart 82800 15.0 1968.066667 37.151171 1909.0 1926.00 1992.0 1996.00 1997.0
    afl 82800 15.0 1973.733333 35.293599 1902.0 1978.00 1991.0 1993.00 2004.0
    fairfuzz 82800 18.0 1972.166667 30.020091 1889.0 1955.75 1981.0 1996.00 1999.0
    mopt 82800 16.0 1970.500000 28.768038 1912.0 1949.00 1980.5 1994.25 2001.0
    aflfast 82800 13.0 1951.153846 47.435651 1822.0 1945.00 1969.0 1975.00 1991.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: afl, aflsmart, centipede, eclipser, aflfast, aflplusplus, honggfuzz, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 14.0 5830.142857 4.258153 5821.0 5830.25 5831.5 5833.00 5834.0
    libfuzzer 82800 20.0 5826.550000 6.202504 5817.0 5820.00 5830.0 5832.00 5834.0
    aflsmart 82800 15.0 5827.066667 4.847189 5817.0 5825.50 5829.0 5830.50 5833.0
    afl 82800 15.0 5826.600000 3.960519 5820.0 5824.00 5828.0 5829.50 5832.0
    mopt 82800 18.0 5824.555556 6.536974 5811.0 5823.25 5828.0 5828.75 5831.0
    eclipser 82800 15.0 5825.266667 4.096456 5817.0 5823.50 5827.0 5828.00 5831.0
    libafl 82800 9.0 5826.222222 4.352522 5821.0 5823.00 5825.0 5830.00 5833.0
    centipede 82800 15.0 5822.200000 6.689010 5812.0 5818.00 5824.0 5826.50 5834.0
    fairfuzz 82800 16.0 5821.125000 2.753785 5816.0 5819.50 5822.0 5823.00 5825.0
    aflfast 82800 14.0 5817.000000 8.009610 5807.0 5810.25 5816.0 5824.00 5829.0
    gfuzz 82800 2.0 5816.000000 1.414214 5815.0 5815.50 5816.0 5816.50 5817.0
    honggfuzz 82800 14.0 5814.000000 6.480741 5802.0 5808.50 5814.5 5820.00 5821.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflfast, centipede, eclipser, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 19.0 1265.842105 2.455451 1262.0 1264.00 1267.0 1268.00 1269.0
    gfuzz 82800 2.0 1267.000000 2.828427 1265.0 1266.00 1267.0 1268.00 1269.0
    libfuzzer 82800 20.0 1255.700000 54.323689 1025.0 1267.00 1267.0 1269.00 1271.0
    aflsmart 82800 17.0 1252.117647 15.591995 1194.0 1252.00 1255.0 1258.00 1262.0
    mopt 82800 16.0 1250.000000 19.572089 1178.0 1250.75 1254.0 1257.25 1261.0
    libafl 82800 11.0 1254.090909 2.981763 1251.0 1252.00 1253.0 1254.50 1261.0
    afl 82800 17.0 1245.941176 15.666328 1201.0 1247.00 1252.0 1254.00 1258.0
    aflfast 82800 15.0 1246.400000 14.695966 1195.0 1246.00 1250.0 1252.50 1256.0
    honggfuzz 82800 16.0 1248.500000 6.673330 1235.0 1245.75 1248.5 1254.00 1259.0
    eclipser 82800 15.0 1248.133333 5.514483 1236.0 1245.00 1248.0 1252.50 1255.0
    fairfuzz 82800 18.0 1228.111111 28.060625 1175.0 1208.25 1238.0 1250.75 1257.0
    centipede 82800 15.0 1143.733333 9.917277 1127.0 1136.50 1146.0 1150.00 1162.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: mopt, libafl, aflsmart, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 14.0 1183.357143 14.323768 1155.0 1170.25 1190.5 1192.00 1208.0
    aflplusplus 82800 17.0 1179.882353 11.362671 1154.0 1172.00 1180.0 1189.00 1197.0
    gfuzz 82800 2.0 1177.500000 7.778175 1172.0 1174.75 1177.5 1180.25 1183.0
    libfuzzer 82800 20.0 1156.150000 46.898855 1058.0 1129.50 1174.5 1194.25 1214.0
    honggfuzz 82800 17.0 1155.882353 27.126745 1113.0 1130.00 1158.0 1177.00 1198.0
    aflsmart 82800 13.0 1105.461538 29.809438 1047.0 1090.00 1123.0 1127.00 1129.0
    mopt 82800 15.0 1109.600000 21.503488 1071.0 1095.00 1116.0 1127.00 1136.0
    afl 82800 16.0 1106.125000 21.171915 1066.0 1098.25 1113.0 1121.75 1133.0
    aflfast 82800 17.0 1090.352941 24.882075 1043.0 1071.00 1099.0 1108.00 1118.0
    centipede 82800 18.0 1075.166667 13.057565 1060.0 1064.25 1072.0 1088.75 1099.0
    fairfuzz 82800 17.0 994.941176 30.150602 959.0 982.00 991.0 1001.00 1098.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflplusplus, mopt, aflsmart, libafl, gfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 469.650000 3.897030 462.0 469.50 471.5 472.00 473.0
    gfuzz 82800 2.0 462.500000 0.707107 462.0 462.25 462.5 462.75 463.0
    aflplusplus 82800 15.0 461.666667 4.685337 456.0 458.00 461.0 462.50 471.0
    mopt 82800 15.0 455.066667 12.290918 423.0 455.00 460.0 461.00 469.0
    honggfuzz 82800 18.0 458.777778 3.858612 452.0 458.00 459.0 460.75 468.0
    fairfuzz 82800 16.0 457.875000 3.685557 455.0 455.75 457.0 459.00 470.0
    aflsmart 82800 12.0 455.750000 13.948770 416.0 455.75 456.0 462.00 470.0
    afl 82800 16.0 449.625000 17.450406 406.0 454.75 455.0 459.25 466.0
    eclipser 82800 16.0 451.687500 13.189484 423.0 450.25 455.0 458.00 471.0
    centipede 82800 18.0 453.166667 2.935583 451.0 451.00 452.0 455.00 462.0
    aflfast 82800 17.0 428.352941 37.729202 345.0 401.00 448.0 454.00 458.0
    libafl 82800 7.0 448.714286 6.156684 442.0 444.50 447.0 451.50 460.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)