FuzzBench: 2023-01-18-latest-cov report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
honggfuzz 95.60
libafl 94.59
libfuzzer 94.51
aflsmart 89.19
mopt 89.05
afl 87.46
aflfast 86.24
fairfuzz 83.35
centipede 70.65
eclipser 31.73
By avg. rank
average rank
fuzzer
libafl 2.95
libfuzzer 3.52
honggfuzz 3.76
aflsmart 3.86
mopt 5.00
afl 5.05
fairfuzz 6.76
aflfast 7.00
centipede 7.57
eclipser 8.05
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      libafl eclipser honggfuzz libfuzzer aflsmart mopt afl aflfast centipede fairfuzz
    FuzzerMedian 98.00 96.00 97.00 94.00 96.00 95.00 96.00 94.00 90.00 87.00
    FuzzerMean 95.35 92.29 91.81 90.71 86.33 86.14 84.71 83.48 83.18 80.67
    bloaty_fuzz_target 98.00 nan 95.00 90.00 95.00 96.00 94.00 93.00 nan 79.00
    curl_curl_fuzzer_http 98.00 nan 98.00 91.00 94.00 94.00 94.00 93.00 nan 85.00
    freetype2_ftfuzzer 91.00 nan 87.00 75.00 66.00 65.00 65.00 64.00 56.00 61.00
    harfbuzz_hb-shape-fuzzer 99.00 nan 96.00 94.00 96.00 96.00 96.00 95.00 nan 84.00
    jsoncpp_jsoncpp_fuzzer 98.00 nan 99.00 100.00 98.00 98.00 98.00 98.00 98.00 98.00
    lcms_cms_transform_fuzzer 96.00 nan 83.00 89.00 71.00 72.00 41.00 29.00 39.00 56.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 nan 99.00 99.00 99.00 99.00 99.00 99.00 96.00 99.00
    libpcap_fuzz_both 81.00 nan 80.00 75.00 1.00 1.00 1.00 1.00 81.00 1.00
    libpng_libpng_read_fuzzer 98.00 98.00 99.00 99.00 98.00 98.00 98.00 97.00 99.00 98.00
    libxml2_xml 99.00 nan 99.00 98.00 97.00 97.00 97.00 97.00 93.00 89.00
    libxslt_xpath 98.00 nan 99.00 94.00 97.00 95.00 96.00 95.00 95.00 97.00
    mbedtls_fuzz_dtlsclient 85.00 70.00 70.00 70.00 70.00 70.00 70.00 68.00 69.00 73.00
    openssl_x509 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00
    openthread_ot-ip6-send-fuzzer 86.00 nan 74.00 75.00 71.00 71.00 71.00 70.00 70.00 67.00
    re2_fuzzer 99.00 99.00 98.00 99.00 99.00 98.00 99.00 99.00 95.00 99.00
    sqlite3_ossfuzz 95.00 nan 72.00 84.00 96.00 95.00 96.00 95.00 65.00 61.00
    stb_stbi_read_fuzzer 96.00 92.00 93.00 88.00 88.00 87.00 88.00 87.00 86.00 87.00
    systemd_fuzz-link-parser 99.00 92.00 98.00 96.00 92.00 92.00 92.00 92.00 nan 87.00
    vorbis_decode_fuzzer 98.00 nan 98.00 99.00 98.00 98.00 98.00 98.00 90.00 97.00
    woff2_convert_woff2ttf_fuzzer nan nan 95.00 92.00 92.00 91.00 91.00 90.00 88.00 81.00
    zlib_zlib_uncompress_fuzzer 95.00 96.00 97.00 99.00 96.00 97.00 96.00 94.00 95.00 96.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, mopt.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 16.0 6388.937500 46.971578 6322.0 6342.75 6389.0 6427.00 6464.0
    mopt 82800 14.0 6242.357143 84.158831 6046.0 6225.75 6267.5 6291.00 6350.0
    aflsmart 82800 15.0 6143.400000 156.758139 5816.0 6031.00 6149.0 6241.50 6401.0
    honggfuzz 82800 18.0 6094.111111 174.869662 5884.0 5918.25 6142.0 6248.00 6329.0
    afl 82800 18.0 6101.944444 119.949240 5889.0 6070.75 6098.5 6141.75 6338.0
    aflfast 82800 17.0 6028.411765 77.961576 5893.0 5960.00 6035.0 6065.00 6185.0
    libfuzzer 82800 14.0 5863.928571 124.347196 5687.0 5776.50 5844.5 5987.50 6047.0
    fairfuzz 82800 16.0 5189.937500 67.159977 5111.0 5135.25 5170.0 5226.00 5323.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer, mopt, honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 82800 12.0 10852.333333 75.781184 10735.0 10803.75 10842.0 10911.75 10967.0
    libafl 82800 17.0 10742.470588 38.195415 10654.0 10724.00 10750.0 10761.00 10803.0
    aflsmart 82800 17.0 10390.764706 113.590014 10102.0 10374.00 10412.0 10455.00 10512.0
    afl 82800 17.0 10349.000000 110.809521 10104.0 10303.00 10355.0 10449.00 10493.0
    mopt 82800 14.0 10343.285714 52.470400 10244.0 10310.75 10353.0 10377.00 10441.0
    aflfast 82800 17.0 10254.352941 107.364997 9962.0 10228.00 10291.0 10314.00 10399.0
    libfuzzer 82800 14.0 10023.714286 353.387873 9531.0 9760.00 10058.0 10296.25 10557.0
    fairfuzz 82800 19.0 9222.578947 391.462545 8215.0 8926.50 9379.0 9512.00 9714.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: honggfuzz, libfuzzer, libafl, mopt, aflfast.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 14.0 11621.285714 717.617471 10538.0 11153.75 11614.5 12277.25 12756.0
    honggfuzz 82800 15.0 11146.933333 587.165767 9966.0 10638.00 11211.0 11643.50 11874.0
    libfuzzer 82800 15.0 9564.933333 535.501029 8543.0 9186.50 9658.0 9919.50 10387.0
    aflsmart 82800 17.0 8323.647059 218.891794 7882.0 8183.00 8427.0 8489.00 8603.0
    mopt 82800 14.0 8386.428571 93.355989 8142.0 8352.25 8394.0 8459.00 8499.0
    afl 82800 17.0 8258.117647 210.946345 7860.0 8255.00 8313.0 8393.00 8532.0
    aflfast 82800 13.0 8162.615385 203.476673 7780.0 8005.00 8204.0 8329.00 8411.0
    fairfuzz 82800 17.0 7936.117647 236.373349 7764.0 7809.00 7863.0 7895.00 8582.0
    centipede 82800 19.0 7235.526316 211.190374 6852.0 7110.50 7190.0 7354.00 7653.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: fairfuzz, libfuzzer, honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 17.0 11075.352941 38.006482 11015.0 11047.00 11065.0 11105.00 11146.0
    mopt 82800 15.0 10787.266667 79.410747 10535.0 10768.50 10803.0 10831.50 10876.0
    afl 82800 18.0 10766.222222 44.163540 10669.0 10737.25 10773.5 10791.25 10845.0
    aflsmart 82800 18.0 10772.000000 45.695926 10703.0 10737.50 10763.5 10805.75 10859.0
    honggfuzz 82800 8.0 10715.750000 14.992855 10696.0 10704.25 10716.0 10726.25 10736.0
    aflfast 82800 18.0 10653.000000 67.534741 10498.0 10637.75 10667.5 10692.00 10756.0
    libfuzzer 82800 13.0 10544.538462 56.529661 10454.0 10514.00 10554.0 10572.00 10652.0
    fairfuzz 82800 14.0 9509.571429 347.646530 9071.0 9259.50 9399.0 9715.75 10155.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflfast.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 17.0 524.941176 0.242536 524.0 525.00 525.0 525.00 525.0
    honggfuzz 82800 16.0 522.500000 1.154701 521.0 522.00 522.0 524.00 524.0
    centipede 82800 16.0 520.250000 1.949359 518.0 519.00 519.0 522.00 523.0
    mopt 82800 16.0 518.250000 1.183216 516.0 517.75 518.0 519.00 520.0
    afl 82800 17.0 514.588235 6.195349 502.0 516.00 517.0 519.00 520.0
    aflfast 82800 12.0 514.916667 6.126816 501.0 516.75 517.0 517.50 519.0
    aflsmart 82800 14.0 516.428571 3.715131 504.0 517.00 517.0 517.75 519.0
    fairfuzz 82800 16.0 516.437500 2.988171 509.0 516.75 517.0 518.25 520.0
    libafl 82800 15.0 516.933333 0.258199 516.0 517.00 517.0 517.00 517.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 18.0 2056.500000 109.130925 1685.0 2015.00 2097.0 2122.75 2166.0
    libfuzzer 82800 16.0 1861.500000 302.680029 765.0 1867.75 1929.0 1987.50 2060.0
    honggfuzz 82800 11.0 1578.818182 434.253110 741.0 1542.00 1807.0 1848.00 1923.0
    mopt 82800 18.0 1334.222222 446.061399 586.0 873.25 1574.0 1668.50 1795.0
    aflsmart 82800 17.0 1318.294118 462.454696 642.0 859.00 1546.0 1667.00 1762.0
    fairfuzz 82800 18.0 1274.277778 397.407600 800.0 901.00 1221.0 1636.25 1940.0
    afl 82800 17.0 1135.764706 470.692247 583.0 651.00 906.0 1631.00 1728.0
    centipede 82800 18.0 949.388889 231.815378 756.0 782.50 846.5 1025.75 1362.0
    aflfast 82800 19.0 663.947368 178.273969 519.0 619.00 643.0 645.50 1383.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: fairfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 17.0 3085.941176 2.435944 3081.0 3086.00 3087.0 3087.0 3089.0
    aflsmart 82800 17.0 3080.352941 5.024206 3069.0 3080.00 3082.0 3084.0 3086.0
    fairfuzz 82800 13.0 3072.615385 17.399897 3017.0 3072.00 3079.0 3080.0 3084.0
    libafl 82800 16.0 3079.250000 3.000000 3075.0 3076.75 3079.0 3081.0 3085.0
    afl 82800 18.0 3078.611111 5.203380 3070.0 3075.25 3078.5 3083.0 3086.0
    mopt 82800 17.0 3070.882353 21.109484 3014.0 3072.00 3078.0 3081.0 3086.0
    honggfuzz 82800 17.0 3063.411765 9.454224 3040.0 3060.00 3066.0 3068.0 3075.0
    aflfast 82800 17.0 3050.470588 30.820281 3007.0 3014.00 3065.0 3077.0 3084.0
    centipede 82800 19.0 2971.631579 31.225365 2915.0 2945.50 2986.0 2999.5 3008.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflfast, aflsmart, centipede, honggfuzz, mopt.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    centipede 82800 15.0 1788.666667 1437.918022 100.0 104.5 2773.0 2910.50 3390.0
    libafl 82800 19.0 2745.842105 145.707570 2470.0 2645.0 2746.0 2822.00 2982.0
    honggfuzz 82800 15.0 2772.733333 147.428175 2564.0 2679.0 2738.0 2854.00 3129.0
    libfuzzer 82800 16.0 2530.625000 141.622915 2046.0 2498.0 2568.5 2605.75 2660.0
    aflfast 82800 15.0 39.066667 4.463609 34.0 34.0 43.0 43.00 43.0
    afl 82800 18.0 38.777778 4.492550 34.0 34.0 41.0 43.00 43.0
    aflsmart 82800 15.0 34.600000 2.323790 34.0 34.0 34.0 34.00 43.0
    fairfuzz 82800 16.0 37.375000 4.500000 34.0 34.0 34.0 43.00 43.0
    mopt 82800 15.0 144.866667 416.958180 34.0 34.0 34.0 43.00 1652.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: afl, aflsmart, libfuzzer, aflfast.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 15.0 2018.066667 0.961150 2015.0 2018.00 2018.0 2018.50 2019.0
    centipede 82800 18.0 2014.000000 3.613699 2006.0 2013.00 2015.0 2016.75 2018.0
    honggfuzz 82800 18.0 2011.777778 2.860595 2007.0 2010.00 2011.5 2013.50 2018.0
    libafl 82800 19.0 1991.210526 10.014317 1976.0 1981.50 1997.0 1998.00 2005.0
    eclipser 82800 18.0 1983.000000 27.563830 1900.0 1988.25 1993.5 1996.25 1999.0
    aflsmart 82800 15.0 1968.066667 37.151171 1909.0 1926.00 1992.0 1996.00 1997.0
    afl 82800 15.0 1973.733333 35.293599 1902.0 1978.00 1991.0 1993.00 2004.0
    fairfuzz 82800 18.0 1972.166667 30.020091 1889.0 1955.75 1981.0 1996.00 1999.0
    mopt 82800 16.0 1970.500000 28.768038 1912.0 1949.00 1980.5 1994.25 2001.0
    aflfast 82800 13.0 1951.153846 47.435651 1822.0 1945.00 1969.0 1975.00 1991.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: afl, libafl, aflsmart.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 15.0 15624.733333 31.194017 15563.0 15607.00 15624.0 15642.00 15674.0
    honggfuzz 82800 17.0 15606.294118 40.253516 15536.0 15585.00 15602.0 15627.00 15693.0
    libfuzzer 82800 17.0 15421.705882 59.578483 15315.0 15377.00 15425.0 15456.00 15550.0
    aflsmart 82800 14.0 15337.785714 63.447348 15209.0 15310.50 15340.0 15380.25 15431.0
    mopt 82800 18.0 15322.888889 58.018140 15184.0 15302.25 15330.0 15346.75 15425.0
    afl 82800 15.0 15335.066667 49.574571 15270.0 15311.50 15328.0 15357.50 15439.0
    aflfast 82800 20.0 15313.550000 68.346235 15189.0 15271.50 15322.0 15363.75 15441.0
    centipede 82800 17.0 14686.235294 163.148525 14302.0 14555.00 14751.0 14794.00 14881.0
    fairfuzz 82800 19.0 14034.842105 437.228934 12545.0 13923.00 14065.0 14210.00 14843.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 82800 16.0 11040.375000 89.626540 10864.0 10994.50 11076.0 11107.00 11137.0
    libafl 82800 17.0 10979.941176 63.757814 10859.0 10948.00 10990.0 11022.00 11076.0
    fairfuzz 82800 17.0 10804.470588 200.035346 10162.0 10772.00 10855.0 10906.00 11044.0
    aflsmart 82800 16.0 10830.500000 104.788676 10507.0 10812.00 10842.0 10886.00 10968.0
    afl 82800 16.0 10714.937500 122.364193 10433.0 10665.50 10741.5 10794.25 10846.0
    centipede 82800 16.0 10682.000000 104.691929 10510.0 10624.75 10670.5 10746.25 10908.0
    aflfast 82800 17.0 10648.647059 90.347898 10408.0 10638.00 10666.0 10700.00 10732.0
    mopt 82800 16.0 10562.250000 164.019308 10134.0 10474.75 10608.0 10641.50 10781.0
    libfuzzer 82800 16.0 10376.812500 426.188725 9248.0 10307.50 10502.0 10630.75 10788.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: fairfuzz, mopt.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 18.0 3139.944444 300.045971 2631.0 2843.50 3255.0 3333.00 3630.0
    fairfuzz 82800 15.0 2849.466667 159.218119 2754.0 2779.00 2801.0 2807.50 3296.0
    honggfuzz 82800 17.0 2701.235294 23.236634 2664.0 2687.00 2704.0 2713.00 2757.0
    aflsmart 82800 19.0 2657.421053 87.858039 2506.0 2568.00 2699.0 2723.00 2773.0
    eclipser 82800 16.0 2726.562500 280.074030 2496.0 2674.00 2692.5 2715.00 3730.0
    mopt 82800 14.0 2665.428571 65.113764 2510.0 2673.75 2688.5 2702.50 2721.0
    afl 82800 16.0 2733.937500 299.446037 2502.0 2664.00 2682.0 2698.00 3827.0
    libfuzzer 82800 16.0 2683.125000 16.516154 2661.0 2669.75 2680.5 2694.25 2711.0
    centipede 82800 17.0 2710.117647 263.068167 2619.0 2630.00 2650.0 2666.00 3728.0
    aflfast 82800 17.0 2557.411765 121.454754 2309.0 2599.00 2611.0 2624.00 2667.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflfast, honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 16.0 5826.687500 6.690478 5816.0 5820.00 5830.0 5832.00 5835.0
    aflsmart 82800 15.0 5827.066667 4.847189 5817.0 5825.50 5829.0 5830.50 5833.0
    libafl 82800 17.0 5827.470588 4.288322 5820.0 5823.00 5829.0 5831.00 5833.0
    afl 82800 15.0 5826.600000 3.960519 5820.0 5824.00 5828.0 5829.50 5832.0
    mopt 82800 18.0 5824.555556 6.536974 5811.0 5823.25 5828.0 5828.75 5831.0
    eclipser 82800 15.0 5825.266667 4.096456 5817.0 5823.50 5827.0 5828.00 5831.0
    centipede 82800 15.0 5822.200000 6.689010 5812.0 5818.00 5824.0 5826.50 5834.0
    fairfuzz 82800 16.0 5821.125000 2.753785 5816.0 5819.50 5822.0 5823.00 5825.0
    aflfast 82800 14.0 5817.000000 8.009610 5807.0 5810.25 5816.0 5824.00 5829.0
    honggfuzz 82800 14.0 5814.000000 6.480741 5802.0 5808.50 5814.5 5820.00 5821.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: afl, aflsmart, mopt.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 16.0 3461.687500 391.352649 2975.0 3047.0 3524.0 3789.50 4097.0
    libfuzzer 82800 17.0 3082.470588 10.118780 3070.0 3074.0 3080.0 3092.00 3099.0
    honggfuzz 82800 18.0 3070.388889 168.342836 2900.0 2954.0 3045.0 3068.75 3537.0
    mopt 82800 15.0 2930.933333 69.999456 2824.0 2908.5 2916.0 2925.00 3065.0
    aflsmart 82800 15.0 2906.333333 69.832112 2795.0 2869.0 2915.0 2917.00 3052.0
    afl 82800 15.0 2911.333333 60.366342 2817.0 2904.5 2910.0 2917.00 3035.0
    aflfast 82800 17.0 2896.294118 42.210432 2808.0 2902.0 2907.0 2912.00 2980.0
    centipede 82800 17.0 2856.352941 57.396582 2763.0 2790.0 2884.0 2900.00 2917.0
    fairfuzz 82800 19.0 2779.105263 65.278799 2676.0 2745.5 2764.0 2801.50 2912.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libafl, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 14.0 2883.142857 1.561909 2881.0 2882.00 2883.0 2884.00 2886.0
    aflsmart 82800 17.0 2862.705882 22.769949 2776.0 2865.00 2868.0 2870.00 2876.0
    afl 82800 17.0 2854.352941 30.250912 2784.0 2863.00 2867.0 2871.00 2872.0
    eclipser 82800 17.0 2849.176471 37.480387 2746.0 2854.00 2867.0 2872.00 2877.0
    fairfuzz 82800 16.0 2849.875000 34.960692 2757.0 2858.75 2864.0 2866.00 2872.0
    libafl 82800 14.0 2859.500000 6.085923 2845.0 2857.25 2862.5 2863.00 2866.0
    aflfast 82800 16.0 2849.187500 32.711300 2761.0 2853.00 2862.0 2866.00 2869.0
    honggfuzz 82800 18.0 2854.055556 6.347507 2839.0 2849.75 2853.0 2859.50 2862.0
    mopt 82800 17.0 2832.235294 42.418347 2741.0 2795.00 2851.0 2863.00 2871.0
    centipede 82800 16.0 2767.187500 18.999013 2740.0 2753.75 2768.0 2780.25 2801.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: centipede, fairfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    afl 82800 19.0 19120.631579 271.473185 18750.0 18964.50 19069.0 19300.00 19764.0
    aflsmart 82800 16.0 18967.187500 332.880002 18405.0 18748.75 19007.0 19215.75 19455.0
    mopt 82800 18.0 18989.611111 257.525359 18358.0 18874.00 18944.0 19173.00 19401.0
    aflfast 82800 17.0 18843.058824 250.648526 18143.0 18773.00 18861.0 18991.00 19214.0
    libafl 82800 16.0 18786.062500 74.333903 18639.0 18743.00 18798.5 18834.50 18898.0
    libfuzzer 82800 16.0 16586.875000 471.360849 15225.0 16466.50 16633.5 16880.00 17108.0
    honggfuzz 82800 16.0 14378.875000 498.141329 13529.0 14138.50 14347.0 14592.00 15289.0
    centipede 82800 14.0 13009.714286 460.146537 12336.0 12762.25 12918.5 13128.25 13853.0
    fairfuzz 82800 12.0 12516.333333 1555.113169 10873.0 11305.75 12095.0 13230.75 15546.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

stb_stbi_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 16.0 2171.687500 49.049252 2104.0 2113.00 2190.0 2195.25 2269.0
    honggfuzz 82800 16.0 2118.062500 21.971098 2108.0 2111.00 2113.0 2115.00 2200.0
    eclipser 82800 19.0 2102.210526 10.141110 2081.0 2099.50 2106.0 2108.00 2115.0
    libfuzzer 82800 13.0 2031.615385 37.355362 1982.0 2011.00 2016.0 2056.00 2102.0
    aflsmart 82800 16.0 2028.937500 45.036976 1979.0 2001.75 2005.5 2087.25 2109.0
    afl 82800 16.0 2009.000000 49.946638 1936.0 1985.00 2003.5 2007.25 2108.0
    fairfuzz 82800 16.0 1999.312500 43.870596 1942.0 1976.50 1993.5 1999.00 2084.0
    aflfast 82800 17.0 1998.058824 34.450092 1963.0 1982.00 1987.0 1997.00 2087.0
    mopt 82800 18.0 1988.333333 28.033593 1952.0 1976.25 1984.0 2000.00 2080.0
    centipede 82800 17.0 1957.647059 5.049024 1952.0 1954.00 1957.0 1959.00 1973.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflfast, centipede, libfuzzer.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 14.0 1268.000000 1.754116 1265.0 1267.00 1268.0 1269.00 1272.0
    aflsmart 82800 17.0 1252.117647 15.591995 1194.0 1252.00 1255.0 1258.00 1262.0
    mopt 82800 16.0 1250.000000 19.572089 1178.0 1250.75 1254.0 1257.25 1261.0
    afl 82800 17.0 1245.941176 15.666328 1201.0 1247.00 1252.0 1254.00 1258.0
    libafl 82800 19.0 1251.894737 3.264195 1245.0 1250.00 1252.0 1253.50 1257.0
    aflfast 82800 15.0 1246.400000 14.695966 1195.0 1246.00 1250.0 1252.50 1256.0
    honggfuzz 82800 16.0 1248.500000 6.673330 1235.0 1245.75 1248.5 1254.00 1259.0
    fairfuzz 82800 18.0 1228.111111 28.060625 1175.0 1208.25 1238.0 1250.75 1257.0
    centipede 82800 15.0 1143.733333 9.917277 1127.0 1136.50 1146.0 1150.00 1162.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: aflsmart.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 82800 17.0 1155.882353 27.126745 1113.0 1130.00 1158.0 1177.00 1198.0
    libfuzzer 82800 17.0 1128.941176 56.133179 1032.0 1093.00 1128.0 1169.00 1216.0
    aflsmart 82800 13.0 1105.461538 29.809438 1047.0 1090.00 1123.0 1127.00 1129.0
    mopt 82800 15.0 1109.600000 21.503488 1071.0 1095.00 1116.0 1127.00 1136.0
    afl 82800 16.0 1106.125000 21.171915 1066.0 1098.25 1113.0 1121.75 1133.0
    aflfast 82800 17.0 1090.352941 24.882075 1043.0 1071.00 1099.0 1108.00 1118.0
    centipede 82800 18.0 1075.166667 13.057565 1060.0 1064.25 1072.0 1088.75 1099.0
    fairfuzz 82800 17.0 994.941176 30.150602 959.0 982.00 991.0 1001.00 1098.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libafl, aflsmart.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 16.0 469.687500 3.876747 461.0 470.75 471.0 472.00 472.0
    mopt 82800 15.0 455.066667 12.290918 423.0 455.00 460.0 461.00 469.0
    honggfuzz 82800 18.0 458.777778 3.858612 452.0 458.00 459.0 460.75 468.0
    fairfuzz 82800 16.0 457.875000 3.685557 455.0 455.75 457.0 459.00 470.0
    aflsmart 82800 12.0 455.750000 13.948770 416.0 455.75 456.0 462.00 470.0
    afl 82800 16.0 449.625000 17.450406 406.0 454.75 455.0 459.25 466.0
    eclipser 82800 16.0 451.687500 13.189484 423.0 450.25 455.0 458.00 471.0
    centipede 82800 18.0 453.166667 2.935583 451.0 451.00 452.0 455.00 462.0
    libafl 82800 14.0 448.285714 3.667499 442.0 446.50 449.0 449.75 457.0
    aflfast 82800 17.0 428.352941 37.729202 345.0 401.00 448.0 454.00 458.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)