FuzzBench: 2023-10-05-fafuzz report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
aflplusplus_407 95.23
libafl 95.12
aflplusplus 94.44
fafuzz 94.11
honggfuzz 92.09
mopt 90.77
afl 90.59
aflsmart 89.95
libfuzzer 89.88
aflfast 88.62
fairfuzz 83.83
eclipser 77.73
centipede 65.42
By avg. rank
average rank
fuzzer
aflplusplus_407 2.57
aflplusplus 3.07
libafl 4.36
aflsmart 5.86
libfuzzer 5.93
eclipser 6.43
mopt 7.07
honggfuzz 7.14
afl 7.43
fafuzz 7.93
aflfast 9.50
fairfuzz 9.93
centipede 10.64
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      aflplusplus_407 libafl aflplusplus fafuzz honggfuzz mopt afl eclipser aflsmart libfuzzer aflfast fairfuzz centipede
    FuzzerMedian 98.00 97.50 98.00 94.00 95.50 93.00 93.00 93.50 93.00 92.00 92.00 82.50 85.00
    FuzzerMean 92.29 91.86 91.43 91.43 89.21 88.00 87.79 87.75 87.21 87.00 85.86 81.14 80.27
    bloaty_fuzz_target 97.00 98.00 98.00 95.00 94.00 96.00 94.00 94.00 96.00 91.00 93.00 80.00 nan
    curl_curl_fuzzer_http 98.00 97.00 98.00 92.00 98.00 93.00 93.00 94.00 94.00 91.00 93.00 83.00 nan
    freetype2_ftfuzzer 94.00 92.00 90.00 67.00 91.00 66.00 66.00 74.00 66.00 78.00 63.00 62.00 57.00
    harfbuzz_hb-shape-fuzzer 98.00 99.00 98.00 96.00 95.00 96.00 96.00 97.00 97.00 94.00 95.00 87.00 nan
    jsoncpp_jsoncpp_fuzzer 99.00 98.00 99.00 98.00 99.00 98.00 98.00 98.00 98.00 100.00 98.00 98.00 98.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 99.00 99.00 99.00 99.00 99.00 99.00 nan 82.00 99.00 82.00 82.00 82.00
    mbedtls_fuzz_dtlsclient 74.00 83.00 75.00 72.00 72.00 72.00 72.00 73.00 73.00 72.00 71.00 75.00 71.00
    openssl_x509 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00 99.00
    re2_fuzzer 99.00 98.00 99.00 98.00 98.00 98.00 98.00 99.00 99.00 99.00 99.00 99.00 95.00
    sqlite3_ossfuzz 98.00 90.00 89.00 90.00 71.00 92.00 92.00 93.00 93.00 80.00 90.00 65.00 64.00
    systemd_fuzz-link-parser 99.00 98.00 99.00 91.00 97.00 91.00 91.00 92.00 91.00 80.00 91.00 85.00 97.00
    vorbis_decode_fuzzer 94.00 93.00 94.00 93.00 93.00 93.00 93.00 93.00 93.00 94.00 93.00 93.00 85.00
    woff2_convert_woff2ttf_fuzzer 97.00 96.00 96.00 91.00 96.00 92.00 91.00 nan 93.00 93.00 89.00 81.00 88.00
    zlib_zlib_uncompress_fuzzer 47.00 46.00 47.00 99.00 47.00 47.00 47.00 47.00 47.00 48.00 46.00 47.00 47.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 18.0 6370.000000 76.211007 6139.0 6333.25 6382.0 6413.00 6460.0
    aflplusplus 82800 20.0 6356.000000 57.764495 6145.0 6345.50 6358.5 6393.00 6415.0
    aflplusplus_407 82800 20.0 6310.800000 97.262315 6115.0 6245.75 6320.5 6393.75 6449.0
    mopt 82800 20.0 6197.850000 59.793746 6022.0 6164.75 6211.0 6240.25 6277.0
    aflsmart 82800 20.0 6198.450000 127.984981 5937.0 6102.25 6205.0 6295.25 6394.0
    fafuzz 82800 18.0 6162.388889 82.742790 5985.0 6120.75 6146.0 6209.50 6344.0
    honggfuzz 82800 20.0 6100.500000 209.354522 5744.0 5948.25 6131.5 6295.50 6391.0
    eclipser 82800 20.0 6110.200000 114.487232 5900.0 6066.75 6107.0 6144.50 6360.0
    afl 82800 19.0 6072.631579 136.266981 5854.0 5987.00 6073.0 6128.00 6337.0
    aflfast 82800 20.0 6080.100000 119.230824 5858.0 6033.50 6059.0 6136.50 6350.0
    libfuzzer 82800 20.0 5897.050000 169.163414 5559.0 5805.00 5890.5 5988.00 6182.0
    fairfuzz 82800 20.0 5217.750000 121.813868 5060.0 5145.00 5194.5 5272.75 5570.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 82800 20.0 10864.950000 64.618292 10747.0 10829.75 10877.0 10894.75 11006.0
    honggfuzz 82800 20.0 10845.700000 108.334327 10689.0 10747.75 10861.0 10926.00 11036.0
    aflplusplus_407 82800 20.0 10853.500000 66.397131 10741.0 10814.00 10835.0 10880.00 11023.0
    libafl 82800 19.0 10758.421053 47.669366 10662.0 10728.50 10753.0 10777.00 10894.0
    eclipser 82800 20.0 10463.900000 37.160959 10415.0 10437.00 10454.0 10491.00 10551.0
    aflsmart 82800 20.0 10456.750000 63.366831 10389.0 10399.00 10436.5 10486.75 10575.0
    afl 82800 20.0 10279.600000 150.768697 9931.0 10235.75 10315.0 10387.25 10446.0
    mopt 82800 20.0 10284.300000 130.227776 9949.0 10260.75 10314.0 10339.00 10464.0
    aflfast 82800 20.0 10272.850000 106.610592 9985.0 10221.25 10305.5 10346.25 10406.0
    fafuzz 82800 19.0 10119.736842 167.875894 9792.0 9984.00 10199.0 10219.50 10307.0
    libfuzzer 82800 20.0 9985.200000 396.894206 9224.0 9836.25 10063.0 10172.00 10585.0
    fairfuzz 82800 20.0 9200.800000 420.958005 7990.0 9035.75 9266.0 9472.50 9906.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_407 82800 20.0 11755.650000 329.879448 10970.0 11634.00 11795.5 11899.00 12406.0
    libafl 82800 19.0 11432.157895 630.934867 10343.0 10881.00 11589.0 11840.50 12325.0
    honggfuzz 82800 20.0 11253.850000 623.984925 9786.0 10822.25 11416.5 11720.50 12085.0
    aflplusplus 82800 20.0 11391.200000 567.364229 10599.0 10947.00 11322.0 11623.50 12483.0
    libfuzzer 82800 20.0 9845.700000 650.336035 8730.0 9404.00 9806.0 10147.75 11277.0
    eclipser 82800 20.0 9242.100000 100.454703 9028.0 9198.75 9263.0 9316.75 9423.0
    fafuzz 82800 20.0 8317.100000 177.011121 7972.0 8252.50 8387.5 8433.25 8513.0
    afl 82800 17.0 8312.705882 200.942083 7789.0 8313.00 8349.0 8441.00 8506.0
    mopt 82800 16.0 8263.625000 178.152322 7927.0 8186.50 8322.5 8395.50 8470.0
    aflsmart 82800 20.0 8244.000000 202.383690 7800.0 8230.50 8304.5 8374.50 8524.0
    aflfast 82800 20.0 7976.350000 222.903159 7640.0 7820.50 7916.0 8204.75 8311.0
    fairfuzz 82800 20.0 7832.950000 112.700570 7683.0 7768.00 7816.0 7859.75 8213.0
    centipede 82800 20.0 7160.400000 177.924112 6873.0 7047.00 7140.5 7259.00 7535.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: honggfuzz.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 16.0 11077.000000 40.331956 11022.0 11045.00 11072.5 11109.25 11142.0
    aflplusplus 82800 20.0 10907.050000 305.379570 9630.0 10940.25 10973.0 11022.25 11067.0
    aflplusplus_407 82800 20.0 10959.000000 40.072303 10893.0 10927.50 10958.5 10992.50 11038.0
    aflsmart 82800 20.0 10815.450000 39.488139 10703.0 10800.25 10821.0 10841.25 10879.0
    eclipser 82800 20.0 10820.150000 43.008292 10754.0 10778.50 10819.5 10861.50 10885.0
    mopt 82800 20.0 10763.250000 67.699705 10607.0 10710.50 10793.5 10818.00 10847.0
    afl 82800 17.0 10744.588235 46.857575 10641.0 10721.00 10755.0 10764.00 10847.0
    fafuzz 82800 20.0 10762.200000 63.343342 10623.0 10727.75 10748.0 10806.00 10869.0
    honggfuzz 82800 5.0 10670.600000 74.214554 10560.0 10642.00 10682.0 10716.00 10753.0
    aflfast 82800 20.0 10670.700000 51.265152 10567.0 10642.75 10669.0 10702.75 10770.0
    libfuzzer 82800 20.0 10539.650000 52.400557 10425.0 10503.50 10541.5 10578.25 10630.0
    fairfuzz 82800 20.0 9728.250000 323.424257 9130.0 9481.50 9724.0 9986.00 10218.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 524.950000 0.223607 524.0 525.00 525.0 525.00 525.0
    honggfuzz 82800 20.0 522.450000 1.356272 519.0 522.00 522.0 524.00 524.0
    aflplusplus 82800 20.0 520.000000 0.000000 520.0 520.00 520.0 520.00 520.0
    aflplusplus_407 82800 20.0 519.800000 0.410391 519.0 520.00 520.0 520.00 520.0
    eclipser 82800 20.0 518.900000 1.252366 516.0 518.00 519.5 520.00 520.0
    centipede 82800 20.0 519.650000 1.980829 517.0 518.75 519.0 520.00 524.0
    fafuzz 82800 19.0 516.052632 4.707838 506.0 517.00 518.0 519.00 520.0
    mopt 82800 19.0 516.526316 4.869045 505.0 516.50 518.0 520.00 520.0
    aflsmart 82800 20.0 517.900000 1.165287 516.0 517.00 517.5 519.00 520.0
    afl 82800 18.0 516.388889 3.806170 506.0 517.00 517.0 518.75 520.0
    aflfast 82800 20.0 513.900000 10.622221 478.0 517.00 517.0 519.00 519.0
    fairfuzz 82800 20.0 517.450000 0.998683 516.0 517.00 517.0 517.25 520.0
    libafl 82800 17.0 517.294118 0.919559 516.0 517.00 517.0 517.00 519.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 3086.200000 1.507874 3083.0 3085.00 3087.0 3087.00 3088.0
    aflplusplus 82800 20.0 3078.200000 15.763382 3013.0 3080.00 3081.5 3084.00 3087.0
    aflplusplus_407 82800 20.0 3081.750000 2.633289 3077.0 3080.00 3081.0 3082.25 3087.0
    libafl 82800 17.0 3077.882353 9.733206 3044.0 3077.00 3081.0 3083.00 3085.0
    afl 82800 20.0 3078.600000 6.218817 3063.0 3076.00 3079.0 3084.00 3086.0
    mopt 82800 19.0 3071.473684 17.658389 3029.0 3072.00 3079.0 3081.00 3085.0
    fafuzz 82800 18.0 3055.222222 28.120655 3004.0 3033.25 3068.0 3078.75 3084.0
    honggfuzz 82800 20.0 3057.050000 19.465151 3007.0 3055.25 3064.0 3067.00 3074.0
    centipede 82800 20.0 2546.050000 0.686333 2545.0 2546.00 2546.0 2546.25 2547.0
    aflsmart 82800 20.0 2545.350000 1.980829 2543.0 2544.00 2545.0 2546.00 2551.0
    aflfast 82800 20.0 2543.900000 1.483240 2541.0 2543.00 2543.5 2545.00 2548.0
    fairfuzz 82800 20.0 2539.950000 4.795557 2533.0 2536.00 2542.0 2542.25 2550.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 17.0 3067.588235 347.938403 2692.0 2746.00 3104.0 3343.00 3663.0
    aflplusplus 82800 20.0 2793.150000 33.280744 2723.0 2769.25 2792.0 2819.50 2856.0
    fairfuzz 82800 20.0 2820.700000 203.472771 2730.0 2753.25 2779.0 2794.75 3676.0
    aflplusplus_407 82800 17.0 2767.529412 43.336067 2668.0 2750.00 2767.0 2797.00 2843.0
    eclipser 82800 20.0 2774.450000 217.678052 2702.0 2715.25 2726.0 2733.25 3697.0
    aflsmart 82800 20.0 2722.050000 16.750412 2697.0 2712.50 2720.0 2728.25 2771.0
    libfuzzer 82800 20.0 2725.500000 145.014518 2654.0 2675.00 2692.5 2709.50 3331.0
    honggfuzz 82800 20.0 2695.500000 41.359146 2647.0 2670.00 2681.0 2704.50 2802.0
    afl 82800 19.0 2638.736842 83.088201 2505.0 2547.00 2678.0 2701.50 2725.0
    mopt 82800 19.0 2708.789474 161.161609 2520.0 2668.50 2677.0 2702.50 3352.0
    fafuzz 82800 19.0 2655.263158 86.866335 2373.0 2651.50 2670.0 2683.50 2801.0
    centipede 82800 20.0 2639.900000 12.531644 2621.0 2630.50 2639.0 2644.50 2673.0
    aflfast 82800 20.0 2638.300000 14.026666 2607.0 2630.75 2638.0 2643.50 2675.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 19.0 5828.368421 4.058058 5821.0 5825.00 5831.0 5831.00 5833.0
    aflsmart 82800 20.0 5830.350000 2.539685 5827.0 5829.00 5830.0 5831.00 5840.0
    eclipser 82800 20.0 5829.450000 3.119970 5817.0 5829.00 5830.0 5831.00 5832.0
    libfuzzer 82800 20.0 5827.200000 5.615767 5817.0 5822.50 5829.0 5832.00 5833.0
    afl 82800 19.0 5824.578947 5.956470 5805.0 5823.50 5827.0 5827.50 5831.0
    aflplusplus 82800 20.0 5823.550000 6.863136 5807.0 5821.00 5825.0 5827.50 5834.0
    centipede 82800 20.0 5823.800000 5.863356 5810.0 5822.00 5825.0 5828.00 5833.0
    aflplusplus_407 82800 20.0 5822.350000 6.714282 5808.0 5815.00 5824.5 5827.00 5831.0
    mopt 82800 17.0 5824.529412 3.793377 5815.0 5823.00 5824.0 5828.00 5830.0
    fafuzz 82800 18.0 5822.388889 5.852490 5811.0 5818.75 5822.5 5825.75 5831.0
    fairfuzz 82800 20.0 5821.500000 2.874571 5817.0 5819.00 5822.0 5823.00 5827.0
    aflfast 82800 20.0 5819.200000 6.304551 5804.0 5816.75 5820.5 5824.00 5827.0
    honggfuzz 82800 20.0 5817.450000 7.917170 5806.0 5810.00 5819.5 5822.00 5834.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 82800 20.0 2884.000000 2.492093 2880.0 2882.00 2883.5 2885.00 2891.0
    aflplusplus 82800 20.0 2877.800000 4.926833 2866.0 2876.00 2878.0 2880.25 2885.0
    aflplusplus_407 82800 17.0 2876.882353 4.594594 2861.0 2877.00 2878.0 2879.00 2882.0
    aflsmart 82800 20.0 2868.550000 4.936172 2860.0 2865.75 2868.5 2872.50 2876.0
    eclipser 82800 20.0 2860.850000 23.079439 2785.0 2861.50 2867.5 2872.25 2878.0
    aflfast 82800 20.0 2865.450000 6.270021 2849.0 2864.50 2866.5 2870.00 2873.0
    fairfuzz 82800 20.0 2834.650000 73.055332 2631.0 2856.25 2864.0 2869.00 2875.0
    afl 82800 20.0 2850.450000 31.580265 2778.0 2851.00 2862.0 2870.50 2876.0
    libafl 82800 19.0 2857.052632 7.706229 2845.0 2851.00 2859.0 2863.50 2868.0
    mopt 82800 20.0 2856.300000 7.947856 2843.0 2849.75 2858.5 2862.25 2869.0
    fafuzz 82800 19.0 2849.421053 18.334290 2780.0 2847.00 2850.0 2860.00 2864.0
    honggfuzz 82800 20.0 2851.850000 8.963523 2836.0 2846.50 2850.0 2856.25 2869.0
    centipede 82800 20.0 2772.600000 17.005881 2736.0 2764.75 2772.5 2785.00 2812.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_407 82800 20.0 19886.800000 911.541124 16646.0 19834.00 20175.0 20384.75 20551.0
    aflsmart 82800 20.0 19264.550000 284.391590 18652.0 19128.25 19284.0 19399.00 19883.0
    eclipser 82800 20.0 19215.250000 317.380707 18729.0 18953.50 19191.5 19428.00 19939.0
    afl 82800 18.0 19000.833333 222.878297 18608.0 18870.00 19045.5 19088.75 19500.0
    mopt 82800 17.0 19018.941176 359.153253 18431.0 18764.00 18988.0 19274.00 19690.0
    aflfast 82800 20.0 18711.000000 281.495069 18104.0 18609.25 18713.5 18918.25 19183.0
    libafl 82800 19.0 18715.368421 203.787256 18360.0 18607.00 18661.0 18836.00 19107.0
    fafuzz 82800 17.0 18537.294118 251.168958 18030.0 18428.00 18547.0 18711.00 18995.0
    aflplusplus 82800 20.0 18666.050000 1523.443795 15066.0 17937.50 18494.0 20214.25 20568.0
    libfuzzer 82800 20.0 16541.150000 377.282426 15825.0 16287.00 16548.5 16731.00 17535.0
    honggfuzz 82800 20.0 14800.300000 476.602019 13799.0 14555.75 14765.0 15077.25 15724.0
    fairfuzz 82800 20.0 13246.650000 1692.282646 10991.0 11657.25 13522.5 14776.25 16178.0
    centipede 82800 18.0 13391.333333 312.381554 12856.0 13199.75 13329.0 13560.00 13974.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_407 82800 20.0 1265.950000 4.524495 1260.0 1261.75 1267.0 1270.25 1272.0
    libfuzzer 82800 20.0 1272.000000 16.270736 1264.0 1266.00 1267.0 1269.00 1336.0
    aflplusplus 82800 20.0 1266.350000 2.996050 1259.0 1265.00 1266.0 1268.25 1271.0
    aflsmart 82800 20.0 1255.000000 2.919986 1250.0 1253.00 1254.5 1257.25 1262.0
    mopt 82800 19.0 1247.736842 14.749170 1215.0 1247.50 1254.0 1256.00 1260.0
    libafl 82800 19.0 1251.947368 3.597432 1244.0 1250.50 1253.0 1254.00 1258.0
    afl 82800 18.0 1252.555556 3.791976 1244.0 1251.00 1252.0 1255.75 1258.0
    aflfast 82800 20.0 1246.800000 15.340349 1196.0 1248.50 1252.0 1254.00 1256.0
    eclipser 82800 20.0 1251.950000 4.358597 1244.0 1249.75 1252.0 1255.25 1259.0
    fafuzz 82800 19.0 1249.210526 3.809038 1240.0 1247.50 1250.0 1252.00 1254.0
    fairfuzz 82800 20.0 1234.700000 28.252806 1160.0 1222.50 1249.5 1253.50 1258.0
    honggfuzz 82800 20.0 1249.400000 4.805698 1235.0 1248.00 1249.0 1253.00 1258.0
    centipede 82800 20.0 1149.150000 17.263515 1122.0 1138.50 1145.5 1161.75 1181.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_407 82800 17.0 1178.058824 11.850478 1158.0 1170.00 1180.0 1187.00 1194.0
    libafl 82800 20.0 1175.750000 12.276957 1149.0 1167.75 1177.5 1182.25 1198.0
    aflplusplus 82800 20.0 1176.650000 12.261514 1153.0 1170.25 1173.5 1184.50 1200.0
    honggfuzz 82800 20.0 1165.800000 25.071687 1106.0 1159.75 1172.0 1183.00 1196.0
    libfuzzer 82800 20.0 1135.500000 65.774335 1002.0 1097.00 1139.0 1195.00 1214.0
    aflsmart 82800 20.0 1133.600000 7.036746 1116.0 1130.75 1134.5 1138.25 1144.0
    mopt 82800 19.0 1110.894737 15.405247 1082.0 1100.00 1117.0 1123.00 1132.0
    fafuzz 82800 18.0 1096.388889 33.290414 1043.0 1065.75 1113.5 1124.50 1129.0
    afl 82800 19.0 1103.105263 19.570087 1061.0 1088.50 1112.0 1118.00 1123.0
    aflfast 82800 20.0 1083.300000 26.647998 1025.0 1076.00 1089.5 1101.50 1114.0
    centipede 82800 20.0 1076.800000 12.033287 1056.0 1070.25 1073.0 1085.75 1099.0
    fairfuzz 82800 20.0 996.000000 26.789629 966.0 983.00 988.5 998.50 1080.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    fafuzz 82800 20.0 960.950000 4.904080 942.0 960.00 960.5 964.00 965.0
    libfuzzer 82800 20.0 468.650000 4.487116 462.0 463.00 472.0 472.00 472.0
    aflplusplus_407 82800 20.0 461.000000 4.304221 453.0 458.50 460.5 463.00 471.0
    aflplusplus 82800 20.0 462.750000 5.580747 456.0 459.75 460.0 468.50 472.0
    fairfuzz 82800 20.0 460.200000 4.162489 455.0 457.50 460.0 461.25 469.0
    honggfuzz 82800 20.0 460.150000 3.513508 456.0 458.00 459.0 461.25 471.0
    aflsmart 82800 20.0 458.350000 3.082634 455.0 456.00 458.5 459.25 468.0
    eclipser 82800 20.0 456.400000 7.556106 427.0 455.75 458.0 460.25 462.0
    mopt 82800 18.0 454.333333 11.871865 420.0 455.25 457.0 459.00 468.0
    afl 82800 19.0 457.736842 4.066695 449.0 455.00 456.0 460.50 468.0
    centipede 82800 20.0 454.500000 3.804430 451.0 451.00 454.0 457.00 464.0
    aflfast 82800 20.0 451.050000 5.266328 437.0 448.75 451.0 454.25 459.0
    libafl 82800 17.0 450.294118 5.083856 439.0 449.00 450.0 451.00 461.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)