FuzzBench: 2023-05-05-sample report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
libafl 99.71
mopt 86.25
eclipser 85.87
aflsmart 85.35
aflfast 83.70
fairfuzz 80.55
centipede 4.72
By avg. rank
average rank
fuzzer
libafl 2.05
eclipser 2.71
aflsmart 2.76
mopt 3.33
aflfast 4.57
fairfuzz 5.05
centipede 6.67
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      centipede libafl eclipser mopt aflsmart aflfast fairfuzz
    FuzzerMedian 99.00 98.00 96.00 95.00 96.00 94.00 87.00
    FuzzerMean 99.00 95.86 91.16 83.29 82.62 81.05 77.86
    bloaty_fuzz_target nan 98.00 94.00 96.00 96.00 94.00 80.00
    curl_curl_fuzzer_http nan 98.00 96.00 96.00 96.00 95.00 85.00
    freetype2_ftfuzzer nan 90.00 74.00 67.00 67.00 64.00 63.00
    harfbuzz_hb-shape-fuzzer nan 99.00 97.00 97.00 97.00 96.00 87.00
    jsoncpp_jsoncpp_fuzzer nan 98.00 98.00 98.00 98.00 98.00 98.00
    lcms_cms_transform_fuzzer nan 90.00 78.00 62.00 40.00 28.00 50.00
    libjpeg-turbo_libjpeg_turbo_fuzzer nan 99.00 nan 99.00 99.00 99.00 99.00
    libpcap_fuzz_both nan 90.00 85.00 1.00 1.00 1.00 1.00
    libpng_libpng_read_fuzzer nan 95.00 95.00 95.00 95.00 94.00 95.00
    libxml2_xml nan 99.00 98.00 97.00 98.00 97.00 90.00
    libxslt_xpath nan 98.00 98.00 95.00 98.00 96.00 99.00
    openh264_decoder_fuzzer nan 99.00 99.00 99.00 99.00 99.00 90.00
    openssl_x509 nan 99.00 99.00 99.00 99.00 99.00 99.00
    openthread_ot-ip6-send-fuzzer nan 82.00 76.00 73.00 73.00 72.00 69.00
    proj4_proj_crs_to_crs_fuzzer nan 96.00 66.00 10.00 10.00 10.00 10.00
    re2_fuzzer nan 99.00 99.00 99.00 99.00 99.00 99.00
    sqlite3_ossfuzz nan 92.00 96.00 95.00 96.00 93.00 67.00
    stb_stbi_read_fuzzer nan 97.00 93.00 88.00 88.00 87.00 87.00
    systemd_fuzz-link-parser 99.00 100.00 93.00 92.00 92.00 92.00 86.00
    vorbis_decode_fuzzer nan 98.00 98.00 98.00 99.00 98.00 98.00
    woff2_convert_woff2ttf_fuzzer nan 97.00 nan 93.00 95.00 91.00 83.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 6353.80 71.997515 6140.0 6344.75 6371.5 6388.00 6442.0
    mopt 82800 20.0 6236.05 71.516266 6071.0 6226.50 6243.5 6268.50 6354.0
    aflsmart 82800 20.0 6198.45 127.984981 5937.0 6102.25 6205.0 6295.25 6394.0
    eclipser 82800 20.0 6110.20 114.487232 5900.0 6066.75 6107.0 6144.50 6360.0
    aflfast 82800 20.0 6080.10 119.230824 5858.0 6033.50 6059.0 6136.50 6350.0
    fairfuzz 82800 20.0 5217.75 121.813868 5060.0 5145.00 5194.5 5272.75 5570.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 10690.00 54.699177 10599.0 10655.25 10684.5 10723.75 10802.0
    eclipser 82800 20.0 10463.90 37.160959 10415.0 10437.00 10454.0 10491.00 10551.0
    aflsmart 82800 20.0 10456.75 63.366831 10389.0 10399.00 10436.5 10486.75 10575.0
    mopt 82800 20.0 10338.90 115.764234 10036.0 10331.75 10376.0 10408.50 10441.0
    aflfast 82800 20.0 10272.85 106.610592 9985.0 10221.25 10305.5 10346.25 10406.0
    fairfuzz 82800 20.0 9200.80 420.958005 7990.0 9035.75 9266.0 9472.50 9906.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 11258.70 576.622023 10133.0 10872.00 11183.5 11721.75 12353.0
    eclipser 82800 20.0 9242.10 100.454703 9028.0 9198.75 9263.0 9316.75 9423.0
    mopt 82800 20.0 8332.40 177.225815 7888.0 8320.75 8377.5 8442.00 8504.0
    aflsmart 82800 20.0 8244.00 202.383690 7800.0 8230.50 8304.5 8374.50 8524.0
    aflfast 82800 20.0 7976.35 222.903159 7640.0 7820.50 7916.0 8204.75 8311.0
    fairfuzz 82800 20.0 7832.95 112.700570 7683.0 7768.00 7816.0 7859.75 8213.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 11027.85 34.580608 10961.0 11006.50 11026.5 11049.75 11087.0
    aflsmart 82800 20.0 10815.45 39.488139 10703.0 10800.25 10821.0 10841.25 10879.0
    eclipser 82800 20.0 10820.15 43.008292 10754.0 10778.50 10819.5 10861.50 10885.0
    mopt 82800 20.0 10796.70 40.874777 10699.0 10777.25 10801.5 10821.00 10864.0
    aflfast 82800 20.0 10670.70 51.265152 10567.0 10642.75 10669.0 10702.75 10770.0
    fairfuzz 82800 20.0 9728.25 323.424257 9130.0 9481.50 9724.0 9986.00 10218.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    eclipser 82800 20.0 518.90 1.252366 516.0 518.0 519.5 520.00 520.0
    mopt 82800 20.0 517.20 3.955010 505.0 518.0 518.0 519.00 520.0
    aflsmart 82800 20.0 517.90 1.165287 516.0 517.0 517.5 519.00 520.0
    aflfast 82800 20.0 513.90 10.622221 478.0 517.0 517.0 519.00 519.0
    fairfuzz 82800 20.0 517.45 0.998683 516.0 517.0 517.0 517.25 520.0
    libafl 82800 20.0 517.25 0.850696 516.0 517.0 517.0 517.00 519.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 2076.20 108.050476 1894.0 1999.00 2071.0 2161.25 2287.0
    eclipser 82800 20.0 1736.00 135.909025 1522.0 1573.00 1789.0 1838.50 1904.0
    mopt 82800 20.0 1230.00 430.050425 651.0 845.25 1439.5 1604.00 1765.0
    fairfuzz 82800 20.0 1181.85 428.681488 652.0 790.25 1148.5 1575.75 1919.0
    aflsmart 82800 20.0 1123.25 441.957235 649.0 673.75 918.0 1575.50 1795.0
    aflfast 82800 20.0 635.60 19.277475 569.0 637.75 642.5 646.00 657.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflsmart 82800 20.0 2545.35 1.980829 2543.0 2544.0 2545.0 2546.00 2551.0
    mopt 82800 20.0 2544.45 1.099043 2543.0 2544.0 2544.0 2545.00 2547.0
    aflfast 82800 20.0 2543.90 1.483240 2541.0 2543.0 2543.5 2545.00 2548.0
    libafl 82800 20.0 2543.45 0.604805 2543.0 2543.0 2543.0 2544.00 2545.0
    fairfuzz 82800 20.0 2539.95 4.795557 2533.0 2536.0 2542.0 2542.25 2550.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 2734.20 108.331655 2527.0 2678.00 2715.0 2773.50 2989.0
    eclipser 82800 20.0 2506.05 205.275547 1990.0 2411.75 2553.5 2590.75 2872.0
    aflfast 82800 20.0 40.30 4.231461 34.0 34.00 43.0 43.00 43.0
    fairfuzz 82800 20.0 38.85 4.404244 33.0 33.00 42.0 42.00 42.0
    mopt 82800 20.0 37.40 4.357691 34.0 34.00 34.0 43.00 43.0
    aflsmart 82800 20.0 34.80 3.693522 33.0 33.00 33.0 33.00 42.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 1992.85 27.054088 1949.0 1976.75 1997.5 2002.00 2088.0
    aflsmart 82800 20.0 1995.40 5.490662 1974.0 1994.00 1996.0 1998.25 2000.0
    eclipser 82800 20.0 1990.45 11.250614 1972.0 1978.00 1994.0 1998.25 2003.0
    mopt 82800 20.0 1978.85 26.776413 1911.0 1975.00 1990.0 1993.50 2008.0
    fairfuzz 82800 20.0 1984.10 13.814180 1945.0 1974.75 1988.5 1995.00 2000.0
    aflfast 82800 20.0 1967.60 20.301413 1943.0 1948.25 1964.0 1989.50 1997.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 15569.30 37.629915 15515.0 15539.00 15570.5 15584.50 15652.0
    eclipser 82800 20.0 15423.45 79.669366 15257.0 15387.50 15423.5 15463.75 15601.0
    aflsmart 82800 20.0 15366.80 68.816767 15248.0 15322.50 15366.5 15410.75 15497.0
    mopt 82800 20.0 15318.60 62.704319 15155.0 15285.50 15326.0 15366.25 15397.0
    aflfast 82800 20.0 15320.65 78.755635 15159.0 15271.75 15318.0 15356.00 15509.0
    fairfuzz 82800 20.0 14146.65 259.273738 13616.0 14060.75 14144.0 14227.25 14834.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    fairfuzz 82800 20.0 10883.25 144.964197 10516.0 10827.75 10942.5 10959.25 11047.0
    aflsmart 82800 20.0 10894.55 51.799690 10814.0 10859.25 10901.0 10915.00 11026.0
    eclipser 82800 20.0 10895.65 65.938787 10729.0 10862.50 10897.5 10938.25 11010.0
    libafl 82800 20.0 10848.90 87.106650 10675.0 10784.00 10868.0 10895.00 11026.0
    aflfast 82800 20.0 10636.05 92.919022 10363.0 10591.25 10649.5 10692.75 10786.0
    mopt 82800 20.0 10571.50 126.561198 10325.0 10484.75 10580.0 10646.00 10781.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openh264_decoder_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    mopt 82800 20.0 9539.10 13.540971 9517.0 9525.75 9540.5 9548.25 9563.0
    libafl 82800 20.0 9536.95 11.315918 9498.0 9531.00 9539.0 9543.00 9550.0
    aflsmart 82800 20.0 9518.75 42.213586 9433.0 9511.75 9533.5 9543.00 9582.0
    eclipser 82800 20.0 9518.45 26.296838 9441.0 9515.25 9526.5 9533.75 9541.0
    aflfast 82800 20.0 9509.10 52.625689 9380.0 9493.00 9525.5 9551.00 9561.0
    fairfuzz 82800 20.0 8690.75 181.798001 8400.0 8579.25 8713.0 8780.50 9055.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflsmart 82800 20.0 5830.35 2.539685 5827.0 5829.00 5830.0 5831.0 5840.0
    eclipser 82800 20.0 5829.45 3.119970 5817.0 5829.00 5830.0 5831.0 5832.0
    mopt 82800 20.0 5824.90 6.290343 5810.0 5823.00 5828.0 5829.0 5829.0
    libafl 82800 20.0 5825.80 4.274773 5821.0 5822.50 5824.0 5829.5 5833.0
    fairfuzz 82800 20.0 5821.50 2.874571 5817.0 5819.00 5822.0 5823.0 5827.0
    aflfast 82800 20.0 5819.20 6.304551 5804.0 5816.75 5820.5 5824.0 5827.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 3358.10 345.626342 3018.0 3047.50 3270.0 3570.25 3967.0
    eclipser 82800 20.0 3022.20 46.183387 2911.0 3010.75 3038.5 3046.75 3087.0
    mopt 82800 20.0 2945.55 143.939854 2807.0 2902.25 2914.0 2923.75 3490.0
    aflsmart 82800 20.0 2889.60 38.982317 2826.0 2836.50 2909.5 2913.25 2934.0
    aflfast 82800 20.0 2877.30 39.174508 2824.0 2830.75 2892.5 2908.75 2927.0
    fairfuzz 82800 20.0 2756.10 75.288847 2628.0 2724.25 2751.5 2788.50 2907.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 7216.70 122.786550 6946.0 7142.25 7209.5 7274.25 7435.0
    eclipser 82800 20.0 4890.40 462.103247 3876.0 4703.25 4924.5 5103.75 5696.0
    fairfuzz 82800 20.0 780.45 74.632768 497.0 747.75 807.5 818.25 826.0
    aflsmart 82800 20.0 784.00 73.153767 497.0 805.00 806.5 818.00 819.0
    mopt 82800 20.0 770.30 74.056948 497.0 740.75 806.5 815.00 820.0
    aflfast 82800 20.0 738.30 115.000275 475.0 731.75 794.5 806.50 817.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflsmart 82800 20.0 2868.55 4.936172 2860.0 2865.75 2868.5 2872.50 2876.0
    eclipser 82800 20.0 2860.85 23.079439 2785.0 2861.50 2867.5 2872.25 2878.0
    aflfast 82800 20.0 2865.45 6.270021 2849.0 2864.50 2866.5 2870.00 2873.0
    fairfuzz 82800 20.0 2834.65 73.055332 2631.0 2856.25 2864.0 2869.00 2875.0
    mopt 82800 20.0 2856.30 7.947856 2843.0 2849.75 2858.5 2862.25 2869.0
    libafl 82800 20.0 2848.90 9.883000 2820.0 2844.50 2849.5 2854.00 2864.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflsmart 82800 20.0 19264.55 284.391590 18652.0 19128.25 19284.0 19399.00 19883.0
    eclipser 82800 20.0 19215.25 317.380707 18729.0 18953.50 19191.5 19428.00 19939.0
    mopt 82800 20.0 19038.10 288.714191 18474.0 18934.50 19060.5 19248.75 19483.0
    aflfast 82800 20.0 18711.00 281.495069 18104.0 18609.25 18713.5 18918.25 19183.0
    libafl 82800 20.0 18522.75 101.281720 18354.0 18454.50 18517.5 18569.25 18727.0
    fairfuzz 82800 20.0 13246.65 1692.282646 10991.0 11657.25 13522.5 14776.25 16178.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

stb_stbi_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 2178.45 44.681887 2111.0 2135.5 2192.0 2197.25 2255.0
    eclipser 82800 20.0 2109.45 30.165290 2080.0 2098.5 2106.0 2109.25 2192.0
    aflsmart 82800 20.0 2016.00 29.568831 1983.0 2004.0 2005.5 2008.00 2090.0
    mopt 82800 20.0 2003.60 38.368573 1953.0 1985.5 1994.0 2004.25 2088.0
    aflfast 82800 20.0 1983.90 30.450823 1903.0 1974.0 1984.0 2000.25 2065.0
    fairfuzz 82800 20.0 1967.25 49.496278 1854.0 1938.5 1983.0 1996.50 2086.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflsmart 82800 20.0 1255.00 2.919986 1250.0 1253.00 1254.5 1257.25 1262.0
    mopt 82800 20.0 1250.20 12.722132 1199.0 1250.00 1254.0 1255.25 1258.0
    aflfast 82800 20.0 1246.80 15.340349 1196.0 1248.50 1252.0 1254.00 1256.0
    eclipser 82800 20.0 1251.95 4.358597 1244.0 1249.75 1252.0 1255.25 1259.0
    libafl 82800 20.0 1253.50 4.968321 1246.0 1251.50 1252.0 1257.00 1267.0
    fairfuzz 82800 20.0 1234.70 28.252806 1160.0 1222.50 1249.5 1253.50 1258.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 82800 20.0 1166.55 13.743802 1146.0 1157.75 1163.5 1178.25 1190.0
    aflsmart 82800 20.0 1133.60 7.036746 1116.0 1130.75 1134.5 1138.25 1144.0
    mopt 82800 20.0 1114.45 16.040984 1089.0 1102.50 1116.0 1128.25 1136.0
    aflfast 82800 20.0 1083.30 26.647998 1025.0 1076.00 1089.5 1101.50 1114.0
    fairfuzz 82800 20.0 996.00 26.789629 966.0 983.00 988.5 998.50 1080.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)