FuzzBench: 2024-08-17-2028-bases-1 report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
aflplusplus 89.83
libafl 88.67
libfuzzer 88.17
afl 80.32
By avg. rank
average rank
fuzzer
aflplusplus 1.67
libafl 1.95
libfuzzer 2.81
afl 3.43
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      libafl aflplusplus libfuzzer afl
    FuzzerMedian 97.00 98.50 89.00 91.00
    FuzzerMean 95.68 92.40 86.38 78.81
    bloaty_fuzz_target 96.00 99.00 90.00 93.00
    curl_curl_fuzzer_http 99.00 98.00 86.00 94.00
    freetype2_ftfuzzer 97.00 97.00 75.00 80.00
    harfbuzz_hb-shape-fuzzer 99.00 98.00 89.00 92.00
    jsoncpp_jsoncpp_fuzzer 98.00 99.00 100.00 98.00
    lcms_cms_transform_fuzzer 92.00 46.00 40.00 33.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 99.00 99.00 99.00
    libpcap_fuzz_both 86.00 95.00 78.00 1.00
    libpng_libpng_read_fuzzer nan nan 99.00 97.00
    libxml2_xml nan 99.00 93.00 89.00
    libxslt_xpath 95.00 99.00 78.00 82.00
    mbedtls_fuzz_dtlsclient 87.00 76.00 72.00 74.00
    openssl_x509 100.00 99.00 99.00 99.00
    openthread_ot-ip6-send-fuzzer 99.00 91.00 99.00 91.00
    proj4_proj_crs_to_crs_fuzzer 90.00 99.00 88.00 10.00
    re2_fuzzer 99.00 99.00 99.00 95.00
    sqlite3_ossfuzz 99.00 65.00 65.00 61.00
    systemd_fuzz-link-parser 97.00 95.00 80.00 92.00
    vorbis_decode_fuzzer 99.00 97.00 99.00 91.00
    woff2_convert_woff2ttf_fuzzer 95.00 99.00 89.00 88.00
    zlib_zlib_uncompress_fuzzer 92.00 99.00 97.00 96.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 5555.0 22.627417 5539.0 5547.0 5555.0 5563.0 5571.0
    libafl 1800 2.0 5377.0 0.000000 5377.0 5377.0 5377.0 5377.0 5377.0
    afl 1800 2.0 5200.0 164.048773 5084.0 5142.0 5200.0 5258.0 5316.0
    libfuzzer 1800 2.0 5066.0 93.338095 5000.0 5033.0 5066.0 5099.0 5132.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 10132.0 25.455844 10114.0 10123.00 10132.0 10141.00 10150.0
    aflplusplus 1800 2.0 9966.5 68.589358 9918.0 9942.25 9966.5 9990.75 10015.0
    afl 1800 2.0 9561.5 57.275649 9521.0 9541.25 9561.5 9581.75 9602.0
    libfuzzer 1800 2.0 8757.0 1013.991124 8040.0 8398.50 8757.0 9115.50 9474.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 8397.0 39.597980 8369.0 8383.0 8397.0 8411.0 8425.0
    libafl 1800 2.0 8377.0 298.399062 8166.0 8271.5 8377.0 8482.5 8588.0
    afl 1800 2.0 6947.0 53.740115 6909.0 6928.0 6947.0 6966.0 6985.0
    libfuzzer 1800 2.0 6509.0 200.818326 6367.0 6438.0 6509.0 6580.0 6651.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 10102.5 53.033009 10065.0 10083.75 10102.5 10121.25 10140.0
    aflplusplus 1800 2.0 9975.5 239.709199 9806.0 9890.75 9975.5 10060.25 10145.0
    afl 1800 2.0 9366.0 270.114790 9175.0 9270.50 9366.0 9461.50 9557.0
    libfuzzer 1800 2.0 9125.0 82.024387 9067.0 9096.00 9125.0 9154.00 9183.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 525.0 0.000000 525.0 525.00 525.0 525.00 525.0
    aflplusplus 1800 2.0 520.0 0.000000 520.0 520.00 520.0 520.00 520.0
    libafl 1800 2.0 517.0 0.000000 517.0 517.00 517.0 517.00 517.0
    afl 1800 2.0 516.5 0.707107 516.0 516.25 516.5 516.75 517.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 1557.0 188.090404 1424.0 1490.50 1557.0 1623.50 1690.0
    aflplusplus 1800 2.0 782.5 71.417785 732.0 757.25 782.5 807.75 833.0
    libfuzzer 1800 2.0 680.5 21.920310 665.0 672.75 680.5 688.25 696.0
    afl 1800 2.0 573.0 101.823376 501.0 537.00 573.0 609.00 645.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2544.5 0.707107 2544.0 2544.25 2544.5 2544.75 2545.0
    libafl 1800 2.0 2543.5 0.707107 2543.0 2543.25 2543.5 2543.75 2544.0
    libfuzzer 1800 2.0 2543.0 4.242641 2540.0 2541.50 2543.0 2544.50 2546.0
    afl 1800 2.0 2542.5 2.121320 2541.0 2541.75 2542.5 2543.25 2544.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2480.0 172.534055 2358.0 2419.00 2480.0 2541.00 2602.0
    libafl 1800 2.0 2262.0 87.681241 2200.0 2231.00 2262.0 2293.00 2324.0
    libfuzzer 1800 2.0 2038.5 449.012806 1721.0 1879.75 2038.5 2197.25 2356.0
    afl 1800 2.0 33.0 0.000000 33.0 33.00 33.0 33.00 33.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 1973.0 12.727922 1964.0 1968.50 1973.0 1977.50 1982.0
    afl 1800 2.0 1932.5 10.606602 1925.0 1928.75 1932.5 1936.25 1940.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 14250.5 183.140656 14121.0 14185.75 14250.5 14315.25 14380.0
    libfuzzer 1800 2.0 13468.5 174.655375 13345.0 13406.75 13468.5 13530.25 13592.0
    afl 1800 2.0 12902.0 630.739249 12456.0 12679.00 12902.0 13125.00 13348.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 10644.0 89.095454 10581.0 10612.50 10644.0 10675.50 10707.0
    libafl 1800 2.0 10242.5 72.831998 10191.0 10216.75 10242.5 10268.25 10294.0
    afl 1800 2.0 8837.0 2.828427 8835.0 8836.00 8837.0 8838.00 8839.0
    libfuzzer 1800 2.0 8384.5 54.447222 8346.0 8365.25 8384.5 8403.75 8423.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 2991.0 612.354473 2558.0 2774.50 2991.0 3207.50 3424.0
    aflplusplus 1800 2.0 2619.0 1.414214 2618.0 2618.50 2619.0 2619.50 2620.0
    afl 1800 2.0 2540.5 16.263456 2529.0 2534.75 2540.5 2546.25 2552.0
    libfuzzer 1800 2.0 2471.0 5.656854 2467.0 2469.00 2471.0 2473.00 2475.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 5820.0 0.000000 5820.0 5820.00 5820.0 5820.00 5820.0
    aflplusplus 1800 2.0 5797.0 4.242641 5794.0 5795.50 5797.0 5798.50 5800.0
    libfuzzer 1800 2.0 5790.5 10.606602 5783.0 5786.75 5790.5 5794.25 5798.0
    afl 1800 2.0 5790.0 0.000000 5790.0 5790.00 5790.0 5790.00 5790.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 2962.5 27.577164 2943.0 2952.75 2962.5 2972.25 2982.0
    libafl 1800 2.0 2952.5 24.748737 2935.0 2943.75 2952.5 2961.25 2970.0
    aflplusplus 1800 2.0 2727.0 342.239682 2485.0 2606.00 2727.0 2848.00 2969.0
    afl 1800 2.0 2715.5 91.216775 2651.0 2683.25 2715.5 2747.75 2780.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 4586.5 62.932504 4542.0 4564.25 4586.5 4608.75 4631.0
    libafl 1800 2.0 4193.5 142.128463 4093.0 4143.25 4193.5 4243.75 4294.0
    libfuzzer 1800 2.0 4089.5 85.559921 4029.0 4059.25 4089.5 4119.75 4150.0
    afl 1800 2.0 484.0 0.000000 484.0 484.00 484.0 484.00 484.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2826.0 9.899495 2819.0 2822.50 2826.0 2829.50 2833.0
    libafl 1800 2.0 2819.0 18.384776 2806.0 2812.50 2819.0 2825.50 2832.0
    libfuzzer 1800 2.0 2813.5 24.748737 2796.0 2804.75 2813.5 2822.25 2831.0
    afl 1800 2.0 2700.5 153.442172 2592.0 2646.25 2700.5 2754.75 2809.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 18121.0 103.237590 18048.0 18084.5 18121.0 18157.5 18194.0
    aflplusplus 1800 2.0 11925.0 56.568542 11885.0 11905.0 11925.0 11945.0 11965.0
    libfuzzer 1800 2.0 11895.0 162.634560 11780.0 11837.5 11895.0 11952.5 12010.0
    afl 1800 2.0 11124.0 38.183766 11097.0 11110.5 11124.0 11137.5 11151.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 1229.0 1.414214 1228.0 1228.50 1229.0 1229.50 1230.0
    libafl 1800 2.0 1222.0 4.242641 1219.0 1220.50 1222.0 1223.50 1225.0
    aflplusplus 1800 2.0 1196.5 7.778175 1191.0 1193.75 1196.5 1199.25 1202.0
    afl 1800 2.0 1130.5 57.275649 1090.0 1110.25 1130.5 1150.75 1171.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 1147.5 10.606602 1140.0 1143.75 1147.5 1151.25 1155.0
    libafl 1800 2.0 1102.5 47.376154 1069.0 1085.75 1102.5 1119.25 1136.0
    libfuzzer 1800 2.0 1029.0 14.142136 1019.0 1024.00 1029.0 1034.00 1039.0
    afl 1800 2.0 1019.5 20.506097 1005.0 1012.25 1019.5 1026.75 1034.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 459.0 5.656854 455.0 457.00 459.0 461.00 463.0
    libfuzzer 1800 2.0 450.0 1.414214 449.0 449.50 450.0 450.50 451.0
    afl 1800 2.0 445.5 0.707107 445.0 445.25 445.5 445.75 446.0
    libafl 1800 2.0 428.0 5.656854 424.0 426.00 428.0 430.00 432.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)