FuzzBench: 2024-08-17-empty-seeds-2 report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
libfuzzer_empty_seeds 95.31
aflplusplus_empty_seeds 93.07
afl_empty_seeds 77.90
honggfuzz_empty_seeds 76.43
By avg. rank
average rank
fuzzer
aflplusplus_empty_seeds 1.43
libfuzzer_empty_seeds 2.10
honggfuzz_empty_seeds 2.52
afl_empty_seeds 3.33
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      aflplusplus_empty_seeds libfuzzer_empty_seeds honggfuzz_empty_seeds afl_empty_seeds
    FuzzerMedian 98.50 96.00 97.00 90.00
    FuzzerMean 95.80 93.19 93.00 76.33
    bloaty_fuzz_target 99.00 98.00 97.00 96.00
    curl_curl_fuzzer_http 97.00 96.00 97.00 94.00
    freetype2_ftfuzzer 97.00 74.00 82.00 72.00
    harfbuzz_hb-shape-fuzzer 99.00 92.00 94.00 90.00
    jsoncpp_jsoncpp_fuzzer 99.00 99.00 97.00 98.00
    lcms_cms_transform_fuzzer 99.00 91.00 98.00 5.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 100.00 100.00 100.00 100.00
    libpcap_fuzz_both 78.00 91.00 nan 1.00
    libpng_libpng_read_fuzzer nan 97.00 83.00 72.00
    libxml2_xml 99.00 95.00 nan 85.00
    libxslt_xpath 99.00 94.00 nan 89.00
    mbedtls_fuzz_dtlsclient 98.00 97.00 99.00 96.00
    openssl_x509 100.00 100.00 100.00 100.00
    openthread_ot-ip6-send-fuzzer 98.00 97.00 91.00 91.00
    proj4_proj_crs_to_crs_fuzzer 84.00 78.00 97.00 14.00
    re2_fuzzer 99.00 98.00 91.00 98.00
    sqlite3_ossfuzz 98.00 75.00 76.00 76.00
    systemd_fuzz-link-parser 98.00 96.00 96.00 95.00
    vorbis_decode_fuzzer 84.00 92.00 nan 76.00
    woff2_convert_woff2ttf_fuzzer 91.00 98.00 84.00 65.00
    zlib_zlib_uncompress_fuzzer 100.00 99.00 99.00 90.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 4515.5 2.121320 4514.0 4514.75 4515.5 4516.25 4517.0
    libfuzzer_empty_seeds 1800 2.0 4461.5 6.363961 4457.0 4459.25 4461.5 4463.75 4466.0
    honggfuzz_empty_seeds 1800 2.0 4411.5 0.707107 4411.0 4411.25 4411.5 4411.75 4412.0
    afl_empty_seeds 1800 2.0 4364.0 0.000000 4364.0 4364.00 4364.0 4364.00 4364.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 7970.0 240.416306 7800.0 7885.00 7970.0 8055.00 8140.0
    honggfuzz_empty_seeds 1800 2.0 7896.5 78.488853 7841.0 7868.75 7896.5 7924.25 7952.0
    libfuzzer_empty_seeds 1800 2.0 7876.5 7.778175 7871.0 7873.75 7876.5 7879.25 7882.0
    afl_empty_seeds 1800 2.0 7661.5 4.949747 7658.0 7659.75 7661.5 7663.25 7665.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 6354.0 267.286363 6165.0 6259.50 6354.0 6448.50 6543.0
    honggfuzz_empty_seeds 1800 2.0 5377.0 214.960461 5225.0 5301.00 5377.0 5453.00 5529.0
    libfuzzer_empty_seeds 1800 2.0 4851.5 0.707107 4851.0 4851.25 4851.5 4851.75 4852.0
    afl_empty_seeds 1800 2.0 4750.5 12.020815 4742.0 4746.25 4750.5 4754.75 4759.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 9195.5 101.116270 9124.0 9159.75 9195.5 9231.25 9267.0
    honggfuzz_empty_seeds 1800 2.0 8713.5 12.020815 8705.0 8709.25 8713.5 8717.75 8722.0
    libfuzzer_empty_seeds 1800 2.0 8572.0 15.556349 8561.0 8566.50 8572.0 8577.50 8583.0
    afl_empty_seeds 1800 2.0 8402.0 220.617316 8246.0 8324.00 8402.0 8480.00 8558.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer_empty_seeds 1800 2.0 524.5 0.707107 524.0 524.25 524.5 524.75 525.0
    aflplusplus_empty_seeds 1800 2.0 520.0 0.000000 520.0 520.00 520.0 520.00 520.0
    afl_empty_seeds 1800 2.0 518.0 1.414214 517.0 517.50 518.0 518.50 519.0
    honggfuzz_empty_seeds 1800 2.0 511.0 2.828427 509.0 510.00 511.0 512.00 513.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 682.5 0.707107 682.0 682.25 682.5 682.75 683.0
    honggfuzz_empty_seeds 1800 2.0 678.5 14.849242 668.0 673.25 678.5 683.75 689.0
    libfuzzer_empty_seeds 1800 2.0 627.0 55.154329 588.0 607.50 627.0 646.50 666.0
    afl_empty_seeds 1800 2.0 41.0 0.000000 41.0 41.00 41.0 41.00 41.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    afl_empty_seeds 1800 2.0 2520.0 0.0 2520.0 2520.0 2520.0 2520.0 2520.0
    aflplusplus_empty_seeds 1800 2.0 2520.0 0.0 2520.0 2520.0 2520.0 2520.0 2520.0
    honggfuzz_empty_seeds 1800 2.0 2520.0 0.0 2520.0 2520.0 2520.0 2520.0 2520.0
    libfuzzer_empty_seeds 1800 2.0 2520.0 0.0 2520.0 2520.0 2520.0 2520.0 2520.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer_empty_seeds 1800 2.0 1978.5 246.780267 1804.0 1891.25 1978.5 2065.75 2153.0
    aflplusplus_empty_seeds 1800 2.0 1694.0 76.367532 1640.0 1667.00 1694.0 1721.00 1748.0
    afl_empty_seeds 1800 2.0 33.5 0.707107 33.0 33.25 33.5 33.75 34.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer_empty_seeds 1800 2.0 1883.5 72.831998 1832.0 1857.75 1883.5 1909.25 1935.0
    honggfuzz_empty_seeds 1800 2.0 1613.0 21.213203 1598.0 1605.50 1613.0 1620.50 1628.0
    afl_empty_seeds 1800 2.0 1402.0 0.000000 1402.0 1402.00 1402.0 1402.00 1402.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 11535.0 84.852814 11475.0 11505.00 11535.0 11565.00 11595.0
    libfuzzer_empty_seeds 1800 2.0 11032.5 36.062446 11007.0 11019.75 11032.5 11045.25 11058.0
    afl_empty_seeds 1800 2.0 9970.0 1541.492783 8880.0 9425.00 9970.0 10515.00 11060.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 8204.0 69.296465 8155.0 8179.50 8204.0 8228.50 8253.0
    libfuzzer_empty_seeds 1800 2.0 7833.0 50.911688 7797.0 7815.00 7833.0 7851.00 7869.0
    afl_empty_seeds 1800 2.0 7408.5 27.577164 7389.0 7398.75 7408.5 7418.25 7428.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz_empty_seeds 1800 2.0 2078.0 24.041631 2061.0 2069.50 2078.0 2086.50 2095.0
    aflplusplus_empty_seeds 1800 2.0 2055.5 12.020815 2047.0 2051.25 2055.5 2059.75 2064.0
    libfuzzer_empty_seeds 1800 2.0 2033.0 1.414214 2032.0 2032.50 2033.0 2033.50 2034.0
    afl_empty_seeds 1800 2.0 2031.0 1.414214 2030.0 2030.50 2031.0 2031.50 2032.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    afl_empty_seeds 1800 2.0 5774.0 0.0 5774.0 5774.0 5774.0 5774.0 5774.0
    aflplusplus_empty_seeds 1800 2.0 5774.0 0.0 5774.0 5774.0 5774.0 5774.0 5774.0
    honggfuzz_empty_seeds 1800 2.0 5774.0 0.0 5774.0 5774.0 5774.0 5774.0 5774.0
    libfuzzer_empty_seeds 1800 2.0 5774.0 0.0 5774.0 5774.0 5774.0 5774.0 5774.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 2969.0 74.953319 2916.0 2942.50 2969.0 2995.50 3022.0
    libfuzzer_empty_seeds 1800 2.0 2956.5 19.091883 2943.0 2949.75 2956.5 2963.25 2970.0
    honggfuzz_empty_seeds 1800 2.0 2776.0 12.727922 2767.0 2771.50 2776.0 2780.50 2785.0
    afl_empty_seeds 1800 2.0 2770.5 20.506097 2756.0 2763.25 2770.5 2777.75 2785.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz_empty_seeds 1800 2.0 4870.0 168.291414 4751.0 4810.5 4870.0 4929.5 4989.0
    aflplusplus_empty_seeds 1800 2.0 4232.0 524.673232 3861.0 4046.5 4232.0 4417.5 4603.0
    libfuzzer_empty_seeds 1800 2.0 3910.0 209.303607 3762.0 3836.0 3910.0 3984.0 4058.0
    afl_empty_seeds 1800 2.0 726.0 2.828427 724.0 725.0 726.0 727.0 728.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 2831.0 4.242641 2828.0 2829.50 2831.0 2832.50 2834.0
    libfuzzer_empty_seeds 1800 2.0 2802.5 20.506097 2788.0 2795.25 2802.5 2809.75 2817.0
    afl_empty_seeds 1800 2.0 2777.5 10.606602 2770.0 2773.75 2777.5 2781.25 2785.0
    honggfuzz_empty_seeds 1800 2.0 2582.0 77.781746 2527.0 2554.50 2582.0 2609.50 2637.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 10244.0 172.534055 10122.0 10183.00 10244.0 10305.00 10366.0
    honggfuzz_empty_seeds 1800 2.0 7917.5 60.104076 7875.0 7896.25 7917.5 7938.75 7960.0
    afl_empty_seeds 1800 2.0 7887.0 14.142136 7877.0 7882.00 7887.0 7892.00 7897.0
    libfuzzer_empty_seeds 1800 2.0 7877.5 0.707107 7877.0 7877.25 7877.5 7877.75 7878.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer_empty_seeds 1800 2.0 941.5 109.601551 864.0 902.75 941.5 980.25 1019.0
    aflplusplus_empty_seeds 1800 2.0 857.0 14.142136 847.0 852.00 857.0 862.00 867.0
    afl_empty_seeds 1800 2.0 782.0 0.000000 782.0 782.00 782.0 782.00 782.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer_empty_seeds 1800 2.0 1152.5 28.991378 1132.0 1142.25 1152.5 1162.75 1173.0
    aflplusplus_empty_seeds 1800 2.0 1068.5 26.162951 1050.0 1059.25 1068.5 1077.75 1087.0
    honggfuzz_empty_seeds 1800 2.0 992.0 93.338095 926.0 959.00 992.0 1025.00 1058.0
    afl_empty_seeds 1800 2.0 768.0 0.000000 768.0 768.00 768.0 768.00 768.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus_empty_seeds 1800 2.0 456.0 0.000000 456.0 456.00 456.0 456.00 456.0
    honggfuzz_empty_seeds 1800 2.0 455.5 0.707107 455.0 455.25 455.5 455.75 456.0
    libfuzzer_empty_seeds 1800 2.0 452.0 0.000000 452.0 452.00 452.0 452.00 452.0
    afl_empty_seeds 1800 2.0 414.0 56.568542 374.0 394.00 414.0 434.00 454.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)