FuzzBench: 2023-05-06-sample report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
aflplusplus 96.45
honggfuzz 84.32
libfuzzer 77.75
afl 76.01
centipede 65.86
By avg. rank
average rank
fuzzer
aflplusplus 1.62
honggfuzz 2.76
afl 3.19
libfuzzer 3.48
centipede 3.86
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      aflplusplus honggfuzz libfuzzer afl centipede
    FuzzerMedian 94.00 91.00 77.00 86.50 82.00
    FuzzerMean 85.38 82.00 74.37 70.55 68.26
    bloaty_fuzz_target 60.00 65.00 56.00 89.00 81.00
    curl_curl_fuzzer_http 80.00 79.00 70.00 68.00 nan
    freetype2_ftfuzzer 87.00 67.00 47.00 42.00 13.00
    harfbuzz_hb-shape-fuzzer 94.00 84.00 72.00 84.00 77.00
    jsoncpp_jsoncpp_fuzzer 99.00 99.00 100.00 98.00 98.00
    lcms_cms_transform_fuzzer 41.00 41.00 41.00 38.00 2.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 97.00 95.00 98.00 96.00
    libpcap_fuzz_both 96.00 nan 79.00 1.00 43.00
    libpng_libpng_read_fuzzer 96.00 97.00 nan 92.00 95.00
    libxml2_xml 99.00 93.00 77.00 94.00 85.00
    libxslt_xpath 94.00 nan 79.00 nan 89.00
    openh264_decoder_fuzzer 96.00 99.00 nan 92.00 86.00
    openssl_x509 95.00 93.00 92.00 94.00 95.00
    openthread_ot-ip6-send-fuzzer 98.00 93.00 98.00 91.00 90.00
    proj4_proj_crs_to_crs_fuzzer 78.00 96.00 81.00 12.00 12.00
    re2_fuzzer 99.00 98.00 98.00 98.00 91.00
    sqlite3_ossfuzz 80.00 69.00 68.00 82.00 82.00
    stb_stbi_read_fuzzer 96.00 91.00 69.00 87.00 72.00
    systemd_fuzz-link-parser 86.00 79.00 71.00 86.00 nan
    vorbis_decode_fuzzer 29.00 29.00 34.00 19.00 8.00
    woff2_convert_woff2ttf_fuzzer 91.00 89.00 86.00 46.00 82.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    afl 2700 4.0 3878.00 1812.636386 1162.0 3788.50 4751.0 4840.50 4848.0
    centipede 2700 4.0 3544.50 1678.523458 1037.0 3384.50 4318.0 4478.00 4505.0
    honggfuzz 2700 4.0 3393.25 1912.590630 1295.0 2034.50 3489.5 4848.25 5299.0
    aflplusplus 2700 4.0 3195.50 1586.063156 1764.0 1852.50 3200.5 4543.50 4617.0
    libfuzzer 2700 4.0 2999.00 2015.461403 1128.0 1322.25 3012.0 4688.75 4844.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 6758.00 836.139143 6224.0 6236.75 6410.0 6931.25 7988.0
    honggfuzz 2700 4.0 6701.50 772.332182 6194.0 6288.50 6381.0 6794.00 7850.0
    libfuzzer 2700 4.0 5635.25 144.509227 5447.0 5576.75 5650.5 5709.00 5793.0
    afl 2700 4.0 5723.50 614.727311 5362.0 5381.50 5445.5 5787.50 6641.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 6870.5 531.080973 6508.0 6533.50 6664.5 7001.50 7645.0
    honggfuzz 2700 4.0 5261.5 1183.876260 3936.0 4635.75 5175.0 5800.75 6760.0
    libfuzzer 2700 4.0 3579.5 636.252308 2932.0 3085.00 3610.0 4104.50 4166.0
    afl 2700 4.0 3313.5 293.921191 3050.0 3122.75 3244.5 3435.25 3715.0
    centipede 2700 4.0 1008.5 159.039827 825.0 900.75 1032.0 1139.75 1145.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 9471.50 510.048690 8993.0 9056.00 9449.5 9865.00 9994.0
    afl 2700 4.0 8608.00 445.016854 8240.0 8384.00 8469.0 8693.00 9254.0
    honggfuzz 2700 4.0 8409.75 598.222018 7673.0 8158.25 8421.0 8672.50 9124.0
    centipede 2700 4.0 7699.00 282.669418 7347.0 7554.75 7719.5 7863.75 8010.0
    libfuzzer 2700 4.0 7145.00 242.117052 6785.0 7125.50 7242.5 7262.00 7310.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 2700 4.0 525.00 0.000000 525.0 525.00 525.0 525.00 525.0
    honggfuzz 2700 4.0 522.00 0.000000 522.0 522.00 522.0 522.00 522.0
    aflplusplus 2700 4.0 519.75 0.500000 519.0 519.75 520.0 520.00 520.0
    centipede 2700 4.0 518.00 0.000000 518.0 518.00 518.0 518.00 518.0
    afl 2700 4.0 513.25 8.180261 501.0 513.00 517.0 517.25 518.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 944.5 492.482487 682.0 697.00 706.5 954.00 1683.0
    libfuzzer 2700 4.0 882.5 396.051764 666.0 676.50 694.0 900.00 1476.0
    honggfuzz 2700 4.0 777.0 178.417114 677.0 680.75 693.5 789.75 1044.0
    afl 2700 4.0 637.5 12.793227 619.0 634.00 642.0 645.50 647.0
    centipede 2700 4.0 36.0 0.000000 36.0 36.00 36.0 36.00 36.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 2531.75 19.189841 2503.0 2530.75 2541.0 2542.00 2542.0
    afl 2700 4.0 2506.50 13.379088 2493.0 2496.75 2505.5 2515.25 2522.0
    honggfuzz 2700 4.0 2478.25 23.893863 2450.0 2465.00 2478.5 2491.75 2506.0
    centipede 2700 4.0 2450.75 24.864634 2425.0 2431.75 2452.0 2471.00 2474.0
    libfuzzer 2700 4.0 2437.25 29.567719 2410.0 2416.75 2431.5 2452.00 2476.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 2281.75 74.526841 2195.0 2235.50 2284.5 2330.75 2363.0
    libfuzzer 2700 4.0 1921.75 147.106254 1812.0 1834.50 1869.5 1956.75 2136.0
    centipede 2700 4.0 1033.75 1077.655287 100.0 100.75 1029.0 1962.00 1977.0
    afl 2700 4.0 33.00 0.000000 33.0 33.00 33.0 33.00 33.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 2700 4.0 1936.00 37.832966 1902.0 1905.75 1931.0 1961.25 1980.0
    aflplusplus 2700 4.0 1907.25 58.283074 1840.0 1874.50 1906.0 1938.75 1977.0
    centipede 2700 4.0 1868.25 47.898330 1797.0 1863.00 1888.0 1893.25 1900.0
    afl 2700 4.0 1846.25 44.304815 1804.0 1820.50 1837.0 1862.75 1907.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 13832.25 390.996483 13263.0 13732.5 13986.5 14086.25 14093.0
    afl 2700 4.0 13312.25 273.746081 12981.0 13141.5 13356.0 13526.75 13556.0
    honggfuzz 2700 4.0 13225.75 551.419003 12610.0 13013.5 13171.0 13383.25 13951.0
    centipede 2700 4.0 12099.75 322.668431 11718.0 11902.5 12119.0 12316.25 12443.0
    libfuzzer 2700 4.0 10625.75 863.406577 9364.0 10420.0 10967.5 11173.25 11204.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 8014.75 357.942616 7609.0 7813.75 7998.5 8199.50 8453.0
    centipede 2700 4.0 7425.25 867.427759 6331.0 6934.75 7577.5 8068.00 8215.0
    libfuzzer 2700 4.0 6681.50 280.267134 6293.0 6569.75 6762.0 6873.75 6909.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openh264_decoder_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 2700 4.0 9150.25 113.878224 8984.0 9123.50 9192.5 9219.25 9232.0
    aflplusplus 2700 4.0 8416.50 1070.113545 6817.0 8321.50 8913.5 9008.50 9022.0
    afl 2700 4.0 8044.50 1050.160781 6477.0 7986.75 8493.5 8551.25 8714.0
    centipede 2700 4.0 7597.25 941.522659 6206.0 7482.50 7949.0 8063.75 8285.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 5409.25 195.864537 5215.0 5269.00 5388.0 5528.25 5646.0
    centipede 2700 4.0 5409.75 160.879613 5276.0 5287.25 5372.0 5494.50 5619.0
    afl 2700 4.0 5359.75 127.896768 5244.0 5252.25 5355.5 5463.00 5484.0
    honggfuzz 2700 4.0 5319.25 218.673844 5120.0 5147.00 5290.5 5462.75 5576.0
    libfuzzer 2700 4.0 5265.50 220.778169 5041.0 5106.25 5249.5 5408.75 5522.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 2700 4.0 2967.50 7.047458 2960.0 2962.25 2968.0 2973.25 2974.0
    aflplusplus 2700 4.0 2957.75 56.993421 2904.0 2910.75 2959.5 3006.50 3008.0
    honggfuzz 2700 4.0 2808.50 19.347696 2794.0 2795.50 2802.0 2815.00 2836.0
    afl 2700 4.0 2739.75 29.318652 2707.0 2719.00 2743.5 2764.25 2765.0
    centipede 2700 4.0 2617.25 203.444300 2313.0 2601.00 2711.0 2727.25 2734.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 2700 4.0 5506.25 208.008614 5221.0 5449.75 5542.5 5599.00 5719.0
    libfuzzer 2700 4.0 4718.75 302.894233 4430.0 4484.75 4689.5 4923.50 5066.0
    aflplusplus 2700 4.0 4498.50 320.150485 4117.0 4312.75 4512.0 4697.75 4853.0
    centipede 2700 4.0 683.00 123.512483 498.0 678.75 740.0 744.25 754.0
    afl 2700 4.0 651.00 110.538078 489.0 627.00 693.0 717.00 729.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 2840.00 10.295630 2830.0 2832.25 2839.0 2846.75 2852.0
    afl 2700 4.0 2757.00 124.292665 2571.0 2751.75 2813.0 2818.25 2831.0
    libfuzzer 2700 4.0 2808.25 6.396614 2799.0 2806.50 2810.5 2812.25 2813.0
    honggfuzz 2700 4.0 2790.75 12.311918 2773.0 2787.25 2795.0 2798.50 2800.0
    centipede 2700 4.0 2621.00 32.526912 2594.0 2594.75 2614.5 2640.75 2661.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    afl 2700 4.0 10445.75 1403.385520 9236.0 9383.00 10151.0 11213.75 12245.0
    centipede 2700 4.0 10424.75 901.741047 9740.0 9930.50 10107.0 10601.25 11745.0
    aflplusplus 2700 4.0 10183.00 803.408157 9680.0 9784.25 9834.5 10233.25 11383.0
    honggfuzz 2700 4.0 8793.50 789.902315 8235.0 8290.50 8499.5 9002.50 9940.0
    libfuzzer 2700 4.0 8289.50 661.605371 7458.0 7981.50 8332.5 8640.50 9035.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

stb_stbi_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 1930.75 74.692146 1863.0 1868.25 1927.5 1990.00 2005.0
    honggfuzz 2700 4.0 1774.25 175.687934 1517.0 1741.25 1835.5 1868.50 1909.0
    afl 2700 4.0 1687.25 186.703642 1416.0 1654.50 1745.5 1778.25 1842.0
    centipede 2700 4.0 1416.00 241.212216 1090.0 1351.75 1451.0 1515.25 1672.0
    libfuzzer 2700 4.0 1386.00 246.560337 1068.0 1317.00 1403.0 1472.00 1670.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 2700 4.0 436.50 190.235819 305.0 317.0 363.5 483.00 714.0
    aflplusplus 2700 4.0 483.00 372.454919 271.0 299.5 310.0 493.50 1041.0
    honggfuzz 2700 4.0 303.50 0.577350 303.0 303.0 303.5 304.00 304.0
    afl 2700 4.0 204.75 1.258306 203.0 204.5 205.0 205.25 206.0
    centipede 2700 4.0 85.00 0.000000 85.0 85.0 85.0 85.00 85.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 2700 4.0 996.75 101.634558 874.0 934.75 1008.0 1070.00 1097.0
    honggfuzz 2700 4.0 983.25 68.417225 920.0 927.50 978.5 1034.25 1056.0
    libfuzzer 2700 4.0 955.25 59.595162 888.0 931.50 950.0 973.75 1033.0
    centipede 2700 4.0 918.50 75.102153 856.0 862.00 900.5 957.00 1017.0
    afl 2700 4.0 507.25 576.491327 8.0 8.00 505.5 1004.75 1010.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)