FuzzBench: 2024-08-17-empty-seeds-1 report

experiment summary

We show two different aggregate (cross-benchmark) rankings of fuzzers. The first is based on the average of per-benchmarks scores, where the score represents the percentage of the highest reached median code-coverage on a given benchmark (higher value is better). The second ranking shows the average rank of fuzzers, after we rank them on each benchmark according to their median reached code-covereges (lower value is better).
By avg. score
average normalized score
fuzzer
aflplusplus 93.49
libafl 93.22
libfuzzer 90.16
honggfuzz 88.98
afl 76.56
By avg. rank
average rank
fuzzer
aflplusplus 2.05
libafl 2.29
libfuzzer 3.00
honggfuzz 3.52
afl 4.05
  • Critical difference diagram
    The diagram visualizes the average rank of fuzzers (second ranking above) while showing the significance of the differences as well. What is considered a "critical difference" (CD) is based on the Friedman/Nemenyi post-hoc test. See more in the documentation.
    Note: If a fuzzer does not support all benchmarks, its ranking as shown in this diagram can be lower than it should be. So please check the list of supported benchmarks for the fuzzer(s) of your interest. The list could be specified in the fuzzer's README.md like this.
  • Median relative code-coverages on each benchmark

    Note: The relative coverage summary table shows the median relative performance of each fuzzer to the experiment maximum. Thus the highest relative performance may not be 100%.
    trial_relative_coverage = trial_coverage / experiment_max_coverage

      libafl aflplusplus libfuzzer honggfuzz afl
    FuzzerMedian 97.50 98.00 91.00 92.00 92.50
    FuzzerMean 96.05 91.71 88.38 87.24 78.85
    bloaty_fuzz_target 96.00 98.00 91.00 91.00 87.00
    curl_curl_fuzzer_http 99.00 95.00 84.00 97.00 95.00
    freetype2_ftfuzzer 94.00 92.00 78.00 78.00 81.00
    harfbuzz_hb-shape-fuzzer 100.00 97.00 90.00 80.00 95.00
    jsoncpp_jsoncpp_fuzzer 98.00 99.00 99.00 99.00 98.00
    lcms_cms_transform_fuzzer 95.00 42.00 59.00 55.00 29.00
    libjpeg-turbo_libjpeg_turbo_fuzzer 99.00 99.00 99.00 99.00 99.00
    libpcap_fuzz_both 94.00 96.00 86.00 75.00 1.00
    libpng_libpng_read_fuzzer 98.00 97.00 98.00 92.00 92.00
    libxml2_xml nan 98.00 94.00 79.00 nan
    libxslt_xpath 96.00 99.00 79.00 97.00 83.00
    mbedtls_fuzz_dtlsclient 98.00 99.00 95.00 97.00 96.00
    openssl_x509 99.00 99.00 99.00 99.00 99.00
    openthread_ot-ip6-send-fuzzer 99.00 80.00 99.00 93.00 93.00
    proj4_proj_crs_to_crs_fuzzer 80.00 82.00 76.00 97.00 13.00
    re2_fuzzer 98.00 99.00 98.00 93.00 98.00
    sqlite3_ossfuzz 97.00 63.00 68.00 58.00 61.00
    systemd_fuzz-link-parser 93.00 98.00 79.00 89.00 87.00
    vorbis_decode_fuzzer 99.00 99.00 99.00 90.00 99.00
    woff2_convert_woff2ttf_fuzzer 96.00 98.00 88.00 79.00 76.00
    zlib_zlib_uncompress_fuzzer 93.00 97.00 98.00 95.00 95.00
    • Fuzzers are sorted by "FuzzerMean" (average median relative coverage), highest on the left.
    • Green background = highest relative median coverage.
    • Blue gradient background = greater than 95% relative median coverage.

bloaty_fuzz_target summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 5494.5 91.216775 5430.0 5462.25 5494.5 5526.75 5559.0
    libafl 1800 2.0 5354.0 110.308658 5276.0 5315.00 5354.0 5393.00 5432.0
    honggfuzz 1800 2.0 5109.5 14.849242 5099.0 5104.25 5109.5 5114.75 5120.0
    libfuzzer 1800 2.0 5077.5 23.334524 5061.0 5069.25 5077.5 5085.75 5094.0
    afl 1800 2.0 4874.5 723.370237 4363.0 4618.75 4874.5 5130.25 5386.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

curl_curl_fuzzer_http summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 10108.5 86.974134 10047.0 10077.75 10108.5 10139.25 10170.0
    honggfuzz 1800 2.0 9873.0 90.509668 9809.0 9841.00 9873.0 9905.00 9937.0
    aflplusplus 1800 2.0 9681.0 205.060967 9536.0 9608.50 9681.0 9753.50 9826.0
    afl 1800 2.0 9675.5 62.932504 9631.0 9653.25 9675.5 9697.75 9720.0
    libfuzzer 1800 2.0 8575.0 526.087445 8203.0 8389.00 8575.0 8761.00 8947.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

freetype2_ftfuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 8106.5 12.020815 8098.0 8102.25 8106.5 8110.75 8115.0
    aflplusplus 1800 2.0 7983.0 873.983982 7365.0 7674.00 7983.0 8292.00 8601.0
    afl 1800 2.0 7014.0 52.325902 6977.0 6995.50 7014.0 7032.50 7051.0
    libfuzzer 1800 2.0 6763.5 760.139790 6226.0 6494.75 6763.5 7032.25 7301.0
    honggfuzz 1800 2.0 6750.5 34.648232 6726.0 6738.25 6750.5 6762.75 6775.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

harfbuzz_hb-shape-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
error
The following fuzzers do not have enough samples: libafl.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 1.0 10070.0 NaN 10070.0 10070.00 10070.0 10070.00 10070.0
    aflplusplus 1800 2.0 9847.5 94.045202 9781.0 9814.25 9847.5 9880.75 9914.0
    afl 1800 2.0 9614.5 88.388348 9552.0 9583.25 9614.5 9645.75 9677.0
    libfuzzer 1800 2.0 9086.0 73.539105 9034.0 9060.00 9086.0 9112.00 9138.0
    honggfuzz 1800 2.0 8140.5 23.334524 8124.0 8132.25 8140.5 8148.75 8157.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

jsoncpp_jsoncpp_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 524.5 0.707107 524.0 524.25 524.5 524.75 525.0
    honggfuzz 1800 2.0 521.0 1.414214 520.0 520.50 521.0 521.50 522.0
    aflplusplus 1800 2.0 520.0 0.000000 520.0 520.00 520.0 520.00 520.0
    libafl 1800 2.0 518.0 1.414214 517.0 517.50 518.0 518.50 519.0
    afl 1800 2.0 517.0 0.000000 517.0 517.00 517.0 517.00 517.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

lcms_cms_transform_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 1761.0 127.279221 1671.0 1716.00 1761.0 1806.00 1851.0
    libfuzzer 1800 2.0 1097.0 332.340187 862.0 979.50 1097.0 1214.50 1332.0
    honggfuzz 1800 2.0 1036.0 503.460028 680.0 858.00 1036.0 1214.00 1392.0
    aflplusplus 1800 2.0 781.5 30.405592 760.0 770.75 781.5 792.25 803.0
    afl 1800 2.0 554.0 19.798990 540.0 547.00 554.0 561.00 568.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libjpeg-turbo_libjpeg_turbo_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2544.5 0.707107 2544.0 2544.25 2544.5 2544.75 2545.0
    libafl 1800 2.0 2543.0 0.000000 2543.0 2543.00 2543.0 2543.00 2543.0
    libfuzzer 1800 2.0 2542.5 2.121320 2541.0 2541.75 2542.5 2543.25 2544.0
    afl 1800 2.0 2541.0 0.000000 2541.0 2541.00 2541.0 2541.00 2541.0
    honggfuzz 1800 2.0 2528.5 2.121320 2527.0 2527.75 2528.5 2529.25 2530.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpcap_fuzz_both summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2263.5 106.773124 2188.0 2225.75 2263.5 2301.25 2339.0
    libafl 1800 2.0 2215.0 50.911688 2179.0 2197.00 2215.0 2233.00 2251.0
    libfuzzer 1800 2.0 2031.5 154.856385 1922.0 1976.75 2031.5 2086.25 2141.0
    honggfuzz 1800 2.0 1776.5 74.246212 1724.0 1750.25 1776.5 1802.75 1829.0
    afl 1800 2.0 33.5 0.707107 33.0 33.25 33.5 33.75 34.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libpng_libpng_read_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 1989.5 33.234019 1966.0 1977.75 1989.5 2001.25 2013.0
    libafl 1800 2.0 1974.5 3.535534 1972.0 1973.25 1974.5 1975.75 1977.0
    aflplusplus 1800 2.0 1954.5 0.707107 1954.0 1954.25 1954.5 1954.75 1955.0
    honggfuzz 1800 2.0 1863.0 33.941125 1839.0 1851.00 1863.0 1875.00 1887.0
    afl 1800 2.0 1860.0 121.622366 1774.0 1817.00 1860.0 1903.00 1946.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxml2_xml summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 13983.0 141.421356 13883.0 13933.0 13983.0 14033.0 14083.0
    libfuzzer 1800 2.0 13471.0 155.563492 13361.0 13416.0 13471.0 13526.0 13581.0
    honggfuzz 1800 2.0 11255.0 272.943218 11062.0 11158.5 11255.0 11351.5 11448.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

libxslt_xpath summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 10599.5 43.133514 10569.0 10584.25 10599.5 10614.75 10630.0
    honggfuzz 1800 2.0 10324.0 186.676190 10192.0 10258.00 10324.0 10390.00 10456.0
    libafl 1800 2.0 10226.0 7.071068 10221.0 10223.50 10226.0 10228.50 10231.0
    afl 1800 2.0 8921.5 1498.359269 7862.0 8391.75 8921.5 9451.25 9981.0
    libfuzzer 1800 2.0 8410.0 41.012193 8381.0 8395.50 8410.0 8424.50 8439.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

mbedtls_fuzz_dtlsclient summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2564.0 9.899495 2557.0 2560.50 2564.0 2567.50 2571.0
    libafl 1800 2.0 2548.0 48.083261 2514.0 2531.00 2548.0 2565.00 2582.0
    honggfuzz 1800 2.0 2527.0 11.313708 2519.0 2523.00 2527.0 2531.00 2535.0
    afl 1800 2.0 2483.5 23.334524 2467.0 2475.25 2483.5 2491.75 2500.0
    libfuzzer 1800 2.0 2455.0 32.526912 2432.0 2443.50 2455.0 2466.50 2478.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openssl_x509 summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 5817.5 0.707107 5817.0 5817.25 5817.5 5817.75 5818.0
    aflplusplus 1800 2.0 5801.5 3.535534 5799.0 5800.25 5801.5 5802.75 5804.0
    honggfuzz 1800 2.0 5794.5 3.535534 5792.0 5793.25 5794.5 5795.75 5797.0
    afl 1800 2.0 5792.5 12.020815 5784.0 5788.25 5792.5 5796.75 5801.0
    libfuzzer 1800 2.0 5791.5 6.363961 5787.0 5789.25 5791.5 5793.75 5796.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

openthread_ot-ip6-send-fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 2947.0 19.798990 2933.0 2940.00 2947.0 2954.00 2961.0
    libafl 1800 2.0 2937.5 4.949747 2934.0 2935.75 2937.5 2939.25 2941.0
    afl 1800 2.0 2779.5 23.334524 2763.0 2771.25 2779.5 2787.75 2796.0
    honggfuzz 1800 2.0 2766.0 16.970563 2754.0 2760.00 2766.0 2772.00 2778.0
    aflplusplus 1800 2.0 2385.0 149.906638 2279.0 2332.00 2385.0 2438.00 2491.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

proj4_proj_crs_to_crs_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    honggfuzz 1800 2.0 5139.5 177.483802 5014.0 5076.75 5139.5 5202.25 5265.0
    aflplusplus 1800 2.0 4348.5 243.951840 4176.0 4262.25 4348.5 4434.75 4521.0
    libafl 1800 2.0 4212.5 12.020815 4204.0 4208.25 4212.5 4216.75 4221.0
    libfuzzer 1800 2.0 4031.5 133.643182 3937.0 3984.25 4031.5 4078.75 4126.0
    afl 1800 2.0 712.0 22.627417 696.0 704.00 712.0 720.00 728.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

re2_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 2826.5 37.476659 2800.0 2813.25 2826.5 2839.75 2853.0
    libfuzzer 1800 2.0 2823.0 1.414214 2822.0 2822.50 2823.0 2823.50 2824.0
    afl 1800 2.0 2812.0 2.828427 2810.0 2811.00 2812.0 2813.00 2814.0
    libafl 1800 2.0 2811.5 4.949747 2808.0 2809.75 2811.5 2813.25 2815.0
    honggfuzz 1800 2.0 2672.0 14.142136 2662.0 2667.00 2672.0 2677.00 2682.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

sqlite3_ossfuzz summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libafl 1800 2.0 17382.5 750.240295 16852.0 17117.25 17382.5 17647.75 17913.0
    libfuzzer 1800 2.0 12237.5 574.877813 11831.0 12034.25 12237.5 12440.75 12644.0
    aflplusplus 1800 2.0 11411.0 332.340187 11176.0 11293.50 11411.0 11528.50 11646.0
    afl 1800 2.0 11055.0 79.195959 10999.0 11027.00 11055.0 11083.00 11111.0
    honggfuzz 1800 2.0 10457.0 247.487373 10282.0 10369.50 10457.0 10544.50 10632.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

vorbis_decode_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 1231.5 3.535534 1229.0 1230.25 1231.5 1232.75 1234.0
    afl 1800 2.0 1228.0 2.828427 1226.0 1227.00 1228.0 1229.00 1230.0
    aflplusplus 1800 2.0 1223.5 14.849242 1213.0 1218.25 1223.5 1228.75 1234.0
    libafl 1800 2.0 1223.5 4.949747 1220.0 1221.75 1223.5 1225.25 1227.0
    honggfuzz 1800 2.0 1113.5 2.121320 1112.0 1112.75 1113.5 1114.25 1115.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

woff2_convert_woff2ttf_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    aflplusplus 1800 2.0 1135.5 4.949747 1132.0 1133.75 1135.5 1137.25 1139.0
    libafl 1800 2.0 1110.5 55.861436 1071.0 1090.75 1110.5 1130.25 1150.0
    libfuzzer 1800 2.0 1015.5 36.062446 990.0 1002.75 1015.5 1028.25 1041.0
    honggfuzz 1800 2.0 909.0 26.870058 890.0 899.50 909.0 918.50 928.0
    afl 1800 2.0 883.5 163.341666 768.0 825.75 883.5 941.25 999.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

zlib_zlib_uncompress_fuzzer summary

Ranking by median reached code coverage
Reached code coverage distribution
Mean code coverage growth over time
Mean code coverage growth over time
* The error bands show the 95% confidence interval around the mean code coverage.
  • Sample statistics and statistical significance (code coverage)
    Code coverage sample statistics
    count mean std min 25% median 75% max
    fuzzer time
    libfuzzer 1800 2.0 462.5 12.020815 454.0 458.25 462.5 466.75 471.0
    aflplusplus 1800 2.0 458.5 3.535534 456.0 457.25 458.5 459.75 461.0
    honggfuzz 1800 2.0 452.0 4.242641 449.0 450.50 452.0 453.50 455.0
    afl 1800 2.0 451.0 7.071068 446.0 448.50 451.0 453.50 456.0
    libafl 1800 2.0 439.0 12.727922 430.0 434.50 439.0 443.50 448.0

    Vargha-Delaney A12 measure
    The table summarizes the A12 values from the pairwise Vargha-Delaney A measure of effect size. Green cells indicate the probability the fuzzer in the row will outperform the fuzzer in the column.
    Mann-Whitney U test
    The table summarizes the p values of pairwise Mann-Whitney U tests. Green cells indicate that the reached coverage distribution of a given fuzzer pair is significantly different.
  • Unique code coverage plots
    Ranking by unique code branches covered
    Each bar shows the total number of code branches found by a given fuzzer. The colored area shows the number of unique code branches (i.e., branches that were not covered by any other fuzzers).
    Pairwise unique code coverage
    Each cell represents the number of code branches covered by the fuzzer of the column but not by the fuzzer of the row

experiment data

You can download the raw data for this report here.

Check out the documentation on how to create customized reports using this data. Also see some example Colab notebooks for doing custom analysis on the data here.

Experiment Description:

(None,)