Fuzz introspector: fuzz_detokenize
For issues and ideas: https://github.com/ossf/fuzz-introspector/issues

Fuzz blockers

The following nodes represent call sites where fuzz blockers occur.

Amount of callsites blocked Calltree index Parent function Callsite Largest blocked function
58 49 sacremoses.tokenize.MosesTokenizer.__init__ call site: 00049 sacremoses.cli.truecase_file
10 10 sacremoses.normalize.MosesPunctNormalizer.__init__ call site: 00010 sacremoses.cli.tokenize_file
5 29 sacremoses.corpus.NonbreakingPrefixes.words call site: 00029 sacremoses.tokenize.MosesTokenizer.has_numeric_only
3 21 sacremoses.tokenize.MosesTokenizer.__init__ call site: 00021 sacremoses.corpus.NonbreakingPrefixes.words
2 6 ...fuzz_detokenize.TestOneInput call site: 00006 sacremoses.cli.normalize_file

Fuzzer calltree

0 ...fuzz_detokenize.TestOneInput [function] [call site] 00000
1 atheris.FuzzedDataProvider [function] [call site] 00001
1 fdp.ConsumeIntInRange [function] [call site] 00002
1 fdp.ConsumeUnicodeNoSurrogates [function] [call site] 00003
1 sacremoses.tokenize.MosesDetokenizer.__init__ [function] [call site] 00004
2 <builtin>.super [function] [call site] 00005
1 sacremoses.cli.processor.new_func.processor [function] [call site] 00006
2 sacremoses.cli.normalize_file [function] [call site] 00007
3 sacremoses.normalize.MosesPunctNormalizer.__init__ [function] [call site] 00008
4 itertools.chain [function] [call site] 00009
4 <builtin>.list [function] [call site] 00010
3 functools.partial [function] [call site] 00011
3 sacremoses.cli.parallel_or_not [function] [call site] 00012
4 functools.partial [function] [call site] 00013
4 sacremoses.util.parallelize_preprocess [function] [call site] 00014
5 tqdm.tqdm [function] [call site] 00015
5 <builtin>.map [function] [call site] 00016
5 joblib.delayed [function] [call site] 00017
5 joblib.Parallel [function] [call site] 00018
2 sacremoses.cli.tokenize_file [function] [call site] 00019
3 sacremoses.tokenize.MosesTokenizer.__init__ [function] [call site] 00020
4 <builtin>.super [function] [call site] 00021
4 _nbp.strip [function] [call site] 00022
4 sacremoses.corpus.NonbreakingPrefixes.words [function] [call site] 00023
5 <builtin>.set [function] [call site] 00024
5 os.path.join [function] [call site] 00025
5 pkgutil.get_data [function] [call site] 00026
5 binary_data.decode [function] [call site] 00027
5 line.strip [function] [call site] 00028
5 line.startswith [function] [call site] 00029
4 <builtin>.open [function] [call site] 00030
4 line.strip [function] [call site] 00031
4 line.startswith [function] [call site] 00032
4 w.rpartition [function] [call site] 00033
4 sacremoses.tokenize.MosesTokenizer.has_numeric_only [function] [call site] 00034
5 re.search [function] [call site] 00035
5 <builtin>.bool [function] [call site] 00036
4 sacremoses.corpus.Perluniprops.chars [function] [call site] 00037
5 os.path.join [function] [call site] 00038
5 pkgutil.get_data [function] [call site] 00039
5 binary_data.decode [function] [call site] 00040
4 <builtin>.str [function] [call site] 00041
4 sacremoses.corpus.Perluniprops.chars [function] [call site] 00042
4 <builtin>.str [function] [call site] 00043
4 sacremoses.corpus.Perluniprops.chars [function] [call site] 00044
4 <builtin>.str [function] [call site] 00045
4 sacremoses.corpus.Perluniprops.chars [function] [call site] 00046
4 <builtin>.str [function] [call site] 00047
4 sacremoses.corpus.Perluniprops.chars [function] [call site] 00048
4 <builtin>.str [function] [call site] 00049
3 pattern.strip [function] [call site] 00051
3 fin.readlines [function] [call site] 00052
3 functools.partial [function] [call site] 00053
3 sacremoses.cli.parallel_or_not [function] [call site] 00054
2 sacremoses.cli.train_truecaser [function] [call site] 00055
3 sacremoses.truecase.MosesTruecaser.__init__ [function] [call site] 00056
4 <builtin>.super [function] [call site] 00057
4 re.compile [function] [call site] 00058
4 re.compile [function] [call site] 00059
4 sacremoses.truecase.MosesTruecaser._load_model [function] [call site] 00060
5 collections.defaultdict [function] [call site] 00061
5 line.strip [function] [call site] 00063
5 sacremoses.util.grouper [function] [call site] 00064
6 <builtin>.iter [function] [call site] 00065
6 itertools.zip_longest [function] [call site] 00066
5 count.split [function] [call site] 00067
5 token.lower [function] [call site] 00068
5 <builtin>.int [function] [call site] 00069
5 sacremoses.truecase.MosesTruecaser._casing_to_model [function] [call site] 00070
6 sacremoses.truecase.Counter [function] [call site] 00071
6 collections.Counter [function] [call site] 00072
3 sacremoses.truecase.MosesTruecaser.train [function] [call site] 00073
4 sacremoses.truecase.MosesTruecaser._train [function] [call site] 00074
5 collections.defaultdict [function] [call site] 00075
5 functools.partial [function] [call site] 00076
5 itertools.chain [function] [call site] 00077
5 sacremoses.util.parallelize_preprocess [function] [call site] 00078
5 sacremoses.truecase.MosesTruecaser._save_model_from_casing [function] [call site] 00079
6 <builtin>.sum [function] [call site] 00081
6 <builtin>.enumerate [function] [call site] 00082
6 tokens_counts.append [function] [call site] 00083
6 <builtin>.print [function] [call site] 00084
5 sacremoses.truecase.MosesTruecaser._casing_to_model [function] [call site] 00085
3 sacremoses.truecase.MosesTruecaser.save_model [function] [call site] 00086
4 sacremoses.truecase.MosesTruecaser._save_model_from_casing [function] [call site] 00087
2 sacremoses.cli.detruecase_file [function] [call site] 00088
3 sacremoses.truecase.MosesDetruecaser.__init__ [function] [call site] 00089
4 <builtin>.super [function] [call site] 00090
3 functools.partial [function] [call site] 00091
3 sacremoses.cli.parallel_or_not [function] [call site] 00092
2 sacremoses.cli.detokenize_file [function] [call site] 00093
3 sacremoses.tokenize.MosesDetokenizer.__init__ [function] [call site] 00094
3 functools.partial [function] [call site] 00095
3 sacremoses.cli.parallel_or_not [function] [call site] 00096
3 <builtin>.map [function] [call site] 00097
3 <builtin>.list [function] [call site] 00098
2 sacremoses.cli.truecase_file [function] [call site] 00099
3 os.path.isfile [function] [call site] 00100
3 copy.deepcopy [function] [call site] 00101
3 sacremoses.truecase.MosesTruecaser.__init__ [function] [call site] 00102
3 sacremoses.truecase.MosesTruecaser.train [function] [call site] 00103
3 sacremoses.truecase.MosesTruecaser.save_model [function] [call site] 00104
3 sacremoses.truecase.MosesTruecaser.__init__ [function] [call site] 00105
3 functools.partial [function] [call site] 00106
3 sacremoses.cli.parallel_or_not [function] [call site] 00107