Prompt model inference configuration
Maximum length of output
List of stop sequences
Controls randomness, higher values increase diversity
Sample from the k most likely next tokens
Cumulative probability cutoff for token selection