Blog
Replicate
STABLE

Vicuna 13B

A large language model that's been fine-tuned on ChatGPT interactions

Inputs
Prompt
Prompt to send to Llama.
Debug
provide debugging output in logs
Max Length
Maximum number of tokens to generate. A word is generally 2-3 tokens
Output Index
If output is a list, which element to use
Repetition Penalty
Penalty for repeated words in generated text; 1 is no penalty, values greater than 1 discourage repetition, less than 1 encourage it.
Seed
Seed for random number generator, for reproducibility
Temperature
Adjusts randomness of outputs, greater than 1 is random and 0 is deterministic, 0.75 is a good starting value.
Top P
When decoding text, samples from the top p percentage of most likely tokens; lower to ignore less likely tokens
Outputs
Value
The output string