Ability to set `maxTokens` when calling `prompt()` #36

duci9y · 2024-08-29T21:34:40Z

I'd like to be able to constrain the output to N tokens for my use case (tab autocompletion). Otherwise the model generates strings that are too long to be useful. It also takes more time.

etiennenoel · 2025-05-27T23:08:44Z

Do you have a use case for N tokens or number of words? I am tempted to think that developers will probably want a number of words and if that's the case,

@domenic Should we ask devs to do this using Regex with Structured Output?

domenic · 2025-05-28T01:07:58Z

Although I agree there is some overlap with response constraints, I think there's sufficient precedent in server-side APIs for exactly this constraint such that we might want to include it. Additionally, we already expose token limits in the API (via inputUsage / inputQuota).

It's possible most people using this on server-side APIs are concerned about costs, which are not as applicable for built-in AI. But here we have at least one user suggesting it's about usefulness and time taken.

What I can't easily find out from public docs is how this is implemented in existing server-side APIs. Does it get piped to the model, so that the model tries to stay within the limit? Or is it just a hard cutoff on the frontend, whenever the limit is reached?

Some experimentation in the OpenAI playground implies it's the latter:

domenic mentioned this issue Sep 18, 2024

Sampling hyperparameters are not universal among models #42

Open

domenic added the enhancement New feature or request label Oct 9, 2024

domenic added the ecosystem parity A feature that other popular language model APIs offer label Jun 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ability to set `maxTokens` when calling `prompt()` #36

Ability to set `maxTokens` when calling `prompt()` #36

duci9y commented Aug 29, 2024

etiennenoel commented May 27, 2025

Uh oh!

domenic commented May 28, 2025

Uh oh!

Ability to set maxTokens when calling prompt() #36

Ability to set maxTokens when calling prompt() #36

Comments

duci9y commented Aug 29, 2024

etiennenoel commented May 27, 2025

Uh oh!

domenic commented May 28, 2025

Uh oh!

Ability to set `maxTokens` when calling `prompt()` #36

Ability to set `maxTokens` when calling `prompt()` #36