1) Summarization (enterprise)¶
Goal¶
Produce concise, faithful summaries with low variance.
Typical risks¶
- hallucinated facts
- missing critical details
- overlong output
Recommended baseline settings¶
temperature: 0.0โ0.3top_p: 1.0 (tune only if you choose nucleus sampling)presence_penalty: 0frequency_penalty: 0โ0.3 (optional)max_tokens: set pretty low (e.g., 128โ512) depending on policy
Strong pattern: structured summary schema¶
Use response_format with json_schema so your summary always includes the fields leadership cares about.
Example schema:
{
"name": "Summary",
"strict": true,
"schema": {
"type": "object",
"additionalProperties": false,
"properties": {
"title": {"type": "string"},
"key_points": {"type": "array", "items": {"type": "string"}},
"risks": {"type": "array", "items": {"type": "string"}},
"next_steps": {"type": "array", "items": {"type": "string"}}
},
"required": ["title", "key_points", "risks", "next_steps"]
}
}
How to test¶
Sweep temperature and max_tokens first; measure:
- summary length compliance
- JSON validity (if structured)
- human faithfulness score
Related docs: Parameters โ Decoding, Structured Outputs; Experiments โ Presets.