Smithery Logo
MCPsSkillsDocsPricing
Login
Smithery Logo

Accelerating the Agent Economy

Resources

DocumentationPrivacy PolicySystem Status

Company

PricingAboutBlog

Connect

© 2026 Smithery. All rights reserved.

    parcadei

    llm-tuning-patterns

    parcadei/llm-tuning-patterns
    AI & ML
    3,502
    2 installs

    About

    SKILL.md

    Install

    Install via Skills CLI

    or add to your agent
    • Claude Code
      Claude Code
    • Codex
      Codex
    • OpenClaw
      OpenClaw
    • Cursor
      Cursor
    • Amp
      Amp
    • GitHub Copilot
      GitHub Copilot
    • Gemini CLI
      Gemini CLI
    • Kilo Code
      Kilo Code
    • Junie
      Junie
    • Replit
      Replit
    • Windsurf
      Windsurf
    • Cline
      Cline
    • Continue
      Continue
    • OpenCode
      OpenCode
    • OpenHands
      OpenHands
    • Roo Code
      Roo Code
    • Augment
      Augment
    • Goose
      Goose
    • Trae
      Trae
    • Zencoder
      Zencoder
    • Antigravity
      Antigravity
    ├─
    ├─
    └─

    About

    LLM Tuning Patterns

    SKILL.md

    LLM Tuning Patterns

    Evidence-based patterns for configuring LLM parameters, based on APOLLO and Godel-Prover research.

    Pattern

    Different tasks require different LLM configurations. Use these evidence-based settings.

    Theorem Proving / Formal Reasoning

    Based on APOLLO parity analysis:

    Parameter Value Rationale
    max_tokens 4096 Proofs need space for chain-of-thought
    temperature 0.6 Higher creativity for tactic exploration
    top_p 0.95 Allow diverse proof paths

    Proof Plan Prompt

    Always request a proof plan before tactics:

    Given the theorem to prove:
    [theorem statement]
    
    First, write a high-level proof plan explaining your approach.
    Then, suggest Lean 4 tactics to implement each step.
    

    The proof plan (chain-of-thought) significantly improves tactic quality.

    Parallel Sampling

    For hard proofs, use parallel sampling:

    • Generate N=8-32 candidate proof attempts
    • Use best-of-N selection
    • Each sample at temperature 0.6-0.8

    Code Generation

    Parameter Value Rationale
    max_tokens 2048 Sufficient for most functions
    temperature 0.2-0.4 Prefer deterministic output

    Creative / Exploration Tasks

    Parameter Value Rationale
    max_tokens 4096 Space for exploration
    temperature 0.8-1.0 Maximum creativity

    Anti-Patterns

    • Too low tokens for proofs: 512 tokens truncates chain-of-thought
    • Too low temperature for proofs: 0.2 misses creative tactic paths
    • No proof plan: Jumping to tactics without planning reduces success rate

    Source Sessions

    • This session: APOLLO parity - increased max_tokens 512->4096, temp 0.2->0.6
    • This session: Added proof plan prompt for chain-of-thought before tactics
    Recommended Servers
    Vercel Grep
    Vercel Grep
    Parallel Web Search
    Parallel Web Search
    Nimble MCP Server
    Nimble MCP Server
    Repository
    parcadei/continuous-claude-v3
    Files