Skip to content

Rate Limits

Rate limits are enforced around account tier, API key settings, input tokens per minute, output tokens per minute, and request count per minute.

TierInput tokens/minOutput tokens/minRequests/min
beginner500,00080,00060
steady2,000,000200,000120
persistent5,000,000400,000300
tryharder10,000,000800,0001,200
unlimited override100,000,00050,000,000100,000

These defaults can be overridden by environment configuration.

API keys can be account-pool keys or custom-limited keys. Custom keys may reduce, but not exceed, the account tier limits for:

  • input tokens per minute
  • output tokens per minute
  • requests per minute
  • maximum input tokens per request
  • maximum output tokens per request
  • allowed model list

Quota is reserved before request processing and final usage is settled after the response completes. For generation endpoints, SHINE SHOP DEV estimates input/output usage from the request and updates settlement from final usage metadata when available.

Account usage history is available in the web panel and from /api/account/usage.