Skip to content
Models/glm/GLM 4.7 Flash

GLM 4.7 Flash

glm-4.7-flashText·

Zhipu GLM 4.7 Flash cost-efficient, fast response

Context Window
1.0M
Input Price /M in
Free/M in
Output Price /M out
Free/M out
Cached Input Price /M
Free/M
Max Completion
4K
Input Modalities
text
Output Modalities
text
Function callingChatStreamingFreecode

Description

Zhipu GLM 4.7 Flash cost-efficient, fast response

Available Providers

AllToken can route requests to the providers below based on route priority and policy.

ProviderContextInputOutputCached / MLatencyThroughput

Best For

Zhipu GLM 4.7 Flash cost-efficient, fast response

How To Use This Model

Use the exact model ID shown below. This is the safest way to avoid call failures, variant mismatches, or incorrect route assumptions.

curl https://api.alltoken.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-your-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-4.7-flash",
    "messages": [
      {"role": "user", "content": "Hello!"}
    ]
  }'
Supported Parameters
temperaturetop_pmax_tokenstools
API Key Setup
Smart Routing

Let the platform choose the best provider path automatically.

Default Model

If a request does not specify a model, default the key to glm-4.7-flash.

Forced Model

Always override incoming requests and lock the key to glm-4.7-flash.