如何高效访问ChatGPT API：开发者实战指南与避坑手册

16次阅读

共计 1766 个字符，预计需要花费 5 分钟才能阅读完成。

ChatGPT API 基于 OpenAI 的 GPT 模型，通过 HTTP 请求实现文本生成。其核心机制包括：

认证机制：每个请求需携带 API 密钥（Bearer Token），通过 Authorization 头传递。密钥可在 OpenAI 账户后台生成。
计费方式：按 token 数量计费，包括输入和输出的所有 token（1 个 token≈4 个英文字符）。
交互模式：支持单轮问答（completions）和多轮对话（chat），后者需维护上下文消息列表。

开发者在实际使用中常遇到以下问题：

认证复杂性：密钥泄露风险、多环境密钥管理困难。
速率限制：免费 tier 默认 3 次 / 分钟，付费账号也可能触发限流（如 TPM/RPM 限制）。
长文本处理：单个请求的 token 上限（如 gpt-3.5-turbo 的 4096 tokens）容易突破。
错误处理：API 返回的错误类型多样（如 429、503），需要针对性重试。

import openai
from tenacity import retry, stop_after_attempt, wait_exponential

@retry(stop=stop_after_attempt(3), wait=wait_exponential(multiplier=1, min=4, max=10))
def chat_completion_with_retry(messages):
    try:
        response = openai.ChatCompletion.create(
            model="gpt-3.5-turbo",
            messages=messages,
            stream=False  # 非流式响应
        )
        return response.choices[0].message.content
    except openai.error.APIError as e:
        print(f"API 错误: {e}")
        raise

response = openai.ChatCompletion.create(
    model="gpt-3.5-turbo",
    messages=[{"role": "user", "content": "讲个故事"}],
    stream=True  # 启用流式
)

for chunk in response:
    content = chunk.choices[0].delta.get("content", "")
    print(content, end="", flush=True)

class Conversation:
    def __init__(self):
        self.history = []

    def add_message(self, role, content):
        self.history.append({"role": role, "content": content})
        # 自动修剪超出 token 限制的历史
        while calculate_tokens(self.history) > 3000:
            self.history.pop(0)