免费ChatGPT在线使用：新手入门指南与API实战

10次阅读

没有评论

共计 2985 个字符，预计需要花费 8 分钟才能阅读完成。

ChatGPT 是 OpenAI 开发的对话式 AI 模型，通过分析输入的文本（称为 ”prompt”）生成连贯的响应。免费 API 版本通常基于 GPT-3.5 模型，其工作原理可以简单理解为：

将输入文本拆分为 token（约等于单词片段）
基于海量训练数据预测最可能的响应序列
通过 temperature 等参数控制输出的随机性

免费 API 的主要限制包括：

每分钟 3 - 5 次的请求频率限制
每次对话的 token 数量上限（约 4096 tokens）
不支持部分高级模型功能如微调
响应速度可能比付费版慢

在开始前，请确保准备好以下工具：

Python 3.8 或更高版本
requests 库（用于 HTTP 请求）
代码编辑器（VS Code/PyCharm 等）

安装依赖的命令：

pip install requests

访问 OpenAI 官网并注册账号
进入 API 密钥管理页面（Dashboard > API Keys）
点击 ”Create new secret key” 生成密钥
妥善保存密钥（页面关闭后将无法再次查看完整密钥）

重要安全提示：

不要将密钥直接硬编码在代码中
不要将包含密钥的代码上传到公开仓库
建议使用环境变量存储密钥

import requests
import os

# 从环境变量获取 API 密钥
API_KEY = os.getenv('OPENAI_API_KEY')

def simple_chat(prompt):
    headers = {"Authorization": f"Bearer {API_KEY}",
        "Content-Type": "application/json"
    }

    data = {
        "model": "gpt-3.5-turbo",
        "messages": [{"role": "user", "content": prompt}]
    }

    response = requests.post(
        "https://api.openai.com/v1/chat/completions",
        headers=headers,
        json=data
    )

    if response.status_code == 200:
        return response.json()["choices"][0]["message"]["content"]
    else:
        raise Exception(f"API 请求失败: {response.text}")

# 使用示例
print(simple_chat("Python 如何实现快速排序？"))

def advanced_chat(prompt, temp=0.7, max_tokens=1000):
    headers = {"Authorization": f"Bearer {API_KEY}",
        "Content-Type": "application/json"
    }

    data = {
        "model": "gpt-3.5-turbo",
        "messages": [{"role": "user", "content": prompt}],
        "temperature": temp,  # 控制创造性（0-2）"max_tokens": max_tokens,  # 限制响应长度
        "top_p": 0.9  # 另一种控制多样性的方式
    }

    response = requests.post(
        "https://api.openai.com/v1/chat/completions",
        headers=headers,
        json=data
    )

    return response.json()

from time import sleep

def robust_chat(prompt, max_retries=3):
    for attempt in range(max_retries):
        try:
            response = simple_chat(prompt)
            return response
        except Exception as e:
            if "rate limit" in str(e).lower():
                wait_time = (attempt + 1) * 10  # 指数退避
                print(f"达到频率限制，等待 {wait_time} 秒后重试...")
                sleep(wait_time)
            else:
                raise
    raise Exception("超出最大重试次数")

请求合并：将多个问题合并到一个 prompt 中
响应缓存：对相同问题的响应进行本地缓存
流式响应：对于长文本使用 stream=True 参数
合理设置 max_tokens：根据实际需要调整
异步请求：使用 aiohttp 库提升并发能力

示例缓存实现：

from functools import lru_cache

@lru_cache(maxsize=100)
def cached_chat(prompt):
    return simple_chat(prompt)

使用环境变量存储 API 密钥
为不同应用创建不同的 API 密钥
定期轮换密钥（每月至少一次）
监控 API 使用情况，设置使用告警
避免在前端代码中直接调用 API

密钥泄露：将密钥提交到 GitHub 等公开平台
解决方案：使用.gitignore 和环境变量
忽略频率限制：连续快速发送请求导致被封禁
解决方案：实现指数退避重试机制
超长 prompt：超过 token 限制导致请求失败
解决方案：先计算 token 数量（可使用 tiktoken 库）
误解 temperature 参数：设置过高导致输出随机性太大
建议值：创造性任务 0.7-1.0，确定性任务 0.2-0.5
无错误处理：API 失败导致程序崩溃
解决方案：添加 try-catch 和重试逻辑

保持对话上下文的关键是在每次请求中包含历史消息：

dialog_history = []

def contextual_chat(new_message):
    global dialog_history
    dialog_history.append({"role": "user", "content": new_message})

    response = requests.post(
        "https://api.openai.com/v1/chat/completions",
        headers=headers,
        json={
            "model": "gpt-3.5-turbo",
            "messages": dialog_history
        }
    ).json()

    assistant_reply = response["choices"][0]["message"]
    dialog_history.append(assistant_reply)

    # 防止对话历史过长
    if len(dialog_history) > 10:
        dialog_history = dialog_history[-10:]

    return assistant_reply["content"]

进阶建议：