大模型Agent Skill开发实战：从零构建你的第一个智能体技能

18次阅读

共计 2363 个字符，预计需要花费 6 分钟才能阅读完成。

Agent Skill 是大模型生态中的功能模块，类似于手机上的应用。它让大模型具备执行特定任务的能力，比如查询天气、订餐、翻译等。通过组合不同技能，大模型可以完成更复杂的任务，就像一个真正的智能助手。

技能边界模糊 ：不清楚一个技能应该包含哪些功能，容易把多个功能塞进一个技能里，导致逻辑混乱。
状态管理困难 ：在多轮对话中，难以跟踪用户意图和上下文，导致对话中断或错误。
错误处理不足 ：对 API 调用失败、用户输入错误等情况缺乏有效处理，影响用户体验。

技能描述文件是技能的 ” 说明书 ”，告诉大模型这个技能能做什么、需要什么输入。通常使用 JSON 格式，包含以下关键字段：

{
  "name": "weather_query",
  "description": "查询指定城市的实时天气情况",
  "parameters": {
    "city": {
      "type": "string",
      "description": "要查询的城市名称"
    },
    "date": {
      "type": "string",
      "description": "查询日期，格式 YYYY-MM-DD，默认为当天"
    }
  }
}

使用 FastAPI 实现天气查询接口，包含错误处理和限流机制：

from fastapi import FastAPI, HTTPException, Request
from fastapi.middleware.cors import CORSMiddleware
from pydantic import BaseModel
from datetime import datetime
import requests

app = FastAPI()

# 允许跨域请求
app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_methods=["*"],
    allow_headers=["*"],
)

# 请求模型
class WeatherRequest(BaseModel):
    city: str
    date: str = datetime.now().strftime('%Y-%m-%d')

# 模拟天气数据源
WEATHER_DATA = {
    "北京": {"2023-10-01": {"temperature": "20°C", "condition": "晴"},
        "2023-10-02": {"temperature": "18°C", "condition": "多云"}
    },
    "上海": {"2023-10-01": {"temperature": "25°C", "condition": "小雨"},
        "2023-10-02": {"temperature": "23°C", "condition": "阴"}
    }
}

# 限流中间件
@app.middleware("http")
async def rate_limit_middleware(request: Request, call_next):
    # 这里可以添加更复杂的限流逻辑
    return await call_next(request)

# 天气查询接口
@app.post("/weather")
async def query_weather(data: WeatherRequest):
    try:
        city_data = WEATHER_DATA.get(data.city, {})
        weather = city_data.get(data.date)

        if not weather:
            raise HTTPException(status_code=404, detail="天气数据未找到")

        return {
            "city": data.city,
            "date": data.date,
            "weather": weather
        }
    except Exception as e:
        raise HTTPException(status_code=500, detail=str(e))

if __name__ == "__main__":
    import uvicorn
    uvicorn.run(app, host="0.0.0.0", port=8000)

在多轮对话中，我们需要跟踪用户的意图和上下文。可以通过以下方式实现：

class ConversationState:
    def __init__(self):
        self.current_skill = None
        self.skill_params = {}
        self.history = []

    def update(self, user_input):
        # 解析用户输入，更新状态
        if "天气" in user_input:
            self.current_skill = "weather_query"
            # 提取城市和日期参数
            # ...

        # 记录对话历史
        self.history.append({"user": user_input})

        return self