LLM Agent Skill 开发实战：从零构建你的第一个智能代理

1次阅读

共计 2284 个字符，预计需要花费 6 分钟才能阅读完成。

最近两年，AI 对话能力突飞猛进，但很多开发者发现直接调用大模型 API 做出来的产品总差那么点意思——要么对话逻辑太死板，要么业务场景适配差。这就是 LLM Agent Skill 要解决的问题：通过编程方式赋予大模型「业务技能」，让 AI 不仅会聊天，更能完成具体任务。

智能客服：自动处理 80% 的常见问题，复杂 case 转人工
个人助理：整合日历、邮件、待办事项的智能管家
游戏 NPC：动态生成有记忆的虚拟角色对话
数据分析：用自然语言查询数据库并可视化结果

目前主流的 Agent 开发框架主要有两类：

LangChain
优势：生态丰富，支持多种大模型混用
不足：学习曲线陡峭，调试复杂
Semantic Kernel
优势：微软系工具集成好，C# 友好
不足：Python 支持较新，社区案例少

对于初学者，我推荐先用纯 Python+OpenAI API 起步，理解了核心机制再上框架。

from typing import Dict, Any
import openai

class BasicAgent:
    def __init__(self, api_key: str):
        openai.api_key = api_key
        self.context = []  # 对话上下文缓存

    async def chat(self, user_input: str) -> str:
        """处理用户输入并返回 AI 响应"""
        try:
            self.context.append({"role": "user", "content": user_input})

            response = await openai.ChatCompletion.acreate(
                model="gpt-3.5-turbo",
                messages=self.context,
                temperature=0.7
            )

            ai_reply = response.choices[0].message.content
            self.context.append({"role": "assistant", "content": ai_reply})
            return ai_reply

        except Exception as e:
            return f"出错了: {str(e)}"

关键设计点：

使用 async/await 避免阻塞主线程
context 列表维护完整对话历史
基础异常处理保证服务可用性

让我们给 Agent 添加实用功能：

import aiohttp

class WeatherAgent(BasicAgent):
    def __init__(self, api_key: str, weather_api_key: str):
        super().__init__(api_key)
        self.weather_api_key = weather_api_key
        self.weather_cache: Dict[str, Any] = {}  # 简单缓存

    async def get_weather(self, city: str) -> str:
        """调用外部天气 API"""
        if city in self.weather_cache:
            return self.weather_cache[city]

        url = f"https://api.weatherapi.com/v1/current.json?key={self.weather_api_key}&q={city}"
        async with aiohttp.ClientSession() as session:
            async with session.get(url) as resp:
                data = await resp.json()
                self.weather_cache[city] = data  # 缓存结果
                return f"{city}天气: {data['current']['condition']['text']}, 温度{data['current']['temp_c']}℃"

预加载技术：在用户输入前预先加载可能用到的 API 数据
流式响应：使用 OpenAI 的 stream=True 参数实现逐字返回
本地轻量化模型：先用小模型快速响应，大模型后台修正

摘要压缩法：每 5 轮对话生成一个摘要替换原始记录
关键信息提取：用 NER 识别并单独存储实体信息
自动遗忘机制：设置 TTL 自动清理老旧对话

实现指数退避重试机制
为不同接口设置独立配额桶
监控响应头中的 RateLimit 信息

打印完整的 prompt 结构
用 temperature=0 确保可复现
对长对话分段验证

尝试为你的 Agent 添加以下能力：

记忆用户偏好（比如喜欢的音乐类型）
当用户问「今天适合穿什么」时，结合天气数据给出建议
用 @功能名 的语法触发特定技能

示例实现思路：

# 在 WeatherAgent 类中添加
async def clothing_suggestion(self, city: str) -> str:
    weather = await self.get_weather(city)
    if "雨" in weather:
        return "建议携带雨具"
    elif "30" in weather:  # 高温判断
        return "推荐穿短袖"
    else:
        return "适合穿长袖"

开发 LLM Agent 最有趣的地方在于，你既是在编程，又像是在培养一个数字实习生。建议从简单功能开始，逐步增加复杂度，记得多测试边界 case。当看到自己设计的 Agent 能流畅处理真实业务场景时，那种成就感绝对值得付出！

正文完