从零开始开发一个Skill：新手入门指南与实战避坑

13次阅读

共计 2892 个字符，预计需要花费 8 分钟才能阅读完成。

Skill（语音技能）是运行在智能语音助手（如 Alexa、Google Assistant 等）上的应用程序，它通过自然语言处理（NLP）技术理解用户的语音指令，并做出相应的响应。对于新手开发者来说，开发 Skill 时经常会遇到以下几个典型问题：

自然语言理解不准：用户可能用多种方式表达同一个意图，如何准确识别？
上下文管理困难：在多轮对话中，如何记住之前的对话状态？
交互设计复杂：如何设计自然流畅的对话流，避免用户感到困惑？

目前主流的 Skill 开发平台包括：

Alexa Skills Kit (ASK)：亚马逊的 Skill 开发平台，支持 Node.js 和 Python，文档丰富，社区活跃。
Google Actions：谷歌的 Skill 开发平台，支持 Node.js 和 Java，集成 Google Assistant，适合 Android 生态。
其他平台：如三星的 Bixby、微软的 Cortana 等，市场份额较小。

选型建议：

如果你的目标用户主要在亚马逊设备（如 Echo）上使用，选择 ASK。
如果目标用户是 Android 用户，选择 Google Actions。
对于初学者，建议从 ASK 开始，因为其文档和社区支持更完善。

以 ASK 为例，创建一个 Skill 的步骤如下：

登录Alexa 开发者控制台。
点击“创建 Skill”，输入名称和默认语言。
选择“自定义”模型，模板选择“从头开始”。

设计对话流时，遵循以下原则：

明确意图：每个 Skill 应聚焦一个核心功能，避免过于复杂。
自然引导：通过提示语引导用户，例如“你可以问我天气或新闻”。
错误处理：预料用户可能说出的意外输入，如“我不明白，请再说一遍”。

以下是一个用 Node.js 实现的基本意图处理代码示例：

const Alexa = require('ask-sdk-core');

// 处理欢迎意图
const LaunchRequestHandler = {canHandle(handlerInput) {return Alexa.getRequestType(handlerInput.requestEnvelope) === 'LaunchRequest';
    },
    handle(handlerInput) {
        const speechText = '欢迎使用我的 Skill，请问你需要什么帮助？';
        return handlerInput.responseBuilder
            .speak(speechText)
            .reprompt(speechText)
            .getResponse();}
};

// 处理自定义意图
const HelloWorldIntentHandler = {canHandle(handlerInput) {return Alexa.getRequestType(handlerInput.requestEnvelope) === 'IntentRequest'
            && Alexa.getIntentName(handlerInput.requestEnvelope) === 'HelloWorldIntent';
    },
    handle(handlerInput) {
        const speechText = '你好，世界！';
        return handlerInput.responseBuilder
            .speak(speechText)
            .getResponse();}
};

// 错误处理
const ErrorHandler = {canHandle() {return true;},
    handle(handlerInput, error) {console.log(` 错误: ${error.message}`);
        return handlerInput.responseBuilder
            .speak('抱歉，我遇到了问题，请稍后再试。')
            .getResponse();}
};

// 注册处理器
exports.handler = Alexa.SkillBuilders.custom()
    .addRequestHandlers(
        LaunchRequestHandler,
        HelloWorldIntentHandler
    )
    .addErrorHandlers(ErrorHandler)
    .lambda();

在多轮对话中，可以使用 handlerInput.attributesManager 来保存和读取会话状态：

const GetNameIntentHandler = {canHandle(handlerInput) {return Alexa.getRequestType(handlerInput.requestEnvelope) === 'IntentRequest'
            && Alexa.getIntentName(handlerInput.requestEnvelope) === 'GetNameIntent';
    },
    handle(handlerInput) {const name = Alexa.getSlotValue(handlerInput.requestEnvelope, 'name');
        const attributes = handlerInput.attributesManager.getSessionAttributes();
        attributes.name = name; // 保存名字到会话状态
        handlerInput.attributesManager.setSessionAttributes(attributes);

        const speechText = ` 你好，${name}！`;
        return handlerInput.responseBuilder
            .speak(speechText)
            .getResponse();}
};

在 Alexa 开发者控制台中，可以使用“测试”选项卡进行模拟测试：