@agentor/dashscope

v0.0.6

Published

5 days ago

AI SDK provider for Alibaba Cloud DashScope (Bailian) API

0High
0Medium
0Low

demomacro

ai ai-sdk alibaba aliyun bailian dashscope qwen

@agentor/dashscope

English | 中文

npm version npm downloads npm license

AI SDK provider for Alibaba Cloud DashScope (Bailian) API.

功能特性

Chat Completions - /chat/completions，支持函数调用、流式输出、推理和视觉
Completions (FIM) - /completions，Qwen Coder 模型的代码补全
Responses - /responses 端点，内置工具（联网搜索、代码解释器、MCP 等）
Embedding - /embeddings 端点的文本向量化
Reranking - /reranks 端点的文档重排序
Image Generation - 多模态生成端点的文生图
Video Generation - 文生视频和图生视频，异步轮询
Speech Synthesis - CosyVoice 和 Qwen-TTS 模型的文本转语音
Transcription - 短音频和长音频的语音转文字
Built-in Tools - 联网搜索、代码解释器、网页提取、文件搜索、以图搜图、MCP 集成
Thinking Mode - 可配置预算的推理/思考模式
Multi-region - 北京、新加坡、美国、德国区域
TypeScript-First - 完整的类型安全支持

安装

# Install with npm
$ npm install @agentor/dashscope

# Install with yarn
$ yarn add @agentor/dashscope

# Install with pnpm
$ pnpm add @agentor/dashscope

快速开始

初始化

import { createDashScope } from "@agentor/dashscope";

const dashscope = createDashScope({
  apiKey: process.env.DASHSCOPE_API_KEY,
});

或使用默认实例（自动读取 DASHSCOPE_API_KEY 环境变量）：

import { dashscope } from "@agentor/dashscope";

基础对话

import { dashscope } from "@agentor/dashscope";
import { generateText } from "ai";

const result = await generateText({
  model: dashscope("qwen3.5-flash"),
  prompt: "Introduce yourself in one sentence.",
});

console.log(result.text);

流式输出

import { streamText } from "ai";

const result = streamText({
  model: dashscope("qwen3.5-flash"),
  prompt: "Explain the Vercel AI SDK in three sentences.",
});

for await (const part of result.textStream) {
  process.stdout.write(part);
}

函数调用

import { generateText, hasToolCall, tool } from "ai";
import { z } from "zod/v4";

const result = await generateText({
  model: dashscope("qwen3.5-flash"),
  tools: {
    weather: tool({
      description: "Get weather information for a city",
      inputSchema: z.object({
        city: z.string().describe("City name"),
      }),
      execute: async ({ city }) => {
        return `${city}: Sunny, 25°C`;
      },
    }),
  },
  prompt: "What's the weather in Beijing?",
  stopWhen: hasToolCall("weather"),
});

Chat Completions API

联网搜索

通过 providerOptions 启用联网搜索：

await generateText({
  model: dashscope("qwen3.5-flash"),
  providerOptions: {
    dashscope: {
      enableSearch: true,
    },
  },
  prompt: "What are the latest tech news today?",
});

选项：enableSearch、searchStrategy（"enable" | "enable_with_history" | "agent_max"）。

代码解释器

启用代码解释器（需要开启思考模式）：

await generateText({
  model: dashscope("qwen3.5-flash"),
  providerOptions: {
    dashscope: {
      enableCodeInterpreter: true,
      enableThinking: true,
    },
  },
  prompt: "Calculate the sum of the first 20 Fibonacci numbers.",
});

思考模式

启用推理并配置 token 预算：

await generateText({
  model: dashscope("qwen3.5-flash"),
  providerOptions: {
    dashscope: {
      enableThinking: true,
      thinkingBudget: 5000,
    },
  },
  prompt: "Which is larger, 9.11 or 9.9?",
});

上下文缓存

DashScope 支持显式缓存以减少重复前缀的成本和延迟。通过 providerOptions 在消息或内容块上添加 cacheControl：

import { generateText } from "ai";

// 缓存长系统提示词（最少 1024 tokens）
const first = await generateText({
  model: dashscope("qwen3.5-flash"),
  messages: [
    {
      role: "system",
      content: longText, // must be >= 1024 tokens
      providerOptions: {
        dashscope: { cacheControl: { type: "ephemeral" } },
      },
    },
    { role: "user", content: "What does this code do?" },
  ],
});

// 第二次请求使用相同系统提示词会命中缓存
const second = await generateText({
  model: dashscope("qwen3.5-flash"),
  messages: [
    {
      role: "system",
      content: longText,
      providerOptions: {
        dashscope: { cacheControl: { type: "ephemeral" } },
      },
    },
    { role: "user", content: "How can it be optimized?" },
  ],
});

在用户消息的内容块上设置缓存：

await generateText({
  model: dashscope("qwen3.5-flash"),
  messages: [
    {
      role: "user",
      content: [
        {
          type: "text",
          text: longCode,
          providerOptions: {
            dashscope: { cacheControl: { type: "ephemeral" } },
          },
        },
        { type: "text", text: "Explain this code." },
      ],
    },
  ],
});

支持的模型会自动启用隐式缓存，无需配置。

JSON 输出

带 Schema 的结构化输出

使用 generateText 配合 Output.object() 生成类型化的 JSON：

import { generateText, Output } from "ai";
import { z } from "zod/v4";

const result = await generateText({
  model: dashscope("qwen3.5-flash"),
  prompt: "List 3 programming languages with their creators.",
  output: Output.object({
    schema: z.object({
      languages: z.array(
        z.object({
          name: z.string(),
          creator: z.string(),
          year: z.number(),
        }),
      ),
    }),
  }),
});

console.log(result.output);

Completions (FIM)

使用 completionModel() 通过 /completions 端点进行文本/代码补全（Fill-In-the-Middle）：

const result = await generateText({
  model: dashscope.completionModel("qwen2.5-coder-32b-instruct"),
  prompt:
    '<|fim_prefix|>def quick_sort(arr):\n    """Sort array using quicksort."""\n<|fim_suffix|>\n    return arr\n<|fim_middle|>',
});

console.log(result.text);

Responses API

使用 responses 命名空间访问带内置工具的 Responses API：

import { generateText } from "ai";

const result = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  prompt: "Search the web for the latest news.",
});

内置工具

联网搜索

import { dashscope } from "@agentor/dashscope";
import { generateText } from "ai";

const result = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  tools: [dashscope.responses.tools.webSearch()],
  prompt: "What are the latest tech news today?",
});

选项：forcedSearch、searchStrategy（"enable" | "enable_with_history"）。

代码解释器

const result = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  tools: [dashscope.responses.tools.codeInterpreter()],
  prompt: "Calculate the sum of the first 20 Fibonacci numbers.",
});

网页提取

const result = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  tools: [dashscope.responses.tools.webExtractor()],
  prompt: "Extract the main content from https://example.com",
});

可与 webSearch 搭配使用以获得更好的结果。

文件搜索

const result = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  tools: [
    dashscope.responses.tools.fileSearch({
      vectorStoreIds: ["vs-xxx"],
    }),
  ],
  prompt: "Find documents about machine learning.",
});

MCP 集成

const result = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  tools: [
    dashscope.responses.tools.mcp({
      serverProtocol: "sse",
      serverLabel: "my-mcp-server",
      serverUrl: "https://example.com/mcp/sse",
    }),
  ],
  prompt: "Use the MCP tool to get data.",
});

选项：serverProtocol、serverLabel、serverUrl、serverDescription、headers。

以文搜图

基于文本描述搜索图片。

以图搜图

基于输入图片搜索相似图片。

多轮对话

通过 providerOptions.dashscope 配置：previousResponseId、enableThinking、reasoning（推理力度）、conversation、instructions、includeUsage。

使用 previousResponseId 进行多轮对话：

const first = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  prompt: "Tell me about TypeScript.",
});

const second = await generateText({
  model: dashscope.responses("qwen3.5-flash"),
  providerOptions: {
    dashscope: {
      previousResponseId: first.response.id,
    },
  },
  prompt: "Follow up question...",
});

Embedding

import { embed, embedMany } from "ai";

// 单文本嵌入
const { embedding, usage } = await embed({
  model: dashscope.embeddingModel("text-embedding-v4"),
  value: "The clothes quality is excellent",
});

console.log(embedding.length); // 1024 (default dimensions)

// 批量嵌入
const { embeddings } = await embedMany({
  model: dashscope.embeddingModel("text-embedding-v4"),
  values: ["Hello world", "Machine learning is fascinating"],
});

自定义维度

const { embedding } = await embed({
  model: dashscope.embeddingModel("text-embedding-v4"),
  value: "Custom dimension embedding",
  providerOptions: {
    openaiCompatible: {
      dimensions: 256,
    },
  },
});

console.log(embedding.length); // 256

Reranking

import { rerank } from "ai";

const { ranking } = await rerank({
  model: dashscope.rerankingModel("qwen3-rerank"),
  query: "What is a reranking model?",
  documents: [
    "Reranking models sort candidate texts by relevance",
    "Quantum computing is a frontier field",
    "Pre-trained models brought advances to reranking",
  ],
});

for (const item of ranking) {
  console.log(`Index: ${item.originalIndex}, Score: ${item.score}`);
}

Top N 结果

const { ranking } = await rerank({
  model: dashscope.rerankingModel("qwen3-rerank"),
  query: "How to reset password?",
  documents: [
    "Go to Settings > Security > Change Password",
    "Forgot your password?",
    "Two-factor authentication is supported",
  ],
  topN: 2,
});

Image Generation

import { generateImage } from "ai";

const { images } = await generateImage({
  model: dashscope.imageModel("qwen-image-plus"),
  prompt: "A cute cat sitting on a windowsill with sunlight streaming in",
  providerOptions: {
    dashscope: {
      size: "1024*1024",
    },
  },
});

// images[0].uint8Array — raw image data
// images[0].base64 — base64 encoded image

Video Generation

import { experimental_generateVideo as generateVideo } from "ai";

// Text-to-video
const { videos } = await generateVideo({
  model: dashscope.videoModel("wan2.6-t2v"),
  prompt: "A golden retriever running through a field of sunflowers",
  providerOptions: {
    dashscope: {
      size: "1280*720",
      duration: 5,
    },
  },
});

Image-to-Video

使用包含 -i2v 的模型 ID 进入图生视频模式：

const { videos } = await generateVideo({
  model: dashscope.videoModel("wan2.7-i2v"),
  prompt: "The cat stretches and walks away",
  providerOptions: {
    dashscope: {
      resolution: "720P",
    },
  },
  image: "data:image/png;base64,...", // or a URL string
});

Speech Synthesis (TTS)

import { experimental_generateSpeech as generateSpeech } from "ai";
import { writeFileSync } from "fs";

const { audio } = await generateSpeech({
  model: dashscope.speechModel("cosyvoice-v3-flash"),
  text: "Hello, welcome to Agentor.",
  providerOptions: {
    dashscope: {
      voice: "longanyang",
      format: "wav",
      sampleRate: 24000,
    },
  },
});

writeFileSync("output.wav", audio.uint8Array);

Transcription (Speech-to-Text)

短音频（同步）

import { experimental_transcribe as transcribe } from "ai";

const { text } = await transcribe({
  model: dashscope.transcriptionModel("qwen3-asr-flash"),
  audio: new URL("https://example.com/audio.mp3"),
});

console.log(text);

长音频（异步）

对于异步模型，通过 providerOptions 提供音频 URL：

const { text, segments } = await transcribe({
  model: dashscope.transcriptionModel("qwen3-asr-flash-filetrans"),
  audio: new Uint8Array(0), // placeholder
  providerOptions: {
    dashscope: {
      fileUrl: "https://example.com/long-audio.mp3",
      enableWords: true,
    },
  },
});

Provider 配置

import { createDashScope } from "@agentor/dashscope";

const dashscope = createDashScope({
  apiKey: "sk-xxx", // or set DASHSCOPE_API_KEY env var
  region: "beijing", // beijing | singapore | us | germany
  workspaceId: "ws-xxx", // required for germany region
  baseURL: "https://custom-endpoint.com", // override default base URL
  headers: { "X-Custom-Header": "value" }, // custom headers
});

Available Models

For the complete and up-to-date model list, see Alibaba Cloud Model Studio.

Chat Completions (`/chat/completions`)

| Series | Models | | ----------- | ------------------------------------------------------------------------------ | | Qwen Max | qwen3.6-max-preview, qwen3-max, qwen-max, qwen-max-latest | | Qwen Plus | qwen3.6-plus, qwen3.5-plus, qwen-plus, qwen-plus-latest | | Qwen Flash | qwen3.6-flash, qwen3.5-flash, qwen-flash | | Qwen Turbo | qwen-turbo, qwen-turbo-latest | | Qwen Coder | qwen3-coder-plus, qwen3-coder-flash, qwen-coder-plus, qwen-coder-turbo | | Qwen Long | qwen-long, qwen-long-latest | | QwQ | qwq-plus, qwq-plus-latest | | Qwen Math | qwen-math-plus, qwen-math-turbo | | Vision (VL) | qwen3-vl-plus, qwen3-vl-flash, qwen-vl-max, qwen-vl-plus | | QVQ | qvq-max, qvq-plus |

Completions (`/completions`)

| Model | Description | | ---------------------------- | ----------------------- | | qwen2.5-coder-32b-instruct | Qwen2.5 Coder 32B | | qwen2.5-coder-14b-instruct | Qwen2.5 Coder 14B | | qwen2.5-coder-7b-instruct | Qwen2.5 Coder 7B | | qwen-coder-turbo-latest | Qwen Coder Turbo | | qwen-coder-turbo | Qwen Coder Turbo (base) |

Responses (`/responses`)

qwen3-max, qwen3.6-plus, qwen3.6-flash, qwen3.5-plus, qwen3.5-flash, qwen-plus, qwen-flash, qwen3-coder-plus, qwen3-coder-flash

Embedding (`/embeddings`)

| Model | Dimensions | Languages | | ------------------- | ---------------------- | ---------------------- | | text-embedding-v4 | 64-2048 (default 1024) | 100+ languages | | text-embedding-v3 | 64-1024 (default 1024) | 50+ languages | | text-embedding-v2 | 1536 | Chinese, English, etc. | | text-embedding-v1 | 1536 | Chinese, English, etc. |

Multimodal embedding models (qwen3-vl-embedding, tongyi-embedding-vision-*) do not support the OpenAI-compatible interface.

Reranking (`/reranks`)

| Model | Description | | ----------------- | --------------------------------------- | | qwen3-rerank | Text reranking, 100+ languages | | qwen3-vl-rerank | Multimodal reranking (text/image/video) | | gte-rerank-v2 | Semantic text reranking |

Image Generation

| Model | Description | | -------------------- | -------------------------------------------- | | wan2.7-image-pro | Latest Wan image generation, up to 4096x4096 | | wan2.7-image | Wan image generation, up to 2048x2048 | | qwen-image-2.0-pro | Qwen image generation and editing | | qwen-image-max | High quality image generation | | qwen-image-plus | Enhanced image generation | | z-image-turbo | Fast image generation |

Video Generation

| Model | Mode | Description | | ------------------ | ---- | ------------------------------------- | | wan2.7-t2v | T2V | Recommended text-to-video with audio | | wan2.6-t2v | T2V | Text-to-video with audio | | wan2.2-t2v-plus | T2V | Text-to-video (silent) | | wan2.7-i2v | I2V | Recommended image-to-video with audio | | wan2.6-i2v | I2V | Image-to-video with audio | | wan2.6-i2v-flash | I2V | Fast image-to-video |

Speech Synthesis (TTS)

| Model | Description | | -------------------------- | ---------------------------------- | | cosyvoice-v3.5-plus | Latest flagship, best quality | | cosyvoice-v3.5-flash | Latest lightweight | | cosyvoice-v3-plus | V3 enhanced | | cosyvoice-v3-flash | V3 fast synthesis | | qwen3-tts-flash-realtime | Qwen TTS with 17 human-like voices |

Transcription (STT)

| Model | Mode | Description | | --------------------------- | ----- | ------------------------------ | | qwen3-asr-flash | Sync | Short audio (up to 5 min) | | qwen3-asr-flash-filetrans | Async | Long audio (up to 12 hours) | | fun-asr | Async | Speaker diarization, hot words | | paraformer-v2 | Async | Legacy async transcription |

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@agentor/dashscope

功能特性

安装

快速开始

初始化

基础对话

流式输出

函数调用

Chat Completions API

联网搜索

代码解释器

思考模式

上下文缓存

JSON 输出

带 Schema 的结构化输出

Completions (FIM)

Responses API

内置工具

联网搜索

代码解释器

网页提取

文件搜索

MCP 集成

以文搜图

以图搜图

多轮对话

Embedding

自定义维度

Reranking

Top N 结果

Image Generation

Video Generation

Image-to-Video

Speech Synthesis (TTS)

Transcription (Speech-to-Text)

短音频（同步）

长音频（异步）

Provider 配置

Available Models

Chat Completions (/chat/completions)

Completions (/completions)

Responses (/responses)

Embedding (/embeddings)

Reranking (/reranks)

Image Generation

Video Generation

Speech Synthesis (TTS)

Transcription (STT)

License

Chat Completions (`/chat/completions`)

Completions (`/completions`)

Responses (`/responses`)

Embedding (`/embeddings`)

Reranking (`/reranks`)