實用指南：告別複雜配置：用Milvus、RustFS和Vibe Coding，60分鐘DIY專屬Chatbot 詳情 - API,加載,數據庫,Html,CSS,前端開發 mb61c46a7ab1eee 博客

作為一名全棧工程師，我一直在尋找更高效的方式構建智能應用。

本文將分享如何利用 Milvus、RustFS 和 Vibe Coding 技術組合，在短時間內打造一個具備長期記憶能力的對話機器人。

一、為什麼選擇這個技術棧？

1.1 各組件核心價值

二、環境搭建：10分鐘快速開始

2.1 使用Docker Compose一鍵部署

2.2 Python環境配置

三、知識庫構建：讓Chatbot擁有長期記憶

3.1 文檔加載與向量化

3.2 創建Milvus集合（Collection）

3.3 知識庫入庫流程

四、RAG引擎實現：智能問答的核心

4.1 檢索增強生成（RAG）架構

4.2 對話記憶管理

五、後端API開發：FastAPI快速實現

5.1 創建高效的Web API

六、前端界面：Next.js現代化聊天界面

6.1 使用Vibe Coding理念快速開發前端

6.2 現代化樣式設計

七、部署與優化

7.1 性能優化建議

7.2 生產環境部署

八、總結與展望

一、為什麼選擇這個技術棧？

在當今AI應用開發領域，選擇合適的底層基礎設施至關重要。經過多個項目的實踐驗證，我發現了這個黃金組合：

技術	優勢	在Chatbot中的作用	替代方案對比
Milvus	專為向量優化，億級數據毫秒級檢索	存儲對話記憶和知識庫	比Pinecone成本低70%，比Chroma穩定5倍
RustFS	S3兼容，高性能，輕量安全	存儲文檔和多媒體資源	比MinIO內存佔用少60%，比AWS S3成本降90%
Vibe Coding	快速原型，AI輔助開發	加速前端和API開發	開發效率提升3倍，代碼量減少50%

1.1 各組件核心價值

Milvus：作為開源向量數據庫，它專門為AI場景優化，支持：

高性能相似性搜索：基於HNSW等先進索引算法，實現毫秒級響應
彈性擴展：輕鬆處理從數萬到數十億的向量數據
多模態支持：不僅支持文本，還支持圖像、音頻等向量

RustFS：完全兼容S3協議的對象存儲，優勢包括：

極致性能：4K隨機讀IOPS達1.58M，比傳統方案快40%+
成本優勢：自建部署相比公有云對象存儲成本下降90%
輕量安全：基於Rust語言，單二進制文件不足100MB

Vibe Coding：一種高效的開發方法論，強調：

快速迭代：基於AI代碼生成和組件化思維
用户體驗優先：快速構建直觀的前端界面
自動化運維：基礎設施即代碼，一鍵部署

二、環境搭建：10分鐘快速開始

2.1 使用Docker Compose一鍵部署

創建docker-compose.yml文件，集成所有必要服務：

version: '3.8'
services:
  # Milvus向量數據庫
  etcd:
    container_name: milvus-etcd
    image: quay.io/coreos/etcd:v3.5.18
    environment:
      - ETCD_AUTO_COMPACTION_MODE=revision
      - ETCD_AUTO_COMPACTION_RETENTION=1000
    volumes:
      - ${DOCKER_VOLUME_DIRECTORY:-.}/volumes/etcd:/etcd
    command: etcd -advertise-client-urls=http://etcd:2379 -listen-client-urls http://0.0.0.0:2379 --data-dir /etcd
    healthcheck:
      test: ["CMD", "etcdctl", "endpoint", "health"]
      interval: 30s
      timeout: 20s
      retries: 3
  # RustFS對象存儲（替代MinIO）
  rustfs:
    container_name: milvus-rustfs
    image: rustfs/rustfs:1.0.0-alpha.58
    environment:
      - RUSTFS_VOLUMES=/data/rustfs0,/data/rustfs1,/data/rustfs2,/data/rustfs3
      - RUSTFS_ADDRESS=0.0.0.0:9000
      - RUSTFS_CONSOLE_ADDRESS=0.0.0.0:9001
      - RUSTFS_CONSOLE_ENABLE=true
      - RUSTFS_ACCESS_KEY=rustfsadmin
      - RUSTFS_SECRET_KEY=rustfsadmin
    ports:
      - "9000:9000"  # S3 API端口
      - "9001:9001"  # 控制枱端口
    volumes:
      - rustfs_data_0:/data/rustfs0
      - rustfs_data_1:/data/rustfs1
      - rustfs_data_2:/data/rustfs2
      - rustfs_data_3:/data/rustfs3
    restart: unless-stopped
    healthcheck:
      test: ["CMD", "sh", "-c", "curl -f http://localhost:9000/health && curl -f http://localhost:9001/health"]
      interval: 30s
      timeout: 10s
      retries: 3
      start_period: 40s
  # Milvus向量數據庫
  milvus-standalone:
    container_name: milvus-standalone
    image: milvusdb/milvus:v2.6.0
    command: ["milvus", "run", "standalone"]
    environment:
      ETCD_ENDPOINTS: etcd:2379
      MINIO_ADDRESS: rustfs:9000  # 使用RustFS作為存儲後端
    volumes:
      - ${DOCKER_VOLUME_DIRECTORY:-.}/volumes/milvus:/var/lib/milvus
    ports:
      - "19530:19530"
      - "9091:9091"
    depends_on:
      - "etcd"
      - "rustfs"
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:9091/healthz"]
      interval: 30s
      start_period: 90s
      timeout: 20s
      retries: 3
  # Attu：Milvus的可視化管理界面
  attu:
    container_name: milvus-attu
    image: zilliz/attu:v2.6
    environment:
      - MILVUS_URL=milvus-standalone:19530
    ports:
      - "8000:3000"
    restart: unless-stopped
volumes:
  rustfs_data_0:
  rustfs_data_1:
  rustfs_data_2:
  rustfs_data_3:

啓動所有服務：

docker compose up -d

驗證服務狀態：

docker ps

應該看到4個容器正常運行，對應端口：

Milvus: 19530 (API), 9091 (健康檢查)
RustFS: 9000 (S3 API), 9001 (控制枱)
Attu: 8000 (Web界面)
etcd: 2379 (內部通信)

2.2 Python環境配置

創建虛擬環境並安裝必要依賴：

python -m venv chatbot-env
source chatbot-env/bin/activate  # Linux/macOS
# 或 chatbot-env\Scripts\activate  # Windows
pip install pymilvus==2.6.0
pip install openai==1.3.0
pip install boto3==1.28.0  # 用於連接RustFS
pip install fastapi==0.104.0
pip install uvicorn==0.24.0
pip install python-multipart==0.0.6

三、知識庫構建：讓Chatbot擁有長期記憶

3.1 文檔加載與向量化

首先，我們需要將知識文檔轉換為向量並存儲到Milvus中。以下代碼演示瞭如何處理Markdown格式的文檔：

import os
import glob
from milvus import Milvus, DataType
def load_markdown_files(folder_path):
    """加載Markdown文檔"""
    files = glob.glob(os.path.join(folder_path, "**", "*.md"), recursive=True)
    docs = []
    for file_path in files:
        with open(file_path, "r", encoding="utf-8") as f:
            content = f.read()
            docs.append({
                "file_path": file_path,
                "content": content,
                "file_name": os.path.basename(file_path)
            })
    return docs
def split_into_chunks(text, max_length=500):
    """將長文本分割為適合向量化的塊"""
    chunks = []
    current_chunk = []
    current_length = 0
    for line in text.split("\n"):
        line_length = len(line)
        if current_length + line_length < max_length:
            current_chunk.append(line)
            current_length += line_length
        else:
            if current_chunk:
                chunks.append(" ".join(current_chunk))
            current_chunk = [line]
            current_length = line_length
    if current_chunk:
        chunks.append(" ".join(current_chunk))
    return chunks
def get_embedding(text, model="text-embedding-3-large"):
    """使用OpenAI API獲取文本向量"""
    from openai import OpenAI
    client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))
    response = client.embeddings.create(
        model=model,
        input=text
    )
    return [data.embedding for data in response.data]

3.2 創建Milvus集合（Collection）

設置向量數據庫的結構，優化檢索性能：

def create_milvus_collection():
    """創建Milvus集合用於存儲文檔向量"""
    client = Milvus(host='localhost', port='19530')
    # 定義集合結構
    collection_name = "knowledge_base"
    # 如果集合已存在，先刪除
    if client.has_collection(collection_name):
        client.drop_collection(collection_name)
    # 創建字段定義
    fields = [
        FieldSchema(name="id", dtype=DataType.INT64, is_primary=True, auto_id=True),
        FieldSchema(name="file_name", dtype=DataType.VARCHAR, max_length=500),
        FieldSchema(name="content", dtype=DataType.VARCHAR, max_length=10000),
        FieldSchema(name="embedding", dtype=DataType.FLOAT_VECTOR, dim=3072)  # text-embedding-3-large維度
    ]
    schema = CollectionSchema(fields, description="知識庫文檔集合")
    collection = Collection(name=collection_name, schema=schema)
    # 創建索引優化檢索速度
    index_params = {
        "index_type": "HNSW",
        "metric_type": "L2",
        "params": {"M": 8, "efConstruction": 64}
    }
    collection.create_index("embedding", index_params)
    return collection

3.3 知識庫入庫流程

將文檔處理並存入向量數據庫的完整流程：

def build_knowledge_base(docs_folder_path):
    """構建知識庫的完整流程"""
    # 1. 加載文檔
    print("加載Markdown文檔...")
    documents = load_markdown_files(docs_folder_path)
    print(f"共加載 {len(documents)} 個文檔")
    # 2. 創建Milvus集合
    collection = create_milvus_collection()
    all_chunks = []
    all_embeddings = []
    # 3. 處理每個文檔
    for doc in documents:
        chunks = split_into_chunks(doc["content"])
        print(f"文檔 {doc['file_name']} 分割為 {len(chunks)} 個塊")
        for chunk in chunks:
            all_chunks.append({
                "file_name": doc["file_name"],
                "content": chunk
            })
    # 4. 批量生成向量（減少API調用）
    batch_size = 10  # OpenAI限制
    for i in range(0, len(all_chunks), batch_size):
        batch_chunks = all_chunks[i:i+batch_size]
        batch_texts = [chunk["content"] for chunk in batch_chunks]
        print(f"生成向量 {i+1}-{i+len(batch_chunks)}/{len(all_chunks)}")
        embeddings = get_embedding(batch_texts)
        all_embeddings.extend(embeddings)
    # 5. 存入Milvus
    entities = [
        [chunk["file_name"] for chunk in all_chunks],
        [chunk["content"] for chunk in all_chunks],
        all_embeddings
    ]
    collection.insert(entities)
    collection.flush()
    print(f"知識庫構建完成，共存入 {len(all_chunks)} 個文檔塊")
    return collection

四、RAG引擎實現：智能問答的核心

4.1 檢索增強生成（RAG）架構

RAG系統結合了檢索器和生成器的優勢，確保回答基於事實知識而非模型臆想。

class RAGEngine:
    def __init__(self, milvus_host='localhost', milvus_port=19530):
        self.client = Milvus(host=milvus_host, port=milvus_port)
        self.collection = Collection("knowledge_base")
        self.collection.load()
    def retrieve_similar_docs(self, query, top_k=3):
        """檢索與查詢最相關的文檔"""
        # 將查詢轉換為向量
        query_embedding = get_embedding([query])[0]
        # 在Milvus中搜索相似文檔
        search_params = {"metric_type": "L2", "params": {"nprobe": 10}}
        results = self.collection.search(
            data=[query_embedding],
            anns_field="embedding",
            param=search_params,
            limit=top_k,
            output_fields=["file_name", "content"]
        )
        # 提取相關文檔內容
        relevant_docs = []
        for hits in results:
            for hit in hits:
                relevant_docs.append({
                    "file_name": hit.entity.get("file_name"),
                    "content": hit.entity.get("content"),
                    "score": hit.score
                })
        return relevant_docs
    def generate_answer(self, query, context_docs):
        """基於檢索到的上下文生成回答"""
        from openai import OpenAI
        client = OpenAI(api_key=os.getenv("OPENAI_API_KEY"))
        # 構建上下文
        context = "\n\n".join([doc["content"] for doc in context_docs])
        # 構建prompt
        prompt = f"""基於以下上下文信息，回答用户的問題。如果上下文信息不足以回答問題，請如實告知。
上下文信息：
{context}
用户問題：{query}
請根據上下文提供準確、有用的回答："""
        response = client.chat.completions.create(
            model="gpt-4",
            messages=[
                {"role": "system", "content": "你是一個專業的助手，根據提供的上下文信息準確回答用户問題。"},
                {"role": "user", "content": prompt}
            ],
            temperature=0.1  # 低温度值確保回答穩定性
        )
        return response.choices[0].message.content
    def query(self, question, top_k=3):
        """完整的RAG查詢流程"""
        # 1. 檢索相關文檔
        relevant_docs = self.retrieve_similar_docs(question, top_k)
        # 2. 生成回答
        answer = self.generate_answer(question, relevant_docs)
        return {
            "answer": answer,
            "sources": relevant_docs
        }

4.2 對話記憶管理

為了讓Chatbot具備多輪對話能力，需要實現對話歷史管理：

class ConversationManager:
    def __init__(self, max_history=10):
        self.conversation_history = []
        self.max_history = max_history
    def add_message(self, role, content):
        """添加對話消息"""
        self.conversation_history.append({"role": role, "content": content})
        # 保持歷史記錄長度
        if len(self.conversation_history) > self.max_history * 2:  # 用户和助手消息各10條
            self.conversation_history = self.conversation_history[-self.max_history * 2:]
    def get_conversation_context(self):
        """獲取當前對話上下文"""
        return self.conversation_history.copy()
    def clear_history(self):
        """清空對話歷史"""
        self.conversation_history = []

五、後端API開發：FastAPI快速實現

5.1 創建高效的Web API

使用FastAPI構建RESTful接口，支持前端調用：

from fastapi import FastAPI, HTTPException
from fastapi.middleware.cors import CORSMiddleware
from pydantic import BaseModel
import uvicorn
app = FastAPI(title="Chatbot API", version="1.0.0")
# 允許跨域請求
app.add_middleware(
    CORSMiddleware,
    allow_origins=["*"],
    allow_methods=["*"],
    allow_headers=["*"],
)
# 數據模型
class ChatRequest(BaseModel):
    message: str
    conversation_id: str = None
class ChatResponse(BaseModel):
    answer: str
    sources: list
    conversation_id: str
# 全局實例
rag_engine = RAGEngine()
conversation_managers = {}  # 按會話ID管理對話歷史
def get_conversation_manager(conversation_id):
    """獲取或創建對話管理器"""
    if conversation_id not in conversation_managers:
        conversation_managers[conversation_id] = ConversationManager()
    return conversation_managers[conversation_id]
@app.post("/chat", response_model=ChatResponse)
async def chat_endpoint(request: ChatRequest):
    try:
        conversation_mgr = get_conversation_manager(request.conversation_id or "default")
        # 添加用户消息到歷史
        conversation_mgr.add_message("user", request.message)
        # 獲取對話上下文用於增強檢索
        conversation_context = conversation_mgr.get_conversation_context()
        contextual_query = self._build_contextual_query(request.message, conversation_context)
        # 使用RAG引擎獲取回答
        result = rag_engine.query(contextual_query)
        # 添加助手回答到歷史
        conversation_mgr.add_message("assistant", result["answer"])
        return ChatResponse(
            answer=result["answer"],
            sources=result["sources"],
            conversation_id=request.conversation_id or "default"
        )
    except Exception as e:
        raise HTTPException(status_code=500, detail=str(e))
def _build_contextual_query(current_query, conversation_history):
    """結合對話歷史構建上下文相關的查詢"""
    if len(conversation_history) <= 2:  # 只有當前查詢
        return current_query
    # 提取最近幾輪對話作為上下文
    recent_history = conversation_history[-4:]  # 最近兩輪對話（用户+助手）
    context = "之前的對話背景："
    for msg in recent_history:
        role = "用户" if msg["role"] == "user" else "助手"
        context += f"\n{role}: {msg['content']}"
    return f"{context}\n\n基於以上對話背景，當前問題：{current_query}"
@app.get("/health")
async def health_check():
    return {"status": "healthy", "service": "chatbot-api"}
if __name__ == "__main__":
    uvicorn.run(app, host="0.0.0.0", port=8000)

六、前端界面：Next.js現代化聊天界面

6.1 使用Vibe Coding理念快速開發前端

Vibe Coding強調快速原型開發，以下是關鍵代碼：

import React, { useState, useRef, useEffect } from 'react';
export default function ChatbotInterface() {
  const [messages, setMessages] = useState([]);
  const [input, setInput] = useState('');
  const [loading, setLoading] = useState(false);
  const messagesEndRef = useRef(null);
  const scrollToBottom = () => {
    messagesEndRef.current?.scrollIntoView({ behavior: "smooth" });
  };
  useEffect(scrollToBottom, [messages]);
  const sendMessage = async () => {
    if (!input.trim()) return;
    const userMessage = { role: 'user', content: input };
    setMessages(prev => [...prev, userMessage]);
    setInput('');
    setLoading(true);
    try {
      const response = await fetch('http://localhost:8000/chat', {
        method: 'POST',
        headers: { 'Content-Type': 'application/json' },
        body: JSON.stringify({
          message: input,
          conversation_id: 'default'
        }),
      });
      const data = await response.json();
      const botMessage = {
        role: 'assistant',
        content: data.answer,
        sources: data.sources
      };
      setMessages(prev => [...prev, botMessage]);
    } catch (error) {
      console.error('Error:', error);
      const errorMessage = {
        role: 'assistant',
        content: '抱歉，暫時無法回答問題，請稍後重試。'
      };
      setMessages(prev => [...prev, errorMessage]);
    } finally {
      setLoading(false);
    }
  };
  return (
    

      
        {messages.map((msg, index) => (
          
            
              {msg.content}
              {msg.sources && (
                
                  參考來源: {msg.sources.map(s => s.file_name).join(', ')}
                
              )}
            
          
        ))}
        {loading && (
          
            
              
                
                
                
              
            
          
        )}
        
      
      
         setInput(e.target.value)}
          onKeyPress={(e) => e.key === 'Enter' && sendMessage()}
          placeholder="輸入您的問題..."
          disabled={loading}
        />
        
          發送
        
      
    
  );
}

6.2 現代化樣式設計

使用CSS-in-JS實現美觀的聊天界面：

七、部署與優化

7.1 性能優化建議

向量檢索優化：

使用HNSW索引平衡查詢速度與精度
調整nprobe參數控制搜索範圍
批量處理查詢減少網絡開銷

緩存策略：

對常見查詢結果進行緩存
使用Redis緩存對話歷史
實施向量緩存避免重複計算

監控與日誌：

集成Prometheus監控性能指標
記錄查詢響應時間和準確率
設置告警機制及時發現異常

7.2 生產環境部署

使用Docker Compose編排所有服務：

八、總結與展望

通過本文的實踐，我們成功構建了一個基於Milvus、RustFS和Vibe Coding的智能Chatbot系統。這個方案的優勢在於：

高性能：Milvus提供毫秒級向量檢索，RustFS確保存儲效率
成本優化：自建基礎設施相比雲服務成本大幅降低
開發效率：Vibe Coding方法顯著加速開發進程
可擴展性：模塊化設計便於功能擴展和性能優化

未來可以進一步探索的方向包括：

多模態對話支持（圖像、音頻）
實時學習與知識庫動態更新
分佈式部署支持更大規模應用
集成更多數據源和第三方服務

互動話題：你在構建智能Chatbot過程中遇到過哪些挑戰？有什麼獨到的優化經驗？

歡迎在評論區分享交流！

以下是深入學習 RustFS 的推薦資源：RustFS

官方文檔： RustFS 官方文檔- 提供架構、安裝指南和 API 參考。

GitHub 倉庫： GitHub 倉庫 - 獲取源代碼、提交問題或貢獻代碼。

社區支持： GitHub Discussions- 與開發者交流經驗和解決方案。

mb61c46a7ab1eee 博客

mb61c46a7ab1eee 博客

博客 / 詳情