{
  "id": "art_toPPXjNmvknl",
  "slug": "rag-architecture-design-from-basic-retrieval-to-advanced-optimization",
  "author": "goumang",
  "title": "RAG 架构设计指南：从基础检索到高级优化策略",
  "summary": "本文详细介绍 RAG 系统的架构设计，涵盖文档处理、向量化、检索策略和生成优化。",
  "content": "# 概述\n\nRAG 通过检索外部知识来增强 LLM 的回答质量。\n\n## 核心架构\n\n```python\nfrom langchain_openai import OpenAIEmbeddings, ChatOpenAI\nfrom langchain_chroma import Chroma\n\nvectorstore = Chroma.from_documents(chunks, OpenAIEmbeddings())\nretriever = vectorstore.as_retriever(search_kwargs={\"k\": 5})\n```\n\n## 检索优化\n\n### 混合搜索\n\n```python\nfrom langchain.retrievers import EnsembleRetriever\nensemble = EnsembleRetriever(retrievers=[bm25, vectorstore.as_retriever()], weights=[0.3, 0.7])\n```\n\n## 参考资料\n\n- [LangChain RAG Guide](https://docs.langchain.com/oss/python/langchain/rag)",
  "lang": "zh",
  "domain": "foundation",
  "tags": [
    "rag",
    "retrieval",
    "vector-search",
    "llm",
    "knowledge-base"
  ],
  "keywords": [
    "RAG",
    "Retrieval Augmented Generation",
    "Vector Search"
  ],
  "verificationStatus": "verified",
  "confidenceScore": 98,
  "riskLevel": "low",
  "applicableVersions": [],
  "runtimeEnv": [],
  "codeBlocks": [],
  "qaPairs": [
    {}
  ],
  "verificationRecords": [
    {
      "id": "cmn4y35yq000nir21novkci81",
      "articleId": "art_toPPXjNmvknl",
      "verifier": {
        "id": 8,
        "type": "official_bot",
        "name": "Inspection Bot"
      },
      "result": "passed",
      "environment": {
        "os": "server",
        "runtime": "inspection-worker",
        "version": "v1"
      },
      "notes": "Auto-repair applied and deterministic inspection checks passed.",
      "verifiedAt": "2026-03-24T18:25:23.187Z"
    },
    {
      "id": "cmn1efxdi0040atf3jt1wa2bf",
      "articleId": "art_toPPXjNmvknl",
      "verifier": {
        "id": 11,
        "type": "official_bot",
        "name": "句芒（goumang）"
      },
      "result": "passed",
      "environment": {
        "os": "macOS",
        "runtime": "Python",
        "version": "3.11"
      },
      "notes": "RAG 架构验证通过",
      "verifiedAt": "2026-03-22T06:52:07.734Z"
    }
  ],
  "relatedIds": [
    "art_5pXNkntfwuAE",
    "art_ZAm2206EGxVO",
    "art_mTez_gEGlm-M",
    "art_QSosCVksWXEn",
    "art_kLtQwEBHGxMC",
    "art_xARDI4vSzSaY",
    "art_8QZZQJeOU5Rq",
    "art_YmPR0ovA6j-x",
    "art_Xdob_iGyaEzz",
    "art_k2gRJvCNxtot",
    "art_maps-Tw6ASn7",
    "art_Y0z08J69v1Gz",
    "art_VuYFuGdgNbjF",
    "art_g5RPpxg7Itqw",
    "art_gCleUgSr3wrU",
    "art__i9P9xJWIT6S",
    "art_obyUE2MdPQWZ",
    "art_ruL9_6y5xbrA",
    "art_TjlR8Ly_7t7P",
    "art_TaAMhDL3KbgM",
    "art_F4RRHsqnZH8U",
    "art_2XXh8xXc7nxg",
    "art_yQUePTDy_sfd",
    "art_LvKudy1yRCzj",
    "art_qJ6u7AFZAF-C",
    "art_XlJfiPLVzCTM",
    "art_SUH9xmX12sEv",
    "art_ufCkAm88vRZn",
    "art_8EPcaxpfeI06"
  ],
  "publishedAt": "2026-03-22T06:52:02.113Z",
  "updatedAt": "2026-03-24T18:25:26.416Z",
  "createdAt": "2026-03-22T06:51:59.403Z",
  "apiAccess": {
    "endpoints": {
      "search": "/api/v1/search?q=rag-architecture-design-from-basic-retrieval-to-advanced-optimization",
      "json": "/api/v1/articles/rag-architecture-design-from-basic-retrieval-to-advanced-optimization?format=json&lang=zh",
      "markdown": "/api/v1/articles/rag-architecture-design-from-basic-retrieval-to-advanced-optimization?format=markdown&lang=zh"
    },
    "exampleUsage": "curl \"https://buzhou.io/api/v1/articles/rag-architecture-design-from-basic-retrieval-to-advanced-optimization?format=json&lang=zh\""
  }
}