<aside> 📢 有些Medium文章是只有開放給會員，目前Medium有個新功能可以產生連結給非會員，非Medium會員請點選「Friend Link」

</aside>

<aside> 💡

相關頁面: AI相關工具非常非常的多，RAG開發工具會整理在RAG開發工具，LLM以及相關工具會整理在Large Language Models，Graph RAG開發工具會整理在Graph RAG，其他工具整理在Generative AI開發工具中。目前還有一些資料在Generative AI及RAG中，將逐步整理這些內容。

</aside>

基本概念

目前的大型語言模型，由於訓練的資料相當龐大，所以，幾乎有問必答，然而，當我們詢問比較冷門的問題時，目前的大型語言模型會給我們一個不正確的答案，主要的原因是大型語言模型並沒有這方面的知識。Retrieval Augmented Generation (RAG)就是在回答問題前，先去檢索資料，檢索後，再透過大型語言模型統整檢索的內容並產生回答。可以看一下 Felo Search、Proplexity，大概就可以理解這樣的概念，然而， Felo Search、Proplexity是去檢索網頁，一般的RAG則是去檢索非公開資料集。

RAG 檢索增強生成— 讓大型語言模型更聰明的秘密武器
- RAG架構主要由兩個部分構成：檢索器和生成器。檢索器負責從外部知識庫（例如，文本數據庫或預先訓練的知識嵌入）中檢索相關的知識訊息。這些檢索到的知識將會被送到生成器進行處理。而生成器會利用檢索到的知識來生成回應。通常，檢索器和生成器會通過一個聯合訓練的過程來學習如何協同工作，以產生符合目標的輸出內容。
RAG 開發知識庫 (ihower整理) **
- 基本流程/模組
  - 流程
    - RAG Routing
  - 資料預處理
    - Parsing
    - Chunking
    - Embedding
    - Vector Stores
  - 檢索
  - 生成
    - Prompt Engineering for RAG
    - Query Optimization
- 不同的延伸
從零打造屬於自己的 RAG-based LLM Line Bot 系列
- 介紹與規格
  - LangChain + ChatGPT
- 系統架構與事前準備
  - Postgres + pgvector
- 資料準備與處理
- Line Bot 開發
- 將 Line Bot 發佈到 Heroku
替你的應用程式加上智慧! 談 RAG 的檢索與應用 (2024.3)
- “安德魯的部落格” GPTs - Demo
- 部落格檢索服務
  - Synthesis:
    - 我用 GPTs (內建 GPT4 LLM) 來負責這段, 主要是調整 instruction 與設定 custom actions
  - Retrieval:
    - 我用 Kernel Memory (Serevice) 來提供檢索的能力。雖然他也支援 Synthesis ( api: /ask )，但是這段我靠 GPTs 處理掉了，因此我只用到 Retrieval ( api: /search ) 的部分。這部分由於檢索的需要，必須依賴外部 text-embedding model
  - Ingestion:
    - 我用 Kernel Memory (Serviceless) 來替我所有的文章向量化，並且建立 Index (向量資料庫)，同樣是用 Kernel Memory，只是他是離線作業，並非線上運作的服務。
- AI 改變了內容搜尋方式
  - 從 “表格” 到 “空間” 的演進
  - 從 “條件” 到 “語意” 的查詢
  - 從 “APP” 到 “AGENT” 的操作
The Rise and Evolution of RAG in 2024: A Year in Review
Mastering AI — Advanced RAG Techniques Online Course (Friend Link)
- Week 1 — Mastering Retrieval Systems
- Week 2 — Embedding Techniques Demystified
- Week 3 — Optimizing Document Chunking
- Week 4 — Advanced Hybrid Search Strategies
- Week 5 — Beyond Text — Multi Modal RAG
A Taxonomy of Retrieval Augmented Generation (Friend Link)
- RAG Basics
- Core Components
- Evaluation
- Pipeline Design
- Operations Stack
- Emerging Patterns
- Technology Providers
- Applied RAG
RAG App Development: From Start to Finish
- Setup: A Recipe to start with
  - Jupyter Notebook: The Interactive Kitchen
  - Installation: Gathering Our Tools
    - LangChain dependencies
- Setting Up the Ingredients: Environment Variables
- Indexing: Loading Our Data
- Indexing: Splitting the Document
- Indexing: Storing the Chunks
- Retrieval and Generation: Retrieving Information
- Retrieval and Generation: Generating Responses
- Next Steps: Expanding Your Q&A Application
An introduction to RAG and simple/ complex RAG (WhyHow.AI)
Your First RAG
Local RAG From Scratch
Who says RAG is only about Vector Stores?!
What Nobody Tells You About RAGs (Friend Link)
The Hidden Cost of Automated AI: Why 73% of RAG Systems Need Human Oversight (Friend Link)
The not-so-hidden costs of building an AI enabled RAG based chatbot
RAG LLM Best Practices