-
-
Notifications
You must be signed in to change notification settings - Fork 2
[Docs] Update blog: In-Context-Search #9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,7 +1,7 @@ | ||
| --- | ||
| title: "From Index-Centric Retrieval to In-Context Reasoning: The Technical Evolution of RAG Systems" | ||
| summary: "An in-depth analysis of the transition from traditional Graph-based RAG to next-generation In-Context Search (ICS) paradigms, comparing LightRAG, PageIndex, and Sirchmunk." | ||
| date: 2026-02-12 | ||
| date: Feb 17, 2026 | ||
| authors: | ||
| - me | ||
| tags: | ||
|
|
@@ -16,7 +16,7 @@ image: | |
| caption: 'The Technical Evolution of RAG Systems' | ||
| --- | ||
|
|
||
| Traditional Retrieval-Augmented Generation (RAG) frameworks have established a robust foundation for grounding Large Language Models (LLMs) in external knowledge through static indexing and vector similarity. However, as computational paradigms shift toward **LLM-native** architectures, a new frontier known as **In-Context Search (ICS)** is emerging. This post analyzes the transition from traditional Graph-based RAG to next-generation ICS paradigms, represented by **VectifyAI's PageIndex** and **ModelScope's Sirchmunk**. | ||
| With the evolution of RAG (Retrieval-Augmented Generation) technology, a new paradigm called **In-Context Search (ICS)** is redefining how LLMs interact with external knowledge. This post compares traditional Graph-based RAG with next-generation ICS approaches represented by **[PageIndex](https://github.com/VectifyAI/PageIndex)** and **[Sirchmunk](https://github.com/modelscope/sirchmunk)**. | ||
|
|
||
| <!--more--> | ||
|
|
||
|
|
@@ -28,7 +28,7 @@ Traditional Retrieval-Augmented Generation (RAG) frameworks have established a r | |
|
|
||
| ## 1. The Foundation and Frontiers of RAG | ||
|
|
||
| The first generation of RAG successfully addressed LLM hallucinations by introducing external knowledge bases. These systems typically rely on **Vector Databases** or **Static Knowledge Graphs** (e.g., **LightRAG**). | ||
| The first generation of RAG successfully addressed LLM hallucinations by introducing external knowledge bases. These systems typically rely on **Vector Databases** or **Static Knowledge Graphs** (e.g., **[LightRAG](https://github.com/HKUDS/LightRAG)**). | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. |
||
|
|
||
| ### The Capabilities and Constraints of Traditional RAG | ||
|
|
||
|
|
@@ -219,3 +219,15 @@ The next research frontier lies in the convergence of structural reasoning and a | |
|
|
||
|
|
||
| The era of treating RAG as a static database lookup is ending. By embracing **In-Context Search**, frameworks like PageIndex and Sirchmunk are turning the context window into a dynamic, evolving laboratory for intelligence. | ||
|
|
||
| --- | ||
|
|
||
| ## References | ||
|
|
||
| 1. Lewis, P., Perez, E., Piktus, A., et al. (2020). *Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.* NeurIPS 2020. [arXiv:2005.11401](https://arxiv.org/abs/2005.11401) | ||
| 2. Guo, Z., Qian, C., et al. (2024). *LightRAG: Simple and Fast Retrieval-Augmented Generation.* [arXiv:2410.05779](https://arxiv.org/abs/2410.05779) | [GitHub](https://github.com/HKUDS/LightRAG) | ||
| 3. VectifyAI. (2025). *PageIndex: Extracting and Understanding Financial Reports with LLM.* [GitHub](https://github.com/VectifyAI/PageIndex) | ||
| 4. ModelScope. (2025). *Sirchmunk: An Embedding-Free, Agentic Search Engine for Raw Data.* [GitHub](https://github.com/modelscope/sirchmunk) | ||
| 5. Yao, S., Zhao, J., Yu, D., et al. (2023). *ReAct: Synergizing Reasoning and Acting in Language Models.* ICLR 2023. [arXiv:2210.03629](https://arxiv.org/abs/2210.03629) | ||
| 6. Anthropic. (2024). *Model Context Protocol (MCP) Specification.* [Documentation](https://modelcontextprotocol.io) | ||
| 7. Kaddour, J., Harris, J., Mozes, M., et al. (2023). *Challenges and Applications of Large Language Models.* [arXiv:2307.10169](https://arxiv.org/abs/2307.10169) | ||
|
Comment on lines
+222
to
+233
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. |
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,7 +1,7 @@ | ||
| --- | ||
| title: "从索引中心检索到上下文推理:RAG 系统的技术演进" | ||
| summary: "深入分析传统 Graph-based RAG 向下一代上下文搜索(ICS)范式的演进,对比 LightRAG、PageIndex 与 Sirchmunk 的技术路线。" | ||
| date: 2026-02-12 | ||
| date: 2026-02-17 | ||
| authors: | ||
| - me | ||
| tags: | ||
|
|
@@ -16,7 +16,8 @@ image: | |
| caption: 'RAG 系统的技术演进' | ||
| --- | ||
|
|
||
| 传统的检索增强生成(RAG)框架通过静态索引与向量相似度匹配,为大语言模型(LLM)构建了坚实的外部知识接入基础。然而,随着计算范式向 **LLM 原生** 架构演进,一种被称为 **上下文搜索(In-Context Search, ICS)** 的新前沿正在兴起。本文分析了从传统 Graph-based RAG 到下一代 ICS 范式的技术演进,以 **VectifyAI 的 PageIndex** 和 **ModelScope 的 Sirchmunk** 为代表。 | ||
| 随着 RAG (Retrieval-Augmented Generation) 技术的演进,一种名为 **上下文搜索(In-Context Search, ICS)** 的新范式正在重新定义 LLM 与外部知识的交互方式。本文对比了传统 Graph-based RAG 与以 **PageIndex** 和 **Sirchmunk** 为代表的下一代 ICS 方案。 | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The Chinese version also benefits from the rephrased introduction, making it more concise and direct. Consider adding the GitHub links for PageIndex and Sirchmunk here as well, similar to the English version, for consistency and reader convenience. References
|
||
|
|
||
|
|
||
| <!--more--> | ||
|
|
||
|
|
@@ -168,7 +169,7 @@ ICS 实现路线的分化反映了 **自顶向下逻辑** 与 **自底向上发 | |
| --- | ||
|
|
||
|
|
||
| ## 5. 挑战与未来研究方向 | ||
| ## 4. 挑战与未来研究方向 | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. |
||
|
|
||
| 这些框架之间的分化代表了应对 **"上下文瓶颈"** 的两种不同解决方案: | ||
|
|
||
|
|
@@ -216,3 +217,15 @@ ICS 范式以预处理时间换取查询时的智能,这使得性能瓶颈发 | |
| * **硬件加速 ICS:** 与 NPU/GPU 加速文件系统的集成将缓解 I/O 瓶颈,即使在超大规模下也能实现"零索引"推理。 | ||
|
|
||
| 将 RAG 视为静态数据库查找的时代正在终结。通过拥抱 **上下文搜索**,PageIndex 和 Sirchmunk 等框架正在将上下文窗口转变为一个动态、进化的智能实验场。 | ||
|
|
||
| --- | ||
|
|
||
| ## 参考文献 | ||
|
|
||
| 1. Lewis, P., Perez, E., Piktus, A., 等. (2020). *Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.* NeurIPS 2020. [arXiv:2005.11401](https://arxiv.org/abs/2005.11401) | ||
| 2. Guo, Z., Qian, C., 等. (2024). *LightRAG: Simple and Fast Retrieval-Augmented Generation.* [arXiv:2410.05779](https://arxiv.org/abs/2410.05779) | [GitHub](https://github.com/HKUDS/LightRAG) | ||
| 3. VectifyAI. (2025). *PageIndex: Extracting and Understanding Financial Reports with LLM.* [GitHub](https://github.com/VectifyAI/PageIndex) | ||
| 4. ModelScope. (2025). *Sirchmunk:一个无嵌入的、智能体驱动的原始数据搜索引擎。* [GitHub](https://github.com/modelscope/sirchmunk) | ||
| 5. Yao, S., Zhao, J., Yu, D., 等. (2023). *ReAct: Synergizing Reasoning and Acting in Language Models.* ICLR 2023. [arXiv:2210.03629](https://arxiv.org/abs/2210.03629) | ||
| 6. Anthropic. (2024). *模型上下文协议(MCP)规范。* [官方文档](https://modelcontextprotocol.io) | ||
| 7. Kaddour, J., Harris, J., Mozes, M., 等. (2023). *Challenges and Applications of Large Language Models.* [arXiv:2307.10169](https://arxiv.org/abs/2307.10169) | ||
|
Comment on lines
+221
to
+231
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. |
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The introduction has been rephrased for better flow and clarity. Adding direct links to the GitHub repositories for PageIndex and Sirchmunk is a good enhancement, providing immediate access to the mentioned projects.