Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 15 additions & 3 deletions content/blog/in-context-search/index.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
title: "From Index-Centric Retrieval to In-Context Reasoning: The Technical Evolution of RAG Systems"
summary: "An in-depth analysis of the transition from traditional Graph-based RAG to next-generation In-Context Search (ICS) paradigms, comparing LightRAG, PageIndex, and Sirchmunk."
date: 2026-02-12
date: Feb 17, 2026
authors:
- me
tags:
Expand All @@ -16,7 +16,7 @@ image:
caption: 'The Technical Evolution of RAG Systems'
---

Traditional Retrieval-Augmented Generation (RAG) frameworks have established a robust foundation for grounding Large Language Models (LLMs) in external knowledge through static indexing and vector similarity. However, as computational paradigms shift toward **LLM-native** architectures, a new frontier known as **In-Context Search (ICS)** is emerging. This post analyzes the transition from traditional Graph-based RAG to next-generation ICS paradigms, represented by **VectifyAI's PageIndex** and **ModelScope's Sirchmunk**.
With the evolution of RAG (Retrieval-Augmented Generation) technology, a new paradigm called **In-Context Search (ICS)** is redefining how LLMs interact with external knowledge. This post compares traditional Graph-based RAG with next-generation ICS approaches represented by **[PageIndex](https://github.com/VectifyAI/PageIndex)** and **[Sirchmunk](https://github.com/modelscope/sirchmunk)**.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The introduction has been rephrased for better flow and clarity. Adding direct links to the GitHub repositories for PageIndex and Sirchmunk is a good enhancement, providing immediate access to the mentioned projects.


<!--more-->

Expand All @@ -28,7 +28,7 @@ Traditional Retrieval-Augmented Generation (RAG) frameworks have established a r

## 1. The Foundation and Frontiers of RAG

The first generation of RAG successfully addressed LLM hallucinations by introducing external knowledge bases. These systems typically rely on **Vector Databases** or **Static Knowledge Graphs** (e.g., **LightRAG**).
The first generation of RAG successfully addressed LLM hallucinations by introducing external knowledge bases. These systems typically rely on **Vector Databases** or **Static Knowledge Graphs** (e.g., **[LightRAG](https://github.com/HKUDS/LightRAG)**).
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

Adding a GitHub link for LightRAG here is consistent with the previous change and provides useful context for readers interested in the project.


### The Capabilities and Constraints of Traditional RAG

Expand Down Expand Up @@ -219,3 +219,15 @@ The next research frontier lies in the convergence of structural reasoning and a


The era of treating RAG as a static database lookup is ending. By embracing **In-Context Search**, frameworks like PageIndex and Sirchmunk are turning the context window into a dynamic, evolving laboratory for intelligence.

---

## References

1. Lewis, P., Perez, E., Piktus, A., et al. (2020). *Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.* NeurIPS 2020. [arXiv:2005.11401](https://arxiv.org/abs/2005.11401)
2. Guo, Z., Qian, C., et al. (2024). *LightRAG: Simple and Fast Retrieval-Augmented Generation.* [arXiv:2410.05779](https://arxiv.org/abs/2410.05779) | [GitHub](https://github.com/HKUDS/LightRAG)
3. VectifyAI. (2025). *PageIndex: Extracting and Understanding Financial Reports with LLM.* [GitHub](https://github.com/VectifyAI/PageIndex)
4. ModelScope. (2025). *Sirchmunk: An Embedding-Free, Agentic Search Engine for Raw Data.* [GitHub](https://github.com/modelscope/sirchmunk)
5. Yao, S., Zhao, J., Yu, D., et al. (2023). *ReAct: Synergizing Reasoning and Acting in Language Models.* ICLR 2023. [arXiv:2210.03629](https://arxiv.org/abs/2210.03629)
6. Anthropic. (2024). *Model Context Protocol (MCP) Specification.* [Documentation](https://modelcontextprotocol.io)
7. Kaddour, J., Harris, J., Mozes, M., et al. (2023). *Challenges and Applications of Large Language Models.* [arXiv:2307.10169](https://arxiv.org/abs/2307.10169)
Comment on lines +222 to +233
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The addition of a comprehensive 'References' section significantly improves the academic rigor and credibility of the blog post. It allows readers to delve deeper into the cited works.

19 changes: 16 additions & 3 deletions content/blog/in-context-search/index.zh.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
title: "从索引中心检索到上下文推理:RAG 系统的技术演进"
summary: "深入分析传统 Graph-based RAG 向下一代上下文搜索(ICS)范式的演进,对比 LightRAG、PageIndex 与 Sirchmunk 的技术路线。"
date: 2026-02-12
date: 2026-02-17
authors:
- me
tags:
Expand All @@ -16,7 +16,8 @@ image:
caption: 'RAG 系统的技术演进'
---

传统的检索增强生成(RAG)框架通过静态索引与向量相似度匹配,为大语言模型(LLM)构建了坚实的外部知识接入基础。然而,随着计算范式向 **LLM 原生** 架构演进,一种被称为 **上下文搜索(In-Context Search, ICS)** 的新前沿正在兴起。本文分析了从传统 Graph-based RAG 到下一代 ICS 范式的技术演进,以 **VectifyAI 的 PageIndex** 和 **ModelScope 的 Sirchmunk** 为代表。
随着 RAG (Retrieval-Augmented Generation) 技术的演进,一种名为 **上下文搜索(In-Context Search, ICS)** 的新范式正在重新定义 LLM 与外部知识的交互方式。本文对比了传统 Graph-based RAG 与以 **PageIndex** 和 **Sirchmunk** 为代表的下一代 ICS 方案。
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The Chinese version also benefits from the rephrased introduction, making it more concise and direct. Consider adding the GitHub links for PageIndex and Sirchmunk here as well, similar to the English version, for consistency and reader convenience.

References
  1. Ensure consistency across different language versions of the same content, especially for external links or key references.



<!--more-->

Expand Down Expand Up @@ -168,7 +169,7 @@ ICS 实现路线的分化反映了 **自顶向下逻辑** 与 **自底向上发
---


## 5. 挑战与未来研究方向
## 4. 挑战与未来研究方向
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The section heading has been updated from '5. 挑战与未来研究方向' to '4. 挑战与未来研究方向'. Please ensure that the numbering of all sections throughout the document is consistent and logical after this change.


这些框架之间的分化代表了应对 **"上下文瓶颈"** 的两种不同解决方案:

Expand Down Expand Up @@ -216,3 +217,15 @@ ICS 范式以预处理时间换取查询时的智能,这使得性能瓶颈发
* **硬件加速 ICS:** 与 NPU/GPU 加速文件系统的集成将缓解 I/O 瓶颈,即使在超大规模下也能实现"零索引"推理。

将 RAG 视为静态数据库查找的时代正在终结。通过拥抱 **上下文搜索**,PageIndex 和 Sirchmunk 等框架正在将上下文窗口转变为一个动态、进化的智能实验场。

---

## 参考文献

1. Lewis, P., Perez, E., Piktus, A., 等. (2020). *Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.* NeurIPS 2020. [arXiv:2005.11401](https://arxiv.org/abs/2005.11401)
2. Guo, Z., Qian, C., 等. (2024). *LightRAG: Simple and Fast Retrieval-Augmented Generation.* [arXiv:2410.05779](https://arxiv.org/abs/2410.05779) | [GitHub](https://github.com/HKUDS/LightRAG)
3. VectifyAI. (2025). *PageIndex: Extracting and Understanding Financial Reports with LLM.* [GitHub](https://github.com/VectifyAI/PageIndex)
4. ModelScope. (2025). *Sirchmunk:一个无嵌入的、智能体驱动的原始数据搜索引擎。* [GitHub](https://github.com/modelscope/sirchmunk)
5. Yao, S., Zhao, J., Yu, D., 等. (2023). *ReAct: Synergizing Reasoning and Acting in Language Models.* ICLR 2023. [arXiv:2210.03629](https://arxiv.org/abs/2210.03629)
6. Anthropic. (2024). *模型上下文协议(MCP)规范。* [官方文档](https://modelcontextprotocol.io)
7. Kaddour, J., Harris, J., Mozes, M., 等. (2023). *Challenges and Applications of Large Language Models.* [arXiv:2307.10169](https://arxiv.org/abs/2307.10169)
Comment on lines +221 to +231
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

medium

The addition of the '参考文献' (References) section in the Chinese version is a valuable improvement, providing academic backing and further reading for the audience.

Loading