Method Drift›Retrieval-augmented generation
Self-RAG
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionRetrieval-augmented generation · first seen Oct 17, 2023
heavily superseded — a standard baseline that newer methods routinely beat
26 papers critique it · 40 beat it on benchmarks
What papers say
Verbatim critique sentences, each from a paper that cites Self-RAG as a baseline.
“existing approaches---including Self-RAG~asai2024selfrag and CRAG~yan2024crag---primarily target retrieval relevance without explicitly detecting or resolving contradictions”
— ConflictRAG: Detecting and Resolving Knowledge Conflicts in Retrieval Augmented Generation“Adaptive methods such as FLARE~jiang2023active, Self-RAG~asai2024selfrag, and DRAGIN~su2024dragin dynamically trigger retrieval based on uncertainty signals, but do so reactively, first detecting uncertainty and then blocking generation to perform retrieval.”
— Predictive Prefetching for Retrieval-Augmented Generation“However, their reliance on the LLM itself or the training data makes their generalization susceptible to data biases.”
— Know3-RAG: A Knowledge-aware RAG Framework with Adaptive Retrieval, Generation, and Filtering“Prior attempts like Self-RAG introduce special tokens to control reasoning but require architectural modifications.”
— Retrieval is Not Enough: Enhancing RAG Reasoning through Test-Time Critique and Optimization“However, these methods still operate at the document level, failing to adequately filter individual text chunks.”
— ChunkRAG: Novel LLM-Chunk Filtering Method for RAG Systems“this solution requires the training of two external models, requiring tens of thousands of additional training samples”
— Eliciting Critical Reasoning in Retrieval-Augmented Language Models via Contrastive Explanations“these methods typically incur high inference latency due to multiple LLM calls”
— Beyond Semantic Relevance: Counterfactual Risk Minimization for Robust Retrieval-Augmented Generation“While effective in identifying valuable documents, multiple LLM calls introduce substantial computation overhead.”
— InfoGain-RAG: Boosting Retrieval-Augmented Generation via Document Information Gain-based Reranking and Filtering“Self-RAG requires threshold tuning to balance QA performance and retrieval efficiency, while vanilla prompting is insufficient in guiding LLMs to make reliable retrieval decisions”
— RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering“While these methods improve robustness against irrelevant context, they typically operate via Breadth-First Addition: they append new passages to the existing context.”
— Replace, Don't Expand: Mitigating Context Dilution in Multi-Hop RAG via Fixed-Budget Evidence Assembly“However, these methods generally require substantial computational resources and API costs, making model updates challenging.”
— Rationale-Guided Retrieval Augmented Generation for Medical Question Answering“While effective, these approaches often add supervision, special control tokens, auxiliary probers, or multi-stage loops that increase engineering complexity and latency.”
— TARG: Training-Free Adaptive Retrieval Gating for Efficient RAG
Beaten on benchmarks
Head-to-head results where a newer method reports beating Self-RAG. Values are copied from the source paper's tables — verify against the cited paper.
- DeepNote: Note-Centric Deep Retrieval-Augmented Generation
DeepNote beats Self-RAG · f1 [Adaptive RAG baseline Self-RAG]
51.1 vs 44.4
- Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation
TTARAG beats Self-RAG · Overall [Llama-2-7b-chat]
30.5 vs 19.8
- Feedback Adaptation for Retrieval-Augmented Generation
PatchRAG beats Self-RAG · NQ (Exact Match) [t (post-feedback)]
49.8 vs 36.4
- Feedback Adaptation for Retrieval-Augmented Generation
PatchRAG beats Self-RAG · TriviaQA (Exact Match) [t (post-feedback)]
83.9 vs 38.2
- Feedback Adaptation for Retrieval-Augmented Generation
PatchRAG beats Self-RAG · HotpotQA (F1) [t (post-feedback)]
53.2 vs 29.6
- Feedback Adaptation for Retrieval-Augmented Generation
PatchRAG beats Self-RAG · Average [t (post-feedback)]
62.3 vs 34.7
- RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts
RankCoT beats Self-RAG · Avg. [full evaluation]
54.93 vs 47.61
- DRAG: Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation
DRAG beats Self-RAG · ARC-C [LLaMA-2-7B backbone]
86.2 vs 67.3
- Predictive Prefetching for Retrieval-Augmented Generation
predictive prefetching framework beats Self-RAG · F1 [HotpotQA]
75.1 vs 73.6
- Predictive Prefetching for Retrieval-Augmented Generation
predictive prefetching framework beats Self-RAG · E2E [HotpotQA]
5.2 vs 7.8
- Vendi-RAG: Adaptively Trading-Off Diversity And Quality Significantly Improves Retrieval Augmented Generation With LLMs
Vendi-RAG($s_1=0.8$) beats Self-RAG · Acc [MuSiQue]
30.4 vs 11.8
- Vendi-RAG: Adaptively Trading-Off Diversity And Quality Significantly Improves Retrieval Augmented Generation With LLMs
Vendi-RAG($s_1=0.8$) beats Self-RAG · Acc [HotpotQA]
58.4 vs 30.6
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.
- May 26, 2026
- May 18, 2026
- ConflictRAGConflictRAG: Detecting and Resolving Knowledge Conflicts in Retrieval Augmented GenerationMay 17, 2026
- SEMA-RAGSEMA-RAG: A Self-Evolving Multi-Agent Retrieval-Augmented Generation Framework for Medical ReasoningMay 16, 2026
- PyRAGRetrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented GenerationMay 13, 2026
- CoRM-RAGBeyond Semantic Relevance: Counterfactual Risk Minimization for Robust Retrieval-Augmented GenerationMay 2, 2026
- STEMSTEM: Structure-Tracing Evidence Mining for Knowledge Graphs-Driven Retrieval-Augmented GenerationApr 24, 2026
- Apr 22, 2026
- Self-Correcting RAGSelf-Correcting RAG: Enhancing Faithfulness via MMKP Context Selection and NLI-Guided MCTSApr 12, 2026
- Mar 7, 2026
- Cooperative Retrieval-Augmented Generation (CoRAG)Rethinking Retrieval-Augmented Generation as a Cooperative Decision-Making ProblemFeb 21, 2026
- Jan 29, 2026