RAG Papers

Updated on 2025.06.28

Publish Date Title Authors PDF Code
2025-06-26 Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation Qitao Qin et.al. 2504.05312 null
2025-06-26 PsyLite Technical Report Fangjun Ding et.al. 2506.21536 null
2025-06-26 EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing Corpora Fangyuan Zhang et.al. 2506.20963 null
2025-06-24 Talking to GDELT Through Knowledge Graphs Audun Myers et.al. 2503.07584 null
2025-06-24 Conversational Intent-Driven GraphRAG: Enhancing Multi-Turn Dialogue Systems through Adaptive Dual-Retrieval of Flow Patterns and Context Semantics Ziqi Zhu et.al. 2506.19385 null
2025-06-24 Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs Travis Thompson et.al. 2506.19967 null
2025-06-23 RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding Guanzheng Chen et.al. 2502.20330 link
2025-06-23 Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks Hongyuan Tao et.al. 2505.16901 null
2025-06-23 T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent Hong Qing Yu et.al. 2506.18559 null
2025-06-23 LLMs on a Budget? Say HOLA Zohaib Hasan Siddiqui et.al. 2506.18952 null
2025-06-22 PDF Retrieval Augmented Question Answering Thi Thu Uyen Hoang et.al. 2506.18027 null
2025-06-22 A Comprehensive Graph Framework for Question Answering with Mode-Seeking Preference Alignment Quanwei Tang et.al. 2506.17951 null
2025-06-21 ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation Pengcheng Huang et.al. 2502.15543 link
2025-06-20 Memory-enhanced Retrieval Augmentation for Long Video Understanding Huaying Yuan et.al. 2503.09149 null
2025-06-19 From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Bernal Jiménez Gutiérrez et.al. 2502.14802 link
2025-06-18 AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding Zhucun Xue et.al. 2506.13589 null
2025-06-18 Research on Graph-Retrieval Augmented Generation Based on Historical Text Knowledge Graphs Yang Fan et.al. 2506.15241 null
2025-06-17 Automated Decision-Making on Networks with LLMs through Knowledge-Guided Evolution Xiaohan Zheng et.al. 2506.14529 null
2025-06-17 Graph RAG for Legal Norms: A Hierarchical, Temporal and Deterministic Approach Hudson de Martim et.al. 2505.00039 null
2025-06-16 SimpleDoc: Multi-Modal Document Understanding with Dual-Cue Page Retrieval and Iterative Refinement Chelsi Jain et.al. 2506.14035 link
2025-06-16 Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences Stas Bekman et.al. 2506.13996 link
2025-06-15 Reasoning with RAGged events: RAG-Enhanced Event Knowledge Base Construction and reasoning with proof-assistants Stergios Chatzikyriakidis et.al. 2506.07042 null
2025-06-15 SlimRAG: Retrieval without Graphs via Entity-Aware Context Selection Jiale Zhang et.al. 2506.17288 null
2025-06-14 FlexRAG: A Flexible and Comprehensive Framework for Retrieval-Augmented Generation Zhuocheng Zhang et.al. 2506.12494 link
2025-06-14 MALM: A Multi-Information Adapter for Large Language Models to Mitigate Hallucination Ao Jia et.al. 2506.12483 null
2025-06-14 FastRAG: Retrieval Augmented Generation for Semi-structured Data Amar Abane et.al. 2411.13773 null
2025-06-13 Maximally-Informative Retrieval for State Space Model Generation Evan Becker et.al. 2506.12149 null
2025-06-12 FedRAG: A Framework for Fine-Tuning Retrieval-Augmented Generation Systems Val Andrei Fajardo et.al. 2506.09200 link
2025-06-11 KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs Dingjun Wu et.al. 2506.09542 null
2025-06-11 Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering Tianjun Yao et.al. 2506.09645 link
2025-06-11 Traceable LLM-based validation of statements in knowledge graphs Daniel Adam et.al. 2409.07507 link
2025-06-11 CC-RAG: Structured Multi-Hop Reasoning via Theme-Based Causal Graphs Jash Rajesh Parekh et.al. 2506.08364 null
2025-06-10 XGraphRAG: Interactive Visual Analysis for Graph-based Retrieval-Augmented Generation Ke Wang et.al. 2506.13782 link
2025-06-09 Graph-KV: Breaking Sequence via Injecting Structural Biases into Large Language Models Haoyu Wang et.al. 2506.07334 null
2025-06-09 LEANN: A Low-Storage Vector Index Yichuan Wang et.al. 2506.08276 null
2025-06-09 Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval Abdellah Ghassel et.al. 2506.08074 link
2025-06-09 SceneRAG: Scene-level Retrieval-Augmented Generation for Video Understanding Nianbo Zeng et.al. 2506.07600 null
2025-06-09 LlamaRec-LKG-RAG: A Single-Pass, Learnable Knowledge Graph-RAG Framework for LLM-Based Ranking Vahid Azizi et.al. 2506.07449 link
2025-06-08 Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs Atahan Özer et.al. 2506.07270 null
2025-06-08 KG2QA: Knowledge Graph-enhanced Retrieval-Augmented Generation for Communication Standards Question Answering Zhongze Luo et.al. 2506.07037 null
2025-06-07 Graph-based RAG Enhancement via Global Query Disambiguation and Dependency-Aware Reranking Ningyuan Li et.al. 2506.11106 null
2025-06-06 Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation Ze Yu Zhang et.al. 2506.05939 null
2025-06-06 BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions Saptarshi Sengupta et.al. 2506.05766 null
2025-06-06 DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation Tianjun Gu et.al. 2505.21969 link
2025-06-06 Building Models of Neurological Language Henry Watkins et.al. 2506.06208 null
2025-06-06 Large Language Models are Good Relational Learners Fang Wu et.al. 2506.05725 null
2025-06-06 When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation Zhishang Xiang et.al. 2506.05690 link
2025-06-06 E^2GraphRAG: Streamlining Graph-based RAG for High Efficiency and Effectiveness Yibo Zhao et.al. 2505.24226 null
2025-06-05 Dynamic Context Tuning for Retrieval-Augmented Generation: Enhancing Multi-Turn Planning and Tool Adaptation Jubin Abhishek Soni et.al. 2506.11092 null
2025-06-05 From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems Jiayi Chen et.al. 2506.04565 null
2025-06-03 Beyond RAG: Reinforced Reasoning Augmented Generation for Clinical Notes Lo Pang-Yun Ting et.al. 2506.05386 null
2025-06-03 MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation Mingyang Huang et.al. 2506.02661 null
2025-06-02 SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Aurick Qiao et.al. 2410.03960 null
2025-06-02 Improving Factuality with Explicit Working Memory Mingda Chen et.al. 2412.18069 null
2025-06-02 Guiding Generative Storytelling with Knowledge Graphs Zhijun Pan et.al. 2505.24803 null
2025-06-02 DRAG: Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation Jennifer Chen et.al. 2506.01954 null
2025-06-02 Retrieval-Augmented Generation of Ontologies from Relational Databases Mojtaba Nayyeri et.al. 2506.01232 null
2025-06-01 A Survey of LLM $\times$ DATA Xuanhe Zhou et.al. 2505.18458 link
2025-06-01 A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy Yang Zhao et.al. 2506.04252 null
2025-06-01 RARE: Retrieval-Aware Robustness Evaluation for Retrieval-Augmented Generation Systems Yixiao Zeng et.al. 2506.00789 link
2025-05-31 OntoRAG: Enhancing Question-Answering through Automated Ontology Derivation from Unstructured Knowledge Bases Yash Tiwari et.al. 2506.00664 null
2025-05-29 AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems Yingxuan Yang et.al. 2504.00587 null
2025-05-29 Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking Liangliang Zhang et.al. 2505.23495 null
2025-05-29 SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation Yuzheng Cai et.al. 2412.15272 link
2025-05-29 Retrieval Augmented Generation based Large Language Models for Causality Mining Thushara Manjari Naduvilakandy et.al. 2505.23944 link
2025-05-29 BugRepro: Enhancing Android Bug Reproduction with Domain-Specific Knowledge Integration Hongrong Yin et.al. 2505.14528 null
2025-05-28 Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference Yue Zhu et.al. 2505.21919 null
2025-05-28 Contextual Memory Intelligence – A Foundational Paradigm for Human-AI Collaboration and Reflective Generative AI Systems Kristy Wedel et.al. 2506.05370 null
2025-05-28 MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models Zhiyu Li et.al. 2505.22101 null
2025-05-28 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges Ranjan Sapkota et.al. 2505.10468 null
2025-05-28 SkewRoute: Training-Free LLM Routing for Knowledge Graph Retrieval-Augmented Generation via Score Skewness of Retrieved Context Hairu Wang et.al. 2505.23841 null
2025-05-28 Walk&Retrieve: Simple Yet Effective Zero-shot Retrieval-Augmented Generation via Knowledge Graph Walks Martin Böckling et.al. 2505.16849 link
2025-05-28 Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation Shijie Wang et.al. 2501.02226 null
2025-05-28 Rethinking Hybrid Retrieval: When Small Embeddings and LLM Re-ranking Beat Bigger Models Arjun Rao et.al. 2506.00049 null
2025-05-27 DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs Xiabin Zhou et.al. 2412.14838 null
2025-05-27 Long Context Scaling: Divide and Conquer via Multi-Agent Question-driven Collaboration Sibo Xiao et.al. 2505.20625 null
2025-05-27 Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework Saman Marandi et.al. 2505.21291 null
2025-05-27 Path Pooling: Training-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation Hairu Wang et.al. 2503.05203 null
2025-05-26 DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud Systems Wenqing Zhou et.al. 2505.19847 null
2025-05-26 LLM-Agent-Controller: A Universal Multi-Agent Large Language Model System as a Control Engineer Rasoul Zahedifar et.al. 2505.19567 null
2025-05-26 Benchmarking Multimodal Knowledge Conflict for Large Multimodal Models Yifan Jia et.al. 2505.19509 link
2025-05-26 KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing Rui Li et.al. 2505.20245 link
2025-05-26 GTR: Graph-Table-RAG for Cross-Table Question Answering Jiaru Zou et.al. 2504.01346 null
2025-05-25 Accelerating Adaptive Retrieval Augmented Generation via Instruction-Driven Representation Reduction of Retrieval Overlaps Jie Ou et.al. 2505.12731 null
2025-05-25 Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases Teng Lin et.al. 2504.05634 null
2025-05-24 BRIT: Bidirectional Retrieval over Unified Image-Text Graph Ainulla Khan et.al. 2505.18450 null
2025-05-23 Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning Xingyu Tan et.al. 2505.17464 null
2025-05-22 LSM-VEC: A Large-Scale Disk-Based System for Dynamic Vector Search Shurui Zhong et.al. 2505.17152 null
2025-05-22 CUB: Benchmarking Context Utilisation Techniques for Language Models Lovisa Hagström et.al. 2505.16518 null
2025-05-22 Cosmos: A CXL-Based Full In-Memory System for Approximate Nearest Neighbor Search Seoyoung Ko et.al. 2505.16096 null
2025-05-22 FiDeLiS: Faithful Reasoning in Large Language Model for Knowledge Graph Question Answering Yuan Sui et.al. 2405.13873 null
2025-05-22 Align-GRAG: Reasoning-Guided Dual Alignment for Graph Retrieval-Augmented Generation Derong Xu et.al. 2505.16237 null
2025-05-22 Hallucination Detection in LLMs with Topological Divergence on Attention Graphs Alexandra Bazarova et.al. 2504.10063 null
2025-05-21 Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Dun Yuan et.al. 2503.24245 null
2025-05-21 HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph Databases Pingqing Zheng et.al. 2505.15701 link
2025-05-21 Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization Yutao Zhu et.al. 2505.15444 null
2025-05-20 Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding Sakhinana Sagar Srinivas et.al. 2504.01281 null
2025-05-20 MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations Ernests Lavrinovics et.al. 2505.14101 link
2025-05-20 Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering Yihua Zhu et.al. 2505.14099 null
2025-05-20 Divide by Question, Conquer by Agent: SPLIT-RAG with Question-Driven Graph Partitioning Ruiyi Yang et.al. 2505.13994 null
2025-05-19 SubGCache: Accelerating Graph-based RAG with Subgraph-level KV Cache Qiuyu Zhu et.al. 2505.10951 null
2025-05-19 Know3-RAG: A Knowledge-aware RAG Framework with Adaptive Retrieval, Generation, and Filtering Xukai Liu et.al. 2505.12662 link
2025-05-19 Beyond Single Pass, Looping Through Time: KG-IRAG with Iterative Knowledge Retrieval Ruiyi Yang et.al. 2503.14234 null
2025-05-19 Optimizing Retrieval Augmented Generation for Object Constraint Language Kevin Chenhao Li et.al. 2505.13129 null
2025-05-19 Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain Yuyang Li et.al. 2505.13006 null
2025-05-19 CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models Feiyang Li et.al. 2504.13534 null
2025-05-18 GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction Guanyuan Pan et.al. 2504.10240 null
2025-05-17 Telco-oRAG: Optimizing Retrieval-augmented Generation for Telecom Queries via Hybrid Retrieval and Neural Routing Andrei-Laurentiu Bornea et.al. 2505.11856 null
2025-05-17 A Pilot Empirical Study on When and How to Use Knowledge Graphs as Retrieval Augmented Generation Xujie Yuan et.al. 2502.20854 null
2025-05-17 Let’s have a chat with the EU AI Act Adam Kovari et.al. 2505.11946 null
2025-05-17 ELITE: Embedding-Less retrieval with Iterative Text Exploration Zhangyu Wang et.al. 2505.11908 link
2025-05-17 DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented Generation David Osei Opoku et.al. 2505.17058 null
2025-05-16 mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs Chuan Xu et.al. 2505.11180 link
2025-05-16 Leveraging Graph Retrieval-Augmented Generation to Support Learners’ Understanding of Knowledge Concepts in MOOCs Mohamed Abdelmagied et.al. 2505.10074 null
2025-05-16 Empowering Agentic Video Analytics Systems with Video Language Models Yuxuan Yan et.al. 2505.00254 null
2025-05-15 Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph Deeksha Prahlad et.al. 2505.09945 link
2025-05-15 GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs Longchao Da et.al. 2505.10143 null
2025-05-13 Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration Rishabh Agrawal et.al. 2505.08261 null
2025-05-12 RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning Yuanhuiyi Lyu et.al. 2502.00848 null
2025-05-12 KAQG: A Knowledge-Graph-Enhanced RAG for Difficulty-Controlled Question Generation Ching Han Chen et.al. 2505.07618 null
2025-05-12 GRADA: Graph-based Reranker against Adversarial Documents Attack Jingjie Zheng et.al. 2505.07546 link
2025-05-09 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 link
2025-05-09 ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding Shuai Wang et.al. 2505.06020 null
2025-05-08 KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification Qianbo Zang et.al. 2505.05583 link
2025-05-06 Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation Mohammad Shoaib Ansari et.al. 2505.03406 null
2025-05-05 KG-Retriever: Efficient Knowledge Indexing for Retrieval-Augmented Large Language Models Weijie Chen et.al. 2412.05547 link
2025-05-04 Real-time Spatial Retrieval Augmented Generation for Urban Environments David Nazareno Campo et.al. 2505.02271 null
2025-05-04 Incorporating Legal Structure in Retrieval-Augmented Generation: A Case Study on Copyright Fair Use Justin Ho et.al. 2505.02164 link
2025-05-03 Harnessing the Power of LLMs, Informers and Decision Transformers for Intent-driven RAN Management in 6G Md Arafat Habib et.al. 2505.01841 null
2025-05-02 CaGR-RAG: Context-aware Query Grouping for Disk-based Vector Search in RAG Systems Yeonwoo Jeong et.al. 2505.01164 null
2025-04-30 Optimization of embeddings storage for RAG systems using quantization and dimensionality reduction techniques Naamán Huerga-Pérez et.al. 2505.00105 null
2025-04-30 LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household Robotics Marc Glocker et.al. 2504.21716 link
2025-04-28 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Prateek Chhikara et.al. 2504.19413 null
2025-04-26 Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs Yingjian Chen et.al. 2504.10982 null
2025-04-26 Building Scalable AI-Powered Applications with Cloud Databases: Architectures, Best Practices and Performance Considerations Santosh Bhupathi et.al. 2504.18793 null
2025-04-25 PropRAG: Guiding Retrieval with Beam Search over Proposition Paths Jingjin Wang et.al. 2504.18070 null
2025-04-24 Synergizing RAG and Reasoning: A Systematic Review Yunfan Gao et.al. 2504.15909 null
2025-04-23 LLM-assisted Graph-RAG Information Extraction from IFC Data Sima Iranmanesh et.al. 2504.16813 null
2025-04-21 Efficient Document Retrieval with G-Retriever Manthankumar Solanki et.al. 2504.14955 link
2025-04-20 Understanding and Optimizing Multi-Stage AI Inference Pipelines Abhimanyu Rajeshkumar Bambhaniya et.al. 2504.09775 null
2025-04-20 Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval Pengcheng Jiang et.al. 2410.04585 link
2025-04-19 Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and Education Ali Forootani et.al. 2409.07110 link
2025-04-17 RAGDoll: Efficient Offloading-based Online RAG System on a Single GPU Weiping Yu et.al. 2504.15302 null
2025-04-17 Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild Jiatai Wang et.al. 2504.12982 null
2025-04-17 InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning Zheng Wang et.al. 2504.13032 null
2025-04-17 CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented Generation Elahe Khatibi et.al. 2504.12560 link
2025-04-16 Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs Hyungwoo Lee et.al. 2504.11765 null
2025-04-15 NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes Tianyang Xu et.al. 2504.11544 null
2025-04-15 Timing Analysis Agent: Autonomous Multi-Corner Multi-Mode (MCMM) Timing Debugging with Timing Debug Relation Graph Jatin Nainani et.al. 2504.11502 null
2025-04-14 CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation Jia Li et.al. 2504.10046 null
2025-04-14 DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify Zhengxuan Zhang et.al. 2504.10036 null
2025-04-14 Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models Yujing Wang et.al. 2504.09910 null
2025-04-14 RAKG:Document-level Retrieval Augmented Knowledge Graph Construction Hairong Zhang et.al. 2504.09823 link
2025-04-13 HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation Pei Liu et.al. 2504.12330 link
2025-04-12 Semantic Commit: Helping Users Update Intent Specifications for AI Memory at Scale Priyan Vaithilingam et.al. 2504.09283 null
2025-04-11 An Adaptive Vector Index Partitioning Scheme for Low-Latency RAG Pipeline Junkyum Kim et.al. 2504.08930 null
2025-04-11 PCA-RAG: Principal Component Analysis for Efficient Retrieval-Augmented Generation Arman Khaledian et.al. 2504.08386 null
2025-04-11 Knowledge Graph-extended Retrieval Augmented Generation for Question Answering Jasper Linders et.al. 2504.08893 null
2025-04-11 HyperCore: The Core Framework for Building Hyperbolic Foundation Models with Comprehensive Modules Neil He et.al. 2504.08912 link
2025-04-10 ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models Joel Barmettler et.al. 2504.07624 null
2025-04-09 MemoRAG: Boosting Long Context Processing with Global Memory-Enhanced Retrieval Augmentation Hongjin Qian et.al. 2409.05591 link
2025-04-08 Decentralizing AI Memory: SHIMI, a Semantic Hierarchical Memory Index for Scalable Agent Reasoning Tooraj Helmi et.al. 2504.06135 null
2025-04-08 MicroNN: An On-device Disk-resident Updatable Vector Database Jeffrey Pound et.al. 2504.05573 null
2025-04-08 Graph-based Approaches and Functionalities in Retrieval-Augmented Generation: A Comprehensive Survey Zulun Zhu et.al. 2504.10499 null
2025-04-07 LLM meets ML: Data-efficient Anomaly Detection on Unseen Unstable Logs Fatemeh Hadadi et.al. 2406.07467 null
2025-04-07 GraphRAFT: Retrieval Augmented Fine-Tuning for Knowledge Graphs on Graph Databases Alfred Clemedtson et.al. 2504.05478 link
2025-04-07 Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness Dongzhuoran Zhou et.al. 2504.05163 null
2025-04-06 Hierarchical Planning for Complex Tasks with Knowledge Graph-RAG and Symbolic Verification Cristina Cornelio et.al. 2504.04578 null
2025-04-06 Driving-RAG: Driving Scenarios Embedding, Search, and RAG Applications Cheng Chang et.al. 2504.04419 null
2025-04-04 Rotation Invariance in Floor Plan Digitization using Zernike Moments Marius Graumann et.al. 2504.03241 null
2025-04-03 HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse Yuwei An et.al. 2504.02921 null
2025-04-03 CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion Jiayi Yao et.al. 2405.16444 link
2025-04-03 Retrieval-Augmented Purifier for Robust LLM-Empowered Recommendation Liangbo Ning et.al. 2504.02458 null
2025-04-02 Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph Lingxiao Guan et.al. 2504.01309 null
2025-04-01 Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB Anas Dorbani et.al. 2504.01157 null
2025-04-01 TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG Savini Kashmira et.al. 2412.05447 null
2025-03-29 Memory-Aware and Uncertainty-Guided Retrieval for Multi-Hop Question Answering Yuelyu Ji et.al. 2503.23095 null
2025-03-28 Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis Lukas Bahr et.al. 2406.18114 link
2025-03-27 MemInsight: Autonomous Memory Augmentation for LLM Agents Rana Salama et.al. 2503.21760 null
2025-03-27 Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack Cheng Wang et.al. 2503.21315 null
2025-03-26 Retro-li: Small-Scale Retrieval Augmented Generation Supporting Noisy Similarity Searches and Domain Shift Generalization Gentiana Rashiti et.al. 2410.00004 link
2025-03-21 Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent Humza Nusrat et.al. 2503.17553 null
2025-03-20 Financial Analysis: Intelligent Financial Data Analysis System Based on LLM-RAG Jingru Wang et.al. 2504.06279 null
2025-03-20 Tuning LLMs by RAG Principles: Towards LLM-native Memory Jiale Wei et.al. 2503.16071 link
2025-03-20 Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models Baolong Bi et.al. 2503.15888 link
2025-03-14 AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation Fengyu Li et.al. 2503.11346 link
2025-03-14 RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration Hong Qing Yu et.al. 2503.13514 null
2025-03-09 Training Sparse Mixture Of Experts Text Embedding Models Zach Nussbaum et.al. 2502.07972 link
2025-03-07 Leveraging Approximate Caching for Faster Retrieval-Augmented Generation Shai Bergman et.al. 2503.05530 null
2025-03-07 Interpersonal Memory Matters: A New Task for Proactive Dialogue Utilizing Conversational History Bowen Wu et.al. 2503.05150 link
2025-03-06 Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning Giulio Corallo et.al. 2503.04973 null
2025-03-05 A Comprehensive Framework for Reliable Legal AI: Combining Specialized Expert Systems and Adaptive Refinement Sidra Nasir et.al. 2412.20468 null
2025-03-04 RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards Xinze Li et.al. 2410.13509 link
2025-02-28 TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval Chien-Yu Lin et.al. 2502.20969 null
2025-02-26 CommGPT: A Graph and Retrieval-Augmented Multimodal Communication Foundation Model Feibo Jiang et.al. 2502.18763 null
2025-02-26 EMERGE: Enhancing Multimodal Electronic Health Records Predictive Modeling with Retrieval-Augmented Generation Yinghao Zhu et.al. 2406.00036 null
2025-02-25 Rank1: Test-Time Compute for Reranking in Information Retrieval Orion Weller et.al. 2502.18418 link
2025-02-25 OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model Sumeth Yuenyong et.al. 2411.07238 null
2025-02-23 Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan et.al. 2412.15605 link
2025-02-21 OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering Jiahao Nick Li et.al. 2409.08250 null
2025-02-21 From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants Manisha Mukherjee et.al. 2502.15237 null
2025-02-21 Knowledge Pyramid Construction for Multi-Level Retrieval-Augmented Generation Rubing Chen et.al. 2407.21276 null
2025-02-19 DH-RAG: A Dynamic Historical Context-Powered Retrieval-Augmented Generation Method for Multi-Turn Dialogue Feiyuan Zhang et.al. 2502.13847 null
2025-02-18 RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models Tanqiu Jiang et.al. 2502.12794 link
2025-02-17 Does RAG Really Perform Bad For Long-Context Processing? Kun Luo et.al. 2502.11444 null
2025-02-16 Streamlining the Collaborative Chain of Models into A Single Forward Pass in Generation-Based Tasks Yuanjie Lyu et.al. 2502.11083 link
2025-02-16 RuleRAG: Rule-Guided Retrieval-Augmented Generation with Language Models for Question Answering Zhongwu Chen et.al. 2410.22353 link
2025-02-13 GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units Arghadip Das et.al. 2502.06921 link
2025-02-13 Knowledge-Enhanced Program Repair for Data Science Code Shuyin Ouyang et.al. 2502.09771 null
2025-02-12 APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding Xinyu Yang et.al. 2502.05431 link
2025-02-12 How to Build an Adaptive AI Tutor for Any Course Using Knowledge Graph-Enhanced Retrieval-Augmented Generation (KG-RAG) Chenxi Dong et.al. 2311.17696 null
2025-02-10 LLM-based SPARQL Query Generation from Natural Language over Federated Knowledge Graphs Vincent Emonet et.al. 2410.06062 link
2025-02-10 Think-on-Graph 2.0: Deep and Faithful Large Language Model Reasoning with Knowledge-guided Retrieval Augmented Generation Shengjie Ma et.al. 2407.10805 link
2025-02-08 Knowledge Graph-Guided Retrieval Augmented Generation Xiangrong Zhu et.al. 2502.06864 link
2025-02-07 Efficient Knowledge Feeding to Language Models: A Novel Integrated Encoder-Decoder Architecture S Santosh Kumar et.al. 2502.05233 null
2025-02-07 Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research Junde Wu et.al. 2502.04644 link
2025-02-06 MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot Xuejiao Zhao et.al. 2502.04413 link
2025-02-05 Cache-Craft: Managing Chunk-Caches for Efficient Retrieval-Augmented Generation Shubham Agarwal et.al. 2502.15734 null
2025-02-05 Simple Is Effective: The Roles of Graphs and Large Language Models in Knowledge-Graph-Based Retrieval-Augmented Generation Mufei Li et.al. 2410.20724 link
2025-02-04 Plan*RAG: Efficient Test-Time Planning for Retrieval Augmented Generation Prakhar Verma et.al. 2410.20753 null
2025-02-03 Augmented Knowledge Graph Querying leveraging LLMs Marco Arazzi et.al. 2502.01298 null
2025-02-03 Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective Rui Yang et.al. 2410.17600 link
2025-01-24 GraPPI: A Retrieve-Divide-Solve GraphRAG Framework for Large-scale Protein-protein Interaction Exploration Ziwen Li et.al. 2501.16382 link
2025-01-24 Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph Xujian Liang et.al. 2501.14300 link
2025-01-23 Parallel Key-Value Cache Fusion for Position Invariant RAG Philhoon Oh et.al. 2501.07523 null
2025-01-22 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-21 Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation Quanting Xie et.al. 2409.18313 null
2025-01-20 Zep: A Temporal Knowledge Graph Architecture for Agent Memory Preston Rasmussen et.al. 2501.13956 link
2025-01-20 Explainable Lane Change Prediction for Near-Crash Scenarios Using Knowledge Graph Embeddings and Retrieval Augmented Generation M. Manzour et.al. 2501.11560 null
2025-01-17 4bit-Quantization in Vector-Embedding for RAG Taehee Jeong et.al. 2501.10534 link
2025-01-16 Adaptive Contextual Caching for Mobile Edge Large Language Model Service Guangyuan Liu et.al. 2501.09383 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 link
2025-01-13 Research on the Online Update Method for Retrieval-Augmented Generation (RAG) Model with Incremental Learning Yuxin Fan et.al. 2501.07063 null
2024-12-31 EdgeRAG: Online-Indexed RAG for Edge Devices Korakit Seemakhupt et.al. 2412.21023 null
2024-12-31 CancerKG.ORG A Web-scale, Interactive, Verifiable Knowledge Graph-LLM Hybrid for Assisting with Optimal Cancer Treatment and Care Michael Gubanov et.al. 2501.00223 null
2024-12-27 Casevo: A Cognitive Agents and Social Evolution Simulator Zexun Jiang et.al. 2412.19498 link
2024-12-25 RAGONITE: Iterative Retrieval on Induced Databases and Verbalized RDF for Conversational QA over KGs with RAG Rishiraj Saha Roy et.al. 2412.17690 null
2024-12-19 LLaVA Needs More Knowledge: Retrieval Augmented Natural Language Generation with Knowledge Graph for Explaining Thoracic Pathologies Ameer Hamza et.al. 2410.04749 link
2024-12-16 Towards Understanding Systems Trade-offs in Retrieval-Augmented Generation Model Inference Michael Shen et.al. 2412.11854 null
2024-12-16 Let your LLM generate a few tokens and you will reduce the need for retrieval Hervé Déjean et.al. 2412.11536 null
2024-12-16 GraphInsight: Unlocking Insights in Large Language Models for Graph Structure Understanding Yukun Cao et.al. 2409.03258 null
2024-12-14 Accelerating Retrieval-Augmented Generation Derrick Quinn et.al. 2412.15246 null
2024-12-10 Generating Knowledge Graphs from Large Language Models: A Comparative Study of GPT-4, LLaMA 2, and BERT Ahan Bhatt et.al. 2412.07412 null
2024-12-10 Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation Mohammad Sadeq Abolhasani et.al. 2412.00608 null
2024-12-06 SurgBox: Agent-Driven Operating Room Sandbox with Surgery Copilot Jinlin Wu et.al. 2412.05187 link
2024-12-05 Hybrid-SQuAD: Hybrid Scholarly Question Answering Dataset Tilahun Abedissa Taffa et.al. 2412.02788 null
2024-12-04 Advancing Conversational Psychotherapy: Integrating Privacy, Dual-Memory, and Domain Expertise with Large Language Models XiuYu Zhang et.al. 2412.02987 null
2024-11-29 Knowledge Management for Automobile Failure Analysis Using Graph RAG Yuta Ojima et.al. 2411.19539 null
2024-11-20 Multimodal large language model for wheat breeding: a new exploration of smart breeding Guofeng Yang et.al. 2411.15203 null
2024-11-18 Towards Evaluating Large Language Models for Graph Query Generation Siraj Munir et.al. 2411.08449 null
2024-11-17 On-Board Vision-Language Models for Personalized Autonomous Vehicle Motion Control: System Design and Real-World Validation Can Cui et.al. 2411.11913 null
2024-11-11 Toward Optimal Search and Retrieval for RAG Alexandria Leto et.al. 2411.07396 link
2024-11-11 AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant Yujia Zhou et.al. 2411.06805 link
2024-11-07 AMSnet-KG: A Netlist Dataset for LLM-based AMS Circuit Auto-Design Using Knowledge Graph RAG Yichen Shi et.al. 2411.13560 null
2024-11-04 RAGViz: Diagnose and Visualize Retrieval-Augmented Generation Tevin Wang et.al. 2411.01751 link
2024-11-01 CRAG – Comprehensive RAG Benchmark Xiao Yang et.al. 2406.04744 link
2024-10-30 Emotional RAG: Enhancing Role-Playing Agents through Emotional Retrieval Le Huang et.al. 2410.23041 link
2024-10-30 Semantic Enrichment of the Quantum Cascade Laser Properties in Text- A Knowledge Graph Generation Approach Deperias Kerre et.al. 2410.22996 link
2024-10-29 GraphAide: Advanced Graph-Assisted Query and Reasoning System Sumit Purohit et.al. 2411.08041 null
2024-10-28 CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models Meiqi Chen et.al. 2410.21067 null
2024-10-24 LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search Elias Jääsaari et.al. 2410.18926 link
2024-10-23 ConfusedPilot: Confused Deputy Risks in RAG-based LLMs Ayush RoyChowdhury et.al. 2408.04870 null
2024-10-22 Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency Prafulla Kumar Choubey et.al. 2410.16597 null
2024-10-22 DepsRAG: Towards Agentic Reasoning and Planning for Software Dependency Management Mohannad Alhanahnah et.al. 2405.20455 link
2024-10-21 Who’s Who: Large Language Models Meet Knowledge Conflicts in Practice Quang Hieu Pham et.al. 2410.15737 link
2024-10-21 DRIM-ANN: An Approximate Nearest Neighbor Search Engine based on Commercial DRAM-PIMs Mingkai Chen et.al. 2410.15621 null
2024-10-21 Towards a Reliable Offline Personal AI Assistant for Long Duration Spaceflight Oliver Bensch et.al. 2410.16397 null
2024-10-19 “Ghost of the past”: identifying and resolving privacy leakage from LLM’s memory through proactive user interaction Shuning Zhang et.al. 2410.14931 null
2024-10-19 A New Perspective on ADHD Research: Knowledge Graph Construction with LLMs and Network Based Insights Hakan T. Otal et.al. 2409.12853 link
2024-10-17 GeoCoder: Solving Geometry Problems by Generating Modular Code through Vision-Language Models Aditya Sharma et.al. 2410.13510 null
2024-10-11 Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanism Yimin Tang et.al. 2410.12859 null
2024-10-10 TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text Songshuo Lu et.al. 2410.07590 link
2024-10-08 Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA Wenyu Huang et.al. 2410.06121 link
2024-10-07 Fast State Restoration in LLM Serving with HCache Shiwei Gao et.al. 2410.05004 null
2024-10-07 Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models Mehrdad Farahani et.al. 2410.05162 link
2024-10-07 PEAR: Position-Embedding-Agnostic Attention Re-weighting Enhances Retrieval-Augmented Generation with Zero Inference Overhead Tao Tan et.al. 2409.19745 null
2024-10-06 Mindful-RAG: A Study of Points of Failure in Retrieval Augmented Generation Garima Agrawal et.al. 2407.12216 null
2024-10-03 Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization Ryan C. Barron et.al. 2410.02721 null
2024-10-01 Quantifying reliance on external information over parametric knowledge during Retrieval Augmented Generation (RAG) using mechanistic analysis Reshmi Ghosh et.al. 2410.00857 null
2024-10-01 AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow Huizi Yu et.al. 2409.18924 null
2024-09-28 Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs Zheng Wang et.al. 2409.19401 null
2024-09-26 KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation Lei Liang et.al. 2409.13731 link
2024-09-23 RepoGenReflex: Enhancing Repository-Level Code Completion with Verbal Reinforcement and Retrieval-Augmented Generation Jicheng Wang et.al. 2409.13122 null
2024-09-23 GEM-RAG: Graphical Eigen Memories For Retrieval Augmented Generation Brendan Hogan Rappazzo et.al. 2409.15566 null
2024-09-21 AI Assistants for Spaceflight Procedures: Combining Generative Pre-Trained Transformer and Retrieval-Augmented Generation on Knowledge Graphs With Augmented Reality Cues Oliver Bensch et.al. 2409.14206 null
2024-09-19 Should RAG Chatbots Forget Unimportant Conversations? Exploring Importance and Forgetting with Psychological Insights Ryuichi Sumida et.al. 2409.12524 link
2024-09-18 RAG-Modulo: Solving Sequential Tasks using Experience, Critics, and Language Models Abhinav Jain et.al. 2409.12294 null
2024-09-17 Investigating Context-Faithfulness in Large Language Models: The Roles of Memory Strength and Evidence Style Yuepei Li et.al. 2409.10955 null
2024-09-07 Cross-Data Knowledge Graph Construction for LLM-enabled Educational Question-Answering System: A Case Study at HCMUT Tuan Bui et.al. 2404.09296 null
2024-09-06 Column Vocabulary Association (CVA): semantic interpretation of dataless tables Margherita Martorana et.al. 2409.13709 null
2024-08-27 Writing in the Margins: Better Inference Pattern for Long Context Retrieval Melisa Russak et.al. 2408.14906 link
2024-08-16 CommunityKG-RAG: Leveraging Community Structures in Knowledge Graphs for Advanced Retrieval-Augmented Generation in Fact-Checking Rong-Ching Chang et.al. 2408.08535 null
2024-08-09 HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction Bhaskarjit Sarmah et.al. 2408.04948 null
2024-07-18 PRAGyan – Connecting the Dots in Tweets Rahul Ravi et.al. 2407.13909 null
2024-06-24 Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks Sefika Efeoglu et.al. 2406.14745 null
2024-06-17 TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation Jinyuan Fang et.al. 2406.11460 link
2024-05-30 GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning Costas Mavromatis et.al. 2405.20139 link
2024-05-25 Accelerating Inference of Retrieval-Augmented Generation via Sparse Context Selection Yun Zhu et.al. 2405.16178 null
2024-05-20 KG-RAG: Bridging the Gap Between Knowledge and Creativity Diego Sanmartin et.al. 2405.12035 null
2024-05-13 Biomedical knowledge graph-optimized prompt generation for large language models Karthik Soman et.al. 2311.17330 link
2024-05-06 Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering Zhentao Xu et.al. 2404.17723 null
2024-05-03 Comparative Analysis of Retrieval Systems in the Real World Dmytro Mozolevskyi et.al. 2405.02048 null
2024-05-01 RAG-based Explainable Prediction of Road Users Behaviors for Automated Driving using Knowledge Graphs and Large Language Models Mohamed Manzour Hussien et.al. 2405.00449 null
2024-04-25 RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation Chao Jin et.al. 2404.12457 null
2024-04-19 HyKGE: A Hypothesis Knowledge Graph Enhanced Framework for Accurate and Reliable Medical LLMs Responses Xinke Jiang et.al. 2312.15883 null
2024-04-13 Introducing Super RAGs in Mistral 8x7B-v1 Ayush Thakur et.al. 2404.08940 null
2024-03-13 From human experts to machines: An LLM supported approach to ontology and knowledge graph construction Vamsi Krishna Kommineni et.al. 2403.08345 link
2024-02-22 MeTMaP: Metamorphic Testing for Detecting False Vector Matching Problems in LLM Augmented Generation Guanyu Wang et.al. 2402.14480 null
2024-02-10 REALM: RAG-Driven Enhancement of Multimodal Electronic Health Records Analysis via Large Language Models Yinghao Zhu et.al. 2402.07016 null
2021-04-17 Zero-shot Slot Filling with DPR and RAG Michael Glass et.al. 2104.08610 link