RAG

Advanced RAG Techniques: an Illustrated Overview

Advanced RAG Techniques: an Illustrated Overview

0
A comprehensive study of the advanced retrieval augmented generation techniques and algorithms, systemising various approaches. The article comes with a collection of links in my knowledge base referencing various implementations and studies mentioned. Link
Auto-Generated Agent Chat: Revolutionizing Group Conversations with RAG

Auto-Generated Agent Chat: Revolutionizing Group Conversations with RAG

0
In the ever-evolving landscape of artificial intelligence (AI) and natural language processing (NLP), researchers and developers continue to push the boundaries of what’s possible. One such groundbreaking development is the Auto Generated Agent Chat, a cutting-edge system that employs Retrieval Augmented Generation (RAG) to transform group conversations. This technology combines the strengths of both retrieval-based and generative models, offering a unique and efficient solution for enhancing communication in group settings.
Beyond Basic Chatbots: How Semantic Router is Changing the Game

Beyond Basic Chatbots: How Semantic Router is Changing the Game

0
A Semantic Router is an advanced layer in the realm of chatbots and natural language processing. Think of it as a fuzzy yet deterministic interface layered over your chatbots or any system that processes natural language. Its primary function? To serve as a super-fast decision-making layer for Large Language Models (LLMs). Link
Introducing Query Pipelines

Introducing Query Pipelines

0
Today we introduce Query Pipelines, a new declarative API within LlamaIndex that allows you to concisely orchestrate simple-to-advanced query workflows over your data for different use cases (RAG, structured data extraction, and more). Link Usage Pattern

LangChain cookbook

0
Example code for building applications with LangChain, with an emphasis on more applied and end-to-end examples than contained in the main documentation. Link
Optimizing RAG Systems with LlamaIndex: Strategies for Production Performance

Optimizing RAG Systems with LlamaIndex: Strategies for Production Performance

0
Prototyping a Retrieval-Augmented Generation (RAG) application is relatively straightforward, but the challenge lies in optimizing it for performance, robustness, and scalability across vast knowledge repositories. This guide aims to provide insights, strategies, and implementations leveraging LlamaIndex to enhance the efficiency of your RAG pipeline, catering to complex datasets and ensuring accurate query responses without hallucinations. Link
RAG Pipeline with Query Rewriting

RAG Pipeline with Query Rewriting

0
Here we chain together a full RAG pipeline consisting of query rewriting, retrieval, reranking, and response synthesis. Here we can’t use chain syntax because certain modules depend on multiple inputs (for instance, response synthesis expects both the retrieved nodes and the original question). Instead we’ll construct a DAG explicitly, through add_modules and then add_link. Link