Skip to content

Document Retrieval System with Hybrid Embeddings using LangChain, OpenAI embeddings, FastEmbedSparse, ChatGroq.

Notifications You must be signed in to change notification settings

faizanbhatti/rag_using_qdrant_and_chatgroq_langchain

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Document Retrieval System with Hybrid Embeddings

Overview

This project showcases a powerful document retrieval system leveraging hybrid embeddings. By combining dense and sparse embeddings, the system provides efficient and accurate search capabilities across a large collection of documents. The primary technologies used include LangChain, Qdrant, and Groq, among others.

Features

  • Hybrid Retrieval Mode: Utilizes both dense and sparse embeddings for comprehensive search results.

  • Document Processing: Efficiently loads, preprocesses, and splits documents into manageable chunks.

  • Contextual Question Answering: Integrates a chat model for generating concise and relevant answers based on retrieved documents.

  • Flexible Deployment: Easily adaptable to different datasets and deployment environments.

Technologies Used

  • LangChain: Framework for building chainable NLP applications.

  • Qdrant: Vector database for managing and querying embeddings.

  • Groq: Model serving and deployment platform.

  • OpenAI Embeddings: Dense embedding generation.

  • FastEmbedSparse: Sparse embedding generation for hybrid retrieval.

  • PyPDFLoader: Document loader for PDF files.

  • RecursiveCharacterTextSplitter: Utility for splitting text into chunks.

Contributing

Feel free to fork this repository and contribute by submitting pull requests. For major changes, please open an issue first to discuss what you would like to change.

Acknowledgments

Special thanks to the open-source community and contributors who make projects like this possible.

About

Document Retrieval System with Hybrid Embeddings using LangChain, OpenAI embeddings, FastEmbedSparse, ChatGroq.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages