Document Retrieval System with Hybrid Embeddings

Overview

This project showcases a powerful document retrieval system leveraging hybrid embeddings. By combining dense and sparse embeddings, the system provides efficient and accurate search capabilities across a large collection of documents. The primary technologies used include LangChain, Qdrant, and Groq, among others.

Features

Hybrid Retrieval Mode: Utilizes both dense and sparse embeddings for comprehensive search results.
Document Processing: Efficiently loads, preprocesses, and splits documents into manageable chunks.
Contextual Question Answering: Integrates a chat model for generating concise and relevant answers based on retrieved documents.
Flexible Deployment: Easily adaptable to different datasets and deployment environments.

Technologies Used

LangChain: Framework for building chainable NLP applications.
Qdrant: Vector database for managing and querying embeddings.
Groq: Model serving and deployment platform.
OpenAI Embeddings: Dense embedding generation.
FastEmbedSparse: Sparse embedding generation for hybrid retrieval.
PyPDFLoader: Document loader for PDF files.
RecursiveCharacterTextSplitter: Utility for splitting text into chunks.

Contributing

Feel free to fork this repository and contribute by submitting pull requests. For major changes, please open an issue first to discuss what you would like to change.

Acknowledgments

Special thanks to the open-source community and contributors who make projects like this possible.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
RAG.py		RAG.py
README.md		README.md
qdrant_setup_guide.txt		qdrant_setup_guide.txt
requirements.txt		requirements.txt
sample_data.pdf		sample_data.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document Retrieval System with Hybrid Embeddings

Overview

Features

Technologies Used

Contributing

Acknowledgments

About

Releases

Packages

Languages

faizanbhatti/rag_using_qdrant_and_chatgroq_langchain

Folders and files

Latest commit

History

Repository files navigation

Document Retrieval System with Hybrid Embeddings

Overview

Features

Technologies Used

Contributing

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages