Skip to content

Simple rag implementation for any WordPress blog. Leverages the bootstrap, python, milvus vector dB, and configurable options for LLM providers (Open AI, Anthropic) and embeddings.

License

Notifications You must be signed in to change notification settings

speechwrecko/wordpress-rag

Repository files navigation

Basic RAG implementation for Wordpress blogs

A basic implementation of RAG for any worpress blog. Leverages the Milvus vector DB. Options for either using OpenAI or Anthropic as your LLM and Sentence BERT for embeddings.

On first run all blog posts will be scrapped. Embeddings are based on sentence parsing of each post.

alt text

Running Locally

Make sure you have Python installed

$ git clone https://github.com/speechwrecko/wordpress-rag.git # or clone your own fork
$ cd wordpress-rag
$ pip install -r requirements.txt
$ python app.py

Your app should now be running on localhost:5000.

Required third party services

This project requires API access to several 3rd party services:

note: the current scraper assumes you are using the Diffbot free version which only allows for 5 API calls per minute. If you have a paid account you can remove the 12 second sleep that exists in the code to rate limit calls to Diffbot.

Credits

Initial basis for flask / bootstrap implementation

Peter Simeth's basic flask pretty youtube downloader (v1.3) https://github.com/petersimeth/basic-flask-template © MIT licensed, 2018-2023

About

Simple rag implementation for any WordPress blog. Leverages the bootstrap, python, milvus vector dB, and configurable options for LLM providers (Open AI, Anthropic) and embeddings.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published