GitHub - keithmcnulty/genai_londonny_pa_meetup_jul2024: Materials for London-NY People Analytics Meetup July 2024

Materials for London-NY People Analytics Meetup July 2024

These materials allow you to test and build a simple Generative AI application from scratch using a Retrieval Augmented Generation (RAG) architecture. RAG architectures allow greater injection of useful information and context into the prompt sent to a language model.

The project converts a large set of comments from readers of the NY Times into vector embeddings and stores them in a local ChromaDB vector database. This is then used to provide context to a questions asked about the opinions of NY Times readers on specific issues.

Prerequisites

To fully complete this build you will need:

A high specification computer, ideally with high RAM and high CPU, and preferably with GPU.
Access to the OpenAI API
ollama installed on your machines (see https://ollama.com)
API access to Kaggle, with the credentials stored in the kaggle.json inside the root directory of this project
All environment variables stored in a .env file in the root of this project.

Steps

Use requirements.txt to set up your Python environment.
Create the vector database and test it using the scripts located in chromadb_prep
Test various versions of the RAG architecture with different language models using the notebooks in jupyter_test_rag_pipeline
Launch the streamlit app using streamlit run app/app.py from project root in the terminal

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
app		app
chromadb_prep		chromadb_prep
jupyter_test_rag_pipeline		jupyter_test_rag_pipeline
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
London Meetup GenAI 2024.pptx		London Meetup GenAI 2024.pptx
README.md		README.md
rag_architecture.jpg		rag_architecture.jpg
requirements.txt		requirements.txt
~$London Meetup GenAI 2024.pptx		~$London Meetup GenAI 2024.pptx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Materials for London-NY People Analytics Meetup July 2024

Prerequisites

Steps

About

Releases

Packages

Languages

License

keithmcnulty/genai_londonny_pa_meetup_jul2024

Folders and files

Latest commit

History

Repository files navigation

Materials for London-NY People Analytics Meetup July 2024

Prerequisites

Steps

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages