GitHub - pvn-ptl/data_helper: Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/

Code for blog at: Democratize Data Access with RAGS

Set up

We will use LlamaIndex to build our RAG pipeline. The concepts used to RAG pipelines in general.

GitHub Repo: Data Helper

Pre-requisite

Demo

We will clone the repo setup poetry shell as shown below:

git clone https://github.com/josephmachado/data_helper.git
cd data_helper
poetry install
poetry shell # activate the virtual env

# To run the code, please set your OPEN AI API key as shown below
export OPENAI_API_KEY=your-key-here
python run_code.py INDEX # Create an index with data from ./data folder
python run_code.py QUERY --query "show me for each buyers what date they made their first purchase"
# The above command uses the already existing index to make a request to LLM API to get results
# The code will return a SQL query with DuckDB format

python run_code.py QUERY --query "for every seller, show me a monthly report of the number of unique products that they sold, avg cost per product, max/min value of product purchased that month"
# The code will return a SQL query with DuckDB format

Next Steps

Evaluate results and tune the pipeline
Add observation system
Monitor API costs
Add additional documentation as input
Explore other use cases such as RAGs for onboarding, DE training tool, etc

References

LlamaIndex docs

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
data		data
index		index
query		query
.gitignore		.gitignore
.tool-versions		.tool-versions
Makefile		Makefile
README.md		README.md
formatted_default_vector_store.json		formatted_default_vector_store.json
formatted_docstore.json		formatted_docstore.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_code.py		run_code.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Set up

Pre-requisite

Demo

Next Steps

Further reading

References

About

Releases

Packages

Languages

pvn-ptl/data_helper

Folders and files

Latest commit

History

Repository files navigation

Set up

Pre-requisite

Demo

Next Steps

Further reading

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages