Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Lag-Llama is the first open-source foundation model for time series forecasting!

[Tweet Thread]

[Model Weights] [Colab Demo 1: Zero-Shot Forecasting] [Colab Demo 2: (Preliminary Finetuning)]

[Paper]

[Video]

Updates:

16-Apr-2024: Released pretraining and finetuning scripts to replicate the experiments in the paper. See Reproducing Experiments in the Paper for details.
9-Apr-2024: We have released a 15-minute video 🎥 on Lag-Llama on YouTube.
5-Apr-2024: Added a section in Colab Demo 1 on the importance of tuning the context length for zero-shot forecasting. Added a best practices section in the README; added recommendations for finetuning. These recommendations will be demonstrated with an example in Colab Demo 2 soon.
4-Apr-2024: We have updated our requirements file with new versions of certain packages. Please update/recreate your environments if you have previously used the code locally.
7-Mar-2024: We have released a preliminary Colab Demo 2 for finetuning. Please note this is a preliminary tutorial. We recommend taking a look at the best practices if you are finetuning the model or using it for benchmarking.
17-Feb-2024: We have released a new updated Colab Demo 1 for zero-shot forecasting that shows how one can load time series of different formats.
7-Feb-2024: We released Lag-Llama, with open-source model checkpoints and a Colab Demo for zero-shot forecasting.

Current Features:

💫 Zero-shot forecasting on a dataset of any frequency for any prediction length, using Colab Demo 1.

💫 Finetuning on a dataset using Colab Demo 2.

💫 Reproducing experiments in the paper using the released scripts. See Reproducing Experiments in the Paper for details.

Note: Please see the best practices section when using the model for zero-shot prediction and finetuning.

Reproducing Experiments in the Paper

To replicate the pretraining setup used in the paper, please see the pretraining script. Once a model is pretrained, instructions to finetune it with the setup in the paper can be found in the finetuning script.

Best Practices

Here are some general tips in using Lag-Llama.

General Information

Lag-Llama is a probabilistic forecasting model trained to output a probability distribution for each timestep to be predicted. For your own specific use-case, we would recommend benchmarking the zero-shot performance of the model on your data first, and then finetuning if necessary. As we show in our paper, Lag-Llama has strong zero-shot capabilities, but performs best when finetuned. The more data you finetune on, the better. For specific tips on applying on model zero-shot or on finetuning, please refer to the sections below.

Zero-Shot Forecasting

Importantly, we recommend trying different context lengths (starting from $32$ which it was trained on) and identifying what works best for your data. As we show in this section of the zero-shot forecasting demo, the model's zero-shot performance improves as the context length is increased, until a certain context length which may be specific to your data. Further, we recommend enabling RoPE scaling for the model to work well with context lengths larger than what it was trained on.

Fine-Tuning

If you are trying to benchmark the performance of the model under finetuning, or trying to obtain maximum performance from the model:

We recommend tuning two important hyperparameters for each dataset that you finetune on: the context length (suggested values: $32$, $64$, $128$, $256$, $512$, $1024$) and the learning rate (suggested values: $10^{-2}$, $5 * 10^{-3}$, $10^{-3}$, $5 * 10^{-3}$, $1 * 10^{-4}$, $5 * 10^{-4}$).
We also highly recommend using a validation split of your dataset to early stop your model, with an early stopping patience of 50 epochs.

Contact

We are dedicated to ensuring the reproducility of our results, and would be happy to help clarify questions about benchmarking our model or about the experiments in the paper. The quickest way to reach us would be by email. Please email both:

Arjun Ashok - arjun [dot] ashok [at] servicenow [dot] com
Kashif Rasul - kashif [dot] rasul [at] gmail [dot] com

If you have questions about the model usage (or) code (or) have specific errors (eg. using it with your own dataset), it would be best to create an issue in the GitHub repository.

Citing this work

Please use the following Bibtex entry to cite Lag-Llama.

@misc{rasul2024lagllama,
      title={Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting}, 
      author={Kashif Rasul and Arjun Ashok and Andrew Robert Williams and Hena Ghonia and Rishika Bhagwatkar and Arian Khorasani and Mohammad Javad Darvishi Bayazi and George Adamopoulos and Roland Riachi and Nadhir Hassen and Marin Biloš and Sahil Garg and Anderson Schneider and Nicolas Chapados and Alexandre Drouin and Valentina Zantedeschi and Yuriy Nevmyvaka and Irina Rish},
      year={2024},
      eprint={2310.08278},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
configs		configs
data		data
gluon_utils		gluon_utils
images		images
lag_llama		lag_llama
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
VERSION		VERSION
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Reproducing Experiments in the Paper

Best Practices

General Information

Zero-Shot Forecasting

Fine-Tuning

Contact

Citing this work

About

Releases

Packages

Languages

License

AgnostiqHQ/lag-llama

Folders and files

Latest commit

History

Repository files navigation

Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

Reproducing Experiments in the Paper

Best Practices

General Information

Zero-Shot Forecasting

Fine-Tuning

Contact

Citing this work

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages