diff --git a/docs/source/index.rst b/docs/source/index.rst index b8c72f8ee2..4386bf3840 100644 --- a/docs/source/index.rst +++ b/docs/source/index.rst @@ -60,13 +60,12 @@ Topics in this section will help you get started with ExecuTorch. ExecuTorch. .. grid-item-card:: :octicon:`file-code;1em` - ExecuTorch Intermediate Representation API + ExecuTorch Llama :img-top: _static/img/card-background.svg - :link: ir-exir.html + :link: llm/llama.html :link-type: url - Learn about EXIR, a graph-based intermediate - representation (IR) of PyTorch programs. + Learn about running Llama models via ExecuTorch .. toctree:: :glob: @@ -122,10 +121,11 @@ Topics in this section will help you get started with ExecuTorch. :caption: Working with LLMs :hidden: - llm/getting-started - llm/llama-demo-android - llm/build-run-llama3-qualcomm-ai-engine-direct-backend - llm/llama-demo-ios + Llama + Llama on Android + Llama on iOS + Llama on Android via Qualcomm backend + Intro to LLMs in ExecuTorch .. toctree:: :glob: diff --git a/docs/source/llm/getting-started.md b/docs/source/llm/getting-started.md index cca62edaf8..05a3639e1b 100644 --- a/docs/source/llm/getting-started.md +++ b/docs/source/llm/getting-started.md @@ -1,4 +1,4 @@ -# Getting Started with LLMs via ExecuTorch +# Intro to LLMs in ExecuTorch Welcome to LLM Manual! This manual is designed to provide a practical example to leverage ExecuTorch in onboarding your own Large Language Models (LLMs). Our primary goal is to offer @@ -13,6 +13,8 @@ We encourage users to use this project as a starting point and adapt it to their which includes creating your own versions of the tokenizer, sampler, acceleration backends, and other components. We hope this project serves as a useful guide in your journey with LLMs and ExecuTorch. +For deploying Llama with optimal performance, please see [Llama guide](./llama.md). + ### Table Of Contents diff --git a/docs/source/llm/llama.md b/docs/source/llm/llama.md new file mode 100644 index 0000000000..2d266ba7ae --- /dev/null +++ b/docs/source/llm/llama.md @@ -0,0 +1,5 @@ +# Llama on ExecuTorch + +See +[Llama readme](https://github.com/pytorch/executorch/blob/main/examples/models/llama2/README.md) +for detailed information about running Llama on ExecuTorch.