Skip to content
Change the repository type filter

All

    Repositories list

    • Configuration for generating SDKs and Documentation.
      MDX
      4307Updated Oct 7, 2024Oct 7, 2024
    • Homebrew Tap of OctoML products and tools.
      Ruby
      Apache License 2.0
      0000Updated Sep 26, 2024Sep 26, 2024
    • EAGLE

      Public
      OctoML Implementation of EAGLE-1 and EAGLE-2
      Python
      Apache License 2.0
      91100Updated Sep 12, 2024Sep 12, 2024
    • A collection of reference solutions built on top of OctoAI SaaS
      Python
      MIT License
      0000Updated Sep 11, 2024Sep 11, 2024
    • Simple getting-started code examples for LLM applications powered by OctoAI
      Python
      MIT License
      164310Updated Sep 10, 2024Sep 10, 2024
    • mlc-llm

      Public
      Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
      Python
      Apache License 2.0
      1.6k5121Updated Sep 10, 2024Sep 10, 2024
    • FlashInfer: Kernel Library for LLM Serving
      Cuda
      Apache License 2.0
      162200Updated Sep 9, 2024Sep 9, 2024
    • Jupyter Notebook
      0000Updated Sep 8, 2024Sep 8, 2024
    • Multicloud Asset Code Review Public Repo example.
      Python
      MIT License
      0001Updated Sep 5, 2024Sep 5, 2024
    • Examples and recipes for Llama 2 model
      Jupyter Notebook
      2.3k100Updated Sep 3, 2024Sep 3, 2024
    • Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
      HTML
      Apache License 2.0
      806001Updated Aug 21, 2024Aug 21, 2024
    • msi-fe

      Public
      Python
      0000Updated Aug 14, 2024Aug 14, 2024
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      MIT License
      2k000Updated Aug 12, 2024Aug 12, 2024
    • .github

      Public
      0101Updated Aug 2, 2024Aug 2, 2024
    • Python
      0000Updated Jul 31, 2024Jul 31, 2024
    • RULER

      Public
      This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
      Python
      Apache License 2.0
      53000Updated Jul 25, 2024Jul 25, 2024
    • TypeScript
      0000Updated Jun 21, 2024Jun 21, 2024
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      16k000Updated May 17, 2024May 17, 2024
    • Custom dyld version inherited from original Apple dyld implementation
      C++
      Other
      11200Updated Apr 27, 2024Apr 27, 2024
    • TypeScript
      0000Updated Mar 8, 2024Mar 8, 2024
    • A collection of OctoAI-based demos.
      TypeScript
      0511Updated Mar 5, 2024Mar 5, 2024
    • TFLint ruleset for terraform-provider-google
      Go
      Mozilla Public License 2.0
      19000Updated Feb 23, 2024Feb 23, 2024
    • Authentication server for Docker Registry 2
      Go
      Apache License 2.0
      305000Updated Feb 5, 2024Feb 5, 2024
    • go-jose

      Public
      An implementation of JOSE standards (JWE, JWS, JWT) in Go
      Go
      Apache License 2.0
      79000Updated Feb 5, 2024Feb 5, 2024
    • Pinecone + Vercel RAG application, showcasing a comparison between chat with no context and using a Pinecone index for context
      HTML
      21000Updated Jan 25, 2024Jan 25, 2024
    • A set of models you can build and deploy on octoai
      Python
      MIT License
      0000Updated Jan 19, 2024Jan 19, 2024
    • pre-commit hook which runs kustomize docker image (use with https://github.com/pre-commit/pre-commit)
      Dockerfile
      18100Updated Jan 4, 2024Jan 4, 2024
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
      C++
      Apache License 2.0
      1k000Updated Jan 3, 2024Jan 3, 2024
    • go-oidc

      Public
      A Go OpenID Connect client.
      Go
      Apache License 2.0
      400000Updated Dec 27, 2023Dec 27, 2023
    • A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5k000Updated Dec 14, 2023Dec 14, 2023