Skip to content
Change the repository type filter

All

    Repositories list

    • masader

      Public
      The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.
      JavaScript
      GNU General Public License v3.0
      2515331Updated Dec 26, 2024Dec 26, 2024
    • Python
      0000Updated Dec 26, 2024Dec 26, 2024
    • Python
      0010Updated Dec 26, 2024Dec 26, 2024
    • Python
      MIT License
      5521Updated Dec 21, 2024Dec 21, 2024
    • Calliar

      Public
      A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.
      Jupyter Notebook
      MIT License
      1614620Updated Jun 24, 2024Jun 24, 2024
    • dar

      Public
      A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.
      Python
      Apache License 2.0
      11110Updated Jun 23, 2024Jun 23, 2024
    • HTML
      2010Updated May 10, 2024May 10, 2024
    • .github

      Public
      1100Updated Apr 13, 2024Apr 13, 2024
    • CIDAR-v2

      Public
      Jupyter Notebook
      1520Updated Mar 30, 2024Mar 30, 2024
    • Python
      1110Updated Mar 3, 2024Mar 3, 2024
    • ARBML

      Public
      Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.
      JavaScript
      MIT License
      45395100Updated Mar 1, 2024Mar 1, 2024
    • CIDAR

      Public
      Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
      Jupyter Notebook
      33300Updated Feb 22, 2024Feb 22, 2024
    • Taqyim

      Public
      Python intefrace for evaluation on chatgpt models
      Jupyter Notebook
      MIT License
      31910Updated Feb 13, 2024Feb 13, 2024
    • evals

      Public
      Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
      Jupyter Notebook
      MIT License
      2.6k200Updated Feb 13, 2024Feb 13, 2024
    • nmatheg

      Public
      A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
      Jupyter Notebook
      52110Updated Jan 27, 2024Jan 27, 2024
    • tkseem

      Public
      Arabic Tokenization Library. It provides many tokenization algorithms.
      Jupyter Notebook
      MIT License
      189542Updated Jan 4, 2024Jan 4, 2024
    • klaam

      Public
      Arabic speech recognition, classification and text-to-speech.
      Jupyter Notebook
      MIT License
      74376120Updated Sep 30, 2023Sep 30, 2023
    • tnkeeh

      Public
      Arabic cleaning, normalization and segmentation library.
      Python
      MIT License
      96220Updated Sep 28, 2023Sep 28, 2023
    • mat-bpe

      Public
      Jupyter Notebook
      0000Updated Aug 6, 2023Aug 6, 2023
    • Ashaar

      Public
      Arabic poetry analysis and generation.
      Jupyter Notebook
      22100Updated Jul 23, 2023Jul 23, 2023
    • Jupyter Notebook
      0200Updated Jun 4, 2023Jun 4, 2023
    • Python
      Apache License 2.0
      0200Updated Apr 3, 2023Apr 3, 2023
    • atmatah

      Public
      a repository containing scripts to automate processes, for instance configuring web-apps on remote machines
      Jinja
      MIT License
      0040Updated Jan 25, 2023Jan 25, 2023
    • qawafi

      Public
      Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.
      Jupyter Notebook
      MIT License
      72740Updated Jan 3, 2023Jan 3, 2023
    • 0000Updated Dec 31, 2022Dec 31, 2022
    • whisperar

      Public
      Python
      34110Updated Dec 25, 2022Dec 25, 2022
    • Bohour

      Public
      Bohour, a package that abstracts arabic poetry science, Aroud
      Python
      MIT License
      0120Updated Dec 2, 2022Dec 2, 2022
    • adawat

      Public
      Jupyter Notebook
      GNU General Public License v3.0
      0300Updated Nov 17, 2022Nov 17, 2022
    • rasm

      Public
      Arabic Art using GANs
      Python
      Other
      31700Updated Aug 3, 2022Aug 3, 2022
    • bayanat

      Public
      Explore the content of Arabic text datasets.
      Jupyter Notebook
      MIT License
      31620Updated May 23, 2022May 23, 2022