HeshamHaroon's picture
Initial release: Arabic Function Calling Leaderboard
566d03e verified
|
raw
history blame
1.49 kB
metadata
title: Arabic Function Calling Leaderboard
emoji: ๐Ÿ†
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: true
license: apache-2.0
tags:
  - arabic
  - function-calling
  - leaderboard
  - llm-evaluation

๐Ÿ† Arabic Function Calling Leaderboard

ู„ูˆุญุฉ ุชู‚ูŠูŠู… ุงุณุชุฏุนุงุก ุงู„ุฏูˆุงู„ ุจุงู„ุนุฑุจูŠุฉ

Overview

The Arabic Function Calling Leaderboard (AFCL) evaluates Large Language Models on their ability to:

  1. Understand Arabic queries (MSA + Dialects)
  2. Select appropriate functions from available options
  3. Extract correct arguments from Arabic text
  4. Handle parallel and complex function calls
  5. Detect when no function should be called

Dataset

The benchmark includes 1,470+ samples across 10 categories:

  • Simple, Multiple, Parallel, Parallel Multiple
  • Irrelevance Detection
  • Dialect Handling (Egyptian, Gulf, Levantine)
  • Programming APIs (Java, JavaScript, REST, SQL)

๐Ÿ“Š Dataset: HeshamHaroon/Arabic_Function_Calling

Submit Your Model

To submit your model for evaluation:

  1. Go to the "Submit" tab
  2. Fill in your model details
  3. Your model will be added to the evaluation queue

Citation

@misc{afcl2024,
    title={Arabic Function Calling Leaderboard},
    author={Hesham Haroon},
    year={2024},
    url={https://huggingface.co/spaces/HeshamHaroon/Arabic-Function-Calling-Leaderboard}
}