Spaces:

HeshamHaroon
/

Arabic-Function-Calling-Leaderboard

Running

App Files Files Community

Arabic-Function-Calling-Leaderboard / README.md

HeshamHaroon

Initial release: Arabic Function Calling Leaderboard

566d03e verified 18 days ago

preview code

raw

history blame

1.49 kB

metadata

title: Arabic Function Calling Leaderboard
emoji: 🏆
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: true
license: apache-2.0
tags:
  - arabic
  - function-calling
  - leaderboard
  - llm-evaluation

🏆 Arabic Function Calling Leaderboard

لوحة تقييم استدعاء الدوال بالعربية

Overview

The Arabic Function Calling Leaderboard (AFCL) evaluates Large Language Models on their ability to:

Understand Arabic queries (MSA + Dialects)
Select appropriate functions from available options
Extract correct arguments from Arabic text
Handle parallel and complex function calls
Detect when no function should be called

Dataset

The benchmark includes 1,470+ samples across 10 categories:

Simple, Multiple, Parallel, Parallel Multiple
Irrelevance Detection
Dialect Handling (Egyptian, Gulf, Levantine)
Programming APIs (Java, JavaScript, REST, SQL)

📊 Dataset: HeshamHaroon/Arabic_Function_Calling

Submit Your Model

To submit your model for evaluation:

Go to the "Submit" tab
Fill in your model details
Your model will be added to the evaluation queue

Citation

@misc{afcl2024,
    title={Arabic Function Calling Leaderboard},
    author={Hesham Haroon},
    year={2024},
    url={https://huggingface.co/spaces/HeshamHaroon/Arabic-Function-Calling-Leaderboard}
}