---
title: Arabic Function Calling Leaderboard
emoji: 🏆
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: true
license: apache-2.0
tags:
  - arabic
  - function-calling
  - leaderboard
  - llm-evaluation
---

# 🏆 Arabic Function Calling Leaderboard

لوحة تقييم استدعاء الدوال بالعربية

## Overview

The **Arabic Function Calling Leaderboard (AFCL)** evaluates Large Language Models on their ability to:

1. Understand Arabic queries (MSA + Dialects)
2. Select appropriate functions from available options
3. Extract correct arguments from Arabic text
4. Handle parallel and complex function calls
5. Detect when no function should be called

## Dataset

The benchmark includes **1,470+ samples** across 10 categories:
- Simple, Multiple, Parallel, Parallel Multiple
- Irrelevance Detection
- Dialect Handling (Egyptian, Gulf, Levantine)
- Programming APIs (Java, JavaScript, REST, SQL)

📊 **Dataset**: [HeshamHaroon/Arabic_Function_Calling](https://huggingface.co/datasets/HeshamHaroon/Arabic_Function_Calling)

## Submit Your Model

To submit your model for evaluation:
1. Go to the "Submit" tab
2. Fill in your model details
3. Your model will be added to the evaluation queue

## Citation

```bibtex
@misc{afcl2024,
    title={Arabic Function Calling Leaderboard},
    author={Hesham Haroon},
    year={2024},
    url={https://huggingface.co/spaces/HeshamHaroon/Arabic-Function-Calling-Leaderboard}
}
```