--- title: Arabic Function Calling Leaderboard emoji: 🏆 colorFrom: green colorTo: blue sdk: gradio sdk_version: 4.44.0 app_file: app.py pinned: true license: apache-2.0 tags: - arabic - function-calling - leaderboard - llm-evaluation --- # 🏆 Arabic Function Calling Leaderboard لوحة تقييم استدعاء الدوال بالعربية ## Overview The **Arabic Function Calling Leaderboard (AFCL)** evaluates Large Language Models on their ability to: 1. Understand Arabic queries (MSA + Dialects) 2. Select appropriate functions from available options 3. Extract correct arguments from Arabic text 4. Handle parallel and complex function calls 5. Detect when no function should be called ## Dataset The benchmark includes **1,470+ samples** across 10 categories: - Simple, Multiple, Parallel, Parallel Multiple - Irrelevance Detection - Dialect Handling (Egyptian, Gulf, Levantine) - Programming APIs (Java, JavaScript, REST, SQL) 📊 **Dataset**: [HeshamHaroon/Arabic_Function_Calling](https://huggingface.co/datasets/HeshamHaroon/Arabic_Function_Calling) ## Submit Your Model To submit your model for evaluation: 1. Go to the "Submit" tab 2. Fill in your model details 3. Your model will be added to the evaluation queue ## Citation ```bibtex @misc{afcl2024, title={Arabic Function Calling Leaderboard}, author={Hesham Haroon}, year={2024}, url={https://huggingface.co/spaces/HeshamHaroon/Arabic-Function-Calling-Leaderboard} } ```