Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models
Paper
โข
2602.02185
โข
Published
โข
116
๐๏ธ Creators of models with the most cumulative new downloads each month (users only, no orgs)
./llama-server -m /models/mistralai_Mistral-Small-3.2-24B-Instruct-2506-Q4_K_M.gguf --jinja --chat-template-file /models/Mistral-Small-3.2-24B-Instruct-2506.jinjaauthor_model-name