view article Article ColPali: Efficient Document Retrieval with Vision Language Models ๐ Jul 5, 2024 โข 306
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Paper โข 2408.11305 โข Published Aug 21, 2024 โข 1
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper โข 2504.21776 โข Published Apr 30, 2025 โข 59
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 โข 480
Running 3.62k The Ultra-Scale Playbook ๐ 3.62k The ultimate guide to training LLM on large GPU Clusters
Reasoning Datasets Collection Distilled synthetic Reasoning datasets โข 7 items โข Updated Feb 2, 2025 โข 61
microsoft/Phi-3-mini-128k-instruct Text Generation โข 4B โข Updated 22 days ago โข 60.7k โข 1.69k
๐ง Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community โข 24 items โข Updated May 19, 2025 โข 178
view post Post 2104 New smolagents example landed on Hugging Face cookbook ๐ค Learn how to create an inventory managing multi-agent system with smolagents, MongoDB and DeepSeek Chat ๐ https://huggingface.co/learn/cookbook/mongodb_smolagents_multi_micro_agents See translation ๐ฅ 7 7 ๐ค 4 4 ๐ 2 2 + Reply
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper โข 2501.11425 โข Published Jan 20, 2025 โข 109
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation โข 33B โข Updated Feb 24, 2025 โข 2.75M โข โข 1.48k