S.F.'s picture

S.F.

search-facility

·

ipv6

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

black-forest-labs/FLUX.2-klein-4B

upvoted a paper 1 day ago

Ministral 3

upvoted a paper 7 days ago

The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models

View all activity

Organizations

None yet

liked a model 1 day ago

black-forest-labs/FLUX.2-klein-4B

Image-to-Image • Updated 1 day ago • 3.06k • • 128

upvoted a paper 1 day ago

Ministral 3

Paper • 2601.08584 • Published 3 days ago • 40

upvoted 2 papers 7 days ago

The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models

Paper • 2601.03425 • Published 10 days ago • 15

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 8 days ago • 191

upvoted 4 papers 9 days ago

MiMo-V2-Flash Technical Report

Paper • 2601.02780 • Published 11 days ago • 31

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 10 days ago • 121

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 12 days ago • 52

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published 10 days ago • 95

upvoted 2 papers 17 days ago

SpotEdit: Selective Region Editing in Diffusion Transformers

Paper • 2512.22323 • Published 21 days ago • 38

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 18 days ago • 94

upvoted 3 papers 18 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 22 days ago • 24

ProEdit: Inversion-based Editing From Prompts Done Right

Paper • 2512.22118 • Published 21 days ago • 17

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published 29 days ago • 96

upvoted a paper 21 days ago

How Much 3D Do Video Foundation Models Encode?

Paper • 2512.19949 • Published 25 days ago • 9

upvoted 4 papers 23 days ago

QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 25 days ago • 11

Scaling Laws for Code: Every Programming Language Matters

Paper • 2512.13472 • Published Dec 15, 2025 • 10

INTELLECT-3: Technical Report

Paper • 2512.16144 • Published 30 days ago • 18

FaithLens: Detecting and Explaining Faithfulness Hallucination

Paper • 2512.20182 • Published 25 days ago • 8

upvoted a paper 24 days ago

WorldWarp: Propagating 3D Geometry with Asynchronous Video Diffusion

Paper • 2512.19678 • Published 25 days ago • 29

upvoted a paper 25 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 29 days ago • 49