ZhenYE's picture

ZhenYE

ZhenYe234

·

https://github.com/zhenye234

zhenye234

AI & ML interests

None yet

Organizations

upvoted a paper 3 months ago

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Paper • 2510.09606 • Published Oct 10, 2025 • 17

upvoted a collection 8 months ago

Canary-TTS

12 items • Updated Nov 18, 2025 • 3

upvoted a collection 10 months ago

Multimodal Reasoning

166 items • Updated 2 days ago • 36

upvoted a paper 10 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6, 2025 • 72

upvoted an article 11 months ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Feb 11, 2025

•

33

upvoted a paper 11 months ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6, 2025 • 27

upvoted a collection 11 months ago

Llasa

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated May 11, 2025 • 20