Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZhenYE's picture
10 7 32

ZhenYE

ZhenYe234
21world's profile picture litagin's profile picture Csplk's profile picture
·
https://github.com/zhenye234
  • zhenye234

AI & ML interests

None yet

Organizations

HKUST Audio's profile picture

upvoted a paper 3 months ago

SpaceVista: All-Scale Visual Spatial Reasoning from mm to km

Paper • 2510.09606 • Published Oct 10, 2025 • 17
upvoted a collection 8 months ago

Canary-TTS

Collection
12 items • Updated Nov 18, 2025 • 3
upvoted a collection 10 months ago

Multimodal Reasoning

Collection
166 items • Updated 2 days ago • 36
upvoted a paper 10 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6, 2025 • 72
upvoted an article 11 months ago
view article
Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

Feb 11, 2025
•
33
upvoted a paper 11 months ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published Feb 6, 2025 • 27
upvoted a collection 11 months ago

Llasa

Collection
TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated May 11, 2025 • 20
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs