Terrible Quality!

#16

by qpqpqpqpqpqp - opened 5 days ago

5 days ago

I tried all VibeVoice models, they all generate speech with disgusting noise! It is not so hard to run a denoiser to make clean audio files and then train on them to make a better, high-quality model like some did. Are you deaf, excuse me

stonewh1

5 days ago

Could you share the generated speech? If you used the WebSocket realtime demo, one possible reason is that the device’s inference capability couldn’t keep up with the speech playback, which resulted in noticeable noise.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment