(评论)
(comments)

原始链接: https://news.ycombinator.com/item?id=43683907

Hacker News 最新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交 登录 AudioX:用于任何内容到音频生成的扩散变换器 (zeyuet.github.io) gnabgib 54分钟前 21分 | 隐藏 | 过去 | 收藏 | 1条评论 Fauntleroy 13分钟前 [–] 视频到音频的例子真的令人印象深刻!乐队演奏的视频展示了这种方法的一些明显缺点(人们对5个长号会发出什么样的声音会有非常精确的预期)——但网球的例子展示了它的优势(击球声的时机不错,大型室内空间的音响效果也令人惊讶地准确)。我非常期待看到这种技术在未来几篇论文中得到改进! 回复 加入我们,参加6月16日至17日在旧金山举办的AI创业学校! 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请YC | 联系我们 搜索:


原文
Hacker News new | past | comments | ask | show | jobs | submit login
AudioX: Diffusion Transformer for Anything-to-Audio Generation (zeyuet.github.io)
21 points by gnabgib 54 minutes ago | hide | past | favorite | 1 comment










The video to audio examples are really impressive! The video featuring the band showcases some of the obvious shortcomings of this method (humans will have very precise expectations about the kinds of sounds 5 trombones will make)—but the tennis example shows its strengths (decent timing of hit sounds, eerily accurate acoustics for the large internal space). I'm very excited to see how this improves a few more papers down the line!






Join us for AI Startup School this June 16-17 in San Francisco!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact



Search:
联系我们 contact @ memedata.com