Stability.ai – 稳定视频 3D 简介
Stability.ai – Introducing Stable Video 3D

原始链接: https://stability.ai/news/introducing-stable-video-3d

以下是所提供文本的 100 字摘要: 文本宣布发布 Stable Video 3D,这是一种基于 Stable Video Diffusion 的生成模型,可推动 3D 技术的发展。 它提供两个版本:SV3D_u 用于从单个图像输入生成轨道视频,无需相机调节;SV3D_p 用于处理单个图像和轨道视图,从而沿特定路径生成 3D 视频。 商业用途需要 Stability AI 会员资格,而非商业用户可以访问 Hugging Face 和研究论文上的模型权重。 与 Stable Zero123 等早期版本相比,Stable Video 3D 提供了更高的质量和多视图一致性,通过视频扩散模型和改进的 3D 优化方法提供了优势。

这是有关 Emad Mostaque 的维基百科页面的链接。 它包括有关他的争议和学术不端行为指控的信息。 但是,请记住,维基百科文章可能包含错误或偏见,因此批判性地接触此类来源并尽可能验证来自多个信誉良好的来源的信息非常重要。 应仔细评估这些说法的真实性。
相关文章

原文

SV3D takes a single object image as input and output novel multi-views of that object. We can then use those novel-views and SV3D to generate 3D meshes.

When we released Stable Video Diffusion, we highlighted the versatility of our video model across various applications. Building upon this foundation, we are excited to release Stable Video 3D. This new model advances the field of 3D technology, delivering greatly improved quality and multi-view when compared to the previously released Stable Zero123, as well as outperforming other open source alternatives such as Zero123-XL.

This release features two variants:

  • SV3D_u: This variant generates orbital videos based on single image inputs without camera conditioning.

  • SV3D_p: Extending the capability of SVD3_u, this variant accommodates both single images and orbital views, allowing for the creation of 3D video along specified camera paths.

Stable Video 3D can be used now for commercial purposes with a Stability AI Membership. For non-commercial use, you can download the model weights on Hugging Face and view our research paper here.

Advantages of Video Diffusion

By adapting our Stable Video Diffusion image-to-video diffusion model with the addition of camera path conditioning, Stable Video 3D is able to generate multi-view videos of an object. The use of video diffusion models, in contrast to image diffusion models as used in Stable Zero123, provides major benefits in generalization and view-consistency of generated outputs. Additionally, we propose improved 3D optimization leveraging this powerful capability of Stable Video 3D to generate arbitrary orbits around an object. By further implementing these techniques with disentangled illumination optimization as well as a new masked score distillation sampling loss function, Stable Video 3D is able to reliably output quality 3D meshes from single image inputs.

See the technical report here for more details on the Stable Video 3D models and experimental comparisons.

Novel-View Generation

Stable Video 3D introduces significant advancements in 3D generation, particularly in novel view synthesis (NVS). Unlike previous approaches that often grapple with limited perspectives and inconsistencies in outputs, Stable Video 3D is able to deliver coherent views from any given angle with proficient generalization. This capability not only enhances pose-controllability, but also ensures consistent object appearance across multiple views, further improving critical aspects of realistic and accurate 3D generations.

联系我们 contact @ memedata.com