August 2nd, 2024

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation

PhysGen is a novel method for generating realistic videos from a single image using physical simulation and data-driven techniques, developed by researchers from the University of Illinois and Apple.

Read original articleLink Icon
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation

PhysGen is a new method for generating videos from a single image and specific input conditions, such as applied forces and torques. Developed by researchers from the University of Illinois Urbana-Champaign and Apple, this approach combines model-based physical simulation with data-driven video generation to create realistic and temporally consistent videos. The system comprises three main components: an image understanding module that captures the image's geometry, materials, and physical parameters; a dynamics simulation module that applies rigid-body physics to simulate realistic motion; and a rendering module that uses generative video diffusion to produce high-quality video footage. The videos generated by PhysGen are not only visually realistic but also allow for precise control over the dynamics, outperforming existing image-to-video generation methods in both quantitative assessments and user studies. The framework's interleaved components work together to interpret the image's semantics, simulate physical interactions, and render the final video output, making it suitable for various applications, including creating animations from images and enabling user interaction with dynamic content. The research will be presented at the European Conference on Computer Vision (ECCV) in 2024.

Link Icon 0 comments