这是一款能够将平面图片渲染出动态 3D 景深效果的开源 AI 工具。效果示例如下：

作者提出了一种将单个 RGB-D 输入图像转换为 3D 照片的方法，即一种新的视图合成的多层表示方法，该方法包含在原始视图中被遮挡区域的幻觉颜色和深度结构。作者使用具有明确像素连接性的分层深度图像作为底层表示，并提出一个基于学习的修复模型，该模型以空间上下文感知的方式将新的局部颜色和深度内容迭代合成到遮挡区域。使用标准图形引擎，可以使用运动视差有效地渲染生成的 3D 照片。作者在一系列具有挑战性的日常场景中验证了该方法的有效性。

测试环境

Linux ( Ubuntu 18.04.4 LTS)
Anaconda
python 3.7 (3.7.4)
PyTorch 1.4.0
其他的 Python 依赖项：
opencv-python==4.2.0.32
vispy==0.6.4
moviepy==1.0.2
transforms3d==0.3.1
networkx==2.3
cynetworkx
scikit-image
运行以下指令开始安装：
conda create -n 3DP python=3.7 anaconda
conda activate 3DP
pip install -r requirements.txt
conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit==10.1.243 -c pytorch
下载模型权值:
chmod +x download.sh
./download.sh
更多详情请查看软件主页

更多介绍看 https://baijiahao.baidu.com/s?id=1663565761787257810

传送门

项目主页：https://shihmengli.github.io/3D-Photo-Inpainting/

GitHub地址：https://github.com/vt-vl-lab/3d-photo-inpainting

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

[Paper] [Project Website] [Google Colab]

We propose a method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view. We use a Layered Depth Image with explicit pixel connectivity as underlying representation, and present a learning-based inpainting model that iteratively synthesizes new local color-and-depth content into the occluded region in a spatial context-aware manner. The resulting 3D photos can be efficiently rendered with motion parallax using standard graphics engines. We validate the effectiveness of our method on a wide range of challenging everyday scenes and show fewer artifacts when compared with the state-of-the-arts.

3D Photography using Context-aware Layered Depth Inpainting
Meng-Li Shih, Shih-Yang Su, Johannes Kopf, and Jia-Bin Huang
In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

Prerequisites

Linux (tested on Ubuntu 18.04.4 LTS)
Anaconda
Python 3.7 (tested on 3.7.4)
PyTorch 1.4.0 (tested on 1.4.0 for execution)

and the Python dependencies listed in requirements.txt

To get started, please run the following commands:

conda create -n 3DP python=3.7 anaconda
conda activate 3DP
pip install -r requirements.txt
conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit==10.1.243 -c pytorch

Next, please download the model weight using the following command:
```
chmod +x download.sh
./download.sh
```

Quick start

Please follow the instructions in this section. This should allow to execute our results. For more detailed instructions, please refer to DOCUMENTATION.md.

Execute

Put .jpg files (e.g., test.jpg) into the image folder.
- E.g., image/moon.jpg
Run the following command
```
python main.py --config argument.yml
```
- Note: The 3D photo generation process usually takes about 2-3 minutes depending on the available computing resources.
The results are stored in the following directories:
- Corresponding depth map estimated by MiDaS
  - E.g. depth/moon.npy, depth/moon.png
  - User could edit depth/moon.png manually.
    - Remember to set the following two flags as listed below if user wants to use manually edited depth/moon.png as input for 3D Photo.
      - depth_format: '.png'
      - require_midas: False
- Inpainted 3D mesh (Optional: User need to switch on the flag save_ply)
  - E.g. mesh/moon.ply
- Rendered videos with zoom-in motion
  - E.g. video/moon_zoom-in.mp4
- Rendered videos with swing motion
  - E.g. video/moon_swing.mp4
- Rendered videos with circle motion
  - E.g. video/moon_circle.mp4
- Rendered videos with dolly zoom-in effect
  - E.g. video/moon_dolly-zoom-in.mp4
  - Note: We assume that the object of focus is located at the center of the image.
(Optional) If you want to change the default configuration. Please read DOCUMENTATION.md and modified argument.yml.

License

This work is licensed under MIT License. See LICENSE for deTails.

If you find our code/models useful, please consider citing our paper:

@inproceedings{Shih3DP20,
  author = {Shih, Meng-Li and Su, Shih-Yang and Kopf, Johannes and Huang, Jia-Bin},
  title = {3D Photography using Context-aware Layered Depth Inpainting},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2020}
}

Acknowledgments

We thank Pratul Srinivasan for providing clarification of the method Srinivasan et al. CVPR 2019.
We thank the author of Zhou et al. 2018, Choi et al. 2019, Mildenhall et al. 2019, Srinivasan et al. 2019, Wiles et al. 2020, Niklaus et al. 2019 for providing their implementations online.
Our code builds upon EdgeConnect, MiDaS and pytorch-inpainting-with-partial-conv

声明：本站所有文章，如无特殊说明或标注，均为本站原创发布。任何个人或组织，在未征得本站同意时，禁止复制、盗用、采集、发布本站内容到任何网站、书籍等各类媒体平台。如若本站内容侵犯了原著者的合法权益，可联系我们进行处理。

3D立体照片生成工具 3d-photo-inpainting

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

[Paper] [Project Website] [Google Colab]

Prerequisites

Quick start

Execute

License

Acknowledgments

最新文章

电脑隐私清理工具 | Privacy Eraser（v6.21.4.5321）

#开源项目# 开源免费U盘TF卡SD卡内存卡镜像写入工具：镜像烧录软件 balenaEtcher

VMware Workstation Pro 17.6.3 Build 24583834 (2025/03/04官方正式版下载

再次完美！win7 Windows7 Stable Perfection 2 正式发布！ pcbate论坛大佬DreameRing精品版本

[分享] [2025.03.31] 丝滑版Win10 LTSC2021 22H2 19045.5679 PIIS优化

Windows 10 (business editions)(consumer editions), version 22H2 (updated March 2025) (x64) – DVD (Chinese-Simplified)

随便看看

hostdare高性能AMD EPYC平台+三网纯高端网络(cn2/cuii/cmin2)，VPS低至$25/年

腾讯云国内北上广轻量4核大算力，新用户低至79元/年

hostdare洛杉矶AMD EPYC VPS，5折优惠，低至$15.49/年

spinservers 七月促销高配/低价美国服务器，低至$59/月，4路铂金8173M/1.5T内存/15TNVMe/100T流量/10Gbps带宽

hostdare：全场VPS低至$10.4/年，美国(CN2+CUII+CMIN2)/日本软银+NTT/保加利亚BGP

3D立体照片生成工具 3d-photo-inpainting

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

[Paper] [Project Website] [Google Colab]

Prerequisites

Quick start

Execute

License

Acknowledgments

相关文章

最新文章

随便看看

标签