Computer Vision, 3D and the connecting factor - AI

About Me

AI Entrepreneur

My name is Peter, I've been enthusiastic about neural networks since I was a kid. I run the awesome reddit community:
/r/2D3DAI

Working on rendi.dev - FFmpeg as a Service

Co-Founded getmunch.com

Welcome to the journal about 2d, 3d and the connecting factor - AI

Subscribe

Subscribe my Newsletter for new blog posts. Let's stay updated!

Leave this field empty if you're human:

Contact Us

Your Name (required)

Your Email (required)

Subject

Your Message

Δ

Keep in touch

Facebook Twitter Linkedin Youtube Github Stack-Overflow Reddit

Missing consumer key - please check your settings in admin > Settings > Twitter Feed Auth

Recent Posts

From 2D to 3D Using Neural Nets technical online lecture
June 18, 2020
Overview of Human Pose Estimation Neural Networks – HRNet + HigherHRNet, Architectures and FAQ
June 14, 2020
Machine Learning project management – A decision makers’ guide
April 13, 2020
Tensorflow 2 Internals – Lessons learned from creating a 50 hours course
February 17, 2020
Implicit-Decoder part 2 – 3D generation
November 16, 2019

Recent Comments

y-aoub on From 2D to 3D Using Neural Nets technical online lecture
aicha on Implicit-Decoder part 1 – 3D reconstruction
back_to_code on 3D scene reconstruction from single image
Peter on From 2D to 3D Using Neural Nets technical online lecture
Wajahat Shah on From 2D to 3D Using Neural Nets technical online lecture

Archives

June 2020
April 2020
February 2020
November 2019
October 2019
February 2019
February 2018
December 2015

@2019 - All Right Reserved. Peter Naftaliev Abelians

3D scene reconstruction from single image

by Peter October 9, 2019

written by Peter October 9, 2019

Reading Time: 2 minutes

This paper by Facebook research on how to use neural networks to analyze one image of a scene, segment it into the seen 3D models within it and automatically create meshes\voxels from that single image.
Link to paper: https://arxiv.org/abs/1906.02739

Example of 3D scene reconstruction — Example of scene 3D reconstruction

Why single image?

Using multiple image will bring better results and reconstruction accuracy, so why use single images only?

It’s easier

Training datasets are more available for single image. The architecture of the neural network is easier to model and explain when it is a single image, it requires less computational resources to train over single image.

It’s more interesting

Once good reconstruction accuracy is reached with a single image, we know that the structure of the neural network is good. It is then possible to change this structure to add more images as input, be it changing the neural network itself, changing the input vector which it receives, averaging over the output of the network or other combination methods. So, actually, multi-image reconstruction is a subgroup of single image reconstruction.

AI and humans

Sometimes people get afraid that AI will replace us all. Well, if it will be able to reach singularity (can read more here https://waitbutwhy.com/2015/01/artificial-intelligence-revolution-1.html ) then yes, it could happen. But, neural networks which transfer 2D images into 3D models is not what’s going to bring this change.
Current technological developments allow to minimize repetitive tasks of humans, and actually facilitate more time, money and energy for creative valuable tasks for humans.

This might lead to a change in the workforce structure in the future, creating new jobs and making older jobs obsolete, but so did the invention of the car (which almost eliminated the use for coachmen but allowed for more accessible transportation and creation of jobs for taxi\bus\truck drivers), the invention of the telegram and many more examples.

Imagine giving a 2D\3D artist a tool in which he can draw whatever shape he likes in 2D and a software can create a corresponding 3D representation, this might open new possibilities both for modelling, for art, for VR\AR, for printing and for other industries that might pop up in the future. Or, just in the short term, making 3D scanning and modelling a much cheaper and faster processes for makers.

Ideas for future research

AI which gets as input a point cloud (instead of image) and reconstructs and accurate 2D mesh
Camera and lighting pose and parameters estimation

2D 3D 3D reconstruction CNN

3 comments

0

Facebook Twitter Linkedin Reddit Whatsapp Telegram Email

3 comments

Implicit-Decoder part 1 - 3D reconstruction - 2d3d.ai October 11, 2019 - 15:34

[…] 3D scene reconstruction from single image […]

Reply

Implicit-Decoder part 2 – 3D generation - 2d3d.ai November 16, 2019 - 20:21

[…] Implicit-Decoder part 1 – 3D reconstruction – 2d3d.ai on 3D scene reconstruction from single image […]

Reply

back_to_code October 9, 2020 - 15:52

Just thing about making scene graphs using these. Then using Graphs you can do magic. (Google it)

Reply

Leave a Comment Cancel Reply

Save my name, email, and website in this browser for the next time I comment.

Δ

Peter

My name is Peter, I've been enthusiastic about neural networks since I was a kid. I run the awesome reddit community: reddit.com/r/2D3DAI Working on rendi.dev - FFmpeg as a Service

previous post

Meet the community member – Shoumik Sharar Chowdhury

next post

Implicit-Decoder part 1 – 3D reconstruction

You may also like

Overview of Human Pose Estimation Neural Networks –...

June 14, 2020

Implicit-Decoder part 2 – 3D generation

November 16, 2019

Implicit-Decoder part 1 – 3D reconstruction

October 11, 2019

Subscribe Newsletter

Subscribe my Newsletter for new blog posts. Let's stay updated!

Leave this field empty if you're human:

@2019 - All Right Reserved. Peter Naftaliev Abelians