Sentiencelab

Depth of field

Problem Description

Estimating depth from 2D images is often used in scene reconstruction, 3D object recognition, segmentation, and detection. Based on a single RGB image as input, we are predicting a depth map for the entire scene.

Model

Encoder Takes an input image and generates a high-dimensional feature vector Decoder Takes a high-dimensional feature vector and generates a semantic segmentation mask There are 3 major building blocks: Convolution Down-Sampling Up-Sampling