Member-only story

DEEP LEARNING

PHOSA — SOTA Neural Network That Extracts an Object 3D Model From an Image

PHOSA is the new state-of-the-art neural network in 3D model extraction from 2D image

2 min readSep 17, 2020

PHOSA — SOTA Neural Network That Extracts an Object 3D Model From an Image — Source: Arxiv

Researchers from Carnegie Mellon University, Facebook AI Research, Argo AI and the University of California have developed a neural network model that generates 3D models of people and surrounding objects from a single 2D image. In this case, the model takes into account the spatial relationships between objects.

PHOSA — More about the model

Overview of the PHOSA approach. Source: Arxiv

PHOSA (Perceiving 3D Human-Object Spatial Arrangements) works without markup at the scene or object level. The model extracts many connections between the person in the image and other objects and translates them into 3D space. Researchers have introduced constraints into the training process of the model that allows resolving controversial situations during the generation of 3D models. For this, several loss terms are used in the model error functionality, which is responsible for:

Scale: the size of the object;

DEEP LEARNING

PHOSA — SOTA Neural Network That Extracts an Object 3D Model From an Image

PHOSA is the new state-of-the-art neural network in 3D model extraction from 2D image

PHOSA — More about the model

Written by Mikhail Raevskiy

No responses yet