Member-only story
DEEP LEARNING
PHOSA — SOTA Neural Network That Extracts an Object 3D Model From an Image
PHOSA is the new state-of-the-art neural network in 3D model extraction from 2D image
Researchers from Carnegie Mellon University, Facebook AI Research, Argo AI and the University of California have developed a neural network model that generates 3D models of people and surrounding objects from a single 2D image. In this case, the model takes into account the spatial relationships between objects.
PHOSA — More about the model
PHOSA (Perceiving 3D Human-Object Spatial Arrangements) works without markup at the scene or object level. The model extracts many connections between the person in the image and other objects and translates them into 3D space. Researchers have introduced constraints into the training process of the model that allows resolving controversial situations during the generation of 3D models. For this, several loss terms are used in the model error functionality, which is responsible for:
- Scale: the size of the object;