This paper describes a novel DNN-based system, named PD3net, that detects multiple people from single depth image, in real time. The proposed neural network processes image and outputs likelihood map coordinates, where each detection corresponds to Gaussian-shaped local distribution, centered at person’s head. encodes both the number of detected as well their position which 3D can be computed. ...