Measuring and alleviating the discrepancies between synthetic (source) real scene (target) data is core issue for domain adaptive semantic segmentation. Though recent works have introduced depth information in source to reinforce geometric knowledge transfer, they cannot extract intrinsic 3D of objects, including positions shapes, merely based on 2D estimated depth. In this work, we propose a n...