The rapid and accurate damage assessment of buildings plays a critical role in disaster response. Based on pairs pre- post-disaster remote sensing images, effective building level can be conducted. However, most existing methods are based Convolutional Neural Network, which has limited ability to learn the global context. An attention mechanism helps ameliorate this problem. Hierarchical Transf...