How do we know a generated patch is correct? This key challenging question that automated program repair (APR) systems struggle to address given the incompleteness of available test suites. Our intuition can triage correct patches by checking whether each implements code changes (i.e., behavior) are relevant bug it addresses. Such commonly specified failing case. Towards predicting correctness ...