Data-driven individualized decision making has recently received increasing research interest. However, most existing methods rely on the assumption of no unmeasured confounding, which cannot be ensured in practice especially observational studies. Motivated by proposed proximal causal inference, we develop several learning to estimate optimal treatment regimes (ITRs) presence confounding. Expl...