Human organs constantly undergo anatomical changes due to a complex mix of short-term (e.g., heartbeat) and long-term aging) factors. Evidently, prior knowledge these factors will be beneficial when modeling their future state, i.e., via image generation. However, most the medical generation tasks only rely on input from single image, thus ignoring sequential dependency even longitudinal data i...