This article studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy algorithm, named optimistic least-squares-based is proposed, which able to find iteratively near-optimal policies problem directly from input/state data witho...