We consider a version of the continuum armed bandit where an action induces filtered realisation non-homogeneous Poisson process. Point data in sample are then revealed to decision-maker, whose reward is total number points. Using knowledge function governing filtering, but without intensity function, decision-maker seeks maximise expected points over T rounds. propose upper confidence bound al...