Abstract Each of our sensory modalities — vision, touch, taste, etc. works on a slightly different timescale, with differing temporal resolutions and processing lag. This raises the question how, or indeed whether, these streams are co-ordinated ‘bound’ into coherent multisensory experience perceptual ‘now’. In this paper I evaluate one account how binding is achieved: windows hypothesis , conc...