Research on video anomaly detection has mainly been based data. However, many real-world cases involve users who can conceive potential normal and abnormal situations within the domain. This domain knowledge be conveniently expressed as text descriptions, such “walking” or “people fighting”, which easily obtained, customized for specific applications, applied to unseen videos not included in tr...