Unsupervised Segmentation of Categorical Time Series into Episodes
نویسندگان
چکیده
This paper describes an unsupervised olgorirhm f o r segmenting categorical time series inro episodes. The VOTING-EXPERTS algorithm first collects starisrics about the frequency and boundav entmpy of ngrams. then passes a window over rhe series and has two “expert methods ” decide where in rhe window boundaries should be drawn. The algorirhm successfully segments t a r into words in four languages. The algorithm also segments time series of mbot sensor data inro subsequences rhar represenr episodes in rhe life of rhe robot. We claim that VOTING-EXPERTS finds meaningful episodes in categorical time series because it exploits two statistical characteristics of meaningful episodes.
منابع مشابه
An Unsupervised Algorithm for Segmenting Categorical Timeseries into Episodes
This paper describes an unsupervised algorithm for segmenting categorical time series into episodes. The VOTINGEXPERTS algorithm first collects statistics about the frequency and boundary entropy of ngrams, then passes a window over the series and has two “expert methods” decide where in the window boundaries should be drawn. The algorithm successfully segments text into words in four languages...
متن کاملAn Unsupervised Algorithm for Segmenting Categorical Timeseries into Episodes
This paper describes an unsupervised algorithm for segmenting categorical time series into episodes. The Voting-Experts algorithm first collects statistics about the frequency and boundary entropy of ngrams, then passes a window over the series and has two “expert methods” decide where in the window boundaries should be drawn. The algorithm successfully segments text into words in four language...
متن کاملAn Unsupervised Algorithm for Finding Episode Boundaries
This paper describes an unsupervised algorithm for segmenting categorical time series into episodes. The VotingExperts algorithm rst collects statistics about the frequency and boundary entropy of ngrams, then passes a window over the series and has two \expert methods" decide where in the window boundaries should be drawn. The algorithm segments text into words successfully in four languages. ...
متن کاملAn Algorithm for Segmenting Categorical Time Series into Meaningful Episodes
This paper describes an unsupervised algorithm for segmenting categorical time series. The algorithm first collects statistics about the frequency and boundary entropy of ngrams, then passes a window over the series and has two “expert methods” decide where in the window boundaries should be drawn. The algorithm segments text into words successfully, and has also been tested with a data set of ...
متن کاملUnsupervised Audio Speech Segmentation Using the Voting Experts Algorithm
Human beings have an apparently innate ability to segment continuous audio speech into words, and that ability is present in infants as young as 8 months old. This propensity towards audio segmentation seems to lay the groundwork for language learning in human beings. To artificially reproduce this ability would be both practically useful and theoretically enlightening. In this paper we propose...
متن کامل