Mining Valid-Time Indeterminate Events
نویسندگان
چکیده
In many temporally oriented applications, it is known that events have occurred but the exact time when an event has occurred is not known. For example, a blood test of a diabetic patient may yield information that the patient's blood glucose level is above the safe threshold but may not exactly tell when that has happened. Such temporal events are said to have valid-time indeterminacy, where the exact time of occurrence of an event is not known. Extensions to SQL for supporting valid-time indeterminacy in temporal databases have been studied. However, no prior research has been done on applying mining techniques for finding interesting patterns from valid-time indeterminate events. Thus, in this paper, we first provide a background on temporal valid-time indeterminacy. We then propose a measure, " ordering probability " , for computing the probability of occurrence of an episode (ordered list of items) in the given temporal sequence of indeterminate events. The bounds for this measure are shown and then the anti-monotonic and asym-metric properties of this measure are proved. Mining of frequent patterns from indeterminate events will require computation of this measure for different sequences, hence an efficient algorithm for computing the ordering probability measure for a given episode in a sequence is proposed. Finally, the use of this measure in two temporal data mining frameworks, namely (i) sequence mining, and (ii) sequential pattern mining, are explained. The extensions of the frequency of an episode in sequence mining, and support for an episode in sequential pattern mining are shown. The research is this paper thus generalizes the research in temporal data mining to allow valid-time indeterminacy.
منابع مشابه
Constraint Logic Programming and Logic Modality for Event's Valid-time Approximation
The Temporal Probabilistic (TP) Database management systems should provide support for valid-time indeterminacy of events, by proposing the concept of an indeterminate instant, that is, an interval of time-points (event’s time-window) with an associated, lower and upper, probability distribution. In particular, users should be able to control, via query language constructs, the amount of tempor...
متن کاملTemporal Probabilistic Logic Programs: State and Revision
There are numerous applications where we have to deal with temporal uncertainty associated with events. The Temporal Probabilistic (TP) Logic Programs should provide support for valid-time indeterminacy of events, by proposing the concept of an indeterminate instant, that is, an interval of time-points (event’s time-window) with an associated, lower and upper, probability distribution. In parti...
متن کاملDiscovery of Time Series Event Patterns based on Time Constraints from Textual Data
This paper proposes a method that discovers time series event patterns from textual data with time information. The patterns are composed of sequences of events and each event is extracted from the textual data, where an event is characteristic content included in the textual data such as a company name, an action, and an impression of a customer. The method introduces 7 types of time constrain...
متن کاملAsynchronous Periodic Patterns Mining in Temporal Databases
Mining periodic patterns in temporal database is an important data mining problem with many applications. Previous studies have considered synchronous periodic patterns where misaligned occurrences are not allowed. However, asynchronous periodic pattern mining has received less attention and was only been discussed for a sequence of symbols where each time point contains one event. In this pape...
متن کاملMining State Dependencies Between Multiple Sensor Data Sources
Pattern mining over data streams is critical to a variety of applications such as prediction and evolution of weather phenomena or anomaly detection in security applications. Most of the current techniques attempt to discover associations between events appearing on the same data stream but are not able to discover associations over multiple heterogeneous data streams. In this work, we aim to i...
متن کامل