AAAI2026

SEQRET: Mining Rule Sets from Event Sequences

Aleena Siji, Joscha Cรผppers, Osman Mian, Jilles Vreeken

Abstract

Summarizing event sequences is a key aspect of data mining. Most existing methods neglect conditional dependencies and focus on discovering sequential patterns only. In this paper, we study the problem of discovering both conditional and unconditional dependencies from event sequence data. We do so by discovering rules of the form ๐‘‹ โ†’ ๐‘Œ where ๐‘‹ and ๐‘Œ are sequential patterns. Rules like these are simple to understand and provide a clear description of the relation between the antecedent and the consequent. To discover succinct and non-redundant sets of rules we formalize the problem in terms of the Minimum Description Length principle. As the search space is enormous and does not exhibit helpful structure, we propose the Seqret method to discover high-quality rule sets in practice. Through extensive empirical evaluation we show that unlike the state of the art, Seqret ably recovers the ground truth on synthetic datasets and finds useful rules from real datasets. In this section we introduce basic notation and give a short introduction to the MDL principle. Notation As data we consider a sequence database ๐ท of |๐ท | event sequences. A sequence ๐‘† โˆˆ ๐ท consists of |๐‘† | events drawn from a finite alphabet 1 https://eda.rg.cispa.io/prj/seqret/