TUM Logo

Efficient Online Sequence Prediction with Side Information

Sequence prediction is a key task in machine learning and data mining. It involves predicting the next symbol in a sequence given its previous symbols. Our motivating application is predicting the execution path of a process on an operating system in real-time. In this case, each symbol in the sequence represents a system call accompanied with arguments and a return value. We propose a novel online algorithm for predicting the next system call by leveraging both context and side information. The online update of our algorithm is efficient in terms of time cost and memory consumption. Experiments on real-world data sets showed that our method outperforms state-of-the-art online sequence prediction methods in both accuracy and efficiency, and incorporation of side information does significantly improve the predictive accuracy.

Efficient Online Sequence Prediction with Side Information

Authors: Han Xiao and Claudia Eckert
Year/month: 2013/12
Booktitle: IEEE International Conference on Data Mining (ICDM)
Address: Dallas, TX USA
Oranization: IEEE
Note: (AR: 19%)
Fulltext: hanxiao2013icdm-paper.pdf

Abstract

Sequence prediction is a key task in machine learning and data mining. It involves predicting the next symbol in a sequence given its previous symbols. Our motivating application is predicting the execution path of a process on an operating system in real-time. In this case, each symbol in the sequence represents a system call accompanied with arguments and a return value. We propose a novel online algorithm for predicting the next system call by leveraging both context and side information. The online update of our algorithm is efficient in terms of time cost and memory consumption. Experiments on real-world data sets showed that our method outperforms state-of-the-art online sequence prediction methods in both accuracy and efficiency, and incorporation of side information does significantly improve the predictive accuracy.

Bibtex:

@conference { hanxiao2013c,
author = { Han Xiao and Claudia Eckert },
title = { Efficient Online Sequence Prediction with Side Information },
year = { 2013 },
month = { December },
booktitle = { IEEE International Conference on Data Mining (ICDM) },
address = { Dallas, TX USA },
note = { (AR: 19%) },
organization = { IEEE },
url = {https://www.sec.in.tum.de/i20/publications/efficient-online-sequence-prediction-with-side-information/@@download/file/hanxiao2013icdm-paper.pdf}
}