Time Series Analysis : A Review of methods

  IJCTT-book-cover
 
International Journal of Computer Trends and Technology (IJCTT)          
 
© 2017 by IJCTT Journal
Volume-52 Number-1
Year of Publication : 2017
Authors : Diana Moses

MLA

Diana Moses "Time Series Analysis : A Review of methods". International Journal of Computer Trends and Technology (IJCTT) V52(1):22-28, October 2017. ISSN:2231-2803. www.ijcttjournal.org. Published by Seventh Sense Research Group.

Abstract -
Time series is an essential class of transient information articles and it very well may be effectively gotten from logical and budgetary applications. A time series is an accumulation of perceptions made sequentially. The idea of time series information incorporates: substantial in information estimate, high dimensionality and important to refresh constantly. In addition time series information, which is described by its numerical and nonstop nature, is constantly considered all in all rather than individual numerical field. The expanding utilization of time series information has started a lot of innovative work endeavors in the field of information mining. The copious research on time series information mining in the most recent decade could hamper the section of intrigued analysts, because of its unpredictability. In this paper, an extensive modification on the current time series information mining research is given. They are commonly classified into portrayal and ordering, comparability measure, division, perception and mining. Besides best in class examine issues are additionally featured. The essential goal of this paper is to fill in as a glossary for intrigued scientists to have a general picture on the ebb and flow time series information mining advancement and distinguish their potential research course to advance examination.

References
[1] Agrawal, R., Faloutsos,C., Swami,A., 1993. Efficient similarity search in sequence databases. In Proceedings of the Fourth International Conference on Foundations of Data Organization and Algorithms, pp.69–84.
[2] Berndt, D.J., Clifford,J., 1996. Finding patterns in timeseries : a dynamic programming approach. Advances in Knowledge Discovery and Data Mining, 229–248.
[3] Chan, K.P., Fu,A.C., 1999. Efficient timeserie matching by wavelets. In Proceedings of the 15th IEEE International Conference on Data Engineering, pp. 126–133.
[4] Faloutsos, C., Ranganathan,M., ManolopoulosY., 1994. Fast subsequence matching in time-series databases. In Proceedings of the 1994 ACM SIG MOD International Conference on Management of Data, pp.419–429.
[5] Keogh, E., 1997. A fast and robust method for pattern matching in timeseries databases. In Proceedings of the Ninth IEEE International Conference on Tools with Artificial Intelligence, pp.578–584.
[6] Keogh, E., 1997. Fast similarity search in the presence of longitudinal scaling in time series databases. In Proceedings of the Ninth IEEE International Conference on Tools with Artificial Intelligence, pp.578– 584.
[7] Keogh, E., 2002.Exact indexing of dynamic time warping. In Proceedings of the 28th International Conference on Very Large Databases, pp.406–417.
[8] Keogh, E., Kasetty,S., 2002.On the need for timeseries data mining benchmarks: a survey and empirical demonstration. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 102–111.
[9] Abonyi, J., Feil,B., Nemeth,S., Arva,P., 2005.Modified Gath–Geva clustering for fuzzing segmentation of multivariate time-series. Fuzzy Sets and Systems, Data Mining Special Issue, 149,39–56.
[10] Astrom, K.J., 1969. On the choice of sampling rates in parametric identification of time series. Information Sciences, 1 (3), 273–278.
[11] Yi, B., Faloutsos,C., 2000. Fast time sequence indexing for arbitrary Lpnorms. In Proceedings of the 26th International Conference on Knowledge Discovery and Data Mining Very Large Data Bases, pp. 385–394.
[12] Yi, B., Jagadish,H.V., Faloutsos,C., 1998. Efficient retrieval of similar time sequences under time warping. In Proceedings of the 14th IEEE International Conference on Data Engineering, pp.201–208.
[13] Yi, B.K., Sidiropoulos,N.D., Johnson,T., Jagadish,H.V., Faloutsos,C., Biliris,A., 2000. Online data mining for co-evolving time sequences. In Proceedings of the 16th IEEE International Conference on Data Engineering, pp.13–22.
[14] Lee, A.J.T., Wu,H.W., Lee,T.Y., Liu,Y.H., Chen,K.T., 2009.Mining closed patterns in multi-sequence time-series databases. Data and Knowledge Engineering, 68 (10), 1071–1090.
[15] Lee, S., Kwon,D., Lee,S., 2003.Dimensionality reduction for indexing timeseries based on the minimum distance. Journal of Information Science and Engineering 19,697–711.
[16] Ratanamahatana, C.A.and Keogh,E., 2004. Making time-series classification more accurate using learned constraints. In Proceedings of the Fourth SIAM International Conference on Data Mining, pp.11– 22.
[17] Ratanamahatana, C.A., Keogh,E., 2005. Three myths about dynamic time warping data mining. In Proceedings of the Fifth SIAM International Conference on Data Mining.
[18] Ratanamahatana, C.A., Keogh,E., Bagnall,A.J., Lonardi,S.A,2005.Novel bit level
[19] time series representation with implications for similarity search and clustering. In Proceedings of the Ninth Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.771–777.
[20] Bagnall, A., Janacek,G., 2005.Clustering timeseries with clipped data. Machine Learning 58(2–3),151–178.
[21] Bagnall, A., Ratanamahatana,C.A., Keogh,E., Lonardi,S., Janacek,G.A., 2006.Bit level representation for timeseries data mining with shape based similarity. Data Mining and Knowledge Discovery 13 (1), 11–40.
[22] Smyth, P., Keogh,E., 1997. Clustering and mode classification of engineering time series data. In Proceedings of the Third ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.24–30.
[23] Shatkay, H., Zdonik,S., 1996. Approximate queries and representations for large data sequences. In Proceedings of the 12th IEEE International Conference on Data Engineering, pp.536–545.
[24] Ge, X.P., 1998. Pattern matching in financial time series data. University of California, Irvine, Final Project Report for. ICS 278.
[25] Perng, C.S., Wang,H., Zhang,R., Parker,D., 2000. Landmarks: a new model for similarity-based pattern querying in timeseries databases. In Proceedings of the 16th IEEE International Conference on Data Engineering, pp.33–42.
[26] Megalooikonomou, V., Li,G., Wang,Q., 2004. A Dimensionality reduction technique for efficient similarity analysis of timeseries databases. In Proceedings of the 13th ACM International Conference on Information and Knowledge Management, pp.160–161.
[27] Megalooikonomou, V., Wang,Q., Li,G., Faloutsos,C., 2005.A Multiresolution symbolic representation of timeseries .In Proceedings of the 21st IEEE International Conference on Data Engineering, pp.668–679.
[28] Moon, Y., Whang,K., Loh,W., 2001.Duality-based subsequence matching in time- series databases. In Proceedings of the 17th IEEE International Conference on Data Engineering, pp.263–272.
[29] Moon, Y.S., Whang,K.Y., Han,W.S., 2002.General match: a subsequence matching method in time-series databases based on generalized windows. In Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, pp.382–393.
[30] Loh, W.K., Moon,Y.S., Srivastava,J., 2010.Distortion-free predictive streaming time-series matching.InformationSciences180(8),1458– 1476.
[31] Chung, F.L., Fu,T.C., Luk,R., Ng,V., 2001.Flexible timeseries pattern matching based on perceptually important points. In International Joint Conference on Artificial Intelligence Workshop on Learning from Temporal and Spatial Data, pp. 1–7.
[32] Chung,F.L., Fu,T.C., Ng,V., Luk,R., 8,2004. An evolutionary approach to pattern-based timeseries segmentation. IEEE Transactions on Evolutionary Computation, 471–489.
[33] van Wijk,J.J., vanSelow,E.R., 1999.Cluster and calendar based visualization of Time Series Data. In Proceedings of the IEEE Symposium on Information Visualization, pp.4–9.
[34] Schreck, Tekusova, Kohlhammer,T., FellnerD., J., 2007.Trajectorybased visual analysis of large financial timeseries data. ACM SIGKDD Explorations Newsletter, SpecialIssueonVisualAnalytics9(2),30–37.
[35] Weber, M., Alexa,M., Muller,W., 2001.Visualizing timeseries on spirals. In Proceedings of the IEEE Symposium on Information Visualization, pp.7–14.
[36] Hao, M.C., Dayal,U., Keim,D.A., Schreck,T., 2007.Multi-resolution techniques for visual exploration of large time-series data. In Proceedings of the Joint Eurographics—IEEE VGTC Symposium on Visualization, pp.27–34.
[37] Lin, J., Li,Y., 2009.Finding structural similarity in timeseries data using bag-of- patterns representation. In Proceedings of the 21st International Conference on Scientific and Statistical Database Management, pp.461–477.
[38] Lin, J., Keogh,E., Lonardi,S., 2005. Visualizing and discovering nontrivial patterns in large timeseries databases. Information Visualization 4 (2), 61–82.
[39] Lin,J., Keogh,E., Lonardi,S., Chiu,B., 2003. A Symbolic representation of timeseries , With implications for streaming algorithms. In Proceedings of the Eighth ACM SIGMOD International Conference on Management of Data Workshop on Research Issues in Data Mining and Knowledge Discovery, pp.2–11.
[40] Lin, J., Keogh,E., Lonardi,S., Patel,P., 2002. Finding motifs in timeseries .In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2nd Workshop on Temporal Data Mining, pp. 53–68.
[41] Lin, J., Keogh,E., Wei,L., Lonardi,S., 2007. Experiencing SAX: a novel symbolic representation of timeseries .Data Mining and Knowledge Discovery, 15(2), 107–144.
[42] Fu, T.C., Chung, F.L., Luk,R., Ng,C.M., 2008. Representing financial timeseries based on data point importance. Engineering Applications of Artificial Intelligence 21(2),277–300.
[43] Fu, T.C., Chung,F.L., Luk,R., Ng,C.M., 2007. Stock timeseries pattern matching: template-based vs. rule-based approaches. Engineering Applications of Artificial Intelligence20(3),347–364.
[44] Fu, T.C., Chung,F.L., Luk,R., Ng,V., 2001. Pattern discovery from stock timeseries using self-organizing maps. In Proceedings of the 7th ACMSIGKDD International Conference on Knowledge Discovery and Data Mining Workshop on Temporal Data Mining, pp.27–37.
[45] Fu, T.C., Chung,F.L., Ng,C.M., 2006. Financial timeseries segmentation based on specialized binary tree representation. In Proceedings of the 2006 International Conference on Data Mining, pp.3–9.
[46] Fu, T.C., Chung,F.L., Tang,P.Y., Luk,R., Ng,C.M., 2005. Incremental stock time series data delivery and visualization. In Proceedings of the 14th ACM Conference on Information and Knowledge Management, pp.279–280.
[47] Chiu, B., Keogh,E., Lonardi,S., 2003. Probabilistic discovery of timeseries motifs. In Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.493–498.
[48] Tanaka,Y., Iwamoto,K., Uehara,K., 2005. Discovery of time-series motif from multi- Dimensional data based on MDL principle. Machine Learning, 58(2–3),269–300.
[49] Caraa-Valente, J.P., Lopez-Chavarrias,I., 2000. Discovering similar patterns in time series. In Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data mining, pp.497–505.
[50] Lerner, A., Shasha,D., Wang,Z., Zhao,X., Zhu,Y., 2004. Fast algorithms for time series with applications to finance, physics, music, biology, and other suspects. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp.965– 968.
[51] Chan, P., Mahoney,M., 2005.Modelingmultiple timeseries for anomaly detection. In Proceedings of the Fifth IEEE International Conference on Data Mining, pp. 90–97.
[52] Kalpakis, K., Gada,D., Puttagunta,V., 2001. Distance measures for effective clustering of ARIMA time-series. In Proceedings of the IEEE International Conference on Data Mining,2001, pp.273–280.
[53] Panuccio, A., Bicego,M., Murino,V., 2002. A Hidden Markov Modelbased approach to sequential data clustering. In the Joint International Association for Pattern Recognition Workshops on Structural, Syntactic and Statistical Pattern Recognition, pp.734–742.
[54] Xiong, Y., Yeung,D.Y., 2004. timeseries clustering with ARMA mixtures. Pattern Recognition 37 (8), 1675–1689.
[55] Oates, T., 1999. Identifying distinctive subsequences in multivariate timeseries by clustering. In Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.322–326.
[56] Oates, T., Firoiu,L., Cohen,P.R., 1999. Clustering timeseries with Hidden Markov Models and dynamic time warping. In Proceedings of the International Joint Conference on Artificial Intelligence Workshop on Sequence Learning.
[57] Duan, J., Wang,W., Liu,B., Xue,Y., Zhou,H., Shi,B., 2005. Incorporating with recursive model training in timeseries clustering. In Proceedings of the Fifth International Conference on Computer and Information Technology, pp. 105–109.
[58] Geurts, P., 2001. Pattern extraction for timeseries classification. In Proceedings of the Fifth European Conference on Principles and Practice of Knowledge Discovery in Databases, pp.115–127.
[59] Zhang, H., Ho,T.B., Lin,M.S., 2004. A Non-parametric wavelet feature extractor for time-series classification. In Proceedings of the Eighth Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.595–603.
[60] Zhang, T., Yue,D., Gu,Y., Yu,G., 2007. Boolean representation based data-adaptive correlation analysis over timeseries streams. In Proceedings of the 16th ACM Conference on Information and Knowledge Management, pp.203–212.
[61] Kadous, M.W., Sammut,C., 2005. Classification of multivariate timeseries and Structured data using constructive induction. Machine Learning, 58(2–3), 179–216
[62] Yang, K., Shahabi,C., 2005. A Multi level distance-based index structure for multivariate timeseries. In Proceedings of the 12th IEEE International Symposium on Temporal Representation and Reasoning, pp.65–73.
[63] Yang, K., Shahabi,C., 2005. On the stationarity of multivariate timeseries for correlation-based data analysis. In Proceedings of the Fifth IEEE International Conference on Data Mining, pp.805–808.
[64] Yang, K., Yoon,H., Shahabi,C., 2005. CLeVer: a feature subset selection technique for multivariate timeseries. In Proceedings of the Ninth Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp.516–522.
[65] Povinelli, J., Feng,X., 1999. Datamining of multiple nonstationary timeseries. In Proceedings of Artificial Neural Networks in Engineering, pp.511–516.
[66] Rodriguez, J.J., Alonso,C.J., 2004. Interval and dynamic time warpingbased decision trees. In Proceedings of the 2004 ACM Symposium on Applied Computing, pp.548–552.
[67] Kahveci, T., Singh,A.K., 2004. Optimizing similarity search of arbitrary length time series queries. IEEE Transactions on Knowledge and Data Engineering 16 (4), 418–433.
[68] Kahveci, T., Sing,A., Gurel,A., 2002. Similarity searching for multiattribute sequences. In Proceedings of the 14th International Conference on Scientific and Statistical Database Management, pp.175–186.
[69] Gilbert, A.C., Kotidis,Y., Muthukrishnan,S., Strauss,M.J., 2001. Surfing wavelets on streams: one-pass summaries for approximate aggregate queries. In Proceedings of the 27th International Conference on Very Large Databases, pp.79–88.
[70] Moses, Diana. A survey of data mining algorithms used in cardiovascular disease diagnosis from multi-lead ECG data. Kuwait Journal of Science 42.2 (2015).
[71] Moses, Diana, and C. Deisy. "m-CADE: A mobile based cardiovascular abnormality detection engine using efficient multidomain feature combinations." Intelligent Data Analysis20.3 (2016): 575-596.
[72] Deisy, C., Diana, M., Feature Selection for Nominal, Categorical and ECG Data, ISBN : 978-93-85977-78-7
[73] Gao, L., Wang,X.S., 2002. Continually evaluating similarity-based pattern queries on a Streaming timeseries .In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 370–381.
[74] Gao, L., Yao,Z., Wang,X.S., 2002. Evaluating continuous nearest neighbour queries for streaming timeseries via pre-fetching. In Proceedings of the 11th ACM International Conference on Information and Knowledge Management, pp. 485–492.
[75] Vlachos, M., Gunopulos,D., Kollios,G., 2002. Discovering similar multidimensional trajectories. In Proceedings of the 18th IEEE International Conference on Data Engineering, pp.673.

Keywords
Time Series Analysis, Classification, Segmentation, Pattern Matching, Subsequence matching