[PDF] Outlier Detection For Temporal Data eBook

Outlier Detection For Temporal Data Book in PDF, ePub and Kindle version is available to download in english. Read online anytime anywhere directly from your device. Click on the download button below to get a free pdf file of Outlier Detection For Temporal Data book. This book definitely worth reading, it is an incredibly well-written.

Outlier Detection for Temporal Data

Author : Manish Gupta
Publisher : Springer Nature
Page : 110 pages
File Size : 34,30 MB
Release : 2022-06-01
Category : Computers
ISBN : 3031019059

GET BOOK

Outlier (or anomaly) detection is a very broad field which has been studied in the context of a large number of research areas like statistics, data mining, sensor networks, environmental science, distributed systems, spatio-temporal mining, etc. Initial research in outlier detection focused on time series-based outliers (in statistics). Since then, outlier detection has been studied on a large variety of data types including high-dimensional data, uncertain data, stream data, network data, time series data, spatial data, and spatio-temporal data. While there have been many tutorials and surveys for general outlier detection, we focus on outlier detection for temporal data in this book. A large number of applications generate temporal datasets. For example, in our everyday life, various kinds of records like credit, personnel, financial, judicial, medical, etc., are all temporal. This stresses the need for an organized and detailed study of outliers with respect to such temporal data. In the past decade, there has been a lot of research on various forms of temporal data including consecutive data snapshots, series of data snapshots and data streams. Besides the initial work on time series, researchers have focused on rich forms of data including multiple data streams, spatio-temporal data, network data, community distribution data, etc. Compared to general outlier detection, techniques for temporal outlier detection are very different. In this book, we will present an organized picture of both recent and past research in temporal outlier detection. We start with the basics and then ramp up the reader to the main ideas in state-of-the-art outlier detection techniques. We motivate the importance of temporal outlier detection and brief the challenges beyond usual outlier detection. Then, we list down a taxonomy of proposed techniques for temporal outlier detection. Such techniques broadly include statistical techniques (like AR models, Markov models, histograms, neural networks), distance- and density-based approaches, grouping-based approaches (clustering, community detection), network-based approaches, and spatio-temporal outlier detection approaches. We summarize by presenting a wide collection of applications where temporal outlier detection techniques have been applied to discover interesting outliers. Table of Contents: Preface / Acknowledgments / Figure Credits / Introduction and Challenges / Outlier Detection for Time Series and Data Sequences / Outlier Detection for Data Streams / Outlier Detection for Distributed Data Streams / Outlier Detection for Spatio-Temporal Data / Outlier Detection for Temporal Network Data / Applications of Outlier Detection for Temporal Data / Conclusions and Research Directions / Bibliography / Authors' Biographies

Outlier Detection for Temporal Data

Author : Manish Gupta
Publisher :
Page : 0 pages
File Size : 28,75 MB
Release : 2014-03
Category : Outliers (Statistics)
ISBN : 9781627053754

GET BOOK

Outlier (or anomaly) detection is a very broad field which has been studied in the context of a large number of research areas like statistics, data mining, sensor networks, environmental science, distributed systems, spatio-temporal mining, etc. Initial research in outlier detection focused on time series-based outliers (in statistics). Since then, outlier detection has been studied on a large variety of data types including high-dimensional data, uncertain data, stream data, network data, time series data, spatial data, and spatio-temporal data. While there have been many tutorials and surveys for general outlier detection, we focus on outlier detection for temporal data in this book. A large number of applications generate temporal datasets. For example, in our everyday life, various kinds of records like credit, personnel, financial, judicial, medical, etc., are all temporal. This stresses the need for an organized and detailed study of outliers with respect to such temporal data. In the past decade, there has been a lot of research on various forms of temporal data including consecutive data snapshots, series of data snapshots and data streams. Besides the initial work on time series, researchers have focused on rich forms of data including multiple data streams, spatio-temporal data, network data, community distribution data, etc. Compared to general outlier detection, techniques for temporal outlier detection are very different. In this book, we will present an organized picture of both recent and past research in temporal outlier detection. We start with the basics and then ramp up the reader to the main ideas in state-of-the-art outlier detection techniques. We motivate the importance of temporal outlier detection and brief the challenges beyond usual outlier detection. Then, we list down a taxonomy of proposed techniques for temporal outlier detection. Such techniques broadly include statistical techniques (like AR models, Markov models, histograms, neural networks), distance- and density-based approaches, grouping-based approaches (clustering, community detection), network-based approaches, and spatio-temporal outlier detection approaches. We summarize by presenting a wide collection of applications where temporal outlier detection techniques have been applied to discover interesting outliers.

Outlier Detection: Techniques and Applications

Author : N. N. R. Ranga Suri
Publisher : Springer
Page : 214 pages
File Size : 19,73 MB
Release : 2019-01-10
Category : Technology & Engineering
ISBN : 3030051277

GET BOOK

This book, drawing on recent literature, highlights several methodologies for the detection of outliers and explains how to apply them to solve several interesting real-life problems. The detection of objects that deviate from the norm in a data set is an essential task in data mining due to its significance in many contemporary applications. More specifically, the detection of fraud in e-commerce transactions and discovering anomalies in network data have become prominent tasks, given recent developments in the field of information and communication technologies and security. Accordingly, the book sheds light on specific state-of-the-art algorithmic approaches such as the community-based analysis of networks and characterization of temporal outliers present in dynamic networks. It offers a valuable resource for young researchers working in data mining, helping them understand the technical depth of the outlier detection problem and devise innovative solutions to address related challenges.

Outlier Analysis

Author : Charu C. Aggarwal
Publisher : Springer
Page : 481 pages
File Size : 43,45 MB
Release : 2016-12-10
Category : Computers
ISBN : 3319475789

GET BOOK

This book provides comprehensive coverage of the field of outlier analysis from a computer science point of view. It integrates methods from data mining, machine learning, and statistics within the computational framework and therefore appeals to multiple communities. The chapters of this book can be organized into three categories: Basic algorithms: Chapters 1 through 7 discuss the fundamental algorithms for outlier analysis, including probabilistic and statistical methods, linear methods, proximity-based methods, high-dimensional (subspace) methods, ensemble methods, and supervised methods. Domain-specific methods: Chapters 8 through 12 discuss outlier detection algorithms for various domains of data, such as text, categorical data, time-series data, discrete sequence data, spatial data, and network data. Applications: Chapter 13 is devoted to various applications of outlier analysis. Some guidance is also provided for the practitioner. The second edition of this book is more detailed and is written to appeal to both researchers and practitioners. Significant new material has been added on topics such as kernel methods, one-class support-vector machines, matrix factorization, neural networks, outlier ensembles, time-series methods, and subspace methods. It is written as a textbook and can be used for classroom teaching.

Outlier Ensembles

Author : Charu C. Aggarwal
Publisher : Springer
Page : 288 pages
File Size : 31,40 MB
Release : 2017-04-06
Category : Computers
ISBN : 3319547658

GET BOOK

This book discusses a variety of methods for outlier ensembles and organizes them by the specific principles with which accuracy improvements are achieved. In addition, it covers the techniques with which such methods can be made more effective. A formal classification of these methods is provided, and the circumstances in which they work well are examined. The authors cover how outlier ensembles relate (both theoretically and practically) to the ensemble techniques used commonly for other data mining problems like classification. The similarities and (subtle) differences in the ensemble techniques for the classification and outlier detection problems are explored. These subtle differences do impact the design of ensemble algorithms for the latter problem. This book can be used for courses in data mining and related curricula. Many illustrative examples and exercises are provided in order to facilitate classroom teaching. A familiarity is assumed to the outlier detection problem and also to generic problem of ensemble analysis in classification. This is because many of the ensemble methods discussed in this book are adaptations from their counterparts in the classification domain. Some techniques explained in this book, such as wagging, randomized feature weighting, and geometric subsampling, provide new insights that are not available elsewhere. Also included is an analysis of the performance of various types of base detectors and their relative effectiveness. The book is valuable for researchers and practitioners for leveraging ensemble methods into optimal algorithmic design.

Anomaly Detection In Temporal Data Mining

Author : Mehmet Yavuz Onat
Publisher : LAP Lambert Academic Publishing
Page : 72 pages
File Size : 31,19 MB
Release : 2015-12-09
Category :
ISBN : 9783659797491

GET BOOK

Temporal data mining is a title for data mining techniques executed over temporal data. The major goals of temporal data mining are; indexing, clustering, classification, prediction, summarization, anomaly detection and segmentation. In temporal data, anomaly detection or novelty detection is the identification of interesting patterns. Several anomaly detection algorithms have been proposed in the literature. However, there are limited number of studies that compare these methods. In this study, Heuristically Ordered Time series using Symbolic Aggregate Approximation (HOT-SAX), Pattern Anomaly Value (PAV), Wavelet and Augmented Trie (WAT) and Multi-Scale Abnormal Pattern Detection Algorithm (MPAV) anomaly detection methods were compared by using synthetic and real temporal data sets. Also, temporal data representation techniques were compared in terms of anomaly detection. R statistical programming language was used for analysis.

Outlier Analysis

Author : Charu C. Aggarwal
Publisher : Springer Science & Business Media
Page : 457 pages
File Size : 24,87 MB
Release : 2013-01-11
Category : Computers
ISBN : 1461463963

GET BOOK

With the increasing advances in hardware technology for data collection, and advances in software technology (databases) for data organization, computer scientists have increasingly participated in the latest advancements of the outlier analysis field. Computer scientists, specifically, approach this field based on their practical experiences in managing large amounts of data, and with far fewer assumptions– the data can be of any type, structured or unstructured, and may be extremely large. Outlier Analysis is a comprehensive exposition, as understood by data mining experts, statisticians and computer scientists. The book has been organized carefully, and emphasis was placed on simplifying the content, so that students and practitioners can also benefit. Chapters will typically cover one of three areas: methods and techniques commonly used in outlier analysis, such as linear methods, proximity-based methods, subspace methods, and supervised methods; data domains, such as, text, categorical, mixed-attribute, time-series, streaming, discrete sequence, spatial and network data; and key applications of these methods as applied to diverse domains such as credit card fraud detection, intrusion detection, medical diagnosis, earth science, web log analytics, and social network analysis are covered.

Tensor-based Spatio-temporal Outlier Detection in Large Datasets

Author : Yanan Sun
Publisher :
Page : 322 pages
File Size : 13,22 MB
Release : 2014
Category :
ISBN :

GET BOOK

Spatio-Temporal data is inherently large since each spatial node has spatial attributes and may also be associated with large amounts of measurement data captured over time. In such large and multi-dimensional data identifying anomalies can be a challenge due to the massive data size and relationships among spatial objects. Discovering anomalies in spatio-temporal data is relevant in several domains, such as detecting rare disease outbreaks, detecting oil spills, discovering regions with highway traffic congestion. Most existing techniques for discovering anomalies in spatio-temporal data may find the spatial outliers first and then identify the spatio-temporal anomalies from the data of that specific spatial location. Alternatively, some approaches may discover anomalous time periods and then discover the unusual spatial location in them. This may lead to identifying incorrect spatio-temporal outliers or missing important spatio-temporal phenomena due to the elimination of information after each step. Thus, there is a need to address capturing both space and time simultaneously. A tensor is a multi-dimensional array. It is considered as a powerful tool to manipulate multi-dimensional and multi-variate data. It has a concise mathematical framework for formulating and solving complex data problems efficiently. Tensors can handle complex relationship in spatio-temporal data and their mathematical framework can help us detect spatio-temporal outliers in an effective and efficient manner. Tensors multiplication can integrate spatial and temporal aspects in the data at the same time. In this dissertation, we present our novel approach addressing the key limitation of existing spatio-temporal outlier detection methods by using an efficient tensor based model that supports complex relationships in spatio-temporal data to detect outliers by looking at space and time simultaneously as well as handling the scalability issue when it comes to manipulating large datasets. In this dissertation, we are going to present our novel spatio-temporal tensor model. Based on the spatio-temporal tensor model, we present our clustering-based neighborhood discovery algorithm, neighborhood-based spatial, and spatio-temporal outlier detection algorithms to discover different types of spatio-temporal outliers namely point based and window based outliers. We discuss detailed experimental results for each of the algorithms proposed and also present comparative results.

Identification of Outliers

Author : D. Hawkins
Publisher : Springer Science & Business Media
Page : 194 pages
File Size : 42,35 MB
Release : 2013-04-17
Category : Science
ISBN : 9401539944

GET BOOK

The problem of outliers is one of the oldest in statistics, and during the last century and a half interest in it has waxed and waned several times. Currently it is once again an active research area after some years of relative neglect, and recent work has solved a number of old problems in outlier theory, and identified new ones. The major results are, however, scattered amongst many journal articles, and for some time there has been a clear need to bring them together in one place. That was the original intention of this monograph: but during execution it became clear that the existing theory of outliers was deficient in several areas, and so the monograph also contains a number of new results and conjectures. In view of the enormous volume ofliterature on the outlier problem and its cousins, no attempt has been made to make the coverage exhaustive. The material is concerned almost entirely with the use of outlier tests that are known (or may reasonably be expected) to be optimal in some way. Such topics as robust estimation are largely ignored, being covered more adequately in other sources. The numerous ad hoc statistics proposed in the early work on the grounds of intuitive appeal or computational simplicity also are not discussed in any detail.