Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Congrui Huang

NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

Oct 12, 2023
Chenguo Lin, Xumeng Wen, Wei Cao, Congrui Huang, Jiang Bian, Stephen Lin, Zhirong Wu

Figure 1 for NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

Figure 2 for NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

Figure 3 for NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

Figure 4 for NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time Series Pretraining

Recent research on time-series self-supervised models shows great promise in learning semantic representations. However, it has been limited to small-scale datasets, e.g., thousands of temporal sequences. In this work, we make key technical contributions that are tailored to the numerical properties of time-series data and allow the model to scale to large datasets, e.g., millions of temporal sequences. We adopt the Transformer architecture by first partitioning the input into non-overlapping windows. Each window is then characterized by its normalized shape and two scalar values denoting the mean and standard deviation within each window. To embed scalar values that may possess arbitrary numerical scales to high-dimensional vectors, we propose a numerically multi-scaled embedding module enumerating all possible scales for the scalar values. The model undergoes pretraining using the proposed numerically multi-scaled embedding with a simple contrastive objective on a large-scale dataset containing over a million sequences. We study its transfer performance on a number of univariate and multivariate classification benchmarks. Our method exhibits remarkable improvement against previous representation learning approaches and establishes the new state of the art, even compared with domain-specific non-learning-based methods.

Via

Access Paper or Ask Questions

Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

Jul 02, 2023
Ziyue Li, Yuchen Fang, You Li, Kan Ren, Yansen Wang, Xufang Luo, Juanyong Duan, Congrui Huang, Dongsheng Li, Lili Qiu

Figure 1 for Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

Figure 2 for Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

Figure 3 for Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

Figure 4 for Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

A timely detection of seizures for newborn infants with electroencephalogram (EEG) has been a common yet life-saving practice in the Neonatal Intensive Care Unit (NICU). However, it requires great human efforts for real-time monitoring, which calls for automated solutions to neonatal seizure detection. Moreover, the current automated methods focusing on adult epilepsy monitoring often fail due to (i) dynamic seizure onset location in human brains; (ii) different montages on neonates and (iii) huge distribution shift among different subjects. In this paper, we propose a deep learning framework, namely STATENet, to address the exclusive challenges with exquisite designs at the temporal, spatial and model levels. The experiments over the real-world large-scale neonatal EEG dataset illustrate that our framework achieves significantly better seizure detection performance.

* Accepted in IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2023

Via

Access Paper or Ask Questions

Label-Efficient Interactive Time-Series Anomaly Detection

Dec 30, 2022
Hong Guo, Yujing Wang, Jieyu Zhang, Zhengjie Lin, Yunhai Tong, Lei Yang, Luoxing Xiong, Congrui Huang

Figure 1 for Label-Efficient Interactive Time-Series Anomaly Detection

Figure 2 for Label-Efficient Interactive Time-Series Anomaly Detection

Figure 3 for Label-Efficient Interactive Time-Series Anomaly Detection

Figure 4 for Label-Efficient Interactive Time-Series Anomaly Detection

Time-series anomaly detection is an important task and has been widely applied in the industry. Since manual data annotation is expensive and inefficient, most applications adopt unsupervised anomaly detection methods, but the results are usually sub-optimal and unsatisfactory to end customers. Weak supervision is a promising paradigm for obtaining considerable labels in a low-cost way, which enables the customers to label data by writing heuristic rules rather than annotating each instance individually. However, in the time-series domain, it is hard for people to write reasonable labeling functions as the time-series data is numerically continuous and difficult to be understood. In this paper, we propose a Label-Efficient Interactive Time-Series Anomaly Detection (LEIAD) system, which enables a user to improve the results of unsupervised anomaly detection by performing only a small amount of interactions with the system. To achieve this goal, the system integrates weak supervision and active learning collaboratively while generating labeling functions automatically using only a few labeled data. All of these techniques are complementary and can promote each other in a reinforced manner. We conduct experiments on three time-series anomaly detection datasets, demonstrating that the proposed system is superior to existing solutions in both weak supervision and active learning areas. Also, the system has been tested in a real scenario in industry to show its practicality.

Via

Access Paper or Ask Questions

Learning Timestamp-Level Representations for Time Series with Hierarchical Contrastive Loss

Jun 19, 2021
Zhihan Yue, Yujing Wang, Juanyong Duan, Tianmeng Yang, Congrui Huang, Bixiong Xu

Figure 1 for Learning Timestamp-Level Representations for Time Series with Hierarchical Contrastive Loss

Figure 2 for Learning Timestamp-Level Representations for Time Series with Hierarchical Contrastive Loss

Figure 3 for Learning Timestamp-Level Representations for Time Series with Hierarchical Contrastive Loss

Figure 4 for Learning Timestamp-Level Representations for Time Series with Hierarchical Contrastive Loss

This paper presents TS2Vec, a universal framework for learning timestamp-level representations of time series. Unlike existing methods, TS2Vec performs timestamp-wise discrimination, which learns a contextual representation vector directly for each timestamp. We find that the learned representations have superior predictive ability. A linear regression trained on top of the learned representations outperforms previous SOTAs for supervised time series forecasting. Also, the instance-level representations can be simply obtained by applying a max pooling layer on top of learned representations of all timestamps. We conduct extensive experiments on time series classification tasks to evaluate the quality of instance-level representations. As a result, TS2Vec achieves significant improvement compared with existing SOTAs of unsupervised time series representation on 125 UCR datasets and 29 UEA datasets. The source code is publicly available at https://github.com/yuezhihan/ts2vec.

* 20 pages, 6 figures

Via

Access Paper or Ask Questions

Multivariate Time-series Anomaly Detection via Graph Attention Network

Sep 04, 2020
Hang Zhao, Yujing Wang, Juanyong Duan, Congrui Huang, Defu Cao, Yunhai Tong, Bixiong Xu, Jing Bai, Jie Tong, Qi Zhang

Figure 1 for Multivariate Time-series Anomaly Detection via Graph Attention Network

Figure 2 for Multivariate Time-series Anomaly Detection via Graph Attention Network

Figure 3 for Multivariate Time-series Anomaly Detection via Graph Attention Network

Figure 4 for Multivariate Time-series Anomaly Detection via Graph Attention Network

Anomaly detection on multivariate time-series is of great importance in both data mining research and industrial applications. Recent approaches have achieved significant progress in this topic, but there is remaining limitations. One major limitation is that they do not capture the relationships between different time-series explicitly, resulting in inevitable false alarms. In this paper, we propose a novel self-supervised framework for multivariate time-series anomaly detection to address this issue. Our framework considers each univariate time-series as an individual feature and includes two graph attention layers in parallel to learn the complex dependencies of multivariate time-series in both temporal and feature dimensions. In addition, our approach jointly optimizes a forecasting-based model and are construction-based model, obtaining better time-series representations through a combination of single-timestamp prediction and reconstruction of the entire time-series. We demonstrate the efficacy of our model through extensive experiments. The proposed method outperforms other state-of-the-art models on three real-world datasets. Further analysis shows that our method has good interpretability and is useful for anomaly diagnosis.

* Accepted by ICDM 2020. 10 pages

Via

Access Paper or Ask Questions

Automated Model Selection for Time-Series Anomaly Detection

Aug 25, 2020
Yuanxiang Ying, Juanyong Duan, Chunlei Wang, Yujing Wang, Congrui Huang, Bixiong Xu

Figure 1 for Automated Model Selection for Time-Series Anomaly Detection

Figure 2 for Automated Model Selection for Time-Series Anomaly Detection

Figure 3 for Automated Model Selection for Time-Series Anomaly Detection

Figure 4 for Automated Model Selection for Time-Series Anomaly Detection

Time-series anomaly detection is a popular topic in both academia and industrial fields. Many companies need to monitor thousands of temporal signals for their applications and services and require instant feedback and alerts for potential incidents in time. The task is challenging because of the complex characteristics of time-series, which are messy, stochastic, and often without proper labels. This prohibits training supervised models because of lack of labels and a single model hardly fits different time series. In this paper, we propose a solution to address these issues. We present an automated model selection framework to automatically find the most suitable detection model with proper parameters for the incoming data. The model selection layer is extensible as it can be updated without too much effort when a new detector is available to the service. Finally, we incorporate a customized tuning algorithm to flexibly filter anomalies to meet customers' criteria. Experiments on real-world datasets show the effectiveness of our solution.

Via

Access Paper or Ask Questions

Time-Series Anomaly Detection Service at Microsoft

Jun 10, 2019
Hansheng Ren, Bixiong Xu, Yujing Wang, Chao Yi, Congrui Huang, Xiaoyu Kou, Tony Xing, Mao Yang, Jie Tong, Qi Zhang

Figure 1 for Time-Series Anomaly Detection Service at Microsoft

Figure 2 for Time-Series Anomaly Detection Service at Microsoft

Figure 3 for Time-Series Anomaly Detection Service at Microsoft

Figure 4 for Time-Series Anomaly Detection Service at Microsoft

Large companies need to monitor various metrics (for example, Page Views and Revenue) of their applications and services in real time. At Microsoft, we develop a time-series anomaly detection service which helps customers to monitor the time-series continuously and alert for potential incidents on time. In this paper, we introduce the pipeline and algorithm of our anomaly detection service, which is designed to be accurate, efficient and general. The pipeline consists of three major modules, including data ingestion, experimentation platform and online compute. To tackle the problem of time-series anomaly detection, we propose a novel algorithm based on Spectral Residual (SR) and Convolutional Neural Network (CNN). Our work is the first attempt to borrow the SR model from visual saliency detection domain to time-series anomaly detection. Moreover, we innovatively combine SR and CNN together to improve the performance of SR model. Our approach achieves superior experimental results compared with state-of-the-art baselines on both public datasets and Microsoft production data.

* KDD 2019

Via

Access Paper or Ask Questions