Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Renat Sergazinov

SwitchTab: Switched Autoencoders Are Effective Tabular Learners

Jan 04, 2024
Jing Wu, Suiyao Chen, Qi Zhao, Renat Sergazinov, Chen Li, Shengjie Liu, Chongchao Zhao, Tianpei Xie, Hanqing Guo, Cheng Ji, Daniel Cociorva, Hakan Brunzel

Self-supervised representation learning methods have achieved significant success in computer vision and natural language processing, where data samples exhibit explicit spatial or semantic dependencies. However, applying these methods to tabular data is challenging due to the less pronounced dependencies among data samples. In this paper, we address this limitation by introducing SwitchTab, a novel self-supervised method specifically designed to capture latent dependencies in tabular data. SwitchTab leverages an asymmetric encoder-decoder framework to decouple mutual and salient features among data pairs, resulting in more representative embeddings. These embeddings, in turn, contribute to better decision boundaries and lead to improved results in downstream tasks. To validate the effectiveness of SwitchTab, we conduct extensive experiments across various domains involving tabular data. The results showcase superior performance in end-to-end prediction tasks with fine-tuning. Moreover, we demonstrate that pre-trained salient embeddings can be utilized as plug-and-play features to enhance the performance of various traditional classification methods (e.g., Logistic Regression, XGBoost, etc.). Lastly, we highlight the capability of SwitchTab to create explainable representations through visualization of decoupled mutual and salient features in the latent space.

* Association for the Advancement of Artificial Intelligence (AAAI), 2024

Via

Access Paper or Ask Questions

Gluformer: Transformer-Based Personalized Glucose Forecasting with Uncertainty Quantification

Sep 09, 2022
Renat Sergazinov, Mohammadreza Armandpour, Irina Gaynanova

Figure 1 for Gluformer: Transformer-Based Personalized Glucose Forecasting with Uncertainty Quantification

Figure 2 for Gluformer: Transformer-Based Personalized Glucose Forecasting with Uncertainty Quantification

Figure 3 for Gluformer: Transformer-Based Personalized Glucose Forecasting with Uncertainty Quantification

Figure 4 for Gluformer: Transformer-Based Personalized Glucose Forecasting with Uncertainty Quantification

Deep learning models achieve state-of-the art results in predicting blood glucose trajectories, with a wide range of architectures being proposed. However, the adaptation of such models in clinical practice is slow, largely due to the lack of uncertainty quantification of provided predictions. In this work, we propose to model the future glucose trajectory conditioned on the past as an infinite mixture of basis distributions (i.e., Gaussian, Laplace, etc.). This change allows us to learn the uncertainty and predict more accurately in the cases when the trajectory has a heterogeneous or multi-modal distribution. To estimate the parameters of the predictive distribution, we utilize the Transformer architecture. We empirically demonstrate the superiority of our method over existing state-of-the-art techniques both in terms of accuracy and uncertainty on the synthetic and benchmark glucose data sets.

Via

Access Paper or Ask Questions

Machine learning approach to force reconstruction in photoelastic materials

Oct 17, 2020
Renat Sergazinov, Miroslav Kramar

Figure 1 for Machine learning approach to force reconstruction in photoelastic materials

Figure 2 for Machine learning approach to force reconstruction in photoelastic materials

Figure 3 for Machine learning approach to force reconstruction in photoelastic materials

Figure 4 for Machine learning approach to force reconstruction in photoelastic materials

Photoelastic techniques have a long tradition in both qualitative and quantitative analysis of the stresses in granular materials. Over the last two decades, computational methods for reconstructing forces between particles from their photoelastic response have been developed by many different experimental teams. Unfortunately, all of these methods are computationally expensive. This limits their use for processing extensive data sets that capture the time evolution of granular ensembles consisting of a large number of particles. In this paper, we present a novel approach to this problem which leverages the power of convolutional neural networks to recognize complex spatial patterns. The main drawback of using neural networks is that training them usually requires a large labeled data set which is hard to obtain experimentally. We show that this problem can be successfully circumvented by pretraining the networks on a large synthetic data set and then fine-tuning them on much smaller experimental data sets. Due to our current lack of experimental data, we demonstrate the potential of our method by changing the size of the considered particles which alters the exhibited photoelastic patterns more than typical experimental errors.

* 20 pages, 6 figures, 2 tables; changed formatting of the tables; reduced picture resolutions

Via

Access Paper or Ask Questions