Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Fangzheng Lin

Institute of Construction Informatics, Dresden University of Technology, Deep Learning Center, Changzhou Microintelligence Co., Ltd

Multistage Spatial Context Models for Learned Image Compression

Feb 18, 2023
Fangzheng Lin, Heming Sun, Jinming Liu, Jiro Katto

Figure 1 for Multistage Spatial Context Models for Learned Image Compression

Figure 2 for Multistage Spatial Context Models for Learned Image Compression

Figure 3 for Multistage Spatial Context Models for Learned Image Compression

Figure 4 for Multistage Spatial Context Models for Learned Image Compression

Recent state-of-the-art Learned Image Compression methods feature spatial context models, achieving great rate-distortion improvements over hyperprior methods. However, the autoregressive context model requires serial decoding, limiting runtime performance. The Checkerboard context model allows parallel decoding at a cost of reduced RD performance. We present a series of multistage spatial context models allowing both fast decoding and better RD performance. We split the latent space into square patches and decode serially within each patch while different patches are decoded in parallel. The proposed method features a comparable decoding speed to Checkerboard while reaching the RD performance of Autoregressive and even also outperforming Autoregressive. Inside each patch, the decoding order must be carefully decided as a bad order negatively impacts performance; therefore, we also propose a decoding order optimization algorithm.

* Accepted to IEEE ICASSP 2023

Via

Access Paper or Ask Questions

A comparative study of attention mechanism and generative adversarial network in facade damage segmentation

Sep 27, 2022
Fangzheng Lin, Jiesheng Yang, Jiangpeng Shu, Raimar J. Scherer

Figure 1 for A comparative study of attention mechanism and generative adversarial network in facade damage segmentation

Figure 2 for A comparative study of attention mechanism and generative adversarial network in facade damage segmentation

Figure 3 for A comparative study of attention mechanism and generative adversarial network in facade damage segmentation

Figure 4 for A comparative study of attention mechanism and generative adversarial network in facade damage segmentation

Semantic segmentation profits from deep learning and has shown its possibilities in handling the graphical data from the on-site inspection. As a result, visual damage in the facade images should be detected. Attention mechanism and generative adversarial networks are two of the most popular strategies to improve the quality of semantic segmentation. With specific focuses on these two strategies, this paper adopts U-net, a representative convolutional neural network, as the primary network and presents a comparative study in two steps. First, cell images are utilized to respectively determine the most effective networks among the U-nets with attention mechanism or generative adversarial networks. Subsequently, selected networks from the first test and their combination are applied for facade damage segmentation to investigate the performances of these networks. Besides, the combined effect of the attention mechanism and the generative adversarial network is discovered and discussed.

Via

Access Paper or Ask Questions

Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Aug 02, 2022
Fangzheng Lin, Heming Sun, Jiro Katto

Figure 1 for Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Figure 2 for Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Figure 3 for Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Figure 4 for Streaming-capable High-performance Architecture of Learned Image Compression Codecs

Learned image compression allows achieving state-of-the-art accuracy and compression ratios, but their relatively slow runtime performance limits their usage. While previous attempts on optimizing learned image codecs focused more on the neural model and entropy coding, we present an alternative method to improving the runtime performance of various learned image compression models. We introduce multi-threaded pipelining and an optimized memory model to enable GPU and CPU workloads asynchronous execution, fully taking advantage of computational resources. Our architecture alone already produces excellent performance without any change to the neural model itself. We also demonstrate that combining our architecture with previous tweaks to the neural models can further improve runtime performance. We show that our implementations excel in throughput and latency compared to the baseline and demonstrate the performance of our implementations by creating a real-time video streaming encoder-decoder sample application, with the encoder running on an embedded device.

* Accepted to IEEE ICIP 2022

Via

Access Paper or Ask Questions

Fast Crack Detection Using Convolutional Neural Network

May 23, 2021
Jiesheng Yang, Fangzheng Lin, Yusheng Xiang, Peter Katranuschkov, Raimar J. Scherer

Figure 1 for Fast Crack Detection Using Convolutional Neural Network

Figure 2 for Fast Crack Detection Using Convolutional Neural Network

Figure 3 for Fast Crack Detection Using Convolutional Neural Network

Figure 4 for Fast Crack Detection Using Convolutional Neural Network

To improve the efficiency and reduce the labour cost of the renovation process, this study presents a lightweight Convolutional Neural Network (CNN)-based architecture to extract crack-like features, such as cracks and joints. Moreover, Transfer Learning (TF) method was used to save training time while offering comparable prediction results. For three different objectives: 1) Detection of the concrete cracks; 2) Detection of natural stone cracks; 3) Differentiation between joints and cracks in natural stone; We built a natural stone dataset with joints and cracks information as complementary for the concrete benchmark dataset. As the results show, our model is demonstrated as an effective tool for industry use.

* 10 pages, 11 figures

Via

Access Paper or Ask Questions

Crack Semantic Segmentation using the U-Net with Full Attention Strategy

Apr 29, 2021
Fangzheng Lin, Jiesheng Yang, Jiangpeng Shu, Raimar J. Scherer

Figure 1 for Crack Semantic Segmentation using the U-Net with Full Attention Strategy

Figure 2 for Crack Semantic Segmentation using the U-Net with Full Attention Strategy

Figure 3 for Crack Semantic Segmentation using the U-Net with Full Attention Strategy

Figure 4 for Crack Semantic Segmentation using the U-Net with Full Attention Strategy

Structures suffer from the emergence of cracks, therefore, crack detection is always an issue with much concern in structural health monitoring. Along with the rapid progress of deep learning technology, image semantic segmentation, an active research field, offers another solution, which is more effective and intelligent, to crack detection Through numerous artificial neural networks have been developed to address the preceding issue, corresponding explorations are never stopped improving the quality of crack detection. This paper presents a novel artificial neural network architecture named Full Attention U-net for image semantic segmentation. The proposed architecture leverages the U-net as the backbone and adopts the Full Attention Strategy, which is a synthesis of the attention mechanism and the outputs from each encoding layer in skip connection. Subject to the hardware in training, the experiments are composed of verification and validation. In verification, 4 networks including U-net, Attention U-net, Advanced Attention U-net, and Full Attention U-net are tested through cell images for a competitive study. With respect to mean intersection-over-unions and clarity of edge identification, the Full Attention U-net performs best in verification, and is hence applied for crack semantic segmentation in validation to demonstrate its effectiveness.

Via

Access Paper or Ask Questions