Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuxin Huang

Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs

Sep 09, 2023
Huafeng Li, Dan Wang, Yuxin Huang, Yafei Zhang, Zhengtao Yu

Figure 1 for Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs

Figure 2 for Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs

Figure 3 for Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs

Figure 4 for Generation and Recombination for Multifocus Image Fusion with Free Number of Inputs

Multifocus image fusion is an effective way to overcome the limitation of optical lenses. Many existing methods obtain fused results by generating decision maps. However, such methods often assume that the focused areas of the two source images are complementary, making it impossible to achieve simultaneous fusion of multiple images. Additionally, the existing methods ignore the impact of hard pixels on fusion performance, limiting the visual quality improvement of fusion image. To address these issues, a combining generation and recombination model, termed as GRFusion, is proposed. In GRFusion, focus property detection of each source image can be implemented independently, enabling simultaneous fusion of multiple source images and avoiding information loss caused by alternating fusion. This makes GRFusion free from the number of inputs. To distinguish the hard pixels from the source images, we achieve the determination of hard pixels by considering the inconsistency among the detection results of focus areas in source images. Furthermore, a multi-directional gradient embedding method for generating full focus images is proposed. Subsequently, a hard-pixel-guided recombination mechanism for constructing fused result is devised, effectively integrating the complementary advantages of feature reconstruction-based method and focused pixel recombination-based method. Extensive experimental results demonstrate the effectiveness and the superiority of the proposed method.The source code will be released on https://github.com/xxx/xxx.

Via

Access Paper or Ask Questions

MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Jul 27, 2023
Zirui Wu, Tianyu Liu, Liyi Luo, Zhide Zhong, Jianteng Chen, Hongmin Xiao, Chao Hou, Haozhe Lou, Yuantao Chen, Runyi Yang, Yuxin Huang, Xiaoyu Ye, Zike Yan, Yongliang Shi, Yiyi Liao, Hao Zhao

Figure 1 for MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Figure 2 for MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Figure 3 for MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Figure 4 for MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Nowadays, autonomous cars can drive smoothly in ordinary cases, and it is widely recognized that realistic sensor simulation will play a critical role in solving remaining corner cases by simulating them. To this end, we propose an autonomous driving simulator based upon neural radiance fields (NeRFs). Compared with existing works, ours has three notable features: (1) Instance-aware. Our simulator models the foreground instances and background environments separately with independent networks so that the static (e.g., size and appearance) and dynamic (e.g., trajectory) properties of instances can be controlled separately. (2) Modular. Our simulator allows flexible switching between different modern NeRF-related backbones, sampling strategies, input modalities, etc. We expect this modular design to boost academic progress and industrial deployment of NeRF-based autonomous driving simulation. (3) Realistic. Our simulator set new state-of-the-art photo-realism results given the best module selection. Our simulator will be open-sourced while most of our counterparts are not. Project page: https://open-air-sun.github.io/mars/.

* CICAI 2023, project page with code: https://open-air-sun.github.io/mars/

Via

Access Paper or Ask Questions

Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts

May 25, 2023
Yuxin Huang, Hao Wang, Zhaoran Liu, Licheng Pan, Haozhe Li, Xinggao Liu

Figure 1 for Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts

Figure 2 for Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts

Figure 3 for Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts

Figure 4 for Modeling Task Relationships in Multi-variate Soft Sensor with Balanced Mixture-of-Experts

Accurate estimation of multiple quality variables is critical for building industrial soft sensor models, which have long been confronted with data efficiency and negative transfer issues. Methods sharing backbone parameters among tasks address the data efficiency issue; however, they still fail to mitigate the negative transfer problem. To address this issue, a balanced Mixture-of-Experts (BMoE) is proposed in this work, which consists of a multi-gate mixture of experts (MMoE) module and a task gradient balancing (TGB) module. The MoE module aims to portray task relationships, while the TGB module balances the gradients among tasks dynamically. Both of them cooperate to mitigate the negative transfer problem. Experiments on the typical sulfur recovery unit demonstrate that BMoE models task relationship and balances the training process effectively, and achieves better performance than baseline models significantly.

Via

Access Paper or Ask Questions

AttentionMixer: An Accurate and Interpretable Framework for Process Monitoring

Feb 21, 2023
Hao Wang, Zhiyu Wang, Yunlong Niu, Zhaoran Liu, Haozhe Li, Yilin Liao, Yuxin Huang, Xinggao Liu

Figure 1 for AttentionMixer: An Accurate and Interpretable Framework for Process Monitoring

Figure 2 for AttentionMixer: An Accurate and Interpretable Framework for Process Monitoring

Figure 3 for AttentionMixer: An Accurate and Interpretable Framework for Process Monitoring

Figure 4 for AttentionMixer: An Accurate and Interpretable Framework for Process Monitoring

An accurate and explainable automatic monitoring system is critical for the safety of high efficiency energy conversion plants that operate under extreme working condition. Nonetheless, currently available data-driven monitoring systems often fall short in meeting the requirements for either high-accuracy or interpretability, which hinders their application in practice. To overcome this limitation, a data-driven approach, AttentionMixer, is proposed under a generalized message passing framework, with the goal of establishing an accurate and interpretable radiation monitoring framework for energy conversion plants. To improve the model accuracy, the first technical contribution involves the development of spatial and temporal adaptive message passing blocks, which enable the capture of spatial and temporal correlations, respectively; the two blocks are cascaded through a mixing operator. To enhance the model interpretability, the second technical contribution involves the implementation of a sparse message passing regularizer, which eliminates spurious and noisy message passing routes. The effectiveness of the AttentionMixer approach is validated through extensive evaluations on a monitoring benchmark collected from the national radiation monitoring network for nuclear power plants, resulting in enhanced monitoring accuracy and interpretability in practice.

Via

Access Paper or Ask Questions

ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music

Oct 11, 2022
Wanpeng Fan, Yuanzhi Su, Yuxin Huang

Figure 1 for ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music

Figure 2 for ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music

Figure 3 for ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music

Figure 4 for ConchShell: A Generative Adversarial Networks that Turns Pictures into Piano Music

We present ConchShell, a multi-modal generative adversarial framework that takes pictures as input to the network and generates piano music samples that match the picture context. Inspired by I3D, we introduce a novel image feature representation method: time-convolutional neural network (TCNN), which is used to forge features for images in the temporal dimension. Although our image data consists of only six categories, our proposed framework will be innovative and commercially meaningful. The project will provide technical ideas for work such as 3D game voice overs, short-video soundtracks, and real-time generation of metaverse background music.We have also released a new dataset, the Beach-Ocean-Piano Dataset (BOPD) 1, which contains more than 3,000 images and more than 1,500 piano pieces. This dataset will support multimodal image-to-music research.

* 5 pages

Via

Access Paper or Ask Questions

Intellectual Property Evaluation Utilizing Machine Learning

Aug 18, 2022
Jinxin Ding, Yuxin Huang, Keyang Ni, Xueyao Wang, Yinxiao Wang, Yucheng Wang

Figure 1 for Intellectual Property Evaluation Utilizing Machine Learning

Figure 2 for Intellectual Property Evaluation Utilizing Machine Learning

Intellectual properties is increasingly important in the economic development. To solve the pain points by traditional methods in IP evaluation, we are developing a new technology with machine learning as the core. We have built an online platform and will expand our business in the Greater Bay Area with plans.

* 5 pages, 2 figures

Via

Access Paper or Ask Questions

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

May 20, 2022
Hui Zhang, Tian Yuan, Junkun Chen, Xintong Li, Renjie Zheng, Yuxin Huang, Xiaojie Chen, Enlei Gong, Zeyu Chen, Xiaoguang Hu, Dianhai Yu, Yanjun Ma, Liang Huang

Figure 1 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

Figure 2 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

Figure 3 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

Figure 4 for PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech.

Via

Access Paper or Ask Questions

Towards Understanding Gender Bias in Relation Extraction

Nov 09, 2019
Andrew Gaut, Tony Sun, Shirlyn Tang, Yuxin Huang, Jing Qian, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, William Yang Wang

Figure 1 for Towards Understanding Gender Bias in Relation Extraction

Figure 2 for Towards Understanding Gender Bias in Relation Extraction

Figure 3 for Towards Understanding Gender Bias in Relation Extraction

Figure 4 for Towards Understanding Gender Bias in Relation Extraction

Recent developments in Neural Relation Extraction (NRE) have made significant strides towards Automated Knowledge Base Construction (AKBC). While much attention has been dedicated towards improvements in accuracy, there have been no attempts in the literature to our knowledge to evaluate social biases in NRE systems. We create WikiGenderBias, a distantly supervised dataset with a human annotated test set. WikiGenderBias has sentences specifically curated to analyze gender bias in relation extraction systems. We use WikiGenderBias to evaluate systems for bias and find that NRE systems exhibit gender biased predictions and lay groundwork for future evaluation of bias in NRE. We also analyze how name anonymization, hard debiasing for word embeddings, and counterfactual data augmentation affect gender bias in predictions and performance.

Via

Access Paper or Ask Questions

Mitigating Gender Bias in Natural Language Processing: Literature Review

Jun 21, 2019
Tony Sun, Andrew Gaut, Shirlyn Tang, Yuxin Huang, Mai ElSherief, Jieyu Zhao, Diba Mirza, Elizabeth Belding, Kai-Wei Chang, William Yang Wang

Figure 1 for Mitigating Gender Bias in Natural Language Processing: Literature Review

Figure 2 for Mitigating Gender Bias in Natural Language Processing: Literature Review

Figure 3 for Mitigating Gender Bias in Natural Language Processing: Literature Review

Figure 4 for Mitigating Gender Bias in Natural Language Processing: Literature Review

As Natural Language Processing (NLP) and Machine Learning (ML) tools rise in popularity, it becomes increasingly vital to recognize the role they play in shaping societal biases and stereotypes. Although NLP models have shown success in modeling various applications, they propagate and may even amplify gender bias found in text corpora. While the study of bias in artificial intelligence is not new, methods to mitigate gender bias in NLP are relatively nascent. In this paper, we review contemporary studies on recognizing and mitigating gender bias in NLP. We discuss gender bias based on four forms of representation bias and analyze methods recognizing gender bias. Furthermore, we discuss the advantages and drawbacks of existing gender debiasing methods. Finally, we discuss future studies for recognizing and mitigating gender bias in NLP.

* Accepted to ACL 2019

Via

Access Paper or Ask Questions