Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Huang

Qiao

Field Testing of a Stochastic Planner for ASV Navigation Using Satellite Images

Sep 26, 2023
Philip, Huang, Tony, Wang, Florian Shkurti, Timothy D. Barfoot

Figure 1 for Field Testing of a Stochastic Planner for ASV Navigation Using Satellite Images

Figure 2 for Field Testing of a Stochastic Planner for ASV Navigation Using Satellite Images

Figure 3 for Field Testing of a Stochastic Planner for ASV Navigation Using Satellite Images

Figure 4 for Field Testing of a Stochastic Planner for ASV Navigation Using Satellite Images

We introduce a multi-sensor navigation system for autonomous surface vessels (ASV) intended for water-quality monitoring in freshwater lakes. Our mission planner uses satellite imagery as a prior map, formulating offline a mission-level policy for global navigation of the ASV and enabling autonomous online execution via local perception and local planning modules. A significant challenge is posed by the inconsistencies in traversability estimation between satellite images and real lakes, due to environmental effects such as wind, aquatic vegetation, shallow waters, and fluctuating water levels. Hence, we specifically modelled these traversability uncertainties as stochastic edges in a graph and optimized for a mission-level policy that minimizes the expected total travel distance. To execute the policy, we propose a modern local planner architecture that processes sensor inputs and plans paths to execute the high-level policy under uncertain traversability conditions. Our system was tested on three km-scale missions on a Northern Ontario lake, demonstrating that our GPS-, vision-, and sonar-enabled ASV system can effectively execute the mission-level policy and disambiguate the traversability of stochastic edges. Finally, we provide insights gained from practical field experience and offer several future directions to enhance the overall reliability of ASV navigation systems.

* 33 pages, 20 figures. Project website https://pcctp.github.io. arXiv admin note: text overlap with arXiv:2209.11864

Via

Access Paper or Ask Questions

Nationality Bias in Text Generation

Feb 11, 2023
Pranav Narayanan Venkit, Sanjana Gautam, Ruchi Panchanadikar, Ting-Hao, Huang, Shomir Wilson

Figure 1 for Nationality Bias in Text Generation

Figure 2 for Nationality Bias in Text Generation

Figure 3 for Nationality Bias in Text Generation

Figure 4 for Nationality Bias in Text Generation

Little attention is placed on analyzing nationality bias in language models, especially when nationality is highly used as a factor in increasing the performance of social NLP models. This paper examines how a text generation model, GPT-2, accentuates pre-existing societal biases about country-based demonyms. We generate stories using GPT-2 for various nationalities and use sensitivity analysis to explore how the number of internet users and the country's economic status impacts the sentiment of the stories. To reduce the propagation of biases through large language models (LLM), we explore the debiasing method of adversarial triggering. Our results show that GPT-2 demonstrates significant bias against countries with lower internet users, and adversarial triggering effectively reduces the same.

* EACL 2023 Paper

Via

Access Paper or Ask Questions

Graph Neural Networks for Community Detection on Sparse Graphs

Nov 06, 2022
Luana Ruiz, Ningyuan, Huang, Soledad Villar

Figure 1 for Graph Neural Networks for Community Detection on Sparse Graphs

Figure 2 for Graph Neural Networks for Community Detection on Sparse Graphs

Figure 3 for Graph Neural Networks for Community Detection on Sparse Graphs

Figure 4 for Graph Neural Networks for Community Detection on Sparse Graphs

Spectral methods provide consistent estimators for community detection in dense graphs. However, their performance deteriorates as the graphs become sparser. In this work we consider a random graph model that can produce graphs at different levels of sparsity, and we show that graph neural networks can outperform spectral methods on sparse graphs. We illustrate the results with numerical examples in both synthetic and real graphs.

* Submitted to ICASSP 2023

Via

Access Paper or Ask Questions

Masked Autoencoders that Listen

Jul 13, 2022
Po-Yao, Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer

Figure 1 for Masked Autoencoders that Listen

Figure 2 for Masked Autoencoders that Listen

Figure 3 for Masked Autoencoders that Listen

Figure 4 for Masked Autoencoders that Listen

This paper studies a simple extension of image-based Masked Autoencoders (MAE) to self-supervised representation learning from audio spectrograms. Following the Transformer encoder-decoder design in MAE, our Audio-MAE first encodes audio spectrogram patches with a high masking ratio, feeding only the non-masked tokens through encoder layers. The decoder then re-orders and decodes the encoded context padded with mask tokens, in order to reconstruct the input spectrogram. We find it beneficial to incorporate local window attention in the decoder, as audio spectrograms are highly correlated in local time and frequency bands. We then fine-tune the encoder with a lower masking ratio on target datasets. Empirically, Audio-MAE sets new state-of-the-art performance on six audio and speech classification tasks, outperforming other recent models that use external supervised pre-training. The code and models will be at https://github.com/facebookresearch/AudioMAE.

* Technical report

Via

Access Paper or Ask Questions

On Adversarial Robustness of Large-scale Audio Visual Learning

Mar 23, 2022
Juncheng B Li, Shuhui Qu, Xinjian Li, Po-Yao, Huang, Florian Metze

Figure 1 for On Adversarial Robustness of Large-scale Audio Visual Learning

Figure 2 for On Adversarial Robustness of Large-scale Audio Visual Learning

Figure 3 for On Adversarial Robustness of Large-scale Audio Visual Learning

Figure 4 for On Adversarial Robustness of Large-scale Audio Visual Learning

As audio-visual systems are being deployed for safety-critical tasks such as surveillance and malicious content filtering, their robustness remains an under-studied area. Existing published work on robustness either does not scale to large-scale dataset, or does not deal with multiple modalities. This work aims to study several key questions related to multi-modal learning through the lens of robustness: 1) Are multi-modal models necessarily more robust than uni-modal models? 2) How to efficiently measure the robustness of multi-modal learning? 3) How to fuse different modalities to achieve a more robust multi-modal model? To understand the robustness of the multi-modal model in a large-scale setting, we propose a density-based metric, and a convexity metric to efficiently measure the distribution of each modality in high-dimensional latent space. Our work provides a theoretical intuition together with empirical evidence showing how multi-modal fusion affects adversarial robustness through these metrics. We further devise a mix-up strategy based on our metrics to improve the robustness of the trained model. Our experiments on AudioSet and Kinetics-Sounds verify our hypothesis that multi-modal models are not necessarily more robust than their uni-modal counterparts in the face of adversarial examples. We also observe our mix-up trained method could achieve as much protection as traditional adversarial training, offering a computationally cheap alternative. Implementation: https://github.com/lijuncheng16/AudioSetDoneRight

* 2022 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2022)

Via

Access Paper or Ask Questions

Design of EMG-driven Musculoskeletal Model for Volitional Control of a Robotic Ankle Prosthesis

Feb 17, 2022
Chinmay Shah, Aaron Fleming, Varun Nalam, He, Huang

Figure 1 for Design of EMG-driven Musculoskeletal Model for Volitional Control of a Robotic Ankle Prosthesis

Figure 2 for Design of EMG-driven Musculoskeletal Model for Volitional Control of a Robotic Ankle Prosthesis

Figure 3 for Design of EMG-driven Musculoskeletal Model for Volitional Control of a Robotic Ankle Prosthesis

Figure 4 for Design of EMG-driven Musculoskeletal Model for Volitional Control of a Robotic Ankle Prosthesis

Existing robotic lower-limb prostheses use autonomous control to address cyclic, locomotive tasks, but they are inadequate to operate the prosthesis for daily activities that are non-cyclic and unpredictable. To address this challenge, this study aims to design a novel electromyography (EMG)-driven musculoskeletal model for volitional control of a robotic ankle-foot prosthesis. This controller places the user in continuous control of the device, allowing them to freely manipulate the prosthesis behavior at will. The Hill-type muscle model was used to model a dorsiflexor and a plantarflexor, which functioned around a virtual ankle joint. The model parameters were determined by fitting the model prediction to the experimental data collected from an able-bodied subject. EMG signals recorded from ankle agonist and antagonist muscle pair were used to activate the virtual muscle models. This model was validated via offline simulations and real-time prosthesis control. Additionally, the feasibility of the proposed prosthesis control on assisting the user's functional tasks was demonstrated. The present control may further improve the function of robotic prosthesis for supporting versatile activities in individuals with lower-limb amputations.

* 6 page conference submission pre-print

Via

Access Paper or Ask Questions

Online Self-Calibration for Visual-Inertial Navigation Systems: Models, Analysis and Degeneracy

Jan 26, 2022
Yulin Yang, Patrick Geneva, Xingxing Zuo, Guoquan, Huang

Figure 1 for Online Self-Calibration for Visual-Inertial Navigation Systems: Models, Analysis and Degeneracy

Figure 2 for Online Self-Calibration for Visual-Inertial Navigation Systems: Models, Analysis and Degeneracy

Figure 3 for Online Self-Calibration for Visual-Inertial Navigation Systems: Models, Analysis and Degeneracy

Figure 4 for Online Self-Calibration for Visual-Inertial Navigation Systems: Models, Analysis and Degeneracy

In this paper, we study in-depth the problem of online self-calibration for robust and accurate visual-inertial state estimation. In particular, we first perform a complete observability analysis for visual-inertial navigation systems (VINS) with full calibration of sensing parameters, including IMU and camera intrinsics and IMU-camera spatial-temporal extrinsic calibration, along with readout time of rolling shutter (RS) cameras (if used). We investigate different inertial model variants containing IMU intrinsic parameters that encompass most commonly used models for low-cost inertial sensors. The observability analysis results prove that VINS with full sensor calibration has four unobservable directions, corresponding to the system's global yaw and translation, while all sensor calibration parameters are observable given fully-excited 6-axis motion. Moreover, we, for the first time, identify primitive degenerate motions for IMU and camera intrinsic calibration. Each degenerate motion profile will cause a set of calibration parameters to be unobservable and any combination of these degenerate motions are still degenerate. Extensive Monte-Carlo simulations and real-world experiments are performed to validate both the observability analysis and identified degenerate motions, showing that online self-calibration improves system accuracy and robustness to calibration inaccuracies. We compare the proposed online self-calibration on commonly-used IMUs against the state-of-art offline calibration toolbox Kalibr, and show that the proposed system achieves better consistency and repeatability. Based on our analysis and experimental evaluations, we also provide practical guidelines for how to perform online IMU-camera sensor self-calibration.

* We study the observability analysis and degeneracy for VINS with full-parameter calibration, including IMU/camera intrinsic and IMU-camera spatial-temporal calibration (with rolling shutter readout time). Extensive simulations and real-world experiments are performed to fully verify the analysis and the proposed system

Via

Access Paper or Ask Questions

Prospective Learning: Back to the Future

Jan 19, 2022
Joshua T. Vogelstein, Timothy Verstynen, Konrad P. Kording, Leyla Isik, John W. Krakauer, Ralph Etienne-Cummings, Elizabeth L. Ogburn, Carey E. Priebe, Randal Burns, Kwame Kutten, James J. Knierim, James B. Potash, Thomas Hartung, Lena Smirnova, Paul Worley, Alena Savonenko, Ian Phillips, Michael I. Miller, Rene Vidal, Jeremias Sulam, Adam Charles, Noah J. Cowan, Maxim Bichuch, Archana Venkataraman, Chen Li, Nitish Thakor, Justus M Kebschull, Marilyn Albert, Jinchong Xu, Marshall Hussain Shuler, Brian Caffo, Tilak Ratnanather, Ali Geisa, Seung-Eon Roh, Eva Yezerets, Meghana Madhyastha, Javier J. How, Tyler M. Tomita, Jayanta Dey, Ningyuan, Huang, Jong M. Shin, Kaleab Alemayehu Kinfu, Pratik Chaudhari, Ben Baker, Anna Schapiro, Dinesh Jayaraman, Eric Eaton, Michael Platt, Lyle Ungar, Leila Wehbe, Adam Kepecs, Amy Christensen, Onyema Osuagwu, Bing Brunton, Brett Mensh, Alysson R. Muotri, Gabriel Silva, Francesca Puppo, Florian Engert, Elizabeth Hillman, Julia Brown, Chris White, Weiwei Yang

Figure 1 for Prospective Learning: Back to the Future

Figure 2 for Prospective Learning: Back to the Future

Figure 3 for Prospective Learning: Back to the Future

Research on both natural intelligence (NI) and artificial intelligence (AI) generally assumes that the future resembles the past: intelligent agents or systems (what we call 'intelligence') observe and act on the world, then use this experience to act on future experiences of the same kind. We call this 'retrospective learning'. For example, an intelligence may see a set of pictures of objects, along with their names, and learn to name them. A retrospective learning intelligence would merely be able to name more pictures of the same objects. We argue that this is not what true intelligence is about. In many real world problems, both NIs and AIs will have to learn for an uncertain future. Both must update their internal models to be useful for future tasks, such as naming fundamentally new objects and using these objects effectively in a new context or to achieve previously unencountered goals. This ability to learn for the future we call 'prospective learning'. We articulate four relevant factors that jointly define prospective learning. Continual learning enables intelligences to remember those aspects of the past which it believes will be most useful in the future. Prospective constraints (including biases and priors) facilitate the intelligence finding general solutions that will be applicable to future problems. Curiosity motivates taking actions that inform future decision making, including in previously unmet situations. Causal estimation enables learning the structure of relations that guide choosing actions for specific outcomes, even when the specific action-outcome contingencies have never been observed before. We argue that a paradigm shift from retrospective to prospective learning will enable the communities that study intelligence to unite and overcome existing bottlenecks to more effectively explain, augment, and engineer intelligences.

Via

Access Paper or Ask Questions