Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luca A. Lanzendörfer

CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Mar 29, 2024
Hei Yi Mak, Flint Xiaofeng Fan, Luca A. Lanzendörfer, Cheston Tan, Wei Tsang Ooi, Roger Wattenhofer

Figure 1 for CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Figure 2 for CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Figure 3 for CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

Figure 4 for CAESAR: Enhancing Federated RL in Heterogeneous MDPs through Convergence-Aware Sampling with Screening

In this study, we delve into Federated Reinforcement Learning (FedRL) in the context of value-based agents operating across diverse Markov Decision Processes (MDPs). Existing FedRL methods typically aggregate agents' learning by averaging the value functions across them to improve their performance. However, this aggregation strategy is suboptimal in heterogeneous environments where agents converge to diverse optimal value functions. To address this problem, we introduce the Convergence-AwarE SAmpling with scReening (CAESAR) aggregation scheme designed to enhance the learning of individual agents across varied MDPs. CAESAR is an aggregation strategy used by the server that combines convergence-aware sampling with a screening mechanism. By exploiting the fact that agents learning in identical MDPs are converging to the same optimal value function, CAESAR enables the selective assimilation of knowledge from more proficient counterparts, thereby significantly enhancing the overall learning efficiency. We empirically validate our hypothesis and demonstrate the effectiveness of CAESAR in enhancing the learning efficiency of agents, using both a custom-built GridWorld environment and the classical FrozenLake-v1 task, each presenting varying levels of environmental heterogeneity.

Via

Access Paper or Ask Questions

DISCO-10M: A Large-Scale Music Dataset

Jun 23, 2023
Luca A. Lanzendörfer, Florian Grötschla, Emil Funke, Roger Wattenhofer

Figure 1 for DISCO-10M: A Large-Scale Music Dataset

Figure 2 for DISCO-10M: A Large-Scale Music Dataset

Figure 3 for DISCO-10M: A Large-Scale Music Dataset

Figure 4 for DISCO-10M: A Large-Scale Music Dataset

Music datasets play a crucial role in advancing research in machine learning for music. However, existing music datasets suffer from limited size, accessibility, and lack of audio resources. To address these shortcomings, we present DISCO-10M, a novel and extensive music dataset that surpasses the largest previously available music dataset by an order of magnitude. To ensure high-quality data, we implement a multi-stage filtering process. This process incorporates similarities based on textual descriptions and audio embeddings. Moreover, we provide precomputed CLAP embeddings alongside DISCO-10M, facilitating direct application on various downstream tasks. These embeddings enable efficient exploration of machine learning applications on the provided data. With DISCO-10M, we aim to democratize and facilitate new research to help advance the development of novel machine learning models for music.

Via

Access Paper or Ask Questions

Siamese SIREN: Audio Compression with Implicit Neural Representations

Jun 22, 2023
Luca A. Lanzendörfer, Roger Wattenhofer

Figure 1 for Siamese SIREN: Audio Compression with Implicit Neural Representations

Figure 2 for Siamese SIREN: Audio Compression with Implicit Neural Representations

Figure 3 for Siamese SIREN: Audio Compression with Implicit Neural Representations

Figure 4 for Siamese SIREN: Audio Compression with Implicit Neural Representations

Implicit Neural Representations (INRs) have emerged as a promising method for representing diverse data modalities, including 3D shapes, images, and audio. While recent research has demonstrated successful applications of INRs in image and 3D shape compression, their potential for audio compression remains largely unexplored. Motivated by this, we present a preliminary investigation into the use of INRs for audio compression. Our study introduces Siamese SIREN, a novel approach based on the popular SIREN architecture. Our experimental results indicate that Siamese SIREN achieves superior audio reconstruction fidelity while utilizing fewer network parameters compared to previous INR architectures.

* Published as a workshop paper at ICML 2023 neural compression workshop

Via

Access Paper or Ask Questions

Examining the Emergence of Deductive Reasoning in Generative Language Models

May 31, 2023
Peter Belcak, Luca A. Lanzendörfer, Roger Wattenhofer

Figure 1 for Examining the Emergence of Deductive Reasoning in Generative Language Models

Figure 2 for Examining the Emergence of Deductive Reasoning in Generative Language Models

Figure 3 for Examining the Emergence of Deductive Reasoning in Generative Language Models

Figure 4 for Examining the Emergence of Deductive Reasoning in Generative Language Models

We conduct a preliminary inquiry into the ability of generative transformer models to deductively reason from premises provided. We observe notable differences in the performance of models coming from different training setups and find that the deductive reasoning ability increases with scale. Further, we discover that the performance generally does not decrease with the length of the deductive chain needed to reach the conclusion, with the exception of OpenAI GPT-3 and GPT-3.5 models. Our study considers a wide variety of transformer-decoder models, ranging from 117 million to 175 billion parameters in size.

* Accepted to the 1st Natural Language Reasoning and Structured Explanations Workshop (NLRSE@ACL'23). 8 pages, 4 figures, 3 tables

Via

Access Paper or Ask Questions