Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maciej Janowski

PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

Jun 21, 2023
Neeratyoy Mallik, Edward Bergman, Carl Hvarfner, Danny Stoll, Maciej Janowski, Marius Lindauer, Luigi Nardi, Frank Hutter

Figure 1 for PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

Figure 2 for PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

Figure 3 for PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

Figure 4 for PriorBand: Practical Hyperparameter Optimization in the Age of Deep Learning

Hyperparameters of Deep Learning (DL) pipelines are crucial for their downstream performance. While a large number of methods for Hyperparameter Optimization (HPO) have been developed, their incurred costs are often untenable for modern DL. Consequently, manual experimentation is still the most prevalent approach to optimize hyperparameters, relying on the researcher's intuition, domain knowledge, and cheap preliminary explorations. To resolve this misalignment between HPO algorithms and DL researchers, we propose PriorBand, an HPO algorithm tailored to DL, able to utilize both expert beliefs and cheap proxy tasks. Empirically, we demonstrate PriorBand's efficiency across a range of DL benchmarks and show its gains under informative expert input and robustness against poor expert beliefs

Via

Access Paper or Ask Questions

Deep Power Laws for Hyperparameter Optimization

Feb 01, 2023
Arlind Kadra, Maciej Janowski, Martin Wistuba, Josif Grabocka

Figure 1 for Deep Power Laws for Hyperparameter Optimization

Figure 2 for Deep Power Laws for Hyperparameter Optimization

Figure 3 for Deep Power Laws for Hyperparameter Optimization

Figure 4 for Deep Power Laws for Hyperparameter Optimization

Hyperparameter optimization is an important subfield of machine learning that focuses on tuning the hyperparameters of a chosen algorithm to achieve peak performance. Recently, there has been a stream of methods that tackle the issue of hyperparameter optimization, however, most of the methods do not exploit the scaling law property of learning curves. In this work, we propose Deep Power Laws (DPL), an ensemble of neural network models conditioned to yield predictions that follow a power-law scaling pattern. Our method dynamically decides which configurations to pause and train incrementally by making use of gray-box evaluations. We compare our method against 7 state-of-the-art competitors on 3 benchmarks related to tabular, image, and NLP datasets covering 57 diverse tasks. Our method achieves the best results across all benchmarks by obtaining the best any-time results compared to all competitors.

Via

Access Paper or Ask Questions