Alert button
Picture for Denis Tarasov

Denis Tarasov

Alert button

Distilling LLMs' Decomposition Abilities into Compact Language Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Denis Tarasov, Kumar Shridhar

Viaarxiv icon

Katakomba: Tools and Benchmarks for Data-Driven NetHack

Add code
Bookmark button
Alert button
Jun 14, 2023
Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov

Figure 1 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Figure 2 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Figure 3 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Figure 4 for Katakomba: Tools and Benchmarks for Data-Driven NetHack
Viaarxiv icon

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Add code
Bookmark button
Alert button
May 16, 2023
Denis Tarasov, Vladislav Kurenkov, Alexander Nikulin, Sergey Kolesnikov

Figure 1 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Figure 2 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Figure 3 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Figure 4 for Revisiting the Minimalist Approach to Offline Reinforcement Learning
Viaarxiv icon

Anti-Exploration by Random Network Distillation

Add code
Bookmark button
Alert button
Jan 31, 2023
Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Sergey Kolesnikov

Figure 1 for Anti-Exploration by Random Network Distillation
Figure 2 for Anti-Exploration by Random Network Distillation
Figure 3 for Anti-Exploration by Random Network Distillation
Figure 4 for Anti-Exploration by Random Network Distillation
Viaarxiv icon

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Add code
Bookmark button
Alert button
Nov 20, 2022
Alexander Nikulin, Vladislav Kurenkov, Denis Tarasov, Dmitry Akimov, Sergey Kolesnikov

Figure 1 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Figure 2 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Figure 3 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Figure 4 for Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size
Viaarxiv icon

Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows

Add code
Bookmark button
Alert button
Nov 20, 2022
Dmitriy Akimov, Vladislav Kurenkov, Alexander Nikulin, Denis Tarasov, Sergey Kolesnikov

Figure 1 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Figure 2 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Figure 3 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Figure 4 for Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows
Viaarxiv icon

CORL: Research-oriented Deep Offline Reinforcement Learning Library

Add code
Bookmark button
Alert button
Oct 13, 2022
Denis Tarasov, Alexander Nikulin, Dmitry Akimov, Vladislav Kurenkov, Sergey Kolesnikov

Figure 1 for CORL: Research-oriented Deep Offline Reinforcement Learning Library
Figure 2 for CORL: Research-oriented Deep Offline Reinforcement Learning Library
Figure 3 for CORL: Research-oriented Deep Offline Reinforcement Learning Library
Figure 4 for CORL: Research-oriented Deep Offline Reinforcement Learning Library
Viaarxiv icon

Inception Architecture and Residual Connections in Classification of Breast Cancer Histology Images

Add code
Bookmark button
Alert button
Dec 10, 2019
Mohammad Ibrahim Sarker, Hyongsuk Kim, Denis Tarasov, Dinar Akhmetzanov

Figure 1 for Inception Architecture and Residual Connections in Classification of Breast Cancer Histology Images
Figure 2 for Inception Architecture and Residual Connections in Classification of Breast Cancer Histology Images
Figure 3 for Inception Architecture and Residual Connections in Classification of Breast Cancer Histology Images
Viaarxiv icon