Alert button
Picture for Assaf Hallak

Assaf Hallak

Alert button

SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search

Add code
Bookmark button
Alert button
Jan 30, 2023
Gal Dalal, Assaf Hallak, Gugan Thoppe, Shie Mannor, Gal Chechik

Figure 1 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 2 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 3 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Figure 4 for SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Viaarxiv icon

SoftTreeMax: Policy Gradient with Tree Search

Add code
Bookmark button
Alert button
Sep 28, 2022
Gal Dalal, Assaf Hallak, Shie Mannor, Gal Chechik

Figure 1 for SoftTreeMax: Policy Gradient with Tree Search
Figure 2 for SoftTreeMax: Policy Gradient with Tree Search
Figure 3 for SoftTreeMax: Policy Gradient with Tree Search
Viaarxiv icon

Reinforcement Learning with a Terminator

Add code
Bookmark button
Alert button
May 30, 2022
Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal

Figure 1 for Reinforcement Learning with a Terminator
Figure 2 for Reinforcement Learning with a Terminator
Figure 3 for Reinforcement Learning with a Terminator
Figure 4 for Reinforcement Learning with a Terminator
Viaarxiv icon

Planning and Learning with Adaptive Lookahead

Add code
Bookmark button
Alert button
Jan 28, 2022
Aviv Rosenberg, Assaf Hallak, Shie Mannor, Gal Chechik, Gal Dalal

Figure 1 for Planning and Learning with Adaptive Lookahead
Figure 2 for Planning and Learning with Adaptive Lookahead
Figure 3 for Planning and Learning with Adaptive Lookahead
Figure 4 for Planning and Learning with Adaptive Lookahead
Viaarxiv icon

On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 13, 2021
Guy Tennenholtz, Assaf Hallak, Gal Dalal, Shie Mannor, Gal Chechik, Uri Shalit

Figure 1 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 2 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 3 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Figure 4 for On Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
Viaarxiv icon

Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction

Add code
Bookmark button
Alert button
Jul 04, 2021
Assaf Hallak, Gal Dalal, Steven Dalton, Iuri Frosio, Shie Mannor, Gal Chechik

Figure 1 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 2 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 3 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Figure 4 for Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Viaarxiv icon

Automatic Representation for Lifetime Value Recommender Systems

Add code
Bookmark button
Alert button
Feb 23, 2017
Assaf Hallak, Yishay Mansour, Elad Yom-Tov

Figure 1 for Automatic Representation for Lifetime Value Recommender Systems
Figure 2 for Automatic Representation for Lifetime Value Recommender Systems
Figure 3 for Automatic Representation for Lifetime Value Recommender Systems
Figure 4 for Automatic Representation for Lifetime Value Recommender Systems
Viaarxiv icon

Consistent On-Line Off-Policy Evaluation

Add code
Bookmark button
Alert button
Feb 23, 2017
Assaf Hallak, Shie Mannor

Figure 1 for Consistent On-Line Off-Policy Evaluation
Figure 2 for Consistent On-Line Off-Policy Evaluation
Figure 3 for Consistent On-Line Off-Policy Evaluation
Figure 4 for Consistent On-Line Off-Policy Evaluation
Viaarxiv icon

Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis

Add code
Bookmark button
Alert button
Nov 27, 2015
Assaf Hallak, Aviv Tamar, Remi Munos, Shie Mannor

Figure 1 for Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis
Figure 2 for Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis
Figure 3 for Generalized Emphatic Temporal Difference Learning: Bias-Variance Analysis
Viaarxiv icon

Emphatic TD Bellman Operator is a Contraction

Add code
Bookmark button
Alert button
Aug 23, 2015
Assaf Hallak, Aviv Tamar, Shie Mannor

Viaarxiv icon