Picture for Moontae Lee

Moontae Lee

LG AI Research & KAIST at EHRSQL 2024: Self-Training Large Language Models with Pseudo-Labeled Unanswerable Questions for a Reliable Text-to-SQL System on EHRs

May 18, 2024
Viaarxiv icon

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Add code
May 07, 2024
Viaarxiv icon

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Add code
May 02, 2024
Viaarxiv icon

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Add code
Apr 26, 2024
Viaarxiv icon

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection

Add code
Mar 21, 2024
Figure 1 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 2 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 3 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Figure 4 for Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Viaarxiv icon

YTCommentQA: Video Question Answerability in Instructional Videos

Add code
Jan 30, 2024
Viaarxiv icon

Projection Regret: Reducing Background Bias for Novelty Detection via Diffusion Models

Add code
Dec 05, 2023
Viaarxiv icon

Code Models are Zero-shot Precondition Reasoners

Nov 16, 2023
Viaarxiv icon

From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning

Add code
Oct 24, 2023
Viaarxiv icon

Merging Generated and Retrieved Knowledge for Open-Domain QA

Add code
Oct 22, 2023
Figure 1 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Figure 2 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Figure 3 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Figure 4 for Merging Generated and Retrieved Knowledge for Open-Domain QA
Viaarxiv icon