RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

To use reinforcement learning from human feedback (RLHF) in practical applications, it is crucial to learn reward models from diverse sources of human feedback and to consider human factors involved in providing feedback of different types. However, the systematic study of learning from diverse typ Show more

Publication status

published

External links

https://icml.cc/virtual/2023/29811#abstract_details
https://openreview.net/forum?id=JvkZtzJBFQ

Book title

Interactive Learning with Implicit Human Feedback Workshop at ICML 2023

Publisher

OpenReview

Event

Interactive Learning from Implicit Human Feedback Workshop @ ICML 2023, Honolulu, HI, USA, July 29, 2023

Subject

Reinforcement learning; Human feedback; Human-AI communication; Human-in-the-loop learning

Organisational unit

09822 - El-Assady, Mennatallah / El-Assady, Mennatallah

Related publications and datasets

Is new version of: https://doi.org/10.48550/ARXIV.2308.04332

Notes

Poster presented on July 29, 2023.

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback Mendeley CSV RIS BibTeX

RLHF-Blender: A Configurable Interactive Interface for Learning from Diverse Human Feedback

Mendeley

CSV

RIS

BibTeX