Blog

Why RLHF is Unnecessary for Transformers Trained on Raw Medical Events

Reinforcement learning with human feedback (RLHF) is a powerful technique in LLMs. However, RHLF can be counter productive in the case of large medical model (LMM) transformers trained on raw medical event data

Ricky Sahu
Ricky Sahu2023-05-25