LLM

Session AI/ML

Improving Meta Generative Ad Text using Reinforcement Learning

Tuesday Nov 18 / 01:35PM PST

Reinforcement Learning with Performance Feedback (RLPF) unlocks a new way of turning generic GenAI models into customized models fine-tuned for specific tasks. This approach is especially powerful when combined with in-house data and performance metrics.

Speaker image - Alex Nikulkov

Alex Nikulkov

Research Scientist (RL lead for Monetization GenAI) @Meta