Reinforcement Learning
Session
AI/ML
Improving Meta Generative Ad Text using Reinforcement Learning
Tuesday Nov 18 / 01:35PM PST
Reinforcement Learning with Performance Feedback (RLPF) unlocks a new way of turning generic GenAI models into customized models fine-tuned for specific tasks. This approach is especially powerful when combined with in-house data and performance metrics.
Alex Nikulkov
Research Scientist (RL lead for Monetization GenAI) @Meta