
Speaker: Alex Nikulkov
Research Scientist (RL lead for Monetization GenAI) @Meta
Session
Improving Meta Generative Ad Text using Reinforcement Learning
Reinforcement Learning with Performance Feedback (RLPF) unlocks a new way of turning generic GenAI models into customized models fine-tuned for specific tasks. This approach is especially powerful when combined with in-house data and performance metrics.