A Batched Multi-Armed Bandit Approach to News Headline Testing
read the original abstract
Optimizing news headlines is important for publishers and media sites. A compelling headline will increase readership, user engagement and social shares. At Yahoo Front Page, headline testing is carried out using a test-rollout strategy: we first allocate equal proportion of the traffic to each headline variation for a defined testing period, and then shift all future traffic to the best-performing variation. In this paper, we introduce a multi-armed bandit (MAB) approach with batched Thompson Sampling (bTS) to dynamically test headlines for news articles. This method is able to gradually allocate traffic towards optimal headlines while testing. We evaluate the bTS method based on empirical impressions/clicks data and simulated user responses. The result shows that the bTS method is robust, converges accurately and quickly to the optimal headline, and outperforms the test-rollout strategy by 3.69% in terms of clicks.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
On the Benefits of Free Exploration for Regret Minimization in Multi-Armed Bandits
Formalizes regret minimization with free exploration, introduces (α,β)-probably saving policies and UFE-KLUCB-H algorithm, and proves instance-dependent regret savings with upper and lower bounds in the logarithmic bu...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.