Table of Contents
Problem Definition
- Problem: CVR prediction problem during sales promotions during which there is a data drift
- Objective: RPM and CVR
Methods
- HDR (Historical Data Reuse), three components
- Automated data retrieval module
- represents each promo as a vector
- based on nearest neighbor, find the one
- Distribution shift correction module (to address the disparity of the conversion behavior between the retireved promos and target promos)
- TransBlock module
- fine-tune model with historical data
- Automated data retrieval module
Impact
- improve both ranking and calibration metrics
- ranking metric: AUC
- Calibration metrics:
- logloss
- ECE: Expected Calibration Error
- PCOC: predicted CVR Over the Actual CVR
- a lift of 9% RPM(Revenue Per Mille) and 16% CVR in Double 11 sales in 2022
Next Step
- Conversion differs between
- weekdays and weekends
- begin and end of month
- Utilize the data from other scenes
TODO & Questions & Further Reading
[ ] definition of ECE, PCOC [ ] to read: CTR, CVR papers (introduction section) [ ] one epoch problem