Implementation Details
1. Create separate prompt versions for rule-based and retrieved history approaches 2. Configure A/B tests with controlled datasets 3. Implement scoring metrics for prediction accuracy 4. Run batch tests across different event types