Sharpe Ratio-Guided Active Learning for Preference Optimization | Best AI papers explained | Podwise