arxiv:2605.20552
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
authored a paper 1 day ago
Spectral bandits for smooth graph functions with applications in recommender systems updated a dataset 1 day ago
misovalko/my-research-papers authored a paper about 1 month ago
Budgeted Online Influence Maximization