https:// huggingface.co/papers/2605.06 130 … Outperforms prior skill-based and

Skill1

A unified framework that trains a single policy to simultaneously select, utilize, and distill skills from a shared reward signal, enabling persistent skill libraries for language agents.

https://huggingface.co/papers/2605.06 130
…
Outperforms prior skill-based and RL baselines on ALFWorld and WebShop by co-evolving skill selection, utilization, and distillation toward a shared task-outcome objective.