Meta’s SPICE framework pushes AI toward self-learning without human supervision
Meta researchers have unveiled a new reinforcement learning framework called SPICE (Self-Play in Corpus Environments) that enables large language models (LLMs) to improve their reasoning