The model uses more cycles during inference to generate more tokens and review responses, improving its performance on reasoning tasks.