GPT-5.5 «Spud»: fact-check de benchmarks, demos y supuestas filtraciones | Investigación profunda