Looks like the best way to keep improving the models is to come up with really u...

		dnw 43 days ago \| parent \| context \| favorite \| on: Gemini 3 Pro Model Card [pdf] Looks like the best way to keep improving the models is to come up with really useful benchmarks and make them popular. ARC-AGI-2 is a big jump, I'd be curious to find out how that transfers over to everyday tasks in various fields.