Google DeepMind’s Gemini 3.1 Pro has posted the highest scores yet recorded on two of the toughest AI reasoning benchmarks in existence, pulling ahead of OpenAI’s GPT 5.2 on both. The results sharpen ...
The latest Gemini model makes impressive strides in benchmarks, but forthcoming models could give it a reality check.
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Farmer Bir Virk tapped the iPad mounted beside his tractor's steering wheel and switched the vehicle to automatic mode. The machine moved forward and began harvesting potatoes on its own in the fields ...
This is the moment a humanoid robot accidentally kicked an engineer right in the groin during a movement test. Footage captured on December 24, 2025 in Shanghai, China shows a Unitree G1 humanoid ...
Microsoft (MSFT) announced today it has acquired Osmos, an agentic artificial intelligence data engineering platform designed to simplify complex data workflows. No financial details on the ...
GeekWire chronicles the Pacific Northwest startup scene. Sign up for our weekly startup newsletter, and check out the GeekWire funding tracker and VC directory. by Taylor Soper on Jan 5, 2026 at 2:28 ...
Some tech leaders are concerned that the artificial intelligence race will exhaust available land and energy. The solution might lie in orbit. Credit...Soña Lee Supported by By Eli Tan and Ryan Mac ...
Over Valentine’s Day weekend 2021, running water was a luxury. When unpredictable freezing temperatures hit the state and the power grid failed, Texans’ survival skills were put to the test. Storm Uri ...
OpenAI’s massive data center plan, called Stargate, in Saline Township is what it says it is — hyperscale. The initial $7 billion buildout is on a scale the state has never seen. Other OpenAI/Oracle ...
The fog masking the direction of the American economy and future of the artificial-intelligence boom is starting to lift. After mounting scrutiny of stratospheric tech investments, as well as a ...