Exploiting the most prominent AI agent benchmarks
Article URL: https://rdi.berkeley.edu/blog/trustworthy-benchmarks-cont/ Comments URL: https://news.ycombinator.com/item?id=47733217 Points: 463 # Comments: 115
Article URL: https://rdi.berkeley.edu/blog/trustworthy-benchmarks-cont/ Comments URL: https://news.ycombinator.com/item?id=47733217 Points: 463 # Comments: 115
Tech
Welcome back to TechCrunch Mobility, your hub for the future of transportation and now, more than ever, how AI is playing a part.
Tech
The rise of AI has brought an avalanche of new terms and slang. Here is a glossary with definitions of some of the most important words and phrases you might encounter.
Tech
Anthropic was the star of the show at San Francisco's AI-centric conference.
Article URL: https://www.researchgate.net/publication/256935390_Eternity_in_six_hours_Intergalactic_spreading_of_intelligent_life_and_sharpening_the_Fermi_paradox Comments URL: https://news.ycombinator.com/item?id=47740315 Points: 13 # Comments: 5
Tech
Article URL: https://github.com/anthropics/claude-code/issues/45756 Comments URL: https://news.ycombinator.com/item?id=47739260 Points: 408 # Comments: 352
Tech
Article URL: https://perthirtysix.com/how-does-gps-work Comments URL: https://news.ycombinator.com/item?id=47738343 Points: 36 # Comments: 4
Tech
Slate auto burst onto the scene in April 2025. Here is a timeline that covers its origins, backers, product, and other new details.
Tech
Article URL: https://www.the-independent.com/tech/renewable-energy-solar-nepal-bhutan-iceland-b2533699.html Comments URL: https://news.ycombinator.com/item?id=47739313 Points: 32 # Comments: 6
Here's hoping that it will return soon, as I really liked it. Comments URL: https://news.ycombinator.com/item?id=47739305 Points: 26 # Comments: 9
Tech
Article URL: https://github.com/anthropics/claude-code/issues/45756 Comments URL: https://news.ycombinator.com/item?id=47739260 Points: 160 # Comments: 93
Tech
Article URL: https://github.com/rochus-keller/OberonSystem3Native/releases Comments URL: https://news.ycombinator.com/item?id=47739174 Points: 7 # Comments: 1
Tech
Article URL: https://blogfontawesome.wpcomstaging.com/we-have-a-99-email-reputation-gmail-disagrees/ Comments URL: https://news.ycombinator.com/item?id=47738978 Points: 28 # Comments: 19
I just spent 1h+ debugging why my locally-hosted gitlab runner would fail to create pipelines. The gitlab job output would just display weird TLS errors when trying to pull a docker images. After debugging gitlab and the runner, I realized after a wh
Article URL: https://nerdy.dev/why-ai-sucks-at-front-end Comments URL: https://news.ycombinator.com/item?id=47738864 Points: 26 # Comments: 20