Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
SoKamil
3 days ago
|
parent
|
context
|
favorite
| on:
Exploiting the most prominent AI agent benchmarks
The more research on this topic is created, the more knowledge how to game them will be stored in future training data. And since it comes from university, it is ranked higher in data corpus. It sounds like a self fulfilling prophecy.
help
abirch
3 days ago
[–]
Damned old Goodhart's Law: "When a measure becomes a target, it ceases to be a good measure".
https://en.wikipedia.org/wiki/Goodhart%27s_law
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: