Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
smlacy
44 days ago
|
parent
|
context
|
favorite
| on:
Muse Spark: Scaling towards personal superintellig...
Post actual results, make a blog post. Don't just say "this sucks" without tangible evidence.
Otherwise you're doomed to "sample size of one" level of relevance.
thorum
44 days ago
|
next
[–]
I have the opposite experience: random HN/Reddit comments saying “this sucks” or “whoa this is a huge improvement” are the only benchmark that means anything. Standard benchmarks are all gamed and don’t capture the complexity of the real world.
titanomachy
44 days ago
|
prev
|
next
[–]
Then your internal benchmarks will be in the post-training set and you’ll have to make new ones.
_2d30
44 days ago
|
prev
[–]
I may already have but I'm pseudonymous on this website.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Otherwise you're doomed to "sample size of one" level of relevance.