Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

According to this LLaMA still didn't go far enough: https://www.harmdevries.com/post/model-size-vs-compute-overh...


Yep, it depends on what your goal is.


This doesn't say that LLaMA didn't go far enough.


Not exactly, but it did say they could have gone further than they did without wasting time and energy on infinitesimally small gains though




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: