Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
red2awn
70 days ago
|
parent
|
context
|
favorite
| on:
AdapTive-LeArning Speculator System (ATLAS): Faste...
A lot of optimizations in LLMs now are low hanging fruits inspired by techniques in classical computer science. Another one that comes to mind is paged KV caching which is based on memory paging.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: