Hacker Newsnew | past | comments | ask | show | jobs | submit | dhruvdh's submissionslogin
1.Accelerating LLM Inference with Parallel Draft Models (PARD) (amd.com)
1 point by dhruvdh 10 months ago | past
2.Open-sourcing Three EXAONE 3.5 Models: 2.4B, 7.8B, 32B (lgresearch.ai)
13 points by dhruvdh on Dec 9, 2024 | past | 4 comments
3.The Tyranny of Possibilities in the Design of Task-Oriented LLM Systems (arxiv.org)
1 point by dhruvdh on Jan 2, 2024 | past | 3 comments
4.MoAI Platform – Scale PyTorch, TensorFlow, etc. to Thousands of GPU/NPUs (moreh.io)
1 point by dhruvdh on Oct 27, 2023 | past
5.Lamini LLM Finetuning on AMD ROCm: A Technical Recipe (lamini.ai)
6 points by dhruvdh on Oct 25, 2023 | past | 4 comments
6.ModuleFormer: Modularity Emerges from Mixture-of-Experts (arxiv.org)
1 point by dhruvdh on Sept 17, 2023 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: