Submissions from twitter.com/tim_dettmers

		Model 4 bit inference 4.2x faster than 16 bit with full HF support (twitter.com/tim_dettmers)
		2 points by convexstrictly on July 11, 2023 \| past \| 1 comment
		99% ChatGPT performance on a consumer GPU (twitter.com/tim_dettmers)
		5 points by dragonwriter on May 25, 2023 \| past
		Guanaco, a chatbot on a single GPU, achieving 99% ChatGPT performance (twitter.com/tim_dettmers)
		10 points by rkwasny on May 24, 2023 \| past \| 1 comment
		Tim Dettmers: QLoRA finetunes a 65B model on a single 48 GB GPU (twitter.com/tim_dettmers)
		3 points by convexstrictly on May 12, 2023 \| past \| 1 comment
		Single desktop machine ML up to scales of 176B params (twitter.com/tim_dettmers)
		4 points by nl on Aug 11, 2022 \| past