Hi, Yes you can. The community creates quantized variants of these that can run ...

vtail · 2024-12-06T18:38:19 1733510299

Thank you Josh. Is there a resource you can point us too that helps answer "what kind of MacBook pro memory do I need to run ABC model at XYZ quantization?"

jwitthuhn · 2024-12-06T20:39:46 1733517586

In general you can just use the parameter count to figure that out.

70B model at 8 bits per parameter would mean 70GB, 4 bits is 35GB, etc. But that is just for the raw weights, you also need some ram to store the data that is passing through the model and the OS eats up some, so add about a 10-15% buffer on top of that to make sure you're good.

Also the quality falls off pretty quick once you start quantizing below 4-bit so be careful with that, but at 3-bit a 70B model should run fine on 32GB of ram.

profsummergig · 2024-12-07T21:25:27 1733606727

Does 70b mean there are 70 billion weights and biases in the model?

Filligree · 2024-12-06T20:08:32 1733515712

Look at the filesize, add a couple of GB.

aiden3 · 2024-12-06T18:59:41 1733511581

how would the pricing on databricks when using model serving compare to, say, the prices seen in the original post here (i.e., "3.3 70B is 25X cheaper than GPT4o")?

nickpsecurity · 2024-12-06T21:32:15 1733520735

I’ve been wanting to run into someone on the Databricks team. Can you ask whoever trains models like MPT to consider training an open model only on data clear of copyright claims? Specifically, one using only Gutenberg and the permissive code in The Stack? Or just Gutenberg?

Since I follow Christ, I can’t break the law or use what might be produced directly from infringement. I might be able to do more experiments if a free, legal model is available. Also, we can legally copy datasets like PG19 since they’re public domain. Whereas, most others have works in which I might need a license to distribute.

Please forward the request to the model trainers. Even a 7B model would let us do a lot of research on optimization algorithms, fine-tuning, etc.

evilduck · 2024-12-07T06:15:37 1733552137

I think you're looking for OLMo, https://allenai.org/olmo

nickpsecurity · 2024-12-07T07:14:00 1733555640

They appear to use Common Crawl in the DCLM dataset. Just downloading Common Crawl is probably copyright infringement before we consider specific terms in the licenses. Arxiv papers have a mix of licenses with some not allowing commercial use.

If I got the sources right, it’s already illegal with just two sources they scraped. That’s why I want one on Gutenberg content that has no restrictions.

profsummergig · 2024-12-06T18:36:33 1733510193

Thank you! Very helpful!