More

aloknnikhil · 2024-09-28T15:59:28 1727539168

I completely disagree with you. The fundamental problem with your concept of open source is it goes against what open source really is. The ability for you to completely change what a piece of software can do. IMO, even with LLMs, models are "executables" and weights are "configuration". Yes, of course you can tune the weights by changing the values, but that's the most I can do. Can I actually add "features" to the model? Perhaps you "open-sourced" an LLM model trained on the United States Constitution. Can I change the model to then be a specialist in real estate law? Not with weights. I need it to learn case histories to extend its "feature-set". Without data and the mechanism to reproduce the model, how is this "open-source"?

NitpickLawyer · 2024-09-28T16:44:52 1727541892

> Can I actually add "features" to the model?

Yes. You can use a number of libraries to add, mix, merge, etc. layers [1]

> Not with weights. I need it to learn case histories to extend its "feature-set".

Again, yes. You can add attention heads, other features, heck you can even add classification if you want [2]. Because you are working with an open architecture! What you think of weights are not binary blobs. That is a common missconception.

[1] - https://github.com/arcee-ai/mergekit

[2] - https://github.com/center-for-humans-and-machines/transforme...

aloknnikhil · 2024-09-28T18:22:49 1727547769

At first glance, that just seems like a bunch of libraries linked together to form a binary. That is not open-source. I completely agree with you that there is just not enough clarity out there. For my education, following up with my earlier example, can I remove the layers that have references to all chapters / laws in the constitution except for the ones meant for real-estate? How would I do that with the approaches you mentioned here?

Fundamentally, if I have to "reverse-engineer" something, then it's not open-source.

ErikBjare · 2024-10-01T19:17:46 1727810266

You would have to do the same fine-tuning as if you had the training data.

aloknnikhil · 2024-08-08T04:54:43 1723092883

I couldn't bother myself to read the whole article. Got GPT-4 to summarize the main points. Not as much insight as I thought I would get going in.

1. *Testing in Staging vs. Production*: - Most engineers prefer testing in staging due to a sense of control. - There's a misconception that it's an either/or situation between staging and production testing. In reality, both are necessary.

2. *Importance of Production Testing*: - Staging environments can’t replicate all possible real-world scenarios. - Production testing is essential to identify complex, real-world issues missed in staging.

3. *Uber's Approach to Testing*: - Uber tests its payment systems in production. - They have developed tools (Cerberus and Deputy) to facilitate transparent interaction with real systems and gather responses effectively.

4. *Every Deployment as an Experiment*: - Every deployment is treated as a hypothesis to be validated against business metrics. - Metrics and monitoring are crucial to determine the success of deployment.

5. *First Rollout Region*: - Uber chooses a specific first rollout region to minimize risk and impact. - Initial rollouts are conducted in regions that are small but significant for practical monitoring.

6. *Canary Deployments*: - Uber conducts canary deployments to a subset of users to detect and mitigate potential issues early. - This approach helps in identifying and fixing issues with minimal impact.

7. *Examples of Issues Discovered Early*: - Uber detected significant issues with GooglePay during its cautious rollout in Portugal, which would have been difficult to identify in a staging environment alone.

8. *Philosophy on Software Quality*: - True robustness and resiliency come from real-world usage and the continuous fixing of encountered issues. - Only production can provide the real stakes and conditions needed for thorough validation.

9. *Author and Newsletter*: - Alvaro Duran, author of “The Payments Engineer Playbook”, emphasizes the importance of sharing and learning from real-world experiences in payments systems. - Encourages readers to engage with the content and share it with colleagues for broader impact.

aloknnikhil · 2024-07-28T00:19:43 1722125983

We can help. Please reach out at: founders@omnistrate.com

https://omnistrate.com is our product

aloknnikhil · 2024-06-28T18:52:19 1719600739

It's down again today

aloknnikhil · 2024-06-07T20:06:10 1717790770

I thought the hottest place on the planet was the Lut Desert. https://whc.unesco.org/en/list/1505/

aloknnikhil · 2024-05-08T16:08:01 1715184481

Very cool! Thanks for sharing mydumper

aloknnikhil · 2024-05-08T15:45:24 1715183124

Fair question. My motivation was mainly to understand if there was something specific that drove that choice. Better question for you maybe: For any new project, which one would you choose and why?

gmiller123456 · 2024-05-08T17:54:28 1715190868

Since I'm not starting a new major project, it's a pointless question to ask as I'm not going to research what I should use. But you completely ignored my question, why do you choose Postgres?

aloknnikhil · 2024-03-18T03:50:38 1710733838

But with all the background OS activity, will this chip ever sustain turbo to 6.2 for noticeable periods? Pure benchmarking win imo.

vbezhenar · 2024-03-18T06:18:43 1710742723

You think 16 efficient cores will not make it?

happycube · 2024-03-18T12:51:14 1710766274

They're clocked lower.

aloknnikhil · 2024-01-01T18:42:46 1704134566

The USGS also has a live list of these: https://earthquake.usgs.gov/earthquakes/map/?extent=-29.5352...

aloknnikhil · on Oct 2, 2023

This is the original author / article - https://medium.com/@mbianchidev/2023-devops-is-terrible-ec88...