More

summerlight · 2025-02-05T19:08:38 1738782518

My experience is that Gemini works relatively well on larger contexts. Not perfect, but more reliable.

summerlight · 2025-02-05T19:06:46 1738782406

I guess 90% is for "benchmark", which is typically tailored to be challenging to parse.

summerlight · 2025-01-30T23:23:33 1738279413

It's a strict improvement over the technology before LLM, which usually just assigns random cryptic symbols.

RolfRolles · 2025-01-31T01:51:47 1738288307

No, it's really not a strict improvement. A meaningless name like `v2` does at least convey that you, as the analyst, haven't understood the role of the variable well enough to rename it to something more fitting to its inferred purpose. If the LLM comes up with an "informative" variable name that is not very well-suited towards what it actually does, the name can waste your time by misleading you as to the role of the variable.

skissane · 2025-01-31T03:53:36 1738295616

I think Ghidra could do better even without any LLM involved. Ghidra will define local variables like this:

SERVICE_TABLE_ENTRY* local_5c;

I wish it at least did something like:

SERVICE_TABLE_ENTRY* local_5c_pServiceTableEntry;

Oh yeah, there’s probably some plugin or Python script to do this. But I just dabble with Ghidra in my spare time

It would be great if it tracked the origin of a variable/parameter name, and could show them in a different colour (or some other visual distinction) based on their origin. That way you could easily distinguish “name manually assigned by analyst” (probably correct) vs “name picked by some LLM” (much more tentative, could easily be a hallucination)

rgovostes · 2025-01-31T22:20:31 1738362031

In my view one of the most pressing shortcomings of Ghidra is that it can't understand the lifetimes of multiple variables with overlapping stack addresses: https://github.com/NationalSecurityAgency/ghidra/issues/975

Ghidra does have an extensive scripting API, and I've used LLMs to help me write scripts to do bulk changes like you've described. But you would have to think about how you would ensure the name suffix is synchronized as you retype variables during your analysis.

skissane · 2025-02-01T02:35:34 1738377334

Yeah, I don't know why they don't use something like SSA – make every line of code which performs an assignment create a new local variable.

Although I suppose when decompiling to C, you need to translate it out of SSA form when you encounter loops or backwards control flow.

rgovostes · 2025-01-31T00:17:40 1738282660

I disagree with it as a blanket statement. It’s related to the problem of hallucinations in LLMs; sometimes they come up with plausible, misleading bullshit. The user quickly relies on the automatic labeling as a crutch, while exhausting mental effort trying to distinguish the bullshit from the accurate labels. The cryptic symbols do not convey meaning but consequently can’t mislead you.

I don’t reject the whole concept and am bullish on AI-assisted decompilation. But the UX needs to help the user have confidence in the results, just like source code-level static analyzers generate proofs.

summerlight · 2025-01-29T00:14:02 1738109642

Isn't it table-valued function? IIRC, the SQL standard still doesn't have it but it's almost universally supported extension across vendors.

zetalyrae · 2025-01-29T00:42:54 1738111374

At least in Postgres, table-valued functions can't take tables as arguments, only scalars. That's the main difference: functors can not just return tables, but take tables satisfying some interface as arguments.

https://www.postgresql.org/docs/7.3/xfunc-tablefunctions.htm...

I thought I had written a footnote or appendix about this but I guess I forgot.

dagss · 2025-01-29T07:08:52 1738134532

MSSQL can take tables as arguments if they are temporary tables declared to be of a certain type. But that restriction limits their use a lot.

summerlight · 2025-01-15T21:18:10 1736975890

If there's any company who can afford "real-time LLM training" at this moment, I'm 100% sure they will win this AI race since they probably have at least ~10x compute compared to competitors. Of course, no one can do that right now.

summerlight · 2025-01-11T20:56:53 1736629013

My take is that `auto` is basically a tool to reduce local redundancies rather than typing convenience. Rule of thumb: you should avoid `auto` unless it actually improves readability (e.g. significant reductions of syntactic redundancies), or there is no other option.

summerlight · 2024-12-28T21:44:39 1735422279

Google seriously needs to scale up their generative models to all of crawling/indexing/ranking infrastructure. Their current ranking models are not capable of dealing with the next-gen web filled with 99% gen AI craps. I think they also know this. The problem is the cost and they're hyper-focused on bringing it down, but it is not fast enough.

summerlight · 2024-12-24T21:12:08 1735074728

This is because programming is not a work in a continuous solution space. Think in this way; you're almost guaranteed to introduce obvious bugs by randomly changing just a single bit/token. Assembler, compiler, stronger type system, etc etc all try to limit this by bringing a different view that is more coherent to human reasoning. But computation has an inherently emergent property which is hard to predict/prove at compile time (see Rice's theorem), so if you want safety guarantee by construction then this discreteness has to be much more visible.

summerlight · 2024-12-21T20:05:48 1734811548

When you write something like that, you probably want to add what are "far more important issues to worry over" and why pursuing this legislation would cause delay on that issues. And just in case you're not aware of, "far more important issues to worry over" is usually much harder and more complex to gather consensus across stakeholders and passing this kind of minor legislations doesn't really have any impacts or delay there. This is more of "spare time" stuff.

nradov · 2024-12-21T20:09:45 1734811785

This is not an actual problem that needs to be solved in the first place. It's a total waste of tax dollars.

summerlight · 2024-12-21T22:16:05 1734819365

You may think in that way, but in order to make a such strong statement you probably want to do your homework first, like searching up the discussions and its rationale. Usually there is a good reason to pass a law if it's almost universally voted for.

bitwalker · 2024-12-21T20:23:24 1734812604

It it apparently _is_ an actual problem, which is why an attempt at solving it is being made. A black market for reservations is bad not just for customers, but for the establishment that loses out on business if the reservations aren't actually sold, and therefore nobody shows up.

bdangubic · 2024-12-21T20:31:14 1734813074

they should go after ticketmaster and livenation which is a far, far bigger issue except of course they lobby hard while black market reservation folk don’t. they need to unionize or something and then lobby hard too :)

carlosjobim · 2024-12-21T20:36:24 1734813384

It's not a problem at all. Cook at home or go to another restaurant. These legislators should be put on a strict diet of water and bread for a few years until they learn.

injb · 2024-12-22T02:20:39 1734834039

Exactly. How is this so called problem even possible without the restaurants allowing it? If they don't like it, stop allowing it. If they don't care, then why should the public care?

summerlight · 2024-12-21T19:52:36 1734810756

This is a prevalent misconception that assumes advertisers don't care about how their money is spent! Advertisers and Google are actually concerned about SEO garbage. Nowadays, most advertisers tend to pay based on # of conversions, its value and ROI. Those spammy sites usually yield a garbage CVR even though their CTR seems great. Advertisers don't like this.

Looks like people don't acknowledge that # of clicks is no more important metric. That seemed to be important when it was the only meaningful performance metric. But the ultimate metric that matters to money is advertiser budget allocation. If they see Google search performs worse in terms of conversions, they will cut their budget there. And this is the real problem that Google has.

nothercastle · 2024-12-22T01:23:49 1734830629

Different groups often look at different metrics. It’s possible that the sales group cares about one value while the dev group optimizes to another

ryoshu · 2024-12-21T19:55:02 1734810902

Where else is that money being spent?