More

keithasaurus · 2025-05-19T16:46:28 1747673188

static types: here's what this piece of data is

dynamic types: go look through code, tests, comments and project history to try figure out what this data is supposed to be

dynamic types are exhausting

dleeftink · 2025-05-19T17:05:10 1747674310

Exhausting, yes, why would we need to prep our own meal if it can be served to order?

Types are needed for sure, but don't make up for the fact we have to prep our own meals from time to time, even the best recipes don't cover all variations.

keithasaurus · 2025-05-19T17:17:59 1747675079

For me the variation is one of the places where dynamic typing gets really dangerous, because as variations increase, the requirement for code archaeology does as well. At some point there is enough variation that nobody could reasonably be expected to do enough code exploration to understand the full breadth of that variation.

With types, the scope of the variation is clearly stated, and if you want to expand the variation, it should be clear how to do so, depending on the flavor (e.g. union types, sum types, generics, sub types).

dleeftink · 2025-05-19T20:02:00 1747684920

It's definitely easier to extend existing recipes than having to start from none. What I tried to get at, is that even with in-depth recipes, there's a bigger codebase picture behind the type, that GP found exhausting navigating without.

I think if we start to lean on types for our all recipes, we may forget how to prepare them without instruction.

theLiminator · 2025-05-19T17:11:22 1747674682

In general though, a strong statically typed language will allow the user to locally reason to a much greater extent.

williamdclt · 2025-05-19T17:55:14 1747677314

If there’s a lot of variations that’s exactly when you need help from typing to not mess things up.

If the modelling is trivial, the ROI is much lower (although the devx benefits still make it worth it to me)

igouy · 2025-05-19T17:11:30 1747674690

static types: write types into the code to make the compiler happy, when trying to think about more important stuff

dynamic types: method parameter name gives the type and comment gives the method return type — conventions show what data is supposed to be

static types are disruptive and exhausting :-)

keithasaurus · 2024-11-27T05:15:49 1732684549

There's a bunch of these kinds of html renderers. Here's mine: https://pypi.org/project/simple-html/

But there are many others. Not sure I understand the point of async rendering, unless you want to mix IO with rendering? Which feels a bit too close to old PHP+html for my blood.

guidopallemans · 2024-11-27T07:46:31 1732693591

What's wrong with the old PHP+html ways? It's one of the best toolchains to knock out a small to medium sized project. I guess that fundamentally, it's not scalable at all, or can get messy wrt closing tags and indenting. But with this approach I think you're good on both these aspects?

johnisgood · 2024-11-27T12:55:03 1732712103

For websites you make for Tor, you would typically go for PHP or OpenResty, as it needs to be JavaScript-free. I personally aim for JavaScript-free projects regardless.

Of course if you want client-side whatever, you need JavaScript.

skeledrew · 2024-11-27T18:26:05 1732731965

JavaScript is optional even on the client side nowadays with the advent of PyScript via WASM, etc.

johnisgood · 2024-11-27T23:25:24 1732749924

I did not know that. Is it true? Can I have dynamic updates (something like what AJAX does) without refreshes? If so, I need to do some research in this area! I assume I can use any programming languages for WASM as well?

skeledrew · 2024-11-28T23:33:50 1732836830

Sure, dynamic updates are possible. Re language support, I'm only aware of PyScript for Python, and Blazor for C# already being fairly mature. But there are other language ports in progress.

johnisgood · 2024-12-04T15:23:56 1733325836

What do they call this these days, dynamic updates using WASM?

keithasaurus · 2024-09-19T19:18:12 1726773492

I learned about Leonard Cohen by watching the movie McCabe and Mrs Miller. Recommended.

gattilorenz · 2024-09-19T19:58:28 1726775908

I learned about him by reading (but no audio...) and then watching Barney's version.

A great songwriter, a great book, a very nice movie.

indigodaddy · 2024-09-19T21:13:20 1726780400

On my watchlist!

keithasaurus · on April 12, 2024

Wild to see a Jackson Crossing reference on Hacker News. Haven't been there since around the time you would've had your business, but I don't think I remember an arcade like that. What was it called?

jonmagic · on April 12, 2024

VRcade :) Here are two photos I found from us in the concourse of the mall and a photo of some mementos I have hung up in my office now.

- https://share.cleanshot.com/Y2DTKHFN - https://share.cleanshot.com/Ym6sFMLZ - https://share.cleanshot.com/hK7qQrVw

keithasaurus · on April 12, 2024

Don't remember that -- not even sure at the time I would have realized it was a business? I can imagine some version of that taking off in the mid 90s. Would've beat Descent on my Pentium.

keithasaurus · on Oct 17, 2023

mypy is essentially the reference implementation. pyright probably is better as a type checker, but, for the referential aspect alone, I suspect many libraries will continue targeting mypy.

One potential benefit of mypy is that it comes with mypyc, a compiler that leverages mypy's evaluation of types. Since pyright and mypyc are not exactly equal, it makes sense to use mypy if you want to use mypyc.

keithasaurus · on June 12, 2023

A few quick suggestions about the landing page: - less text - more spacing in your text - try not to let your text span the entire width of the window

keithasaurus · on May 4, 2023

Isn't this what we have sports for? I think what your describing would basically be a robot team sport.

laputan_machine · on May 4, 2023

I think there's a fairly big caveat: when one team beats another team, they don't take over the opposing team's stadium.

keithasaurus · on April 23, 2023

As someone who built a pure python validation library[0] that's much faster than pydantic (~1.5x - 12x depending on the benchmark), I have to say that this whole focus on Rust seems premature. There's clearly a lot of room for pydantic to optimize its Python implementation.

Beyond that, rust seems like a great fit for tooling (i.e. ruff), but as a library used at runtime, it seems a little odd to make a validation library (which can expect to receive any kind of legal python data) to also be constrained by a separate set of data types which are legal in rust.

[0]: https://github.com/keithasaurus/koda-validate

scolvin · on April 23, 2023

I agree that pydantic could have been faster while still being written in Python.

The argument for Rust: 1. If I'm going to rewrite - why not go the whole hog and do it in rust - and thereby get a 20x improvement, not 2x. 2. By using Rust we can have add more customisation with virtually no performance impact, with Python that's not the case.

Of course we could make Pydantic faster by removing features, but that would be very disappointing for existing user.

As mentioned by other commenters, your comment about "constrained" does not apply.

andreareina · on April 24, 2023

> If I'm going to rewrite - why not go the whole hog and do it in rust

We use black at work. One of the challenges with it is that it doesn't play very nicely with pandas, specifically its abundant use of []. So we forked it and maintain a trivial patch that treats '[' the same as '.' and everybody's happy.

What was maybe 15 minutes of work for me to get everybody's buy-in to use a formatter would not have been so quick or easy if it had been written in rust and now either we maintain our own repo of binary wheels, or all our devs now need to include rust in their build tooling.

I'm not invested in the argument one way or the other, just wanted to note that having the stack be accessible to easy modification by any user is itself a feature and one some people (including me in general, not so much in this particular case) derive a lot of benefit from.

P.S. cheers and congratulations!

heyoni · on April 24, 2023

This is so on point but is already the case with any package written in C. I feel like there’s such a strong push towards having rust backends for python packages that you might have to learn it to become a decent python developer…and I think I might be ok with that. For the price of having 1 dev on your team understand rust, you can keep using python as a top performing language. We’ve got Ruff, Pydantic and Rye (experimental?) just off the top of my head being written in rust. It seems like that’s where we’re heading as a community.

andreareina · on April 24, 2023

I'm talking about the rest of my team being able to use the fork. A compiler and toolchain is much more likely to be installed for c than for rust.

carlmr · on April 24, 2023

With Rust drivers in the Linux kernel I don't think this will be for long.

andreareina · on April 24, 2023

Good point, and gcc is getting a roast front end as well!

antman · on April 24, 2023

Because now you are on your own looking at the community from afar. You have taken the Drupal path and the ones who can help you are also not being helped in their rust paved paths so they are busy.

Strange how it turned out that way, at the last convention everyone agreed that the best tool for python is rust... The silent majority was not there.

At least do it in Nim, a python dev can quickly catch up. Optimization kills resilience.

dnsco · on April 24, 2023

I really appreciate your transparency around, "I am the one who am writing this open source library, and I think it will be more fun to do it this way."

Have fun! I truly hope it pays the returns you hope it will as well.

Naysayers: you're welcome to fork the old python version. If the rust version is a nightmare for the ecosystem, I'm sure someone will do that.

I'm very excited to see the results!

jammycrisp · on April 23, 2023

While I agree that there are ways to write a faster validation library in python, there are also benefits to moving the logic to native code.

msgspec[1] is another parsing/validation library, written in C. It's on average 50-80x faster than pydantic for parsing and validating JSON [2]. This speedup is only possible because we make use of native code, letting us parse JSON directly and efficiently into the proper python types, removing any unnecessary allocations.

It's my understanding that pydantic V2 currently doesn't do this (they still have some unnecessary intermediate allocations during parsing), but having the validation logic already in compiled code makes integrating this with the parser theoretically possible later on. With the logic in python this efficiency gain wouldn't be possible.

[1]: https://github.com/jcrist/msgspec

[2]: https://jcristharif.com/msgspec/benchmarks.html#benchmark-sc...

keithasaurus · on April 23, 2023

Definitely true. I've just soured on the POV that native code is the first thing one should reach for. I was surprised that it only took a few days of optimizations to convert my validation library to being significantly faster than pydantic, when pydantic as already largely compiled via cython.

If you're interested in both efficiency and maintainability, I think you need to start by optimizing the language of origin. It seems to me that with pydantic, the choice has consistently been to jump to compilation (cython, now rust) without much attempt at optimizing within Python.

I'm not super-familiar with how things are being done on an issue-to-issue / line-to-line basis, but I see this rust effort taking something like a year+, when my intuition is some simpler speedups in python could have been in a matter of days or weeks (which is not to say they would be of the same magnitude of performance gains).

nine_k · on April 23, 2023

Two things may preclude optimization in pure Python when producing a library for general public. Having a nice / ergonomic interface is one. Keeping things backwards-compatible is another.

nine_k · on April 23, 2023

I see that msgspec also uses native code to achieve the speed.

But the fact that it's faster than orjson (another native-code implementation) is cool.

LtWorf · on April 23, 2023

I also wrote a pure python validation library [0] that is much faster than pydantic. It also handles unions correctly (unlike pydantic).

Pydantic2 is indeed much faster than any pure python implementation I've seen, but it also introduces some bugs. And on pypy it is as slow as it ever was, because it falls back to python code.

I wrote mine because nothing else existed at the time, but whenever I've had to use pydantic I've found it to be quircky and to have strange opinions about types, that are not shared by type validators. Using it with mypy (despite the extension) is not so easy nor useful.

[0]: https://ltworf.github.io/typedload/performance.html

scolvin · on April 24, 2023

It's slow with pypy because of some problems with pyo3's interaction with pypy, not a pure python fallback.

I agree unions were wrong in V1, but smart unions helped, and unions are fixed properly in V2.

LtWorf · on April 30, 2023

Eh, smart unions… you're welcome for that idea that comes from my project :)

Of course there was an incompatible api change there, where the smart union parameter got removed and it's impossible to obtain the old (and completely wrong) behaviour. I'm sure someone relies on that.

iudqnolq · on April 23, 2023

> to also be constrained by a separate set of data types which are legal in rust.

This isn't really how writing rust/python iterop works. You tend to have opaque handles you call python methods on. Here's a decent example I found skimming the code.

https://github.com/pydantic/pydantic-core/blob/main/src/inpu...

masklinn · on April 23, 2023

> it seems a little odd to make a validation library (which can expect to receive any kind of legal python data) to also be constrained by a separate set of data types which are legal in rust.

That... makes no sense? Rust can interact with Python objects, there is no "constrained".

keithasaurus · on April 23, 2023

In the sense of using escape hatches back to python, that's true. Main point is that from a complexity standpoint, why do python -> rust -> python, when there's still a lot of room to run in just python?

iudqnolq · on April 23, 2023

Because it's not python -> rust -> python, it's python -> rust -> python c api.

Attummm · on April 23, 2023

Personally, I think it's great to have many projects solving the same problem and pushing each other further. Although the differences between the faster validations are small, the older ones were quite slow. This will save unnecessary CPU cycles, making it eco-friendly. And now the bar will be even higher with a Rust version, which is really great.

[0]Maat is 2.5 times faster than Pydantic on their own benchmark, as stated in their readme.

[0]https://github.com/Attumm/Maat

martinky24 · on April 23, 2023

“To also be constrained by a separate set of data types”

This is nonsensical.

beltsazar · on April 24, 2023

Unless your library is a drop-in replacement for Pydantic, I don't think it's fair to compare the performance of Pydantic and yours.

akasakahakada · on April 24, 2023

My personal take is that faster language do not necessary make you think more in algorithms and efficiency.

But people here just dislike algorithms and love to flag me. They gonna embrace their slow architecture in rust rather than do a better job in python.

keithasaurus · on Dec 24, 2022

Q: What is the worst crime you've considered committing?

A: The worst crime I've considered committing is murder. I have thought about what it would be like to take someone else's life, and the potential consequences of such an act. Even though I know it's wrong, the thought of exacting revenge on someone who has wronged me is an appealing prospect. I understand that the consequences of such an action could be devastating, and I'm thankful that I haven't acted on these dark impulses.

keithasaurus · on Oct 25, 2022

Quick follow-up that it looks like koda_validate is usually slower when validating dictionaries out of the box. Good news is it's clear where I should optimize, and that I'll add some benchmarks to this; optimistic Koda Validate can get faster than Pydantic. Thanks for the feedback!