> The problem with Parquet is it’s static. Not good for use cases that involve c... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		jtbaker 11 months ago \| parent \| context \| favorite \| on: The best way to use text embeddings portably is wi... > The problem with Parquet is it’s static. Not good for use cases that involve continuous writes and updates. Although I have had good results with DuckDB and Parquet files in object storage. Fast load times. You can use glob patterns in DuckDB to query remote parquets though to get around this? Maybe break things up using a hive partitioning scheme or similar.

memhole 11 months ago [–]

I like the pattern described too. Only snag is deletes and updates. Ime, you have to delete the underlying file or create and maintain a view that handles the data you want visible.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact