Show HN: I made a Pinterest clone using SigLIP image embeddings

yorwba · 2024-02-16T20:12:06 1708114326

Sometimes there are duplicate results, e.g. https://mood-amber.vercel.app/images/0b733fc2-7093-4443-8872... has two copies of https://mood-amber.vercel.app/images/f920a599-bbd7-4805-3317... right next to each other. (The link UUID is the same, so I assume this is an issue with the search algorithm, not simply duplicate data that got scraped.)

verse · 2024-02-16T20:31:52 1708115512

ah! thank you for pointing this out. will fix

lulzx · 2024-02-16T19:25:15 1708111515

Also, check https://same.energy/

wucaworld · 2024-02-16T11:36:33 1708083393

Very cool! How did you get the collage layout? I noticed images in each column don’t have the same size. I assume images get Centre cropped?

jkcxn · 2024-02-16T13:36:20 1708090580

It’s called a masonry grid. Images retain their aspect ratio so they don’t need to be cropped. You can kind of simulate it with css but there’s proposals to add a proper masonry layout to css

verse · 2024-02-16T19:20:26 1708111226

yeah. I actually wrote the logic for the layout myself (wasn't really happy with the available libraries). may open source this if people are interested!

ReD_CoDE · 2024-02-16T22:16:45 1708121805

Can you share your GH to follow updates? Also, take a look at this, they have a layout too https://github.com/lit/lit/tree/main/packages/labs/virtualiz...

verse · 2024-02-16T22:24:58 1708122298

will post on twitter:

https://x.com/verse_

ReD_CoDE · 2024-02-18T06:03:33 1708236213

I loved your text effects! You did some cool side-projects

Isn't the time for some big movements? Get in touch

seattleeng · 2024-02-16T20:10:25 1708114225

Cool! I haven’t tried SigLIP out yet but it seems to be the new hotness over CLIP… I just dont have a good project idea yet

Tiberium · 2024-02-16T07:21:10 1708068070

Is there a repo, especially for training? I'd like to see how SigLIP performs on a dataset of only anime images.

jarebear6expepj · 2024-02-16T17:35:33 1708104933

The the vision training models are available here: https://github.com/google-research/big_vision/tree/main which I am assuming, based on the research paper is what was used for the project.

gammalost · 2024-02-16T21:08:43 1708117723

There are some interesting images there. Why are you not including the source of the images?

GamerAlias · 2024-02-16T08:42:44 1708072964

Good stuff! Do you have any intuitive sense of whether SigLIP is particularly stronger than CLIP here? Also vector DB over Faiss index?

verse · 2024-02-16T19:24:43 1708111483

I haven't done much testing or anything, but it seems to me that siglip "understands" what it's looking at more than CLIP

also no, I just put everything on Supabase and added pgvector. super easy:

https://supabase.com/docs/guides/database/extensions/pgvecto...

ReD_CoDE · 2024-02-16T19:30:42 1708111842

qdrant doesn't support vector DB over Faiss index?

Also, pgvector or qdrant? which is better?

squam · 2024-02-16T06:51:52 1708066312

Cool project! Thanks for sharing

Yenrabbit · 2024-02-16T05:20:16 1708060816

Neat! How many images are in the dataset out of curiosity?

convolvatron · 2024-02-16T20:55:16 1708116916

how far we've come since https://www.karlsims.com/genetic-images.html

quite a bit, but surprisingly not

ijhuygft776 · 2024-02-16T03:21:39 1708053699

nice, we always need more clones and improvements.... hope you get traction.

I never click Pinterest links because the experience is too bad.

karolist · 2024-02-16T19:00:23 1708110023

I use unpinterested extension in Chrome to remove pinterest from search results, I was annoyed so much at some point. Maybe they're SEO spam is more under control now, not sure.