Thanks for your work, even if it got shut down. Good thing the price of gene sequencing continues to fall - eventualy a responsible and open group of academics will start a more accessible database. Illumina's new machine is reportedly $200 a person.[0] At this price 1 million entries in the dataset would cost two hundred million. GWAS starts being useful at a few hundred thousand entries. I honestly don't think 200 m is so much in the grand scheme, given that biotech companies are valued in the tens of billions. The price will also only continue to fall. Science doesn't acquiesce to ideology - it's the other way around.
GWAS is a very limited technique. Most people in genomics still have very simplified models of how complex organismal phenotypes are formed. My goal was to build an embedding between genotype and phenotype that had predictive and generative abilities. Based on everything I've seen, a few tens of millions of genomes for a few hundred highly specific molecular phenotypes is definitely doable if you had all the world's genomics data at your fingertips.
[0]https://www.bloomberg.com/news/articles/2022-09-29/illumina-...