I'm really excited about the state of data infrastructure and the emergence of t... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

dm03514 on Oct 18, 2020 | parent | context | favorite | on: Emerging Architectures for Modern Data Infrastruct...

I'm really excited about the state of data infrastructure and the emergence of the data lake. I feel like the technical aspects of data engineering is reduced to getting data into some cloud storage (s3) as parquet. Transforms are "solved" using ELT from the data lake, or streaming using kafka/spark.

I think executing this in orgs with legacy data technologies is hard but it is much more a people problem than a tech problem. In orgs that have achieved this foundation it's really cool to see the business and analytic impact to the company.

chrisweekly on Oct 18, 2020 | [–]

"it is much more a people problem than a tech problem"

^ This holds true for nearly every aspect of nearly every company.

spullara on Oct 18, 2020 | [–]

Snowflake (and others) will let you either pull that in and query it or as an external query that queries it in place. You can, if it makes sense for your use case, now just T from the data lake.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact