techhub.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A hub primarily for passionate technologists, but everyone is welcome

Administered by:

Server stats:

4.7K
active users

#datalake

0 posts0 participants0 posts today

Ah, the $10/month Lakehouses: because who wouldn't want a bargain-basement data lake with all the charm of a timeshare in purgatory? 🤔💸 Just add a sprinkle of buzzwords like "DuckLake" and "time travel" and voilà, you've got a tech article that feels like a 2-hour #infomercial for something you'll never use. 📈🔮
tobilg.com/the-age-of-10-dolla #Lakehouses #DuckLake #DataLake #TechTrends #HackerNews #ngated

tobilg.com · Welcome to the age of $10/month LakehousesBy Tobias Müller

Shifting Left isn’t just a buzzword - it’s the foundation for efficiency in your organization!

By making clean, reliable, and accessible data available across your organization, you reduce complexity and unlock time to focus on higher-value work.

💡 Data products are the foundation of this , enabling healthy, scalable data communication.

📖 Dive into the details in the article: bit.ly/3WHjxsf

Attended an event Brewing Data with Snowflake yesterday in Vilnius :blobcatnerd:

Some of they key insights:

  • Medallion Architecture (good or bad) is widespread.
  • Snowflake and Databricks are clear competitors, targeting similar landscape.
  • Open formats are trending: file format, table format, catalog, etc. - the more of them are open source, the better.
  • Time travel feature is important, many users already used it for disaster recovery.
  • Clear distinction of Storage from Compute (generic cloud approach).

Full text of one of the slides presented:

Strategic Architecture Outlook

  • Agility & Future-Proofing - Open, portable data means you can adopt new technologies or switch platforms with minimal friction. No single vendor can hold your data hostage, so you can evolve vour architecture as needed.
  • Multi-Cloud and Hybrid - An open data layer can span clouds and on-prem seamlessly. You avoid cloud vendor lock-in and leverage best-of-breed services on different clouds using the same data. This flexibility is key for resilience and optimization.
  • Accelerating Innovation - When any team can access data with the tools of their choice, experimentation flourishes. Open data fosters Al/ML and cross-domain analytics since data isn't locked in silos - more innovation and insights from the same data.
  • Vendor Leverage - Strategically, using open standards increases your leverage in vendor negotiations. You car opt in or out of services more freely, pushing vendors to provide value (since you're not irreversibly locked to them).

One of the most highlighted parts: "There is no need to move data. Data latency is minimised. Data can be transformed and analysed within a single platform.“

This is one of the reasons for 'Why ETL-Zero' :blobcoffee:

towardsdatascience.com/why-etl

Towards Data Science · Why ETL-Zero? Understanding the Shift in Data IntegrationBy Sarah Lea