nicdex @nicdex

**RetroComputingMX** @retrocompmx@mastodon.social · Jul 20

RetroComputingMX @retrocompmx@mastodon.social

El 20 de julio de 1945 nace William Inmon, informático estadounidense, reconocido por muchos como el padre del almacén de datos. Inmon escribió el primer libro, impartió la primera conferencia y fue el primero en impartir clases sobre almacenamiento de datos.
#retrocomputingmx #DataWarehouse #computerhistory

**Walker** @Walker@infosec.exchange · Jul 2

Jul 2

Walker @Walker@infosec.exchange

Could data segregation help mitigate impact of large scale data incidents?

Looking at the Qantas breach of 6 million passenger records.

Taking a step back from the data warehouse model, what if data could be stored in different locations based on a set of criteria instead of in a single repository. Access to these systems could be isolated as well. If one system got compromised it would not impact the entire data set.

The data could still be mined for business analytics but it could be pseudonymized in a data warehouse. If access to the warehouse got compromised it would not impact privacy.

This is a much more complex and expensive setup, but the cost could be weighed against the loss resulting from a compromise.

There is also the impact on real time data interactions with PII, where is it stored, how is it accessed, etc. Lots of considerations.

Just a thought, though it may not be practical.

#databreach #incidentresponse #securityengineering

**@Retter@chaos.social** @Retter@chaos.social · Jun 30 *

Jun 30 *

@Retter@chaos.social @Retter@chaos.social

Liebes #Fedivershelp
#pleaseboost #pleasehelp
es gibt einen und einen Konzern den ich schnellstmöglich verlassen möchte. Suche etwas im Bereich #businessintelligence bzw. #datawarehouse aber ausdrücklich auch was ganz anderes. Sicher kein Vertrieb. Gerne im Raum #Düsseldorf #Köln

Kontakt / Rückfragen bitte als PM.

#FediverseHilfe #Suche #arbeitsmarkt

**Agenda du Libre** @agenda_du_libre@pouet.chapril.org · Jun 16

Jun 16

Agenda du Libre @agenda_du_libre@pouet.chapril.org

Paris: Apache Iceberg Paris Community Meetup #1, Le jeudi 19 juin 2025 de 18h00 à 21h30. https://www.agendadulibre.org/events/32653 #data #dataLakehouse #dataEngineer #dataScience #dataPlatform #dataWarehouse #apacheIceberg

www.agendadulibre.orgApache Iceberg Paris Community Meetup #1La communauté technologique parisienne dédiée à la data s'apprête à accueillir une nouvelle communauté avec le lancement du Apache Iceberg Paris Community Meetup. Ce nouveau groupe vient compléter le Hadoop User Group, le Paris Spark Meetup et plus récemment le Modern Data Stack France. Pour

**POSETTE: An Event for Postgres** @posetteconf@mastodon.social · Jun 12

Jun 12

POSETTE: An Event for Postgres @posetteconf@mastodon.social

Tune in to Marco Slot's talk during #PosetteConf Livestream to learn about building a #PostgreSQL data warehouse.

https://posetteconf.com

#postgres #databases #datawarehouse

**Vincent Daron** @vdaron@mastodon.social · Jun 10

Jun 10

Vincent Daron @vdaron@mastodon.social

Le 17 juin 2025 à 14h, j’animerai un webinaire gratuit sur les bases du #DataVault, en partenariat avec la #CFDV (Communauté Francophone du Data Vault).
Inscription : https://app.livestorm.co/cfdv/communaute-francophone-du-data-vault-initiation-au-data-vault #datamanagement #DataWarehouse #formation #webinaire

Communauté Francophone du Data VaultCommunauté Francophone du Data Vault : Introduction au Data Vault 2.0 | Communauté Francophone du Data VaultDescriptionCe webinaire gratuit et interactif de 2 à 3 heures offre une introduction complète au Data Vault 2.0, une méthodologie moderne de modélisation de données conçue pour les entrepôts de don...

**Python Job Support** @pythonjobsupport@mastodon.social · Jun 2

Jun 2

Python Job Support @pythonjobsupport@mastodon.social

Top 3 Mistakes to Avoid Migrating to Snowflake | #snowflake #datawarehouse #clouddatawarehouse

Interview Questions & Fundamentals - Snowflake, The Cloud Data Warehouse, #datawarehouse #snowflake data warehouse, ... source

https://quadexcel.com/wp/top-3-mistakes-to-avoid-migrating-to-snowflake-snowflake-datawarehouse-clouddatawarehouse/

QuadExcel.com · Jun 2Top 3 Mistakes to Avoid Migrating to Snowflake | #snowflake #datawarehouse #clouddatawarehouse - QuadExcel.comInterview Questions & Fundamentals - Snowflake, The Cloud Data Warehouse, #datawarehouse #snowflake data warehouse, ... source

**PravinVisual** @Pravinvisual@mastodon.social · May 23

May 23

PravinVisual @Pravinvisual@mastodon.social

Join Our Free Demo on Snowflake Online Training!
Ready to upgrade your data skills with the cloud-based data platform trusted by top companies worldwide? Don’t miss our FREE Demo Session!
Attend Online #FreeDemo On #Snowflake by Mr. Krishna.
Demo on: 29th May, 2025 @ 7:00 AM (IST).
Contact us: +91 7032290546
WhatsApp: https://wa.me/c/917032290546
Visit: https://visualpath.in/snowflake-training.html
Blog: https://visualpathblogs.com/category/snowflake/

#SnowflakeTraining #OnlineTraining #DataWarehouse

UK @uk@pubeurope.com · Apr 24

Apr 24

UK @uk@pubeurope.com

https://www.europesays.com/uk/46252/ Behind bet365’s bold data transformation #Bet365 #CloudStorage #CloudComputing #Computing #Data #DataWarehouse #DigitalTransformation #GoogleCloud #Storage #Technology #UK #UnitedKingdom

**Francis Gulotta** @reconbot@toot.cafe · Apr 6

Apr 6

Francis Gulotta @reconbot@toot.cafe

I’ve been working on a pretty gnarly data a warehouse reporting problem for the past few days. It’s up, leveling my ability to do this kind of work. The tooling has always been so limited and I am beginning to understand it is me who is limited in the understanding of the tooling ecosystem.

There may or may not be a wonderful overlap of programming and data warehousing but it’s clear that me not being aware of it doesn’t mean it doesn’t exist.

#DataWarehouse #bigdata #reporting

**Justin Buzzard** @jdbuzzard@mastodon.social · Mar 20

Mar 20

Justin Buzzard @jdbuzzard@mastodon.social

A Data Lake in the software world is essentially where raw data is taken and turned into something tangible like reports, often using AI/machine learning and them put into the Data Warehouse. #software #datalake #datawarehouse

**Renne Rocha** @rennerocha@chaos.social · Mar 18

Mar 18

Renne Rocha @rennerocha@chaos.social

I just discovered that Snowflake (the company) has its name not because it makes it possible to create a beautiful logo of a snowflake, but because Snowflake Schema is a pattern for storing information in Data Warehouses (we also have Star Schema).

#TIL #DataWarehouse

**Vic** @victorp · Mar 4

Mar 4

Vic @victorp

An analysis of 100 Fortune 500 job postings reveals the tools and technologies shaping the data engineering field in 2025. Top skills in demand:
⁕ Programming Languages (196) - SQL (85), Python (76), Scala (14), Java (14)
⁕ ETL and Data Pipeline (136) - ETL (65), Data Integration (46)
⁕ Cloud Platforms (85) - AWS (45), GCP (26), Azure (14)
⁕ Data Modeling and Warehousing (83) - Data Modeling (40), Data Warehousing (22), Data Architecture (21)
⁕ Big Data Tools (67) - Spark (40), Big Data Tools (19), Hadoop (8)
⁕ DevOps, Version Control, and CI/CD (52) - Git (14), CI/CD (13), DevOps (7), Version Control (6), Terraform (6)
...

#DataEngineering #BigData #SQL #Python #ETL #AWS #CloudComputing #Spark #DataModeling #DataWarehouse #DevOps #DataGovernance #DataVisualization #MachineLearning #API #Scala #Java #GCP #Azure #Hadoop #Git #CICD #Terraform #DataQuality #Tableau #PowerBI #Collaboration #Microservices #MLOps #TechSkills

https://www.reddit.com/r/dataengineering/comments/1hz5ytw/become_a_data_engineer_in_2025_based_on_100_jobs/?utm_source=perplexity&rdt=54709

**Anoncheg** @Anoncheg · Feb 23

Feb 23

Anoncheg @Anoncheg

Part2: #dailyreport #powerbi #datawarehouse #dwh #postgresql

I split all columns to strings and numeric by converting
with Pands function pd.to_numeric and checking if errors
happens.

In PowerBI I download one table with date indexes for
slices and create second table with latest slice.

SQLAlchemy
dtype_mapping = {
'object': String,
'float64': Float,
'int64': Integer,
'datetime64[ns]': DateTime,
'datetime64': DateTime
}
蠡

**Anoncheg** @Anoncheg · Feb 23

Feb 23

Anoncheg @Anoncheg

Part1: #dailyreport #powerbi #datawarehouse #dwh #postgresql
#python
At this week I installed PowerBI and connect it to remote
PostgreSQL.
I asked AI to compare open-source data sources for
PowerBI and compare them by:
- Ease of Setup on Linux: SQLite > PostgreSQL > MySQL >
Redis > MongoDB
- Performance:
+ For large datasets: MongoDB > PostgreSQL > MySQL >
Redis > SQLite.
+ For real-time operations: Redis > MongoDB > MySQL >
PostgreSQL > SQLite.

For PostgreSQL I prepare data in Python script that use:
- pandas - for coverting types to datetime and numeric
- sqlalchemy - for simplifying type converstion
- asyncpg - sqlalchemy backend to connect to PostgreSQL

**Martin De Wulf** @madewulf@mastodon.social · Feb 19 *

Feb 19 *

Martin De Wulf @madewulf@mastodon.social

I love the vocabulary in data science: datalakehouse ? warecatalog? workflowlines?

Anything is possible!

#data #datawarehouse

**Gytis Repečka** @gytisrepecka@social.gyt.is · Jan 9

Jan 9

Gytis Repečka @gytisrepecka@social.gyt.is

Talend, probably the only mature open source Extract Transform Load (ETL) tool to work with data, is no longer maintained and is retired

Apparently one year ago Qlik, which owns Talend, said open-source version of Talend Studio "does not contribute to Qlik's commercial products".

It's so sad because DBT, which some dare to call ETL tool (in fact it is more of a templating engine) is far from functionality an ETL tool is supposed to offer

#data #engineering #etl

**Starlight Insights | Carlsson** @starlightinsights@mastodon.social · Jan 7

Jan 7

Starlight Insights | Carlsson @starlightinsights@mastodon.social

Yes, You Need to Understand Idempotency!

New article: https://starlightinsights.com/blog/2025-01-07-yes-you-need-to-understand-idempotency

starlightinsights.comYes, You Need to Understand IdempotencyKnowing about idempotency can really help you stay out of trouble and make everything so much easier for you.

#DataManagement #DataIntegrity #DataArchitecture

**Sarah Lea** @Sarah_Lea · Dec 24, 2024 *

Dec 24, 2024 *

Sarah Lea @Sarah_Lea

One of the most highlighted parts: "There is no need to move data. Data latency is minimised. Data can be transformed and analysed within a single platform.“

This is one of the reasons for 'Why ETL-Zero'

https://towardsdatascience.com/why-etl-zero-understanding-the-shift-in-data-integration-as-a-beginner-d0cefa244154

Towards Data Science · Nov 11, 2024Why ETL-Zero? Understanding the Shift in Data IntegrationBy Sarah Lea

#data #datascience #dataanalysis

**Sarah Lea** @Sarah_Lea · Dec 12, 2024

Dec 12, 2024

Sarah Lea @Sarah_Lea

In a data warehouse you store structured & organized data. In a data lake you can additionally store unstructured data. And was is now a data lakehouse?

Think of a combination of the strengths of both previous data platforms.

https://towardsdatascience.com/sql-and-data-modelling-in-action-a-deep-dive-into-data-lakehouses-fcbab9a4b9c2

Towards Data Science · Oct 21, 2024SQL and Data Modelling in Action: A Deep Dive into Data LakehousesBy Sarah Lea

#data #DataEngineering #datalakehouse

Recent searches

Search options

Administered by:

Server stats:

#datawarehouse