techhub.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A hub primarily for passionate technologists, but everyone is welcome

Administered by:

Server stats:

4.8K
active users

#DataEngineering

18 posts16 participants2 posts today

In this article, Vitalii Honchar explained how to build AI-powered apps that can chat with uploaded PDF files. He showed how to implement Retrieval Augmented Generation (RAG) using FastAPI for the API and LangChain to interact with OpenAI.

vitaliihonchar.com/insights/py

Python RAG API Tutorial with LangChain & FastAPI – Complete Guide
Vitalii HoncharPython RAG API Tutorial with LangChain & FastAPI – Complete GuideLearn how to build a Retrieval-Augmented Generation (RAG) PDF chat service using FastAPI, Postgres pgvector, and OpenAI API in this step-by-step tutorial.

🎯 Master Microsoft Fabric – The Future of Data Analytics!
Boost your career with in-demand skills in Data Synapse + Microsoft Fabric. Get hands-on experience with real-time projects, led by expert trainers.

📊 What You'll Learn:
✅ Power BI
✅ Data Factory
✅ Data Engineering
✅ Business Intelligence
✅ Data Integration

🚀 Online Free Demo

📞 For More Info: +91 7032290546
🌐 Visit Website: visualpath.in/online-microsoft
💬 WhatsApp Chat: wa.me/c/917032290546

🔔 Slides zu Legal Data Engineering 🔔

Was ist Legal Data Engineering? Wie sieht die Praxis juristischer Daten in Deutschland aus? Welche rechtlichen Probleme ergeben sich im Zusammenhang mit Legal Data Engineering? Diese Präsentation bietet eine Einführung zu Legal Data Engineering und sucht Antworten auf diese Fragen.

Slides: zenodo.org/records/15575231/fi

Legal Data Engineering ist der Schwerpunkt eines jeden Legal Data Science Projekts. Kern von Data Engineering ist der ETL-Prozess: Extraktion, Transformation und das (Hoch-)Laden von Daten. Die Slides bieten dazu einen allgemeinverständlichen Überblick.

Weitere praktische Themen sind die Verfügbarkeit juristischer Daten in Deutschland (insbesondere strukturierter Daten und Programmierschnittstellen), Probleme bei der Tokenisierung in Large Language Models und die Fehlerkennung von Gen-Namen in Microsoft Excel.

Bei den rechtlichen Fragen des Legal Data Engineering behandle ich die tradierte Rechtslage, das neue Datennutzungsgesetz (DNG) und Bayern als Negativbeispiel einer verschlossenen juristischen Datenkultur. Eine Diskussion der Datenschutzklage gegen OpenJur und der Open Data-Klage der Gesellschaft für Freiheitsrechte (GFF) gegen die Bundespolizei klären über aktuelle Entwicklungen in diesem Rechtsbereich auf.

Continued thread

1/10 🚀 Just curated the 10 most impactful data engineering resources you'll need in 2025:

Modern data stacks

Petabyte-scale feature stores

ML pipeline optimizations
All links in the final tweet ↓

🚀 Want to break into Data Engineering or boost your portfolio? Check out these 5 practical project ideas covering ETL, real-time streaming, web scraping, cloud workflows & dashboards. Tools used include Airflow, Kafka, BigQuery, and Power BI. Build real-world skills that employers love! 🔧📊 Checkout in.pinterest.com/pin/107340489 #DataEngineering #Python #ETL #BigData #PortfolioProjects #TechCareers #MastodonTech

PinterestPin on Tech upskillThis Pin was discovered by Browsejobs Technologies. Discover (and save!) your own Pins on Pinterest.

LangGraph is a powerful library for creating advanced conversational AI workflows. If you're interested in learning how to use it and want to build your own AI agent, check out the freeCodeCamp course by Vaibhav Mehra. It's a great starting point to explore LangGraph in action.

youtube.com/watch?v=jGg_1h0qzaM

The @llamaindex ecosystem has over 650 Python packages in a monorepo, making dependency management and publishing a challenge. To solve this, the team built LlamaDev and used uv it for improved performance. In this post, Massimiliano Pippi explains the issues they faced, what they tried and why they ultimately created LlamaDev.

llamaindex.ai/blog/python-tool

www.llamaindex.aiPython Tooling at Scale: LlamaIndex’s Monorepo Overhaul — LlamaIndex - Build Knowledge Assistants over your Enterprise DataLlamaIndex is a simple, flexible framework for building knowledge assistants using LLMs connected to your enterprise data.