techhub.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A hub primarily for passionate technologists, but everyone is welcome

Administered by:

Server stats:

5.3K
active users

#datascience

132 posts90 participants0 posts today

🚀 **Exciting News!** After 15 years of developing #Blosc/#Blosc2, we're thrilled to announce the beta program for Cat2Cloud! 🎉

- 🔄 Share complex data securely and effortlessly
- 🗜️ Access to the best compression algorithms available
- ⚡ Perform advanced computations directly in the cloud

...and more!

ironarray.io/cat2cloud

Join our beta program today and be among the first to experience the power of Cat2Cloud!

#DataScience #Compression #SaaS #CloudComputing #BetaProgram

Share Data Faster!⚡

Can you believe it? A major podcasting achievement has been unlocked with episode 200 of the @rstats @rweekly Highlights podcast! serve.podhome.fm/episodepage/r

🛒 Text analysis and prediction with LLMs: {mall} does it all (Camila Livio) @Posit
📊 The guide to gradients in R and ggplot2 @jimjamslam

Plus great listener feedback from @maurolepore and our usual mix of aha moments and perhaps a few lame jokes 😅

h/t @mike_thomas & @jonmcalder 🙏

R Weekly HighlightsIssue 2025-W14 HighlightsBy some minor miracle (even on April Fools) the R Weekly Highlights podcast has made it to episode 200! We go "virtual" shopping for LLM-powered text…

🔔 Reminder: HMC Information Event 🔔

Time is ticking! Join us for the second HMC Information Event on 04/04/2025 at 11:30 AM to to explore the HMC Project Call 2025, discuss your ideas, and connect with others in the data science and metadata fields.

👉 Register here: events.hifis.net/event/2303

👉 More info about the call: helmholtz-metadaten.de/en/proj

Don't miss this great opportunity!

#HMC #Metadata #DataScience #Innovation

@association @helmholtz_hmc @hidadigital @HelmholtzOpenScienceOffice

Embedding Models Misunderstand Language:
➡️ Text embeddings have blind spots, like capitalization misunderstandings, numerical inaccuracies, inability to detect negations, and confusion with ranges.
➡️Industry stories show dramatic consequences.
➡️ A hybrid approach—combining embedding models with rule-based methods and domain-specific classifiers—proves more reliable.

hackernoon.com/hallucination-b

hackernoon.comHallucination by Design: How Embedding Models Misunderstand Language | HackerNoonEmbedding needs to be tested and evaluated; otherwise, hallucinations will happen. Experiment and evaluation on custom data is a must

Pour information, le site code.gouv.fr maintient un annuaire des logiciels libres remarquables activement développés et financés par des organismes publics : code.gouv.fr/sources/#/awesome
En particulier Onyxia qui intéressera les chercheur·e·s ayant des besoins en matière de science des données : code.gouv.fr/sources/#/awesome #veilleESR #OpenData #DataScience #ScienceOuverte

code.gouv.fr>code.gouv.fr - Codes sources du secteur public - French Public Sector Sources CodesAccès aux codes sources du secteur public