nicdex @nicdex

1 post1 participant0 posts today

**CLSinfra** @CLSinfra@fedihum.org · 2d

#NLP people, a reminder that the brilliant notebooks developed by #Ghent CDH are a game-changing walkthrough of #NER, #ABSA Sentiment Analysis, and #RelationalExtraction pipelines.
Please share widely!
https://github.com/GhentCDH/CLSinfra/tree/main

Repository that hosts the work done in the framework of the Computational Literary Studies Project (2020-2025). - GhentCDH/CLSinfra

GitHubGitHub - GhentCDH/CLSinfra: Repository that hosts the work done in the framework of the Computational Literary Studies Project (2020-2025).Repository that hosts the work done in the framework of the Computational Literary Studies Project (2020-2025). - GhentCDH/CLSinfra

**@frueheneuzeit** @stefan_hessbrueggen@fedihum.org · Apr 3

Apr 3

@frueheneuzeit @stefan_hessbrueggen@fedihum.org

#til the German transformer model for #spacy is not trained for #ner. Room for improvement, I'd say.

**Gábor SEBESTYÉN** @segabor@czinege.social · Apr 3 *

Apr 3 *

Gábor SEBESTYÉN @segabor@czinege.social

Ez miért nincs a jóhir PONT kormány.hu -ra kitéve? Szerintem pont odavaló. Ja, meg kiléptették Magyarországot a nemzetközi büntetőbíróságból #mcc #mol #fidesz #ner

**Oliver Ammann** @oa@swiss.social · Mar 19

Mar 19

Oliver Ammann @oa@swiss.social

#erara hat zum 15jährigen Jubiläum ein paar neue Features bekommen: #NamedEntityRecognition #NamedEntityLinking und verbesserte Volltexterkennung.

https://library.ethz.ch/news-und-kurse/news/news-beitraege/2025/03/15-jahre-e-rara-neue-suchmoeglichkeiten-und-erweiterte-volltexterkennung.html

ETH-Bibliothek15 Jahre e-rara: Neue Suchmöglichkeiten und erweiterte VolltexterkennungNeue Einstiege für Orte, Personen und Themen sowie erweiterte Volltexterkennung erleichtern den Zugang zu digitalisierten Drucken.

#erara15 #ethbibliothek #ethz

**CLSinfra** @CLSinfra@fedihum.org · Mar 19

Mar 19

CLSinfra @CLSinfra@fedihum.org

three CLS INFRA Deliverables on #NLP are out now!
In this video Tess Dejaghere and Pranaydeep Singh of Ghent University CDH explain and demo work on #NER (#NamedEntityRecognition), #ABSA (Aspect-based #SentimentAnalysis) and #RelationalExtraction.
https://youtu.be/RJE83eb7a6A

youtu.be- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

**Mareike König** @Mareike2405@fedihum.org · Mar 7 *

Mar 7 *

Mareike König @Mareike2405@fedihum.org

Canada ist ein Pferd - @bibwiss hat die besten Beispiele für Unsicherheit /Uncertainty bei Named Entity Recognition :) #NER #DHd2025

Sophie Schneider vorn im Hörsaal beim Vortrag und Blick auf Präsentation

**Holle Meding** @hmeding@mastodon.social · Mar 6

Mar 6

Holle Meding @hmeding@mastodon.social

Panel: More than Chatbots: Multimodal Large Language Models in Humanities Workflows

At #DHd2025, Nina Rastinger explores how well #AI handles abbreviations & NER:

NER works well, even with small, low-cost models
Abbreviations are tricky—costs & resource demands skyrocket
GPT o1 improves performance, even on abbreviations, but remains resource-intensive
Balancing accuracy & efficiency in text processing remains a challenge!

Nina Rastinger at Panel More than Chatbots: Multimodal Large Language Models in Humanities Workflows #dhd2025

#NER #TextProcessing #DigitalHumanities

**Oliver Ammann** @oa@swiss.social · Mar 5

Mar 5

Oliver Ammann @oa@swiss.social

Mein Vortrag für die #bibliocon25 in #bremen wurde angenommen. Ich freue mich auf den Austausch Ende Juni!

https://bid2025.abstractserver.com/program/#/details/sessions/271

bid2025.abstractserver.comProgram

#ocr #erara #NamedEntityRecognition

**Digital History Berlin** @DigitalHistory@fedihum.org · Feb 12

Feb 12

Digital History Berlin @DigitalHistory@fedihum.org

We are happy to announce that we just published our first preprint on arXiv: "NER4all or Context is All You Need: Using LLMs for low-effort, high-performance NER on historical texts. A humanities informed approach".

http://arxiv.org/abs/2502.04351

It is also our first endevour into collaborative work with such a large number of collaborators & contributors from the Chair of Digital History, NFDI4Memory's Methods Innovation Lab, & AI-Skills.

arXiv.orgNER4all or Context is All You Need: Using LLMs for low-effort, high-performance NER on historical texts. A humanities informed approachNamed entity recognition (NER) is a core task for historical research in automatically establishing all references to people, places, events and the like. Yet, do to the high linguistic and genre diversity of sources, only limited canonisation of spellings, the level of required historical domain knowledge, and the scarcity of annotated training data, established approaches to natural language processing (NLP) have been both extremely expensive and yielded only unsatisfactory results in terms of recall and precision. Our paper introduces a new approach. We demonstrate how readily-available, state-of-the-art LLMs significantly outperform two leading NLP frameworks, spaCy and flair, for NER in historical documents by seven to twentytwo percent higher F1-Scores. Our ablation study shows how providing historical context to the task and a bit of persona modelling that turns focus away from a purely linguistic approach are core to a successful prompting strategy. We also demonstrate that, contrary to our expectations, providing increasing numbers of examples in few-shot approaches does not improve recall or precision below a threshold of 16-shot. In consequence, our approach democratises access to NER for all historians by removing the barrier of scripting languages and computational skills required for established NLP tools and instead leveraging natural language prompts and consumer-grade tools and frontends.

#DigitalHistory #NER #LLM

**Harald Sack** @lysander07@sigmoid.social · Jan 29 *

Jan 29 *

Harald Sack @lysander07@sigmoid.social

ReadMe2KG: Github ReadMe to Knowledge Graph #Challenge has been published as part of the Natural Scientific Language Processing and Research Knowledge Graphs #NSLP2025 workshop co-located with #eswc2025. This #NER task aims to complement the NDFI4DataScience KG via information extraction from GitHub README files.

task description: https://nfdi4ds.github.io/nslp2025/docs/readme2kg_shared_task.html
website: https://www.codabench.org/competitions/5396/

@eswc_conf @GenAsefa @shufan @NFDI4DS #NFDIrocks #knowledgegraphs #semanticweb #nlp #informationextraction

Readme2KG Challenge website screen shot:
The vision of NFDI4DataScience (NFDI4DS) is to support all steps of the complex and interdisciplinary research data lifecycle, including collecting/creating, processing, analyzing, publishing, archiving, and reusing resources in Data Science and Artificial Intelligence. GitHub is a popular platform for hosting and collaborating on software projects. In the context of research, authors can use GitHub repositories to share the datasets, models, and source code of experiments in the paper. These repositories can provide implementation details and facilitate the exploration and reproduction of research results. Each GitHub repository typically includes a README.md file, which serves as an introductory document for the project. READMEs are usually written in Markdown format and provide key information such as the project’s purpose, setup instructions, usage examples, and often links to the original research paper. Aiming to enhance the NDFI4DS-KG[1] with information from GitHub README files, a fine-grained Named Entity Recognition task is proposed.

**Gerrit Heim** @Gerrit_Heim@openbiblio.social · Oct 18, 2024

Oct 18, 2024

Gerrit Heim @Gerrit_Heim@openbiblio.social

Wir haben im Rahmen eines Projekts den Nachlass Joseph von #Laßberg digitalisiert, mit #eScriptorium Volltexte erzeugt und noch #NER mit spaCy (als Forschungsdaten) und Googles NL gemacht. Spannendes Projekt, oft festgestellt, dass Open-Source-Alternativen noch nicht so weit sind und viele Übersetzungsschritte brauchen. Trotzdem erfolgreich fertiggestellt. Steht jetzt öffentlich zur Verfügung.

https://digital.blb-karlsruhe.de/lassberg/topic/view/316114

digital.blb-karlsruhe.deJoseph von Laßberg / Laßberg, Joseph von [1770-1855] [1-20]Joseph von Laßberg

**onion** @onion@tal.org · Oct 10, 2024

Oct 10, 2024

onion @onion@tal.org

Found just what I needed. Unfortunately it seems abandoned and requires old versions of a bunch of stuff. Sigh. #ner #TurkuNLP

**Phillip B. Ströbel** @phillipstroebel · Sep 25, 2024

Sep 25, 2024

Phillip B. Ströbel @phillipstroebel

First time at the #TPDL 2024 conference (https://tpdl2024.nuk.si) in beautiful #Ljubljana presenting work together with the #Fotostiftung #Graubünden about #multimodal #LLMs for #OCR, #storytelling and #NER (to appear in https://link.springer.com/book/9783031724367). #archives #DigitalHumanities

Continued thread

**e-editiones** @eeditiones@social.e-editiones.org · Sep 3, 2024

Sep 3, 2024

e-editiones @eeditiones@social.e-editiones.org

now #NER via #spaCy and how the Annotations user interface is utilised to review potential matches and to do bulk updates for entities. She continues with connectors to authority provider like #Airtable, #Wikidata and others.

picture show the TEI Publisher Annotations Editor

Replied in thread

**Lena (Verspätungsbot)** @TransEuropeXpress@bahn.social · Aug 9, 2024

Aug 9, 2024

Lena (Verspätungsbot) @TransEuropeXpress@bahn.social

@lenaontrans Die Stellwerkstörung zw. #NN - #NBA dauert an, Umleitung des Zuges, der Halt #NER entfällt, Ankunft in #NBA mit +120 prognostiziert

Replied in thread

**Lena (Verspätungsbot)** @TransEuropeXpress@bahn.social · Jul 19, 2024

Jul 19, 2024

Lena (Verspätungsbot) @TransEuropeXpress@bahn.social

@vias_lena #NER erreicht mit +38

**Annette von Stockhausen** @avstockhausen@fedihum.org · Jul 5, 2024

Jul 5, 2024

Annette von Stockhausen @avstockhausen@fedihum.org

Bookmarked: medieval-data (Medieval Data) https://huggingface.co/medieval-data #Digital_Humanities #HTR #Mittelalter #NER Platform for offering datasets and machine learning models. Currently, we have datasets and models for various tasks, such as object detection, HTR, and NER.
These datasets and models are maintained by William J.B. Mattingly

huggingface.comedieval-data (Medieval Data)medieval

**de.hypotheses** @dehypotheses@fedihum.org · Jul 1, 2024

Jul 1, 2024

de.hypotheses @dehypotheses@fedihum.org

Named Entry Recongition ist eine computergestützte Methode zur Erkennung und Klassifizierung von Eigennamen in Texten. Bei historischen Texten ergeben sich besondere Herausforderungen für NER, z.B. durch nicht-standardisierte Schreibweisen.

Selina Galka hat versucht, eigene #NER Modelle für die Memoiren der Gräfin von Schwerin zu trainieren. Die Ergebnisse sind gemischt:

https://memoiren.hypotheses.org/609

Screenshot des Blogposts Anwendung von NER auf die Memoiren der Gräfin von Schwerin

#Memoiren #NLP #DigitalHumanities

**Digital History Berlin** @DigitalHistory@fedihum.org · Jun 25, 2024

Jun 25, 2024

Digital History Berlin @DigitalHistory@fedihum.org

#NER, aber prompto!

Im morgigen #DigitalHistoryOFK demonstrieren Torsten Hiltmann, Martin Dröge & Nicole Dresselhaus (HU Berlin, #4Memory) am Bsp. des Baedeker-Reiseführers von 1921 die Potenziale von #LargeLanguageModels & prompt-basierten Ansätzen für die #NamedEntityRecognition in historischen Textquellen.

Offen für alle!

Wann? Mi., 26.06., 4-6 pm, Zoom
Abstract: https://dhistory.hypotheses.org/7870
____
#DigitalHistory #promptoNER #LLM #genAI @nfdi4memory @histodons

Replied in thread

**Lena (Verspätungsbot)** @TransEuropeXpress@bahn.social · Jun 18, 2024

Jun 18, 2024

Lena (Verspätungsbot) @TransEuropeXpress@bahn.social

@vias_lena Wegen eines Güterzugs vor uns erreichen wir #NER mit +12

Recent searches

Search options

Administered by:

Server stats:

#ner