nicdex @nicdex

SODa💡 Wie lassen sich Methoden des Machine Learning sinnvoll in der Sammlungsdigitalisierung & -forschung einsetzen?Im Interview spricht unser Kollege <a href="https://fedihum.org/@mathias_zinnen" class="u-url mention" rel="nofollow noopener" target="_blank">@mathias_zinnen</a> über: 🔍 ML-Methoden für Text, Bild & strukturierte Daten 🛠️ Open-Source-Tools 💡 Vermittlungsangebote zu ML-Tools im Sammlungskontext 🌱 Warum ressourcenschonende ML-Ansätze wichtig sind➡️ <a href="https://sammlungen.io/blog/interview-mit-fachexpertise-2d-und-machine-learning?utm_campaign=coschedule&utm_source=mastodon&utm_medium=SODa%40fedihum.org" rel="nofollow noopener" translate="no" target="_blank">https://sammlungen.io/blog/interview-mit-fachexpertise-2d-und-machine-learning?utm_campaign=coschedule&utm_source=mastodon&utm_medium=SODa%40fedihum.org</a> <a href="https://fedihum.org/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://fedihum.org/tags/2D" class="mention hashtag" rel="nofollow noopener" target="_blank">#2D</a> <a href="https://fedihum.org/tags/SODaZentrum" class="mention hashtag" rel="nofollow noopener" target="_blank">#SODaZentrum</a> <a href="https://fedihum.org/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a> <a href="https://fedihum.org/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#ML</a>

Taffer 🇨🇦 :godot:Related to my epub DRM gripe, gImageReader (a front-end for Tesseract) does a great job of OCRing screen shots.I'd still prefer a sane, reliable way to strip Adobe's DRM from epubs, or for publishers to stop using it.<a href="https://mastodon.gamedev.place/tags/ocr" class="mention hashtag" rel="nofollow noopener" target="_blank">#ocr</a> <a href="https://mastodon.gamedev.place/tags/tesseract" class="mention hashtag" rel="nofollow noopener" target="_blank">#tesseract</a> <a href="https://mastodon.gamedev.place/tags/gimagereader" class="mention hashtag" rel="nofollow noopener" target="_blank">#gimagereader</a>

alcea<a href="https://mastodon.social/tags/GoodLuck" class="mention hashtag" rel="nofollow noopener" target="_blank">#GoodLuck</a> <a href="https://mastodon.social/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a> ing that...<a href="https://mastodon.social/tags/Firefish" class="mention hashtag" rel="nofollow noopener" target="_blank">#Firefish</a> works better iirc <a href="https://mastodon.social/tags/alttext" class="mention hashtag" rel="nofollow noopener" target="_blank">#alttext</a>

Patrick Drechslerlooking for a self-hosting app which has good OCR for printed cooking books. Trying to digitize them.Any recommendations?<a href="https://floss.social/tags/selfhosting" class="mention hashtag" rel="nofollow noopener" target="_blank">#selfhosting</a> <a href="https://floss.social/tags/ocr" class="mention hashtag" rel="nofollow noopener" target="_blank">#ocr</a> <a href="https://floss.social/tags/cooking" class="mention hashtag" rel="nofollow noopener" target="_blank">#cooking</a> <a href="https://floss.social/tags/books" class="mention hashtag" rel="nofollow noopener" target="_blank">#books</a>

Adi Keinan-SchoonbaertMy colleague Valentina Vavassori asked heritage organisations about how they do Automatic Text Recognition - the results were super interesting and rich - here's the data and a brief analysis: <a href="https://blogs.bl.uk/digital-scholarship/2025/07/automatic-text-recognition-in-cultural-heritage-institutions-survey-analysis.html" rel="nofollow noopener" translate="no" target="_blank">https://blogs.bl.uk/digital-scholarship/2025/07/automatic-text-recognition-in-cultural-heritage-institutions-survey-analysis.html</a> <a href="https://glammr.us/tags/britishlibrary" class="mention hashtag" rel="nofollow noopener" target="_blank">#britishlibrary</a> <a href="https://techhub.social/@BL_DigiSchol" class="u-url mention" rel="nofollow noopener" target="_blank">@BL_DigiSchol</a> <a href="https://glammr.us/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a> <a href="https://glammr.us/tags/HTR" class="mention hashtag" rel="nofollow noopener" target="_blank">#HTR</a> <a href="https://glammr.us/tags/ATR" class="mention hashtag" rel="nofollow noopener" target="_blank">#ATR</a>

CaioA pauta de hoje do <a href="https://bolha.us/tags/TerSoftware" class="mention hashtag" rel="nofollow noopener" target="_blank">#TerSoftware</a> é sobre "gestão de papel". Recentemente, testei OCR para digitalização de tabelas e... não fiquei muito feliz com o resultado.Acredito que <a href="https://bolha.us/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a> funcione melhor quando fica bem amarrado com o documento digitalizado (por exemplo, tornando um arquivo PDF buscável), mas para extração de texto, ainda é um grande "depende".Na minha curta jornada, testei <a href="https://bolha.us/tags/Tesseract" class="mention hashtag" rel="nofollow noopener" target="_blank">#Tesseract</a> e <a href="https://bolha.us/tags/Docling" class="mention hashtag" rel="nofollow noopener" target="_blank">#Docling</a>. Talvez funcione com código bem escrito, mas acabei me rendendo e indo "no muque" mesmo.O Tesseract parece bem fácil de instalar no Linux (mesmo no <a href="https://bolha.us/tags/openSUSE" class="mention hashtag" rel="nofollow noopener" target="_blank">#openSUSE</a> Leap, que tem suas limitações por sair do SUSE empresarial, achei fácil), mas o Docling exigiu alguns malabarismos com ambientes em Python (usando conda e pip).Para texto corrido, o Tesseract parece bem suficiente, já. Pode ser rodado via linha de comando e, pelo menos no openSUSE Leap, vários dicionários se encontram empacotados para facilitar.

Dissent Doe :cupofcoffee:HHS' Office for Civil Rights Settles HIPAA Privacy and Security Rule Investigation with Deer Oaks Behavioral Health for $225k and a Corrective Action Plan:<a href="https://databreaches.net/2025/07/08/hhs-office-for-civil-rights-settles-hipaa-privacy-and-security-rule-investigation-with-deer-oaks-behavioral-health-for-225k-and-a-corrective-action-plan/" rel="nofollow noopener" translate="no" target="_blank">https://databreaches.net/2025/07/08/hhs-office-for-civil-rights-settles-hipaa-privacy-and-security-rule-investigation-with-deer-oaks-behavioral-health-for-225k-and-a-corrective-action-plan/</a>This was a ransomware attack in 2023 claimed by LockBit. Deer Oaks was already under investigation for a prior breach and HHS OCR expanded their case. <a href="https://infosec.exchange/tags/databreach" class="mention hashtag" rel="nofollow noopener" target="_blank">#databreach</a> <a href="https://infosec.exchange/tags/healthsec" class="mention hashtag" rel="nofollow noopener" target="_blank">#healthsec</a> <a href="https://infosec.exchange/tags/HIPAA" class="mention hashtag" rel="nofollow noopener" target="_blank">#HIPAA</a> <a href="https://infosec.exchange/tags/cybersecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#cybersecurity</a> <a href="https://infosec.exchange/tags/ransomware" class="mention hashtag" rel="nofollow noopener" target="_blank">#ransomware</a> <a href="https://infosec.exchange/tags/LockBit" class="mention hashtag" rel="nofollow noopener" target="_blank">#LockBit</a> <a href="https://infosec.exchange/tags/HHS" class="mention hashtag" rel="nofollow noopener" target="_blank">#HHS</a> <a href="https://infosec.exchange/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a>

Pyrzout :vm:Convert Any Book to a DIY Audiobook? <a href="https://hackaday.com/2025/07/06/convert-any-book-to-a-diy-audiobook/" rel="nofollow noopener" translate="no" target="_blank">https://hackaday.com/2025/07/06/convert-any-book-to-a-diy-audiobook/</a> <a href="https://social.skynetcloud.site/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#ArtificialIntelligence</a> <a href="https://social.skynetcloud.site/tags/RaspberryPiZero2W" class="mention hashtag" rel="nofollow noopener" target="_blank">#RaspberryPiZero2W</a> <a href="https://social.skynetcloud.site/tags/GoogleGemini2" class="mention hashtag" rel="nofollow noopener" target="_blank">#GoogleGemini2</a>.5 <a href="https://social.skynetcloud.site/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechsynthesis</a> <a href="https://social.skynetcloud.site/tags/RaspberryPi" class="mention hashtag" rel="nofollow noopener" target="_blank">#RaspberryPi</a> <a href="https://social.skynetcloud.site/tags/PiperVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#PiperVoice</a> <a href="https://social.skynetcloud.site/tags/webcam" class="mention hashtag" rel="nofollow noopener" target="_blank">#webcam</a> <a href="https://social.skynetcloud.site/tags/GenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#GenAI</a> <a href="https://social.skynetcloud.site/tags/CV2" class="mention hashtag" rel="nofollow noopener" target="_blank">#CV2</a> <a href="https://social.skynetcloud.site/tags/ocr" class="mention hashtag" rel="nofollow noopener" target="_blank">#ocr</a> <a href="https://social.skynetcloud.site/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai</a>

Tiago F. R. RibeiroO Nanonets-OCR-s da Nanonets é um modelo OCR que transforma documentos em markdown estruturado, ideal para LLMs. Recursos incluem reconhecimento de equações LaTeX, descrição inteligente de imagens, deteção de assinaturas e marcas d'água, manipulação de caixas de seleção e extração de tabelas complexas.📎<a href="https://huggingface.co/nanonets/Nanonets-OCR-s" rel="nofollow noopener" translate="no" target="_blank">https://huggingface.co/nanonets/Nanonets-OCR-s</a>📎<a href="https://github.com/NanoNets/docext" rel="nofollow noopener" translate="no" target="_blank">https://github.com/NanoNets/docext</a>📎<a href="https://idp-leaderboard.org/details/" rel="nofollow noopener" translate="no" target="_blank">https://idp-leaderboard.org/details/</a><a href="https://mastodon.social/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#AI</a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#MachineLearning</a> <a href="https://mastodon.social/tags/Markdown" class="mention hashtag" rel="nofollow noopener" target="_blank">#Markdown</a> <a href="https://mastodon.social/tags/ProcessamentoDeDocumentos" class="mention hashtag" rel="nofollow noopener" target="_blank">#ProcessamentoDeDocumentos</a> <a href="https://mastodon.social/tags/LaTeX" class="mention hashtag" rel="nofollow noopener" target="_blank">#LaTeX</a> <a href="https://mastodon.social/tags/BigData" class="mention hashtag" rel="nofollow noopener" target="_blank">#BigData</a>

IT NewsConvert Any Book to a DIY Audiobook? - If the idea of reading a physical book sounds like hard work, [Nick Bild’s] latest... - <a href="https://hackaday.com/2025/07/06/convert-any-book-to-a-diy-audiobook/" rel="nofollow noopener" translate="no" target="_blank">https://hackaday.com/2025/07/06/convert-any-book-to-a-diy-audiobook/</a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#artificialintelligence</a> <a href="https://schleuss.online/tags/raspberrypizero2w" class="mention hashtag" rel="nofollow noopener" target="_blank">#raspberrypizero2w</a> <a href="https://schleuss.online/tags/googlegemini2" class="mention hashtag" rel="nofollow noopener" target="_blank">#googlegemini2</a>.5 <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#speechsynthesis</a> <a href="https://schleuss.online/tags/raspberrypi" class="mention hashtag" rel="nofollow noopener" target="_blank">#raspberrypi</a> <a href="https://schleuss.online/tags/pipervoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#pipervoice</a> <a href="https://schleuss.online/tags/webcam" class="mention hashtag" rel="nofollow noopener" target="_blank">#webcam</a> <a href="https://schleuss.online/tags/genai" class="mention hashtag" rel="nofollow noopener" target="_blank">#genai</a> <a href="https://schleuss.online/tags/cv2" class="mention hashtag" rel="nofollow noopener" target="_blank">#cv2</a> <a href="https://schleuss.online/tags/ocr" class="mention hashtag" rel="nofollow noopener" target="_blank">#ocr</a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#ai</a>

hansamann 🔥❄️🚀 (he/him)Survived the Kulmbach Beast! 21km of pure pain and glory. Blog post with all the details here: <a href="https://open.substack.com/pub/hansamann/p/conquering-the-kulmbach-beast-a-spartan?r=1sq3ws&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true" rel="nofollow noopener" translate="no" target="_blank">https://open.substack.com/pub/hansamann/p/conquering-the-kulmbach-beast-a-spartan?r=1sq3ws&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true</a> <a href="https://chaos.social/tags/SpartanRace" class="mention hashtag" rel="nofollow noopener" target="_blank">#SpartanRace</a> <a href="https://chaos.social/tags/OCR" class="mention hashtag" rel="nofollow noopener" target="_blank">#OCR</a>

**Rainer Simon** @aboutgeo@vis.social · Jul 1 *

Jul 1 *

Rainer Simon @aboutgeo@vis.social

If you haven’t checked in on #IMMARKUS lately (understandable—there’s been a lot going on!)—we’ve added even more transcription service options.

You can now run OCR or full-text transcription with a single click using:

• Anthropic Claude
• Azure Computer Vision
• Google Gemini
• Google Vision OCR
• LLaMA & Qwen via kluster.ai
• OCR.space
• OpenAI GPT
• Volcano Engine Doubao 1.5 Vision Pro

Try it out here: https://immarkus.xmarkus.org

#OCR #DigitalHumanities #ImageAnnotation

Continued thread

**Táta Geek** @tatageek@witter.cz · Jun 30

Jun 30

Táta Geek @tatageek@witter.cz

Je to tam.
#ocr tréninková dráha je hotová. Jen jsem měl použít o kousek delší trubku, pro děti by to mohlo být ještě tak o 10cm níž

**なかはらいちろう** @lithium03@mastodon.lithium03.info · Jun 30

Jun 30

なかはらいちろう @lithium03@mastodon.lithium03.info

#OCR
石碑を読み取りしてみたら、もう少しで勝てそうな所まで来た。夕方の横からの光源でないと多分そもそも無理そうな気もする

**Praveen Yadav** @Praveen_Yadav343@mastodon.social · Jun 29

Replied in thread

**panigrc** @panigrc@mastodon.social · Jun 28

Jun 28

panigrc @panigrc@mastodon.social

@_DigitalWriter_ ist es eine alternative zu #paperlessngx?

#papra #ocr #selfhosting

**Oliver Ammann** @oa@swiss.social · Jun 27

Jun 27

Oliver Ammann @oa@swiss.social

In meinem Blogbeitag auf ETHeritage beschreibe ich, wie wir auf e-rara #NamedEntityRecognition und #NamedEntityLinking einsetzten.

https://etheritage.ethz.ch/2025/06/27/drachenkopf/

#erara #ner #nel

**Terence Eden** @Edent@mastodon.social · Jun 26 *

Jun 26 *

Terence Eden @Edent@mastodon.social

I have a genuine AI problem which I can't find an easy solution for.

We use Google's Vision API to OCR inscription from photographs.

Most of the time it is great, but sometimes it includes homographic characters.

For example - https://openbenches.org/bench/38224

It has misdetected the letters in "ΤΟΝΙΑ" as Greek rather than Latin.

I can't send a language hint, because people upload images from all over the world.

Is there a good (preferably free) OCR which wouldn't make this mistake?

OpenBenchesIn Loving Memory of ΤΟΝΙΑ HENDRIKS 21st November 2023

#AI #OCR

**athmane mokraoui [BoF] ⏚ꝃ⌁⁂** @ButterflyOfFire@mstdn.fr · Jun 26

Jun 26

athmane mokraoui [BoF] ⏚ꝃ⌁⁂ @ButterflyOfFire@mstdn.fr

Il y a 7 mois, le département d'informatique de l'Université de Tizi-Ouzou publia une vidéo d'un projet de deux étudiants concernant l'#OCR en #Kabyle

Cependant, je ne trouve aucune trace du projet sur Github.

https://www.youtube.com/watch?v=drhr2v3lLtY

YouTubeDéveloppement d'un système de #Reconnaissance #Optique de Caractères (#OCR) pour la langue #KabyleBy Computer Science Department

**Rainer Simon** @aboutgeo@vis.social · Jun 25 *

Jun 25 *

Rainer Simon @aboutgeo@vis.social

Small enhancement to #IMMARKUS’ Google Vision integration: You can now choose the level of detail when importing OCR results—words, paragraphs, or full blocks.

#OCR #IIIF #ImageAnnotation

Recent searches

Search options

Administered by:

Server stats:

#ocr