techhub.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A hub primarily for passionate technologists, but everyone is welcome

Administered by:

Server stats:

4.6K
active users

#lmeval

0 posts0 participants0 posts today
InfoQ<p>Introducing <a href="https://techhub.social/tags/LMEval" class="mention hashtag" rel="tag">#<span>LMEval</span></a> – a tool that helps AI researchers &amp; developers compare the performance of different <a href="https://techhub.social/tags/LLMs" class="mention hashtag" rel="tag">#<span>LLMs</span></a>.</p><p>Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.</p><p>Dive deeper: <a href="https://bit.ly/3T7fgfk" target="_blank" rel="nofollow noopener" translate="no"><span class="invisible">https://</span><span class="">bit.ly/3T7fgfk</span><span class="invisible"></span></a> </p><p><a href="https://techhub.social/tags/AI" class="mention hashtag" rel="tag">#<span>AI</span></a> <a href="https://techhub.social/tags/opensource" class="mention hashtag" rel="tag">#<span>opensource</span></a> <a href="https://techhub.social/tags/Google" class="mention hashtag" rel="tag">#<span>Google</span></a> <a href="https://techhub.social/tags/InfoQ" class="mention hashtag" rel="tag">#<span>InfoQ</span></a></p>
KINEWS24<p>🔍 Was ist Google LMEval? Entdecke das neue KI-Test-Framework!</p><p>Einheitliche Modellbewertung<br>Multimodal &amp; anbieterübergreifend<br>Effiziente, inkrementelle Tests</p><p><a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/ki" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ki</span></a> <a href="https://mastodon.social/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>artificialintelligence</span></a> <a href="https://mastodon.social/tags/Google" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Google</span></a> <a href="https://mastodon.social/tags/LMEval" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LMEval</span></a> <a href="https://mastodon.social/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a></p><p>Jetzt LIKEN, teilen, LESEN und FOLGEN! </p><p><a href="https://kinews24.de/google-lmeval-llms-ki-modelle-2025-clever-testen/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">kinews24.de/google-lmeval-llms</span><span class="invisible">-ki-modelle-2025-clever-testen/</span></a></p>
Giskard<p>At Giskard, we've integrated LMEval into our Phare LLM benchmark (phare.giskard.ai) to independently evaluate popular models' security and safety dimensions - through rigorous testing.</p><p>Read the announcement: <a href="https://opensource.googleblog.com/2025/05/announcing-lmeval-an-open-ource-framework-cross-model-evaluation.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">opensource.googleblog.com/2025</span><span class="invisible">/05/announcing-lmeval-an-open-ource-framework-cross-model-evaluation.html</span></a> </p><p><a href="https://fosstodon.org/tags/LMEval" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LMEval</span></a> <a href="https://fosstodon.org/tags/AISecurity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AISecurity</span></a> <a href="https://fosstodon.org/tags/LLMEvaluation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLMEvaluation</span></a> <a href="https://fosstodon.org/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a></p>