InfoQ<p>Introducing <a href="https://techhub.social/tags/LMEval" class="mention hashtag" rel="tag">#<span>LMEval</span></a> – a tool that helps AI researchers & developers compare the performance of different <a href="https://techhub.social/tags/LLMs" class="mention hashtag" rel="tag">#<span>LLMs</span></a>.</p><p>Designed to be accurate, multimodal, and easy to use, LMEval has already been used to evaluate major models in terms of safety and security.</p><p>Dive deeper: <a href="https://bit.ly/3T7fgfk" target="_blank" rel="nofollow noopener" translate="no"><span class="invisible">https://</span><span class="">bit.ly/3T7fgfk</span><span class="invisible"></span></a> </p><p><a href="https://techhub.social/tags/AI" class="mention hashtag" rel="tag">#<span>AI</span></a> <a href="https://techhub.social/tags/opensource" class="mention hashtag" rel="tag">#<span>opensource</span></a> <a href="https://techhub.social/tags/Google" class="mention hashtag" rel="tag">#<span>Google</span></a> <a href="https://techhub.social/tags/InfoQ" class="mention hashtag" rel="tag">#<span>InfoQ</span></a></p>