How this open source LLM chatbot runner hit the gas on x86, Arm CPUs

日本 ニュース ニュース

How this open source LLM chatbot runner hit the gas on x86, Arm CPUs
日本 最新ニュース,日本 見出し
  • 📰 TheRegister
  • ⏱ Reading Time:
  • 42 sec. here
  • 2 min. at publisher
  • 📊 Quality Score:
  • News: 20%
  • Publisher: 61%

Way to whip that LLaMA's ass

A handy open source tool for packaging up LLMs into single universal chatbot executables that are easy to distribute and run has apparently had a 30 to 500 percent CPU performance boost on x86 and Arm systems.

You can build and run llama.cpp natively, give it a model to load, and then interact with that LLM in various ways. Where it gets tricky is the model files involved are usually quite large in size, and it can be a bit confusing to know which variant is best to use. For instance, on an Intel Skylake Core i9-9900, prompt processing jumped 50 percent versus llama.cpp, whereas evaluation stayed the same.

While the performance gains that come with MKL are great, the fact that it's closed source is less than ideal for this open source effort, according to Tunney. She noted that"integrating foreign BLAS libraries into llama.cpp isn't that practical, due to the way its threading model works." And since MKL is closed source, it's not possible to just look at it and see how it can be improved.

このニュースをすぐに読めるように要約しました。ニュースに興味がある場合は、ここで全文を読むことができます。 続きを読む:

TheRegister /  🏆 67. in UK

日本 最新ニュース, 日本 見出し

Similar News:他のニュース ソースから収集した、これに似たニュース記事を読むこともできます。

Databricks claims its open source foundational LLM outsmarts GPT-3.5Databricks claims its open source foundational LLM outsmarts GPT-3.5In the AI gold rush, analytics outfit wants to provide the shovels
続きを読む »

Can generative AI truly transform healthcare into a more personalized experience?Can generative AI truly transform healthcare into a more personalized experience?The current literature on large language model (LLM)-based evaluation metrics for healthcare chatbots.
続きを読む »

You got legal trouble? Better call SauLM-7BYou got legal trouble? Better call SauLM-7BCooked in a math lab, here's an open source LLM that knows the law
続きを読む »

Intel is still top dog: ships 3x more CPUs than AMD and Apple combinedIntel is still top dog: ships 3x more CPUs than AMD and Apple combinedAndy built his first gaming PC at the tender age of 12, when IDE cables were a thing and high resolution wasn't. After spending over 15 years in the production industry overseeing a variety of live and recorded projects, he started writing his own PC hardware blog for a year in the hope that people might send him things. Sometimes they did.
続きを読む »

First Armv9 automotive CPUs aim to power AI-enabled vehiclesFirst Armv9 automotive CPUs aim to power AI-enabled vehiclesVehicle electronics and software becoming ever more complex
続きを読む »

Beijing issues list of approved CPUs – with no Intel or AMDBeijing issues list of approved CPUs – with no Intel or AMD2024 may be the year of Linux On The Arm-or-RISC-desktop as China moves away from Western tech
続きを読む »



Render Time: 2025-03-31 10:48:52