The role of small language models in enterprise ai

According to analyst gartner, Small Language Models (Slms) Offer a potentially cost-effective alternative for Generative Artificial Intelligence (Genai) Development and Deployment Because Thei Are Easier to Fuine-Tune, More Efficient to Serve and More Straightforward to Control.

In its Explore Small Language Models for Specific Ai Scenarios Report, Published in August 2024, Gartner Exploes how the definitions of “small” and “large” in ai language models have changed and evolved.

Gartner notes that there are estimates that GPT-4 (Openai-March 2023), Gemini 1.5 (Google-February 2024), LLAMA 3.1 405B (Meta-July 2024) and Claude 3 opus (Anthropic-MARCH 2024) Hawa Aroopic-MARCH ARPIC-MARCH 2024) half a trillion to two trillion parameters. On the opposite end of the spectrum, models such as Mistral 7B (Mistral.ai-September 2023), PHI-3-Mini 3.8B and PHI-3-Small 7B (Microsoft-April 2024), LLAMA 3.1 8B (Meta-July 2024) And Gemma 2 9B (Google – June 2024) are estimated to have 10 billion parameters or fewer.

Looking at one example of the computational Resources used by a small language model compared with that there used by a large language model, gartner reports that lLAMA 3 8B of Graphics Processing Unit (GPU) Memory, Whereas LLAma 3 70B (70 Billion Parameters) Requires 160GB.

The More GPU Memory Needed, The Greater The Cost. For instance, at current gpu prisles, a server capable of running the complete 670 billion parameter Deepseek-R1 Model in-memory will cost over $ 100,000.

Breaking

The role of small language models in enterprise ai

Related Post

Leave a Reply Cancel reply

Top Blogs

AT&T Data Breach 2025: What You Need to Know

Microsoft Given Until 25 July to Respond to Uk Cloud Licensing Legal Claim

Third-Parthy Security Weaknesses Threten Europe's Big Banks

What Happens If Silicon Valley's Ai Investment Bubble Bursts?

Knowledge distillation

Augmenting Slms

Reducing errors and hallucinations

Related Post

Leave a Reply Cancel reply

Top Blogs