🌟 𝐋𝐋𝐌 𝐏𝐫𝐨𝐯𝐢𝐝𝐞𝐫'𝐬 𝐑𝐞𝐥𝐞𝐚𝐬𝐞 (2024 1H) & 📝 2024 𝐋𝐋𝐌 𝐒𝐮𝐫𝐯𝐞𝐲 (on Training / Data / RAG / Serving / Agent)

Posted May 30, 2024 Updated Nov 15, 2025

LLM 2024 Procider

By Fodev JEO 6 min read

🌟 LLM Provider’s Release (2024 1H): A Comprehensive Overview

Curiosity: What patterns can we retrieve from the rapid pace of LLM releases in 2024? How do these innovations connect to the broader evolution of the field?

2024’s first half witnessed an unprecedented surge in LLM releases, with 21 major models from leading providers. This comprehensive overview retrieves insights from release patterns, technical innovations, and market dynamics to understand where the field is heading.

Release Timeline Overview

gantt
    title LLM Releases 2024 1H Timeline
    dateFormat YYYY-MM-DD
    section Major Releases
    GPT-4o (OpenAI)           :2024-05-13, 1d
    Llama-3 (Meta)            :2024-04-18, 1d
    Claude-3 (Anthropic)      :2024-03-04, 1d
    Gemini-1.5 (Google)       :2024-03-08, 1d
    section Open Source
    Qwen-2 (Alibaba)          :2024-06-07, 1d
    DeepSeek-V2              :2024-05-07, 1d
    Phi-3 (Microsoft)         :2024-04-22, 1d
    section Specialized
    Solar-Mini-ja (Upstage)   :2024-05-22, 1d
    Mistral-Large            :2024-02-26, 1d

21 LLM Releases: Complete Catalog

#	Model	Provider	Release Date	Key Features	News	Paper
1	Qwen-2	Alibaba Group	2024.06.07	Multilingual, large-scale	Link	-
2	Solar-Mini-ja	Upstage	2024.05.22	Japanese-optimized	Link	-
3	Yi-Large	01.AI	2024.05.13	Large-scale model	Link	-
4	Yi-1.5	01.AI	2024.05.13	Enhanced version	Link	arXiv
5	GPT-4o	OpenAI	2024.05.13	Omni-modal, faster	Link	-
6	Qwen-Max	Alibaba Group	2024.05.11	Maximum performance	Link	-
7	DeepSeek-V2	DeepSeek	2024.05.07	Efficient architecture	Link	arXiv
8	Snowflake-Arctic	Snowflake	2024.04.24	Enterprise-focused	Link	-
9	Phi-3	Microsoft	2024.04.22	Small language model	Link	arXiv
10	Llama-3	Meta	2024.04.18	Open-source leader	Link	-
11	Mixtral-8x22B	Mistral AI	2024.04.17	Mixture of experts	Link	-
12	Reka-Core	Reka AI	2024.04.15	Multimodal	Link	arXiv
13	Command-R-Plus	Cohere	2024.04.04	Enterprise RAG	Link	-
14	DBRX	Databricks	2024.03.27	Open-source SOTA	Link	-
15	Gemini-1.5	Google	2024.03.08	Long context	Link	arXiv
16	Claude-3	Anthropic	2024.03.04	Safety-focused	Link	-
17	Mistral-Large	Mistral AI	2024.02.26	European leader	Link	-
18	Gemma	Google	2024.02.21	Open models	Link	arXiv
19	Qwen-1.5	Alibaba Group	2024.02.04	Multilingual	Link	-
20	Solar-Mini	Upstage	2024.01.25	Efficient Korean	Link	-
21	Solar-10.7B	Upstage	2023.12.23	Top pre-trained	Link	arXiv

Provider Distribution

pie title LLM Releases by Provider (2024 1H)
    "Alibaba Group" : 3
    "Upstage" : 3
    "Google" : 2
    "Mistral AI" : 2
    "01.AI" : 2
    "Others" : 9

Key Trends & Insights

Retrieve: Analysis of release patterns reveals several key trends:

Open Source Acceleration: Major releases from Meta (Llama-3), Alibaba (Qwen series), and Databricks (DBRX)
Multimodal Expansion: GPT-4o, Gemini-1.5, Reka-Core emphasize vision capabilities
Efficiency Focus: Phi-3, Solar-Mini demonstrate small model excellence
Regional Specialization: Solar-Mini-ja (Japanese), Qwen series (Chinese)

Innovate: These releases show the field moving toward:

More efficient architectures (DeepSeek-V2, Phi-3)
Better multilingual support (Qwen, Solar)
Enterprise-ready solutions (Snowflake Arctic, Command-R-Plus)

𝐐𝐰𝐞𝐧-2 (Alibaba Group, 2024.06.07)
- • 📣News: https://qwenlm.github.io/blog/qwen2/
𝐒𝐨𝐥𝐚𝐫-𝐌𝐢𝐧𝐢-𝐣𝐚 (Upstage, 2024.05.22)
- • 📣News: https://www.upstage.ai/feed/tech/solar-mini-chat-ja
𝐘𝐢-𝐋𝐚𝐫𝐠𝐞 (01.AI, 2024.05.13)
- • 📣News: https://x.com/01AI_Yi/status/1789929378467426794
𝐘𝐢-1.5 (01.AI, 2024.05.13)
- • 📣News: https://x.com/01AI_Yi/status/1789869537317540016
- • 📋arXiv: https://arxiv.org/abs/2403.04652
𝐆𝐏𝐓-4𝐨 (OpenAI, 2024.05.13)
- • 📣News: https://openai.com/index/hello-gpt-4o/
𝐐𝐰𝐞𝐧-𝐌𝐚𝐱 (Alibaba Group, 2024.05.11)
- • 📣News: https://qwenlm.github.io/blog/qwen-max-0428/
𝐃𝐞𝐞𝐩𝐒𝐞𝐞𝐤-𝐕2 (DeepSeek, 2024.05.07)
- • 📣News: https://x.com/deepseek_ai/status/1787478986731429933
- • 📋arXiv: https://arxiv.org/abs/2405.04434
𝐒𝐧𝐨𝐰𝐟𝐥𝐚𝐤𝐞-𝐀𝐫𝐜𝐭𝐢𝐜 (Snowflake, 2024.04.24)
- • 📣News: https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/
𝐏𝐡𝐢-3 (Microsoft, 2024.04.22)
- • 📣News: https://azure.microsoft.com/en-us/blog/introducing-phi-3-redefining-whats-possible-with-slms/
- • 📋arXiv: https://arxiv.org/abs/2404.14219
𝐋𝐥𝐚𝐦𝐚-3 (Meta Facebook, 2024.04.18)
- • 📣News: https://ai.meta.com/blog/meta-llama-3/
𝐌𝐢𝐱𝐭𝐫𝐚𝐥-8𝐱22𝐁 (Mistral AI, 2024.04.17)
- • 📣News: https://mistral.ai/news/mixtral-8x22b/
𝐑𝐞𝐤𝐚-𝐂𝐨𝐫𝐞 (Reka AI, 2024.04.15)
- • 📣News: https://www.reka.ai/news/reka-core-our-frontier-class-multimodal-language-model
- • 📋arXiv: https://arxiv.org/abs/2404.12387
𝐂𝐨𝐦𝐦𝐚𝐧𝐝-𝐑-𝐏𝐥𝐮𝐬 (Cohere, 2024.04.04)
- • 📣News: https://cohere.com/blog/command-r-plus-microsoft-azure
𝐃𝐁𝐑𝐗 (Databricks, 2024.03.27)
- • 📣News: https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
𝐆𝐞𝐦𝐢𝐧𝐢-1.5 (Google, 2024.03.08)
- • 📣News: https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/
- • 📋arXiv: https://arxiv.org/abs/2403.05530
𝐂𝐥𝐚𝐮𝐝𝐞-3 (Anthropic, 2024.03.04)
- • 📣News: https://www.anthropic.com/news/claude-3-family
𝐌𝐢𝐬𝐭𝐫𝐚𝐥-𝐋𝐚𝐫𝐠𝐞 (Mistral AI, 2024.02.26)
- • 📣News: https://mistral.ai/news/mistral-large/
𝐆𝐞𝐦𝐦𝐚 (Google, 2024.02.21)
- • 📣News: https://blog.google/technology/developers/gemma-open-models/
- • 📋arXiv: https://arxiv.org/abs/2403.08295
𝐐𝐰𝐞𝐧-1.5 (Alibaba Group, 2024.02.04)
- • 📣News: https://qwenlm.github.io/blog/qwen1.5/
𝐒𝐨𝐥𝐚𝐫-𝐌𝐢𝐧𝐢 (Upstage, 2024.01.25)
- • 📣News: https://www.upstage.ai/feed/product/solarmini-performance-report
𝐒𝐨𝐥𝐚𝐫-10.7𝐁 (Upstage, 2023.12.23)
- • 📣News: https://www.upstage.ai/feed/press/solar-10-7b-emerges-as-worlds-top-pre-trained-llm
- • 📋arXiv: https://arxiv.org/abs/2312.15166

📝 2024 LLM Survey: Comprehensive Research Overview

Retrieve: What are the latest research trends across training, data, RAG, serving, and agents? This section compiles essential survey papers that capture the state of the art.

Essential Reading: These surveys provide comprehensive overviews of rapidly evolving LLM research areas.

Survey Categories Overview

graph TB
    A[2024 LLM Surveys] --> B[Training]
    A --> C[Data]
    A --> D[RAG]
    A --> E[Serving]
    A --> F[Agent]
    
    B --> B1[Self-Evolution]
    B --> B2[Continual Learning]
    B --> B3[Pre-trained Models]
    
    C --> C1[Datasets]
    C --> C2[Data Selection]
    C --> C3[Instruction Tuning]
    
    D --> D1[RALM Survey]
    D --> D2[AIGC RAG]
    D --> D3[LLM RAG]
    
    E --> E1[Inference]
    E --> E2[Invocation Methods]
    E --> E3[Resource Efficiency]
    
    F --> F1[Multimodal Agents]
    F --> F2[Multi-Agents]
    F --> F3[Personal Agents]
    
    style A fill:#e1f5ff
    style B fill:#fff3cd
    style C fill:#d4edda
    style D fill:#f8d7da
    style E fill:#e7d4f8
    style F fill:#ffe5e5

📚 Training Surveys

Retrieve: How do LLMs evolve and adapt? These surveys explore self-evolution, continual learning, and transfer learning.

Survey	Date	Focus	arXiv	GitHub
Self-Evolution of LLMs	2024.04.22	Autonomous improvement mechanisms	Link	Repo
Continual Learning of LLMs	2024.04.25	Lifelong learning approaches	Link	Repo
Continual Learning with PTMs	2024.01.29	Pre-trained model adaptation	Link	Repo

📊 Data Surveys

Innovate: Data quality and selection are critical for LLM performance. These surveys explore dataset curation and optimization.

Survey	Date	Focus	arXiv	GitHub
Datasets for LLMs	2024.02.28	Comprehensive dataset catalog	Link	Repo
Data Selection for LMs	2024.02.26	Selection strategies	Link	Repo
Data Selection for Instruction Tuning	2024.02.04	Instruction data curation	Link	Repo

🔍 RAG Surveys

Retrieve: Retrieval-Augmented Generation is transforming how LLMs access knowledge. These surveys cover the latest RAG research.

Survey	Date	Focus	arXiv	GitHub
RAG and RAU Survey	2024.04.30	RALM in NLP	Link	Repo
RAG for AIGC	2024.02.29	AI-generated content	Link	Repo
RAG for LLMs	2023.12.18	Comprehensive RAG overview	Link	Repo

⚡ Serving Surveys

Innovate: Efficient inference and serving are crucial for production deployment. These surveys explore optimization strategies.

Survey	Date	Focus	arXiv	GitHub
LLM Inference Unveiled	2024.02.26	Roofline model insights	Link	Repo
Effective LLM Service Invocation	2024.02.05	LLMaaS strategies	Link	Repo
Resource-Efficient LLMs	2024.01.01	Efficiency optimization	Link	Repo

🤖 Agent Surveys

Retrieve: AI agents represent the next frontier. These surveys explore multimodal, multi-agent, and personal agent systems.

Survey	Date	Focus	arXiv	GitHub
Large Multimodal Agents	2024.02.23	Vision-language agents	Link	Repo
LLM-based Multi-Agents	2024.01.21	Multi-agent systems	Link	Repo
Personal LLM Agents	2024.01.10	Personalization & security	Link	Repo

Research Trends Summary

graph LR
    A[2024 LLM Research] --> B[Training<br/>3 surveys]
    A --> C[Data<br/>3 surveys]
    A --> D[RAG<br/>3 surveys]
    A --> E[Serving<br/>3 surveys]
    A --> F[Agent<br/>3 surveys]
    
    style A fill:#e1f5ff
    style B fill:#fff3cd
    style C fill:#d4edda
    style D fill:#f8d7da
    style E fill:#e7d4f8
    style F fill:#ffe5e5

Key Takeaways

Retrieve: These 15 comprehensive surveys cover the essential areas of LLM research: training methodologies, data strategies, RAG systems, serving optimization, and agent architectures.

Innovate: By studying these surveys, you can retrieve the latest research insights and innovate on your own LLM applications, staying at the forefront of this rapidly evolving field.

Curiosity → Retrieve → Innovation: Start with curiosity about LLM capabilities, retrieve knowledge from these surveys, and innovate by applying cutting-edge techniques to your projects.

Information about Tokens in LLsM

Why do we keep talking about “tokens” in LLMs instead of words?

It happens to be much more efficient to break the words into sub-words (tokens) for model performance!

The typical strategy used in most modern LLMs since GPT-1 is the Byte Pair Encoding (BPE) strategy. The idea is to use, as tokens, sub-word units that appear often in the training data. The algorithm works as follows:

We start with a character-level tokenization
we count the pair frequencies
We merge the most frequent pair
We repeat the process until the dictionary is as big as we want it to be

The size of the dictionary becomes a hyperparameter that we can adjust based on our training data. For example, GPT-1 has a dictionary size of ~40K merges, GPT-2, GPT-3, and ChatGPT have a dictionary size of ~50K, and Llama 3 128K.

LLM, Survey

LLM Survey

This post is licensed under CC BY 4.0 by the author.