
[2502.09992] Large Language Diffusion Models - arXiv.org
Feb 14, 2025 · Autoregressive models (ARMs) are widely regarded as the cornerstone of large language models (LLMs). We challenge this notion by introducing LLaDA, a diffusion model …
[2402.06196] Large Language Models: A Survey - arXiv.org
Feb 9, 2024 · In this paper, we review some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discuss their characteristics, contributions and …
[2303.18223] A Survey of Large Language Models - arXiv.org
Mar 31, 2023 · In this survey, we review the recent advances of LLMs by introducing the background, key findings, and mainstream techniques. In particular, we focus on four major …
GitHub - Hannibal046/Awesome-LLM: Awesome-LLM: a curated …
Here is a curated list of papers about large language models, especially relating to ChatGPT. It also contains frameworks for LLM training, tools to deploy LLM, courses and tutorials about …
The 7 best arXiv papers to learn how LLMs work - Deepgram
Feb 11, 2025 · These papers cover various aspects of LLMs, including their architectures, pre-training methods, scaling properties, few-shot learning capabilities, and applications in tasks …
A Review on Large Language Models: Architectures, Applications ...
Feb 13, 2024 · This article thoroughly overviews LLMs, including their history, architectures, transformers, resources, training methods, applications, impacts, challenges, etc. This paper …
AGI-Edgerunners/LLM-Planning-Papers - GitHub
Aug 12, 2023 · LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models. Chan Hee Song, Jiaman Wu, Clayton Washington, Brian M. Sadler, Wei …
LLM Research Papers: The 2024 List
Dec 8, 2024 · A curated list of interesting LLM-related research papers from 2024, shared for those looking for something to read over the holidays.
Training Large Language Models to Reason in a Continuous Latent …
Dec 9, 2024 · Large language models (LLMs) are restricted to reason in the "language space", where they typically express the reasoning process with a chain-of-thought (CoT) to solve a …
zjunlp/LLMAgentPapers: Must-read Papers on LLM Agents. - GitHub
Must-read Papers on LLM Agents. Contribute to zjunlp/LLMAgentPapers development by creating an account on GitHub.