Fast visual discovery for photos, concepts, and creative inspiration.

Explore

Home
Discover Boards
Trending Search

Account

Sign In
Create Account
Saved Images
My Boards

© 2026 Mungart. All rights reserved.

Built for speed, clarity, and visual exploration.

…

Sparse LLMs

Family-friendly

SizeAspectAccentType

Showing 120 of 120on this page. Filters & sort apply to loaded results; URL updates for sharing.120 of 120 on this page

DeepSeek Native Sparse Attention: Advanced Attention Mechanism for LLMs ...

Figure 1 from LLMs as Sparse Retrievers:A Framework for First-Stage ...

Exploring In-Context Reinforcement Learning in LLMs with Sparse ...

【论文阅读】Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — LessWrong

Finding Sparse Linear Connections between Features in LLMs — AI ...

Sparse Llama: Revolutionizing LLMs with 70% Sparsity

Sparse Fine-Tuning: Revolutionizing Inference Speeds in LLMs | by AI ...

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper ...

Dynamic sparse attention makes LLMs run faster on regular GPUs by being ...

(PDF) Less is More: Sparse Watermarking in LLMs with Enhanced Text Quality

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper expla ...

Optimizing Sparse LLMs

DeepSparse Sparse LLMs - a neuralmagic Collection

LServe: efficient system for serving LLMs with hybrid sparse attention ...

[논문 리뷰] Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs

Finding Sparse Linear Connections between Features in LLMs — AI ...

A Window Into LLMs | Sparse Autoencoders Explained - YouTube

Sparse Llama: Revolutionizing LLMs with 70% Sparsity

Figure 1 from Mistral-SPLADE: LLMs for better Learned Sparse Retrieval ...

Mixture-of-Experts (MoE) Routing Algorithms for Sparse LLMs - ML Journey

Enhancing Multiple Dimensions of Trustworthiness in LLMs via Sparse ...

【论文阅读】Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

FlashAttention, Sparse Attention & How Modern LLMs Handle 100K+ Token ...

[논문 리뷰] SparsePO: Controlling Preference Alignment of LLMs via Sparse ...

🚀Native Sparse Attention for Long Context LLMs | by Tahir | Dec, 2025 ...

Finding Sparse Linear Connections between Features in LLMs — LessWrong

Memory-Efficient Fine-Tuning of LLMs through Sparse Adapter and Mixture ...

DeepMind's Leap in Interpreting LLMs with Sparse Autoencoders

[论文评述] How LLMs Learn: Tracing Internal Representations with Sparse ...

Scaling Sparse and Dense Retrieval in Decoder-Only LLMs

【论文阅读】Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine ...

Table 1 from LLMs as Sparse Retrievers:A Framework for First-Stage ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Hoagy Cunningham — Finding distributed features in LLMs with sparse ...

【论文阅读】Leave it to the Specialist: Repair Sparse LLMs with Sparse Fine ...

Scaling laws for sparse LLMs 😮 🔍 Researchers are studying the ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

Finding Sparse Linear Connections between Features in LLMs — AI ...

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs ...

Uncovering Cross-Linguistic Disparities in LLMs using Sparse Autoencoders

DeepSeek’s Native Sparse Attention (NSA): A Breakthrough in Efficient ...

Figure 1 from EBFT: Effective and Block-Wise Fine-Tuning for Sparse ...

Uni-MoE: A Unified Multimodal LLM based on Sparse MoE Architecture ...

Formulation of Feature Circuits with Sparse Autoencoders in LLM ...

An Intuitive Explanation of Sparse Autoencoders for LLM ...

LLM Interpretability and Sparse Autoencoders: Research from OpenAI and ...

[R] Enabling sparse, foundational LLMs for faster and more efficient ...

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via ...

Sparse LLM Inference on CPU

Sparse LLM Inference on CPU

How LLMs Work: From Neural Networks to Real-World Uses

Demystifying Sparse Attention: A Comprehensive Guide from Scratch | by ...

Support for Sparse LLM Operations

Understanding Sparse and Efficient Attention Mechanisms in Large ...

(PDF) SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter ...

(PDF) ELO-Mask: Effective and Layerwise Optimization of Mask for sparse ...

Pushing the Boundaries of LLMs: Sparse & Flash Attention, Quantisation ...

LLM Interpretability and Sparse Autoencoders: Research from OpenAI and ...

Introducing Sparse Llama: 70% Smaller, 3x Faster, Full Accuracy - Cerebras

Exploring the Sparse Frontier: How Researchers are Rethinking Attention ...

LLM大模型：deepseek浅度解析(四)：Native Sparse Attention NSA原理 - 第七子007 - 博客园

Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction ...

Sparse LLM Inference on CPU | Kevin Farinango Cinilin

LLM MOE的进化之路，从普通简化 MOE，到 sparse moe，再到 deepseek 使用的 share_expert sparse ...

[2405.16325] SLoPe: Double-Pruned Sparse Plus Lazy Low-Rank Adapter ...

Dynamic Low-Rank Sparse Adaptation for Large Language Models | AI ...

Sparse Attention in LLMs: Making AI More Efficient | nat.io

Open the Artificial Brain: Sparse Autoencoders for LLM Inspection ...

MInference: Million-Tokens Prompt Inference for LLMs

GitHub - NimbleEdge/sparse_transformers: Sparse Inferencing for ...

ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse ...

The research paper explores the concept of sparse activation in large ...

[论文评述] Sparse Brains are Also Adaptive Brains: Cognitive-Load-Aware ...

Uni-MoE: A Unified Multimodal LLM based on Sparse MoE Architecture ...

Sparse Compiler: Unlocking New Frontiers in LLM Inference | A100 GPU ...

SparseServe: Unlocking Parallelism for Dynamic Sparse Attention in Long ...

(PDF) A new sparse convex combination of ZA-LLMS and RZA-LLMS algorithms

Advanced Modern LLM Part 3: Sparse Attention and Application to Long ...

Implementing Dense and Sparse LLM from Scratch - 知乎

LLM optimizations for sparse matrix processing on Jetson Orin and other ...

Figure 1 from Uncovering Cross-Linguistic Disparities in LLMs using ...

Open the Artificial Brain: Sparse Autoencoders for LLM Inspection | by ...

Mastering LLM Techniques: Inference Optimization – GIXtools

Q-Sparse: A New Artificial Intelligence AI Approach to Enable Full ...

[arXiv] Microsoft Research introduces Q-Sparse: a breakthrough in ...

[PDF] SpQR: A Sparse-Quantized Representation for Near-Lossless LLM ...

Paper page - ReLU^2 Wins: Discovering Efficient Activation Functions ...

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For ...

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For ...

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For ...

7 Steps to Mastering Large Language Models (LLMs) - KDnuggets

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight ...

DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For ...

LLM 的推理优化技术纵览_llm 提高推理性能的方法-CSDN博客

Scaling LLMs: GPT-3 and Beyond | AI Tutorial | Next Electronics

聊聊Sparse Autoencoder对于LLM解释性的重塑 - 知乎

LLM Inference Series: 2. The two-phase process behind LLMs’ responses ...

[2402.03804] ReLU2 Wins: Discovering Efficient Activation Functions for ...

[2402.03804] ReLU2 Wins: Discovering Efficient Activation Functions for ...

Relocate-Vote: Using Sparsity Information to Exploit Ciphertext Side ...

GitHub - zyxxmu/DSnoT: Official Pytorch Implementation of Our Paper ...

Publications | 同济大学ADMIS实验室

Introducing NVFP4 for Efficient and Accurate Low-Precision Inference ...

Turbo Sparse：LLM推理性能与速度的平衡 - 知乎

Figure 9 from MInference 1.0: Accelerating Pre-filling for Long-Context ...

Q-Sparse: a sparsely-activated LLM by Microsoft | Microsoft Research ...

GitHub - zyxxmu/DSnoT: Official Pytorch Implementation of Our Paper ...

完全激活稀疏大模型，Q-Sparse突破LLM推理效能 - Microsoft Research

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large ...

Enable Efficient LLM Inference with SqueezeLLM

Figure 3 from SpQR: A Sparse-Quantized Representation for Near-Lossless ...

GitHub - nanowell/Q-Sparse-LLM: My Implementation of Q-Sparse: All ...

People also searched

LLM in the Ai Space Sparse Attention Rope LLM Sparse Coding Ai Agent and LLM Sparse Representation Tracing Ai LLMs Sparse Network LLM Sparse Moe LLM Sparse Architecture Unsloth LLM Sparse Autoencoder Pre Training in LLMs Sparse Represnetation Sparse Autoencoders for LLMs Interpretability Sparse Initialization Visualizations Explaining How LLMs Work LLMs and Understanding Sparse Representation in Deep Learning Model Pruning LLM LLMs Lying Sparse Attention Mechanism LLMs Friendly