arxiv_ml 70% Match Research Paper Machine Learning Researchers,Deep Learning Engineers,Researchers focused on efficient AI 2 weeks ago

MetaCluster: Enabling Deep Compression of Kolmogorov-Arnold Network

generative-ai › vae

📄 Abstract

Abstract: Kolmogorov-Arnold Networks (KANs) replace scalar weights with per-edge vectors of basis coefficients, thereby boosting expressivity and accuracy but at the same time resulting in a multiplicative increase in parameters and memory. We propose MetaCluster, a framework that makes KANs highly compressible without sacrificing accuracy. Specifically, a lightweight meta-learner, trained jointly with the KAN, is used to map low-dimensional embedding to coefficient vectors, shaping them to lie on a low-dimensional manifold that is amenable to clustering. We then run K-means in coefficient space and replace per-edge vectors with shared centroids. Afterwards, the meta-learner can be discarded, and a brief fine-tuning of the centroid codebook recovers any residual accuracy loss. The resulting model stores only a small codebook and per-edge indices, exploiting the vector nature of KAN parameters to amortize storage across multiple coefficients. On MNIST, CIFAR-10, and CIFAR-100, across standard KANs and ConvKANs using multiple basis functions, MetaCluster achieves a reduction of up to 80$\times$ in parameter storage, with no loss in accuracy. Code will be released upon publication.

Authors (3)

Matthew Raffel

Adwaith Renjith

Lizhong Chen

Submitted

October 21, 2025

arXiv Category

cs.LG

arXiv PDF

Key Contributions

This paper proposes MetaCluster, a framework for highly compressing Kolmogorov-Arnold Networks (KANs) without sacrificing accuracy. It uses a meta-learner to map embeddings to a low-dimensional manifold for clustering coefficients, significantly reducing parameters by sharing centroids and using indices.

Business Value

Enables the deployment of highly expressive KAN models on resource-constrained devices or in applications requiring lower memory footprints, expanding their practical utility.

Paper Metadata

Innovation Type

Compression Framework for KANs

Deployment Feasibility

Moderate, requires implementing the meta-learning and clustering steps during training/compression.

Limitations Addressed

The multiplicative increase in parameters and memory usage of KANs compared to traditional MLPs.

Performance Gains

Achieves high compressibility for KANs while recovering accuracy through fine-tuning.

Technical Tags

Kolmogorov-Arnold Network (KAN)network compressionmeta-learningclusteringlow-dimensional manifoldcodebookparameter efficiencydeep compression

Research Topics

Model CompressionNeural Network ArchitecturesMeta-LearningEfficient Deep Learning

Methods & Architectures

Meta-learning for CompressionK-means ClusteringManifold Learning Kolmogorov-Arnold Network (KAN)Meta-learner

Applications & Tasks

Deep Learning Model Compression Efficient AI High Parameter Count in KANsModel Compressibility Compressing KANsReducing Model Size

Datasets & Benchmarks

Datasets

MNIST, CIFAR-10, CIFAR-100

Compression RatioAccuracyParameter CountMemory Usage

Related Fields

Deep LearningModel CompressionMeta-LearningNeural Network Architectures

Keywords

KANKolmogorov-Arnold Networkcompressionmeta-learningclusteringmodel sizeparameter efficiencydeep learningMetaCluster

Academic Context

#Model Compression#Neural Network Architectures#Meta-Learning#Efficient Deep Learning

Commercial Potential

Potential Products

Compressed KAN model librariesOn-device AI solutions using KANs

Target Industries

Mobile ComputingEdge AITechnology

Use Case Examples

Deploying advanced function approximators on mobile devices with limited memory.Creating more efficient AI models for real-time applications.

Competitive Edge

Provides a unique and effective method for compressing KANs, addressing a key limitation of this promising architecture.

Market Opportunity

Growing market for efficient AI models and on-device intelligence.

Revenue Models

Licensing of compressed KAN models or compression technology.

Resource Requirements

Compute Needs

Moderate to High, for joint training of KAN and meta-learner, plus clustering.

Data Requirements

Standard image classification datasets (MNIST, CIFAR).

Deployment Constraints

The compression process adds complexity; inference speed might be affected by index lookups.

Scalability

The effectiveness of clustering depends on the structure of the coefficient space.

Regulatory Considerations

N/A

Production Readiness

Maturity Level

Research

Time to Market

2-4 years (further optimization and validation)

View Full Paper Back to Papers