【ICCV' 2025】Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model

Kai Tong, Kang Pan, Xiao Zhang, Erli Meng, Run He, Yawen Cui, Nuoyan Guo, Huiping Zhuang*

Introduction

This is the official implementation for Any-SSR Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model.

Overview

Environment

We recommend using the Anaconda to install the development environment.

git clone --depth=1 https://github.com/ZHUANGHP/Any-SSR.git

cd Any-SSR
conda env create -f environment.yaml

Quick Start

All the data after processing can be downloaded from Trace Benchmark

You should specify the directory to the dataset and the pretrained model (we used Llama-2-7b-chat-hf). You can download the pre-trained weight via the code or directly download it from huggingface.

After finishing dataset and pre-trained weight downloading, use

python train_router_ana_continual.py

to train the router weight recursively, then use

python eval_router_ana.py

to generate routing accuracy.

The router can have nearly 100 percent accuracy in the experiments.

Lora Model Training

You can use

bash train_lora.sh

to train a lora model for each task in the Trace dataset.

Evaluate

bash scripts/inference.sh

This commend will start the inference. Before inference, please specify the directories to the lora models, router weights and other in the script.

From new branch called Analytic Continual Learning

This is the first LLM member from the continual learning branch: Analytic Continual Learning. We have published over 20 papers in this branch (check My Scholar)!

Cite our paper

If you find our paper or this repository useful, please kindly consider citing our paper.

@InProceedings{Tong_2025_ICCV,
    author    = {Tong, Kai and Pan, Kang and Zhang, Xiao and Meng, Erli and He, Run and Cui, Yawen and Guo, Nuoyan and Zhuang, Huiping},
    title     = {Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2025},
    pages     = {3047-3057}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
evaluations		evaluations
imgs		imgs
inference		inference
model		model
scripts		scripts
training		training
utils		utils
README.md		README.md
environment.yaml		environment.yaml
eval_router_ana.py		eval_router_ana.py
metrics.py		metrics.py
train_lora.sh		train_lora.sh
train_router_ana_continual.py		train_router_ana_continual.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

【ICCV' 2025】Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model

Kai Tong, Kang Pan, Xiao Zhang, Erli Meng, Run He, Yawen Cui, Nuoyan Guo, Huiping Zhuang*

Introduction

Overview

Environment

Quick Start

Lora Model Training

Evaluate

From new branch called Analytic Continual Learning

Cite our paper

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

【ICCV' 2025】Any-SSR: How Recursive Least Squares Works in Continual Learning of Large Language Model

Kai Tong, Kang Pan, Xiao Zhang, Erli Meng, Run He, Yawen Cui, Nuoyan Guo, Huiping Zhuang*

Introduction

Overview

Environment

Quick Start

Lora Model Training

Evaluate

From new branch called Analytic Continual Learning

Cite our paper

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages