Orion: Enabling Self-adaptive Memory Management for On-device Online Continual Learning

Cong Liu; Nikil Dutt; Zexin Li

arxiv: 2605.26473 · v1 · pith:WEZ6WGOCnew · submitted 2026-05-26 · 📡 eess.SY · cs.SY

Orion: Enabling Self-adaptive Memory Management for On-device Online Continual Learning

Zexin Li , Nikil Dutt , Cong Liu This is my paper

Pith reviewed 2026-06-29 16:23 UTC · model grok-4.3

classification 📡 eess.SY cs.SY

keywords online continual learningmemory managementon-device learningself-adaptive systemsrobotic applicationsbuckets effectreplay bufferstraining latency

0 comments

The pith

Orion uses a buckets-effect runtime indicator to dynamically reallocate memory across online continual learning components for on-device robotic deployment.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Online continual learning must adapt to new data in real time on memory-limited devices such as robots, but fixed memory allocations create unacceptable trade-offs in speed, plasticity, and stability. Orion supplies a self-adaptive framework that continuously redistributes memory among batch processing, replay buffers, and optimizers at both operating-system and application layers. The mechanism rests on URGE, an indicator that identifies the current scarcest resource and triggers coordinated reallocation plus prefetching. Prototype tests across multiple algorithms, benchmarks, and robotic hardware show faster training with preserved balance and low added cost. Integration into an actual autonomous navigation robot confirms practical feasibility under changing workloads.

Core claim

Orion is a holistic framework that leverages URGE, a unified runtime indicator grounded in the Buckets effect, to dynamically reallocate memory across OCL components by jointly coordinating batch processing, replay buffers, and optimization strategies at both the OS and application level, together with system-level data prefetching, thereby enabling feasible on-device deployment while achieving significant training speedups with minimal runtime, memory, and energy overhead.

What carries the argument

URGE, a unified runtime indicator grounded in the Buckets effect principle that system performance is bounded by its scarcest resource, which dynamically coordinates memory reallocation across OCL components.

If this is right

State-of-the-art OCL algorithms become deployable on memory-constrained robotic hardware without offline retuning.
Training latency decreases while plasticity-stability balance is preserved across changing data distributions.
Memory pressure shifts during learning are handled automatically rather than through manual intervention.
System-level prefetching further reduces I/O stalls without extra application changes.
The same prototype integrates with Avalanche-lib and runs on standard autonomous-robot platforms.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar indicator-driven reallocation could be tested on other edge devices such as drones or wearable sensors facing variable memory pressure.
The buckets-effect logic may generalize to joint management of compute, bandwidth, and energy in multi-task on-device learning.
Developers could explore whether URGE-style indicators improve non-continual on-device training under tight memory caps.

Load-bearing premise

URGE can reliably coordinate memory reallocation across OCL components at OS and application levels without introducing instability or unacceptable overhead in real robotic workloads.

What would settle it

A controlled run on the target robotic platform in which URGE-driven reallocation produces either measurable instability, increased energy draw, or slower convergence compared with static allocation under identical data streams.

Figures

Figures reproduced from arXiv: 2605.26473 by Cong Liu, Nikil Dutt, Zexin Li.

**Figure 1.** Figure 1: No silver bullet. Visualization of the complex tradeoff space among three key performance metrics of on-device OCL. Memory-agnostic co-optimizing training latency, plasticity, and stability could easily raise out-of-memory (OOM) concerns under stringent memory constraints. resource-constrained platforms [2]. Memory selection (GSS) favors informative samples [5]. Constraintand regularization-based methods,… view at source ↗

**Figure 2.** Figure 2: Impact of key hyperparameter choices on OCL metrics under memory constraints. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Design overview of Orion. as URGEt = 1 1 + e kp(Pt−Pth) · 1 1 + e ks(St−Sth) · 1 1 + e−kl(Lt−Lth) · 1 1 + e km(Mt−Mmax) (1) where kp, ks, kl , and km are scaling factors that control the sensitivity of the URGE to changes in plasticity, stability, training latency, and memory pressure, respectively. The terms in the URGE equation (Eq. 1) reflect the system’s state. If plasticity is low, the first term is h… view at source ↗

**Figure 4.** Figure 4: Overall effectiveness of Orion using the ER algorithm evaluated on five benchmarks. Crosses indicate OOM. 11 [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗

**Figure 5.** Figure 5: Overall effectiveness of Orion on four OCL algorithms on the CORe50-NC benchmark. Crosses indicate OOM. 5.2 Overall Effectiveness This section evaluates the effectiveness of Orion across three platforms, benchmarks, and OCL algorithms. The evaluation is divided into three subsets: (1) performance across benchmarks using a state-of-the-art OCL algorithm, (2) performance across representative OCL algorithms … view at source ↗

**Figure 6.** Figure 6: A realistic case study based on an NVIDIA Jetson GPU-enabled autonomous navigation robot [PITH_FULL_IMAGE:figures/full_fig_p016_6.png] view at source ↗

**Figure 7.** Figure 7: Left: Memory breakdown on GSS algorithm in the case study. The red line represents Xavier’s maximum memory. Right: System-level memory usage profiling for GSS algorithm. The red line represents Xavier’s maximum memory. Red crosses indicate the OOM point. Orion to achieve substantial gains in training latency performance while maintaining acceptable plasticity and stability. 5.5 Overhead Analysis Execution … view at source ↗

**Figure 8.** Figure 8: Ablation study of data prefetching [PITH_FULL_IMAGE:figures/full_fig_p017_8.png] view at source ↗

read the original abstract

Online continual learning (OCL) enables real-time adaptation to new data, making it crucial for dynamic robotic applications. However, its practical deployment is hindered by memory constraints in resource-limited systems, which affect key trade-offs in training latency, plasticity, and stability. Unlike offline parameter tuning, which cannot account for the dynamic shift in memory pressure and workload complexity as OCL progresses, an online and self-adaptive approach is essential for robust on-device deployment. This paper proposes Orion, a holistic framework designed to co-optimize training latency, plasticity, and stability of state-of-the-art OCL models under strict memory constraints, enabling feasible on-device deployment. At its core, Orion leverages URGE, a unified runtime indicator grounded in the ``Buckets effect'' principle that system performance is bounded by its scarcest resource, to dynamically reallocate memory across OCL components by jointly coordinating batch processing, replay buffers, and optimization strategies at both the OS and application level. Furthermore, Orion introduces system-level data prefetching techniques to maximize efficiency. A system prototype of Orion has been implemented using the widely adopted \texttt{Avalanche-lib} and thoroughly evaluated across a diverse range of OCL algorithms, benchmarks, and hardware platforms commonly used in autonomous robotic applications. To further demonstrate its practical utility, Orion is integrated into a realistic autonomous navigational robot powered by OCL. The results show that Orion achieves significant training speedups while maintaining balanced performance and effectively adapting to various scenarios, all with minimal runtime, memory, and energy overhead, making Orion a practical solution for on-device continual learning.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Orion adds a concrete runtime with URGE for memory reallocation in on-device OCL but the stability evidence under dynamic loads stays thin.

read the letter

Orion is a framework that uses a buckets-effect indicator called URGE to dynamically shift memory among batch processing, replay buffers, and optimization at both OS and application levels, plus some prefetching, all aimed at on-device online continual learning for robots.

It does a decent job of shipping a working prototype on Avalanche-lib and testing it across multiple OCL algorithms, benchmarks, and hardware platforms, then wiring it into a real autonomous navigation robot. That end-to-end piece is the part that actually matters for people who need to run this stuff on constrained hardware.

The soft spot is the robustness claim. The central promise is that URGE coordinates everything without introducing instability or big overhead when memory pressure changes, yet the abstract gives no variance numbers, no sudden-pressure test results, and no clear separation between measured outcomes and the parameters used to tune the system. The stress-test note is on target here; without those data the self-adaptive part is still an assumption.

This paper is for embedded-ML and robotics groups that want a runnable memory manager rather than another theoretical bound. A reader who needs to deploy continual learning on a robot would get practical value from the implementation and the robot demo.

It deserves a serious referee because it has shipped code, real hardware runs, and a system-level integration that most papers skip. The evaluation needs tightening on dynamic stability, but the work is grounded enough to review.

Referee Report

2 major / 1 minor

Summary. The paper proposes Orion, a holistic framework for co-optimizing training latency, plasticity, and stability in on-device online continual learning under memory constraints. It introduces URGE, a unified runtime indicator based on the buckets effect principle, to dynamically reallocate memory by jointly coordinating batch processing, replay buffers, and optimization strategies at both OS and application levels, along with system-level data prefetching. The framework is implemented using Avalanche-lib and evaluated across OCL algorithms, benchmarks, and hardware platforms, with additional integration into an autonomous navigational robot, claiming significant speedups with balanced performance and minimal overhead.

Significance. If the claims hold with independent measurements, Orion would represent a practical advance for on-device OCL in dynamic robotic settings by providing online self-adaptation to varying memory pressure, which static offline tuning cannot achieve. The prototype implementation, evaluation across algorithms/benchmarks/hardware, and real-robot integration are strengths that support applicability.

major comments (2)

[Abstract] Abstract: the central claim that Orion 'achieves significant training speedups while maintaining balanced performance' with 'minimal runtime, memory, and energy overhead' is load-bearing for the practicality assertion, yet no quantitative results, baselines, error bars, or stability metrics (e.g., variance under memory pressure shifts) are supplied, preventing verification that numbers are independent of tuning parameters.
[Evaluation] Evaluation: the assumption that URGE can jointly coordinate reallocation without instability is load-bearing for the self-adaptive claim, but no concrete metrics on robustness (such as plasticity/stability trade-off variance during sudden memory pressure changes in robotic workloads) are reported.

minor comments (1)

[Abstract] The 'Buckets effect' principle is invoked without a citation or short explanation of its mapping to OCL memory components.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments below and will revise the manuscript to improve clarity and support for the claims.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that Orion 'achieves significant training speedups while maintaining balanced performance' with 'minimal runtime, memory, and energy overhead' is load-bearing for the practicality assertion, yet no quantitative results, baselines, error bars, or stability metrics (e.g., variance under memory pressure shifts) are supplied, preventing verification that numbers are independent of tuning parameters.

Authors: We agree that the abstract would be strengthened by including quantitative highlights. The evaluation sections of the manuscript report detailed results with baselines, multiple runs, error bars, and overhead measurements across algorithms, benchmarks, and hardware. In the revision we will update the abstract to summarize key quantitative outcomes (e.g., observed speedups and overhead ranges) with explicit pointers to the corresponding figures and tables. revision: yes
Referee: [Evaluation] Evaluation: the assumption that URGE can jointly coordinate reallocation without instability is load-bearing for the self-adaptive claim, but no concrete metrics on robustness (such as plasticity/stability trade-off variance during sudden memory pressure changes in robotic workloads) are reported.

Authors: The referee correctly notes the absence of explicit robustness metrics for sudden memory-pressure shifts. While the current evaluation demonstrates adaptation across robotic workloads and varying conditions, it does not isolate variance in the plasticity/stability trade-off under abrupt pressure changes. We will add these metrics in a revised evaluation section, including targeted analysis or additional runs that quantify the trade-off variance during memory-pressure transitions. revision: yes

Circularity Check

0 steps flagged

No significant circularity; claims rest on independent empirical evaluation

full rationale

The abstract describes Orion as a framework that uses the URGE indicator (grounded in the external 'Buckets effect' principle) to coordinate memory reallocation across OCL components, with results from prototype evaluation on Avalanche-lib across algorithms/benchmarks/hardware and robot integration. No equations, fitted parameters, self-citations, or derivation steps are presented that would reduce claimed speedups, overhead, or stability metrics to quantities defined by the same inputs or by construction. The performance assertions appear as outcomes of system implementation and testing rather than tautological renamings or self-referential fits, making the derivation self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract only; no explicit free parameters, axioms, or invented entities are described. The buckets-effect principle is invoked as grounding for URGE but its precise formalization is not given.

pith-pipeline@v0.9.1-grok · 5820 in / 1154 out tokens · 37544 ms · 2026-06-29T16:23:49.651719+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

78 extracted references · 21 canonical work pages · 4 internal anchors

[1]

Continual learning with tiny episodic memories

Arslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, Thalaiyasingam Ajanthan, P Dokania, P Torr, and M Ranzato. Continual learning with tiny episodic memories. InWorkshop on Multi-Task and Lifelong Reinforcement Learning, 2019

2019
[2]

Computationally budgeted continual learning: What does matter? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3698– 3707, 2023

Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet K Dokania, Philip HS Torr, Ser-Nam Lim, Bernard Ghanem, and Adel Bibi. Computationally budgeted continual learning: What does matter? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3698– 3707, 2023

2023
[3]

Gradient episodic memory for continual learning.Ad- vances in neural information processing systems, 30, 2017

David Lopez-Paz and Marc’Aurelio Ranzato. Gradient episodic memory for continual learning.Ad- vances in neural information processing systems, 30, 2017

2017
[4]

Efficient Lifelong Learning with A-GEM

Arslan Chaudhry, Marc’Aurelio Ranzato, Marcus Rohrbach, and Mohamed Elhoseiny. Efficient life- long learning with a-gem.arXiv preprint arXiv:1812.00420, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[5]

Gradient based sample selection for online continual learning.Advances in neural information processing systems, 32, 2019

Rahaf Aljundi, Min Lin, Baptiste Goujaud, and Yoshua Bengio. Gradient based sample selection for online continual learning.Advances in neural information processing systems, 32, 2019

2019
[6]

Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, et al. Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017

2017
[7]

Learning without forgetting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

Zhizhong Li and Derek Hoiem. Learning without forgetting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

2017
[8]

A comprehensive empirical evaluation on online continual learn- ing

Albin Soutif-Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Vincenzo Lomonaco, Joost Van de Weijer, and Hamed Hemati. A comprehensive empirical evaluation on online continual learn- ing. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3518–3528, 2023

2023
[9]

Online active continual learning for robotic lifelong object recognition.IEEE Transactions on Neural Networks and Learning Systems, 2023

Xiangli Nie, Zhiguang Deng, Mingdong He, Mingyu Fan, and Zheng Tang. Online active continual learning for robotic lifelong object recognition.IEEE Transactions on Neural Networks and Learning Systems, 2023

2023
[10]

Covio: Online continual learning for visual-inertial odometry

Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, and Abhinav Valada. Covio: Online continual learning for visual-inertial odometry. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2464–2473, 2023. 19

2023
[11]

Codeps: Online continual learning for depth estimation and panoptic segmentation.arXiv preprint arXiv:2303.10147, 2023

Niclas Vödisch, Kürsat Petek, Wolfram Burgard, and Abhinav Valada. Codeps: Online continual learning for depth estimation and panoptic segmentation.arXiv preprint arXiv:2303.10147, 2023

work page arXiv 2023
[12]

From continual learning to causal discovery in robotics

Luca Castri, Sariah Mghames, and Nicola Bellotto. From continual learning to causal discovery in robotics. InAAAI Bridge Program on Continual Causality, pages 85–91. PMLR, 2023

2023
[13]

Continual learning for au- tonomous robots: A prototype-based approach.arXiv preprint arXiv:2404.00418, 2024

Elvin Hajizada, Balachandran Swaminathan, and Yulia Sandamirskaya. Continual learning for au- tonomous robots: A prototype-based approach.arXiv preprint arXiv:2404.00418, 2024

work page arXiv 2024
[14]

Online con- tinual learning in image classification: An empirical survey.Neurocomputing, 469:28–51, 2022

Zheda Mai, Ruiwen Li, Jihwan Jeong, David Quispe, Hyunwoo Kim, and Scott Sanner. Online con- tinual learning in image classification: An empirical survey.Neurocomputing, 469:28–51, 2022

2022
[15]

Online continual learning with maximal interfered retrieval.Advances in neural information processing systems, 32, 2019

Rahaf Aljundi, Eugene Belilovsky, Tinne Tuytelaars, Laurent Charlin, Massimo Caccia, Min Lin, and Lucas Page-Caccia. Online continual learning with maximal interfered retrieval.Advances in neural information processing systems, 32, 2019

2019
[16]

Real-time evaluation in online continual learn- ing: A new hope

Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip HS Torr, and Bernard Ghanem. Real-time evaluation in online continual learn- ing: A new hope. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11888–11897, 2023

2023
[17]

Lifelearner: Hardware-aware meta continual learning system for embedded computing platforms.arXiv preprint arXiv:2311.11420, 2023

Young D Kwon, Jagmohan Chauhan, Hong Jia, Stylianos I Venieris, and Cecilia Mascolo. Lifelearner: Hardware-aware meta continual learning system for embedded computing platforms.arXiv preprint arXiv:2311.11420, 2023

work page arXiv 2023
[18]

Ekya: Continuous learning of video analytics models on edge compute servers

Romil Bhardwaj, Zhengxu Xia, Ganesh Ananthanarayanan, Junchen Jiang, Yuanchao Shu, Nikolaos Karianakis, Kevin Hsieh, Paramvir Bahl, and Ion Stoica. Ekya: Continuous learning of video analytics models on edge compute servers. In19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pages 119–135, 2022

2022
[19]

Latent replay for real-time continual learning

Lorenzo Pellegrini, Gabriele Graffieti, Vincenzo Lomonaco, and Davide Maltoni. Latent replay for real-time continual learning. In2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 10203–10209. IEEE, 2020

2020
[20]

Saunders Philadelphia, 1971

Eugene Pleasants Odum, Gary W Barrett, et al.Fundamentals of ecology, volume 3. Saunders Philadelphia, 1971

1971
[21]

Avalanche: A pytorch library for deep continual learning.Journal of Machine Learning Research, 24(363):1–6, 2023

Antonio Carta, Lorenzo Pellegrini, Andrea Cossu, Hamed Hemati, and Vincenzo Lomonaco. Avalanche: A pytorch library for deep continual learning.Journal of Machine Learning Research, 24(363):1–6, 2023. URLhttp://jmlr.org/papers/v24/23-0130.html

2023
[22]

Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I

Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L. Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I. Parisi, Fabio Cuz- zolin, Andreas Tolias, Simone Scardapane, Luca Antiga, Subutai Amhad, Adri...

2021
[23]

Learning multiple layers of features from tiny images

Alex Krizhevsky, Geoffrey Hinton, et al. Learning multiple layers of features from tiny images. 2009

2009
[24]

Fine-grained continual learning.arXiv preprint arXiv:1907.03799, 1, 2019

Vincenzo Lomonaco, Davide Maltoni, Lorenzo Pellegrini, et al. Fine-grained continual learning.arXiv preprint arXiv:1907.03799, 1, 2019. 20

work page arXiv 1907
[25]

Core50: a new dataset and benchmark for continual object recogni- tion

V Lomanco and Davide Maltoni. Core50: a new dataset and benchmark for continual object recogni- tion. InProceedings of the 1st Annual Conference on Robot Learning, pages 17–26, 2017

2017
[26]

A procedural world generation framework for systematic evaluation of continual learning.arXiv preprint arXiv:2106.02585, 2021

Timm Hess, Martin Mundt, Iuliia Pliushch, and Visvanathan Ramesh. A procedural world generation framework for systematic evaluation of continual learning.arXiv preprint arXiv:2106.02585, 2021

work page arXiv 2021
[27]

Jetson agx xavier.https://developer.nvidia.com/embedded/ jetson-agx-xavier, 2020

NVIDIA. Jetson agx xavier.https://developer.nvidia.com/embedded/ jetson-agx-xavier, 2020

2020
[28]

Jetson agx orin.https://www.nvidia.com/en-us/autonomous-machines/ embedded-systems/jetson-orin/, 2022

NVIDIA. Jetson agx orin.https://www.nvidia.com/en-us/autonomous-machines/ embedded-systems/jetson-orin/, 2022

2022
[29]

Person re-identification for robot person following with online continual learning.IEEE Robotics and Automation Letters, 2024

Hanjing Ye, Jieting Zhao, Yu Zhan, Weinan Chen, Li He, and Hong Zhang. Person re-identification for robot person following with online continual learning.IEEE Robotics and Automation Letters, 2024

2024
[30]

Incremental learning of retrievable skills for efficient continual task adaptation.Advances in Neural Information Processing Systems, 37:17286–17312, 2024

Daehee Lee, Minjong Yoo, Woo Kyung Kim, Wonje Choi, and Honguk Woo. Incremental learning of retrievable skills for efficient continual task adaptation.Advances in Neural Information Processing Systems, 37:17286–17312, 2024

2024
[31]

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv preprint arXiv:1706.02677, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[32]

Autoware on board: Enabling autonomous vehicles with embedded systems

Shinpei Kato, Shota Tokunaga, Yuya Maruyama, Seiya Maeda, Manato Hirabayashi, Yuki Kitsukawa, Abraham Monrroy, Tomohito Ando, Yusuke Fujii, and Takuya Azumi. Autoware on board: Enabling autonomous vehicles with embedded systems. In2018 ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS), pages 287–296. IEEE, 2018

2018
[33]

Deep learning for autonomous vehicles

Branislav Kisa ˇcanin. Deep learning for autonomous vehicles. In2017 IEEE 47th International Sym- posium on Multiple-Valued Logic (ISMVL), pages 142–142. IEEE, 2017

2017
[34]

Nvradarnet: Real-time radar obstacle and free space detection for autonomous driving.arXiv preprint arXiv:2209.14499, 2022

Alexander Popov, Patrik Gebhardt, Ke Chen, Ryan Oldja, Heeseok Lee, Shane Murray, Ruchi Bhar- gava, and Nikolai Smolyanskiy. Nvradarnet: Real-time radar obstacle and free space detection for autonomous driving.arXiv preprint arXiv:2209.14499, 2022

work page arXiv 2022
[35]

Duckiebot (db-j).https://get.duckietown.com/products/ duckiebot-db21, 2022

NVIDIA. Duckiebot (db-j).https://get.duckietown.com/products/ duckiebot-db21, 2022

2022
[36]

Sparkfun jetbot ai kit.https://www.sparkfun.com/products/18486, 2022

NVIDIA. Sparkfun jetbot ai kit.https://www.sparkfun.com/products/18486, 2022

2022
[37]

Waveshare jetbot ai kit.https://www.amazon.com/ Waveshare-JetBot-AI-Kit-Accessories/dp/B07V8JL4TF/, 2022

NVIDIA. Waveshare jetbot ai kit.https://www.amazon.com/ Waveshare-JetBot-AI-Kit-Accessories/dp/B07V8JL4TF/, 2022

2022
[38]

Deep residual learning for image recog- nition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recog- nition. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016

2016
[39]

ROBOTIS, 2024

ROBOTIS.TurtleBot3 Overview. ROBOTIS, 2024. URLhttps://emanual.robotis.com/ docs/en/platform/turtlebot3/overview/. Accessed: 15 April 2024

2024
[40]

ImageNet: A large- scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848. 21

work page doi:10.1109/cvpr.2009.5206848 2009
[41]

Cloc: Confident initial estimation of long-term visual localization using a few sequential images in large- scale spaces.IEEE Sensors Journal, 23(8):8613–8629, 2023

Joohyung Kim, Janghun Hyeon, Hyunga Choi, Bumchul Jang, Bokyeon Jeong, and Nakju Doh. Cloc: Confident initial estimation of long-term visual localization using a few sequential images in large- scale spaces.IEEE Sensors Journal, 23(8):8613–8629, 2023. doi: 10.1109/JSEN.2023.3253872

work page doi:10.1109/jsen.2023.3253872 2023
[42]

Continual variational autoencoder via continual generative knowledge distillation

Fei Ye and Adrian G Bors. Continual variational autoencoder via continual generative knowledge distillation. InProceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 10918– 10926, 2023

2023
[43]

Continual learning on a diet: Learning from sparsely labeled streams under constrained computation.arXiv preprint arXiv:2404.12766, 2024

Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem, Philip HS Torr, Adel Bibi, and Mohamed Elhoseiny. Continual learning on a diet: Learning from sparsely labeled streams under constrained computation.arXiv preprint arXiv:2404.12766, 2024

work page arXiv 2024
[44]

A unified architecture for accelerating distributed{DNN}training in heterogeneous{GPU/CPU}clusters

Yimin Jiang, Yibo Zhu, Chang Lan, Bairen Yi, Yong Cui, and Chuanxiong Guo. A unified architecture for accelerating distributed{DNN}training in heterogeneous{GPU/CPU}clusters. In14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), pages 463–479, 2020

2020
[45]

Varuna: scalable, low-cost training of massive deep learning models

Sanjith Athlur, Nitika Saran, Muthian Sivathanu, Ramachandran Ramjee, and Nipun Kwatra. Varuna: scalable, low-cost training of massive deep learning models. InProceedings of the Seventeenth Euro- pean Conference on Computer Systems, pages 472–487, 2022

2022
[46]

Large scale distributed deep networks.Advances in neural information processing systems, 25, 2012

Jeffrey Dean, Greg Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Mark Mao, Marc’aurelio Ran- zato, Andrew Senior, Paul Tucker, Ke Yang, et al. Large scale distributed deep networks.Advances in neural information processing systems, 25, 2012

2012
[47]

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, and Bryan Catan- zaro. Megatron-lm: Training multi-billion parameter language models using model parallelism.arXiv preprint arXiv:1909.08053, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1909
[48]

Pipefisher: Efficient training of large language models using pipelining and fisher information matrices.Proceedings of Machine Learning and Systems, 5, 2023

Kazuki Osawa, Shigang Li, and Torsten Hoefler. Pipefisher: Efficient training of large language models using pipelining and fisher information matrices.Proceedings of Machine Learning and Systems, 5, 2023

2023
[49]

Design and implemen- tation of a lightweight on-device ai-based real-time fault diagnosis system using continual learning

Youngjun Kim, Taewan Kim, Suhyun Kim, Seongjae Lee, and Taehyoun Kim. Design and implemen- tation of a lightweight on-device ai-based real-time fault diagnosis system using continual learning. IEMEK Journal of Embedded Systems and Applications, 19(3):151–158, 2024

2024
[50]

Contauth: Continual learning framework for behavioral-based user authentication.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(4):1–23, 2020

Jagmohan Chauhan, Young D Kwon, Pan Hui, and Cecilia Mascolo. Contauth: Continual learning framework for behavioral-based user authentication.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(4):1–23, 2020

2020
[51]

Continual learning for on-device speech recognition using disentangled con- formers

Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, and Abdelrahman Mohamed. Continual learning for on-device speech recognition using disentangled con- formers. InICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Pro- cessing (ICASSP), pages 1–5. IEEE, 2023

2023
[52]

Continual learning at the edge: Real-time training on smartphone devices.arXiv preprint arXiv:2105.13127, 2021

Lorenzo Pellegrini, Vincenzo Lomonaco, Gabriele Graffieti, and Davide Maltoni. Continual learning at the edge: Real-time training on smartphone devices.arXiv preprint arXiv:2105.13127, 2021

work page arXiv 2021
[53]

Memory-efficient domain incremental learning for in- ternet of things

Yuqing Zhao, Divya Saxena, and Jiannong Cao. Memory-efficient domain incremental learning for in- ternet of things. InProceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, pages 1175–1181, 2022. 22

2022
[54]

Online continual learning for human activity recognition.Pervasive and Mobile Computing, 93:101817, 2023

Martin Schiemer, Lei Fang, Simon Dobson, and Juan Ye. Online continual learning for human activity recognition.Pervasive and Mobile Computing, 93:101817, 2023

2023
[55]

Mistify: Automating{DNN}model porting for{On-Device} inference at the edge

Peizhen Guo, Bo Hu, and Wenjun Hu. Mistify: Automating{DNN}model porting for{On-Device} inference at the edge. In18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21), pages 705–719, 2021

2021
[56]

Gemel: Model merging for{Memory- Efficient},{Real-Time}video analytics at the edge

Arthi Padmanabhan, Neil Agarwal, Anand Iyer, Ganesh Ananthanarayanan, Yuanchao Shu, Niko- laos Karianakis, Guoqing Harry Xu, and Ravi Netravali. Gemel: Model merging for{Memory- Efficient},{Real-Time}video analytics at the edge. In20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23), pages 973–994, 2023

2023
[57]

Once-for-all: Train one network and specialize it for efficient deployment.arXiv preprint arXiv:1908.09791,

Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, and Song Han. Once-for-all: Train one network and specialize it for efficient deployment.arXiv preprint arXiv:1908.09791, 2019

work page arXiv 1908
[58]

Continual prune- and-select: class-incremental learning with specialized subnetworks.Applied Intelligence, 53(14): 17849–17864, 2023

Aleksandr Dekhovich, David MJ Tax, Marcel HF Sluiter, and Miguel A Bessa. Continual prune- and-select: class-incremental learning with specialized subnetworks.Applied Intelligence, 53(14): 17849–17864, 2023

2023
[59]

Esacl: Efficient continual learning of sparse models.arXiv preprint arXiv:2401.05667, 2024

Weijieying Ren and Vasant G Honavar. Esacl: Efficient continual learning of sparse models.arXiv preprint arXiv:2401.05667, 2024

work page arXiv 2024
[60]

Impact patterns of combining model pruning and continual learning on model performance

Xueyang Zhang, Hang Li, Xi Chen, and Xue Liu. Impact patterns of combining model pruning and continual learning on model performance. In2021 IEEE Third International Conference on Cognitive Machine Intelligence (CogMI), pages 27–33. IEEE, 2021

2021
[61]

Continual learn- ing for task-oriented dialogue system with iterative network pruning, expanding and masking.arXiv preprint arXiv:2107.08173, 2021

Binzong Geng, Fajie Yuan, Qiancheng Xu, Ying Shen, Ruifeng Xu, and Min Yang. Continual learn- ing for task-oriented dialogue system with iterative network pruning, expanding and masking.arXiv preprint arXiv:2107.08173, 2021

work page arXiv 2021
[62]

Sparcl: Sparse continual learning on the edge.Advances in Neural Information Processing Systems, 35:20366–20380, 2022

Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, and Jennifer Dy. Sparcl: Sparse continual learning on the edge.Advances in Neural Information Processing Systems, 35:20366–20380, 2022

2022
[63]

Learn it or leave it: module composition and pruning for continual learning.arXiv preprint arXiv:2406.18708, 2024

Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, and Hinrich Schütze. Learn it or leave it: module composition and pruning for continual learning.arXiv preprint arXiv:2406.18708, 2024

work page arXiv 2024
[64]

Positcl: Compact continual learning with posit aware quantization

Vedant Karia, Abdullah Zyarah, and Dhireesha Kudithipudi. Positcl: Compact continual learning with posit aware quantization. InProceedings of the Great Lakes Symposium on VLSI 2024, pages 645–650, 2024

2024
[65]

Continual learning via bit-level information preserving

Yujun Shi, Li Yuan, Yunpeng Chen, and Jiashi Feng. Continual learning via bit-level information preserving. InProceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pages 16674–16683, 2021

2021
[66]

A tinyml platform for on-device continual learning with quantized latent replays.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 11(4):789–802, 2021

Leonardo Ravaglia, Manuele Rusci, Davide Nadalini, Alessandro Capotondi, Francesco Conti, and Luca Benini. A tinyml platform for on-device continual learning with quantized latent replays.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 11(4):789–802, 2021

2021
[67]

Andreas Griewank and Andrea Walther. Algorithm 799: revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation.ACM Transactions on Mathematical Software (TOMS), 26(1):19–45, 2000. 23

2000
[68]

Cost- effective on-device continual learning over memory hierarchy with miro

Xinyue Ma, Suyeon Jeong, Minjia Zhang, Di Wang, Jonghyun Choi, and Myeongjae Jeon. Cost- effective on-device continual learning over memory hierarchy with miro. InProceedings of the 29th Annual International Conference on Mobile Computing and Networking, pages 1–15, 2023

2023
[69]

Gpipe: Efficient training of giant neural networks using pipeline parallelism.Advances in neural information processing systems, 32, 2019

Yanping Huang, Youlong Cheng, Ankur Bapna, Orhan Firat, Dehao Chen, Mia Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V Le, Yonghui Wu, et al. Gpipe: Efficient training of giant neural networks using pipeline parallelism.Advances in neural information processing systems, 32, 2019

2019
[70]

Melon: Breaking the memory wall for resource-efficient on-device machine learning

Qipeng Wang, Mengwei Xu, Chao Jin, Xinran Dong, Jinliang Yuan, Xin Jin, Gang Huang, Yunxin Liu, and Xuanzhe Liu. Melon: Breaking the memory wall for resource-efficient on-device machine learning. InProceedings of the 20th Annual International Conference on Mobile Systems, Applications and Services, pages 450–463, 2022

2022
[71]

Training Deep Nets with Sublinear Memory Cost

Tianqi Chen, Bing Xu, Chiyuan Zhang, and Carlos Guestrin. Training deep nets with sublinear memory cost.arXiv preprint arXiv:1604.06174, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[72]

E2-train: Training state-of-the-art cnns with over 80% energy savings.Advances in Neural Information Processing Systems, 32, 2019

Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, and Zhangyang Wang. E2-train: Training state-of-the-art cnns with over 80% energy savings.Advances in Neural Information Processing Systems, 32, 2019

2019
[73]

Selective freezing for efficient continual learning

Amelia Sorrenti, Giovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Concetto Spamp- inato, and Simone Palazzo. Selective freezing for efficient continual learning. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3550–3559, 2023

2023
[74]

Egeria: Efficient dnn training with knowledge-guided layer freezing

Yiding Wang, Decang Sun, Kai Chen, Fan Lai, and Mosharaf Chowdhury. Egeria: Efficient dnn training with knowledge-guided layer freezing. InProceedings of the Eighteenth European Conference on Computer Systems, pages 851–866, 2023

2023
[75]

Sequel: A continual learning library in pytorch and jax.arXiv preprint arXiv:2304.10857, 2023

Nikolaos Dimitriadis, Francois Fleuret, and Pascal Frossard. Sequel: A continual learning library in pytorch and jax.arXiv preprint arXiv:2304.10857, 2023

work page arXiv 2023
[76]

Continuum: Simple management of complex continual learn- ing scenarios.arXiv preprint arXiv:2102.06253, 2021

Arthur Douillard and Timothée Lesort. Continuum: Simple management of complex continual learn- ing scenarios.arXiv preprint arXiv:2102.06253, 2021

work page arXiv 2021
[77]

Sequoia: A software framework to unify continual learning research.arXiv preprint arXiv:2108.01005, 2021

Fabrice Normandin, Florian Golemo, Oleksiy Ostapenko, Pau Rodriguez, Matthew D Riemer, Julio Hurtado, Khimya Khetarpal, Ryan Lindeborg, Lucas Cecchi, Timothée Lesort, et al. Sequoia: A software framework to unify continual learning research.arXiv preprint arXiv:2108.01005, 2021

work page arXiv 2021
[78]

icarl: In- cremental classifier and representation learning

Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, and Christoph H Lampert. icarl: In- cremental classifier and representation learning. InProceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 2001–2010, 2017. 24

2001

[1] [1]

Continual learning with tiny episodic memories

Arslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, Thalaiyasingam Ajanthan, P Dokania, P Torr, and M Ranzato. Continual learning with tiny episodic memories. InWorkshop on Multi-Task and Lifelong Reinforcement Learning, 2019

2019

[2] [2]

Computationally budgeted continual learning: What does matter? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3698– 3707, 2023

Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet K Dokania, Philip HS Torr, Ser-Nam Lim, Bernard Ghanem, and Adel Bibi. Computationally budgeted continual learning: What does matter? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3698– 3707, 2023

2023

[3] [3]

Gradient episodic memory for continual learning.Ad- vances in neural information processing systems, 30, 2017

David Lopez-Paz and Marc’Aurelio Ranzato. Gradient episodic memory for continual learning.Ad- vances in neural information processing systems, 30, 2017

2017

[4] [4]

Efficient Lifelong Learning with A-GEM

Arslan Chaudhry, Marc’Aurelio Ranzato, Marcus Rohrbach, and Mohamed Elhoseiny. Efficient life- long learning with a-gem.arXiv preprint arXiv:1812.00420, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[5] [5]

Gradient based sample selection for online continual learning.Advances in neural information processing systems, 32, 2019

Rahaf Aljundi, Min Lin, Baptiste Goujaud, and Yoshua Bengio. Gradient based sample selection for online continual learning.Advances in neural information processing systems, 32, 2019

2019

[6] [6]

Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, et al. Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017

2017

[7] [7]

Learning without forgetting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

Zhizhong Li and Derek Hoiem. Learning without forgetting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

2017

[8] [8]

A comprehensive empirical evaluation on online continual learn- ing

Albin Soutif-Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Vincenzo Lomonaco, Joost Van de Weijer, and Hamed Hemati. A comprehensive empirical evaluation on online continual learn- ing. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3518–3528, 2023

2023

[9] [9]

Online active continual learning for robotic lifelong object recognition.IEEE Transactions on Neural Networks and Learning Systems, 2023

Xiangli Nie, Zhiguang Deng, Mingdong He, Mingyu Fan, and Zheng Tang. Online active continual learning for robotic lifelong object recognition.IEEE Transactions on Neural Networks and Learning Systems, 2023

2023

[10] [10]

Covio: Online continual learning for visual-inertial odometry

Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, and Abhinav Valada. Covio: Online continual learning for visual-inertial odometry. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2464–2473, 2023. 19

2023

[11] [11]

Codeps: Online continual learning for depth estimation and panoptic segmentation.arXiv preprint arXiv:2303.10147, 2023

Niclas Vödisch, Kürsat Petek, Wolfram Burgard, and Abhinav Valada. Codeps: Online continual learning for depth estimation and panoptic segmentation.arXiv preprint arXiv:2303.10147, 2023

work page arXiv 2023

[12] [12]

From continual learning to causal discovery in robotics

Luca Castri, Sariah Mghames, and Nicola Bellotto. From continual learning to causal discovery in robotics. InAAAI Bridge Program on Continual Causality, pages 85–91. PMLR, 2023

2023

[13] [13]

Continual learning for au- tonomous robots: A prototype-based approach.arXiv preprint arXiv:2404.00418, 2024

Elvin Hajizada, Balachandran Swaminathan, and Yulia Sandamirskaya. Continual learning for au- tonomous robots: A prototype-based approach.arXiv preprint arXiv:2404.00418, 2024

work page arXiv 2024

[14] [14]

Online con- tinual learning in image classification: An empirical survey.Neurocomputing, 469:28–51, 2022

Zheda Mai, Ruiwen Li, Jihwan Jeong, David Quispe, Hyunwoo Kim, and Scott Sanner. Online con- tinual learning in image classification: An empirical survey.Neurocomputing, 469:28–51, 2022

2022

[15] [15]

Online continual learning with maximal interfered retrieval.Advances in neural information processing systems, 32, 2019

Rahaf Aljundi, Eugene Belilovsky, Tinne Tuytelaars, Laurent Charlin, Massimo Caccia, Min Lin, and Lucas Page-Caccia. Online continual learning with maximal interfered retrieval.Advances in neural information processing systems, 32, 2019

2019

[16] [16]

Real-time evaluation in online continual learn- ing: A new hope

Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip HS Torr, and Bernard Ghanem. Real-time evaluation in online continual learn- ing: A new hope. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11888–11897, 2023

2023

[17] [17]

Lifelearner: Hardware-aware meta continual learning system for embedded computing platforms.arXiv preprint arXiv:2311.11420, 2023

Young D Kwon, Jagmohan Chauhan, Hong Jia, Stylianos I Venieris, and Cecilia Mascolo. Lifelearner: Hardware-aware meta continual learning system for embedded computing platforms.arXiv preprint arXiv:2311.11420, 2023

work page arXiv 2023

[18] [18]

Ekya: Continuous learning of video analytics models on edge compute servers

Romil Bhardwaj, Zhengxu Xia, Ganesh Ananthanarayanan, Junchen Jiang, Yuanchao Shu, Nikolaos Karianakis, Kevin Hsieh, Paramvir Bahl, and Ion Stoica. Ekya: Continuous learning of video analytics models on edge compute servers. In19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pages 119–135, 2022

2022

[19] [19]

Latent replay for real-time continual learning

Lorenzo Pellegrini, Gabriele Graffieti, Vincenzo Lomonaco, and Davide Maltoni. Latent replay for real-time continual learning. In2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 10203–10209. IEEE, 2020

2020

[20] [20]

Saunders Philadelphia, 1971

Eugene Pleasants Odum, Gary W Barrett, et al.Fundamentals of ecology, volume 3. Saunders Philadelphia, 1971

1971

[21] [21]

Avalanche: A pytorch library for deep continual learning.Journal of Machine Learning Research, 24(363):1–6, 2023

Antonio Carta, Lorenzo Pellegrini, Andrea Cossu, Hamed Hemati, and Vincenzo Lomonaco. Avalanche: A pytorch library for deep continual learning.Journal of Machine Learning Research, 24(363):1–6, 2023. URLhttp://jmlr.org/papers/v24/23-0130.html

2023

[22] [22]

Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I

Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L. Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I. Parisi, Fabio Cuz- zolin, Andreas Tolias, Simone Scardapane, Luca Antiga, Subutai Amhad, Adri...

2021

[23] [23]

Learning multiple layers of features from tiny images

Alex Krizhevsky, Geoffrey Hinton, et al. Learning multiple layers of features from tiny images. 2009

2009

[24] [24]

Fine-grained continual learning.arXiv preprint arXiv:1907.03799, 1, 2019

Vincenzo Lomonaco, Davide Maltoni, Lorenzo Pellegrini, et al. Fine-grained continual learning.arXiv preprint arXiv:1907.03799, 1, 2019. 20

work page arXiv 1907

[25] [25]

Core50: a new dataset and benchmark for continual object recogni- tion

V Lomanco and Davide Maltoni. Core50: a new dataset and benchmark for continual object recogni- tion. InProceedings of the 1st Annual Conference on Robot Learning, pages 17–26, 2017

2017

[26] [26]

A procedural world generation framework for systematic evaluation of continual learning.arXiv preprint arXiv:2106.02585, 2021

Timm Hess, Martin Mundt, Iuliia Pliushch, and Visvanathan Ramesh. A procedural world generation framework for systematic evaluation of continual learning.arXiv preprint arXiv:2106.02585, 2021

work page arXiv 2021

[27] [27]

Jetson agx xavier.https://developer.nvidia.com/embedded/ jetson-agx-xavier, 2020

NVIDIA. Jetson agx xavier.https://developer.nvidia.com/embedded/ jetson-agx-xavier, 2020

2020

[28] [28]

Jetson agx orin.https://www.nvidia.com/en-us/autonomous-machines/ embedded-systems/jetson-orin/, 2022

NVIDIA. Jetson agx orin.https://www.nvidia.com/en-us/autonomous-machines/ embedded-systems/jetson-orin/, 2022

2022

[29] [29]

Person re-identification for robot person following with online continual learning.IEEE Robotics and Automation Letters, 2024

Hanjing Ye, Jieting Zhao, Yu Zhan, Weinan Chen, Li He, and Hong Zhang. Person re-identification for robot person following with online continual learning.IEEE Robotics and Automation Letters, 2024

2024

[30] [30]

Incremental learning of retrievable skills for efficient continual task adaptation.Advances in Neural Information Processing Systems, 37:17286–17312, 2024

Daehee Lee, Minjong Yoo, Woo Kyung Kim, Wonje Choi, and Honguk Woo. Incremental learning of retrievable skills for efficient continual task adaptation.Advances in Neural Information Processing Systems, 37:17286–17312, 2024

2024

[31] [31]

Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv preprint arXiv:1706.02677, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[32] [32]

Autoware on board: Enabling autonomous vehicles with embedded systems

Shinpei Kato, Shota Tokunaga, Yuya Maruyama, Seiya Maeda, Manato Hirabayashi, Yuki Kitsukawa, Abraham Monrroy, Tomohito Ando, Yusuke Fujii, and Takuya Azumi. Autoware on board: Enabling autonomous vehicles with embedded systems. In2018 ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS), pages 287–296. IEEE, 2018

2018

[33] [33]

Deep learning for autonomous vehicles

Branislav Kisa ˇcanin. Deep learning for autonomous vehicles. In2017 IEEE 47th International Sym- posium on Multiple-Valued Logic (ISMVL), pages 142–142. IEEE, 2017

2017

[34] [34]

Nvradarnet: Real-time radar obstacle and free space detection for autonomous driving.arXiv preprint arXiv:2209.14499, 2022

Alexander Popov, Patrik Gebhardt, Ke Chen, Ryan Oldja, Heeseok Lee, Shane Murray, Ruchi Bhar- gava, and Nikolai Smolyanskiy. Nvradarnet: Real-time radar obstacle and free space detection for autonomous driving.arXiv preprint arXiv:2209.14499, 2022

work page arXiv 2022

[35] [35]

Duckiebot (db-j).https://get.duckietown.com/products/ duckiebot-db21, 2022

NVIDIA. Duckiebot (db-j).https://get.duckietown.com/products/ duckiebot-db21, 2022

2022

[36] [36]

Sparkfun jetbot ai kit.https://www.sparkfun.com/products/18486, 2022

NVIDIA. Sparkfun jetbot ai kit.https://www.sparkfun.com/products/18486, 2022

2022

[37] [37]

Waveshare jetbot ai kit.https://www.amazon.com/ Waveshare-JetBot-AI-Kit-Accessories/dp/B07V8JL4TF/, 2022

NVIDIA. Waveshare jetbot ai kit.https://www.amazon.com/ Waveshare-JetBot-AI-Kit-Accessories/dp/B07V8JL4TF/, 2022

2022

[38] [38]

Deep residual learning for image recog- nition

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recog- nition. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016

2016

[39] [39]

ROBOTIS, 2024

ROBOTIS.TurtleBot3 Overview. ROBOTIS, 2024. URLhttps://emanual.robotis.com/ docs/en/platform/turtlebot3/overview/. Accessed: 15 April 2024

2024

[40] [40]

ImageNet: A large- scale hierarchical image database

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848. 21

work page doi:10.1109/cvpr.2009.5206848 2009

[41] [41]

Cloc: Confident initial estimation of long-term visual localization using a few sequential images in large- scale spaces.IEEE Sensors Journal, 23(8):8613–8629, 2023

Joohyung Kim, Janghun Hyeon, Hyunga Choi, Bumchul Jang, Bokyeon Jeong, and Nakju Doh. Cloc: Confident initial estimation of long-term visual localization using a few sequential images in large- scale spaces.IEEE Sensors Journal, 23(8):8613–8629, 2023. doi: 10.1109/JSEN.2023.3253872

work page doi:10.1109/jsen.2023.3253872 2023

[42] [42]

Continual variational autoencoder via continual generative knowledge distillation

Fei Ye and Adrian G Bors. Continual variational autoencoder via continual generative knowledge distillation. InProceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 10918– 10926, 2023

2023

[43] [43]

Continual learning on a diet: Learning from sparsely labeled streams under constrained computation.arXiv preprint arXiv:2404.12766, 2024

Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem, Philip HS Torr, Adel Bibi, and Mohamed Elhoseiny. Continual learning on a diet: Learning from sparsely labeled streams under constrained computation.arXiv preprint arXiv:2404.12766, 2024

work page arXiv 2024

[44] [44]

A unified architecture for accelerating distributed{DNN}training in heterogeneous{GPU/CPU}clusters

Yimin Jiang, Yibo Zhu, Chang Lan, Bairen Yi, Yong Cui, and Chuanxiong Guo. A unified architecture for accelerating distributed{DNN}training in heterogeneous{GPU/CPU}clusters. In14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), pages 463–479, 2020

2020

[45] [45]

Varuna: scalable, low-cost training of massive deep learning models

Sanjith Athlur, Nitika Saran, Muthian Sivathanu, Ramachandran Ramjee, and Nipun Kwatra. Varuna: scalable, low-cost training of massive deep learning models. InProceedings of the Seventeenth Euro- pean Conference on Computer Systems, pages 472–487, 2022

2022

[46] [46]

Large scale distributed deep networks.Advances in neural information processing systems, 25, 2012

Jeffrey Dean, Greg Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Mark Mao, Marc’aurelio Ran- zato, Andrew Senior, Paul Tucker, Ke Yang, et al. Large scale distributed deep networks.Advances in neural information processing systems, 25, 2012

2012

[47] [47]

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, and Bryan Catan- zaro. Megatron-lm: Training multi-billion parameter language models using model parallelism.arXiv preprint arXiv:1909.08053, 2019

work page internal anchor Pith review Pith/arXiv arXiv 1909

[48] [48]

Pipefisher: Efficient training of large language models using pipelining and fisher information matrices.Proceedings of Machine Learning and Systems, 5, 2023

Kazuki Osawa, Shigang Li, and Torsten Hoefler. Pipefisher: Efficient training of large language models using pipelining and fisher information matrices.Proceedings of Machine Learning and Systems, 5, 2023

2023

[49] [49]

Design and implemen- tation of a lightweight on-device ai-based real-time fault diagnosis system using continual learning

Youngjun Kim, Taewan Kim, Suhyun Kim, Seongjae Lee, and Taehyoun Kim. Design and implemen- tation of a lightweight on-device ai-based real-time fault diagnosis system using continual learning. IEMEK Journal of Embedded Systems and Applications, 19(3):151–158, 2024

2024

[50] [50]

Contauth: Continual learning framework for behavioral-based user authentication.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(4):1–23, 2020

Jagmohan Chauhan, Young D Kwon, Pan Hui, and Cecilia Mascolo. Contauth: Continual learning framework for behavioral-based user authentication.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(4):1–23, 2020

2020

[51] [51]

Continual learning for on-device speech recognition using disentangled con- formers

Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, and Abdelrahman Mohamed. Continual learning for on-device speech recognition using disentangled con- formers. InICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Pro- cessing (ICASSP), pages 1–5. IEEE, 2023

2023

[52] [52]

Continual learning at the edge: Real-time training on smartphone devices.arXiv preprint arXiv:2105.13127, 2021

Lorenzo Pellegrini, Vincenzo Lomonaco, Gabriele Graffieti, and Davide Maltoni. Continual learning at the edge: Real-time training on smartphone devices.arXiv preprint arXiv:2105.13127, 2021

work page arXiv 2021

[53] [53]

Memory-efficient domain incremental learning for in- ternet of things

Yuqing Zhao, Divya Saxena, and Jiannong Cao. Memory-efficient domain incremental learning for in- ternet of things. InProceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, pages 1175–1181, 2022. 22

2022

[54] [54]

Online continual learning for human activity recognition.Pervasive and Mobile Computing, 93:101817, 2023

Martin Schiemer, Lei Fang, Simon Dobson, and Juan Ye. Online continual learning for human activity recognition.Pervasive and Mobile Computing, 93:101817, 2023

2023

[55] [55]

Mistify: Automating{DNN}model porting for{On-Device} inference at the edge

Peizhen Guo, Bo Hu, and Wenjun Hu. Mistify: Automating{DNN}model porting for{On-Device} inference at the edge. In18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21), pages 705–719, 2021

2021

[56] [56]

Gemel: Model merging for{Memory- Efficient},{Real-Time}video analytics at the edge

Arthi Padmanabhan, Neil Agarwal, Anand Iyer, Ganesh Ananthanarayanan, Yuanchao Shu, Niko- laos Karianakis, Guoqing Harry Xu, and Ravi Netravali. Gemel: Model merging for{Memory- Efficient},{Real-Time}video analytics at the edge. In20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23), pages 973–994, 2023

2023

[57] [57]

Once-for-all: Train one network and specialize it for efficient deployment.arXiv preprint arXiv:1908.09791,

Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, and Song Han. Once-for-all: Train one network and specialize it for efficient deployment.arXiv preprint arXiv:1908.09791, 2019

work page arXiv 1908

[58] [58]

Continual prune- and-select: class-incremental learning with specialized subnetworks.Applied Intelligence, 53(14): 17849–17864, 2023

Aleksandr Dekhovich, David MJ Tax, Marcel HF Sluiter, and Miguel A Bessa. Continual prune- and-select: class-incremental learning with specialized subnetworks.Applied Intelligence, 53(14): 17849–17864, 2023

2023

[59] [59]

Esacl: Efficient continual learning of sparse models.arXiv preprint arXiv:2401.05667, 2024

Weijieying Ren and Vasant G Honavar. Esacl: Efficient continual learning of sparse models.arXiv preprint arXiv:2401.05667, 2024

work page arXiv 2024

[60] [60]

Impact patterns of combining model pruning and continual learning on model performance

Xueyang Zhang, Hang Li, Xi Chen, and Xue Liu. Impact patterns of combining model pruning and continual learning on model performance. In2021 IEEE Third International Conference on Cognitive Machine Intelligence (CogMI), pages 27–33. IEEE, 2021

2021

[61] [61]

Continual learn- ing for task-oriented dialogue system with iterative network pruning, expanding and masking.arXiv preprint arXiv:2107.08173, 2021

Binzong Geng, Fajie Yuan, Qiancheng Xu, Ying Shen, Ruifeng Xu, and Min Yang. Continual learn- ing for task-oriented dialogue system with iterative network pruning, expanding and masking.arXiv preprint arXiv:2107.08173, 2021

work page arXiv 2021

[62] [62]

Sparcl: Sparse continual learning on the edge.Advances in Neural Information Processing Systems, 35:20366–20380, 2022

Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, and Jennifer Dy. Sparcl: Sparse continual learning on the edge.Advances in Neural Information Processing Systems, 35:20366–20380, 2022

2022

[63] [63]

Learn it or leave it: module composition and pruning for continual learning.arXiv preprint arXiv:2406.18708, 2024

Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, and Hinrich Schütze. Learn it or leave it: module composition and pruning for continual learning.arXiv preprint arXiv:2406.18708, 2024

work page arXiv 2024

[64] [64]

Positcl: Compact continual learning with posit aware quantization

Vedant Karia, Abdullah Zyarah, and Dhireesha Kudithipudi. Positcl: Compact continual learning with posit aware quantization. InProceedings of the Great Lakes Symposium on VLSI 2024, pages 645–650, 2024

2024

[65] [65]

Continual learning via bit-level information preserving

Yujun Shi, Li Yuan, Yunpeng Chen, and Jiashi Feng. Continual learning via bit-level information preserving. InProceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pages 16674–16683, 2021

2021

[66] [66]

A tinyml platform for on-device continual learning with quantized latent replays.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 11(4):789–802, 2021

Leonardo Ravaglia, Manuele Rusci, Davide Nadalini, Alessandro Capotondi, Francesco Conti, and Luca Benini. A tinyml platform for on-device continual learning with quantized latent replays.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 11(4):789–802, 2021

2021

[67] [67]

Andreas Griewank and Andrea Walther. Algorithm 799: revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation.ACM Transactions on Mathematical Software (TOMS), 26(1):19–45, 2000. 23

2000

[68] [68]

Cost- effective on-device continual learning over memory hierarchy with miro

Xinyue Ma, Suyeon Jeong, Minjia Zhang, Di Wang, Jonghyun Choi, and Myeongjae Jeon. Cost- effective on-device continual learning over memory hierarchy with miro. InProceedings of the 29th Annual International Conference on Mobile Computing and Networking, pages 1–15, 2023

2023

[69] [69]

Gpipe: Efficient training of giant neural networks using pipeline parallelism.Advances in neural information processing systems, 32, 2019

Yanping Huang, Youlong Cheng, Ankur Bapna, Orhan Firat, Dehao Chen, Mia Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V Le, Yonghui Wu, et al. Gpipe: Efficient training of giant neural networks using pipeline parallelism.Advances in neural information processing systems, 32, 2019

2019

[70] [70]

Melon: Breaking the memory wall for resource-efficient on-device machine learning

Qipeng Wang, Mengwei Xu, Chao Jin, Xinran Dong, Jinliang Yuan, Xin Jin, Gang Huang, Yunxin Liu, and Xuanzhe Liu. Melon: Breaking the memory wall for resource-efficient on-device machine learning. InProceedings of the 20th Annual International Conference on Mobile Systems, Applications and Services, pages 450–463, 2022

2022

[71] [71]

Training Deep Nets with Sublinear Memory Cost

Tianqi Chen, Bing Xu, Chiyuan Zhang, and Carlos Guestrin. Training deep nets with sublinear memory cost.arXiv preprint arXiv:1604.06174, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[72] [72]

E2-train: Training state-of-the-art cnns with over 80% energy savings.Advances in Neural Information Processing Systems, 32, 2019

Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, and Zhangyang Wang. E2-train: Training state-of-the-art cnns with over 80% energy savings.Advances in Neural Information Processing Systems, 32, 2019

2019

[73] [73]

Selective freezing for efficient continual learning

Amelia Sorrenti, Giovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Concetto Spamp- inato, and Simone Palazzo. Selective freezing for efficient continual learning. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3550–3559, 2023

2023

[74] [74]

Egeria: Efficient dnn training with knowledge-guided layer freezing

Yiding Wang, Decang Sun, Kai Chen, Fan Lai, and Mosharaf Chowdhury. Egeria: Efficient dnn training with knowledge-guided layer freezing. InProceedings of the Eighteenth European Conference on Computer Systems, pages 851–866, 2023

2023

[75] [75]

Sequel: A continual learning library in pytorch and jax.arXiv preprint arXiv:2304.10857, 2023

Nikolaos Dimitriadis, Francois Fleuret, and Pascal Frossard. Sequel: A continual learning library in pytorch and jax.arXiv preprint arXiv:2304.10857, 2023

work page arXiv 2023

[76] [76]

Continuum: Simple management of complex continual learn- ing scenarios.arXiv preprint arXiv:2102.06253, 2021

Arthur Douillard and Timothée Lesort. Continuum: Simple management of complex continual learn- ing scenarios.arXiv preprint arXiv:2102.06253, 2021

work page arXiv 2021

[77] [77]

Sequoia: A software framework to unify continual learning research.arXiv preprint arXiv:2108.01005, 2021

Fabrice Normandin, Florian Golemo, Oleksiy Ostapenko, Pau Rodriguez, Matthew D Riemer, Julio Hurtado, Khimya Khetarpal, Ryan Lindeborg, Lucas Cecchi, Timothée Lesort, et al. Sequoia: A software framework to unify continual learning research.arXiv preprint arXiv:2108.01005, 2021

work page arXiv 2021

[78] [78]

icarl: In- cremental classifier and representation learning

Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, and Christoph H Lampert. icarl: In- cremental classifier and representation learning. InProceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 2001–2010, 2017. 24

2001