This codebase is originally forked from IDSIA/modern-srwm which we modified for continual learning (also including improved practical settings for self-referential weight matrices, e.g., better ...