TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters November 1, 2024 by Comments