Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs May 4, 2025 by kamal Comments