Lossless LLM compression for efficient GPU inference via dynamic-length float April 25, 2025 by kamal Comments