Google is collecting troves of data from downgraded Nest thermostats

The second-gen Nest Thermostat launched in 2012. Google officially turned off remote control functionality for early Nest Learning Thermostats last month, but it hasn’t stopped collecting a stream of data from these downgraded devices. After digging into the backend, security researcher Cody Kociemba found that the first- and second-generation Nest Learning Thermostats are still sending … Read more

IGQ-ViT: Instance-Aware Group Quantization for Low-Bit Vision Transformers

Table of Links Abstract and 1. Introduction Related work Method 3.1. Uniform quantizer 3.2. IGQ-ViT 3.3. Group size allocation Experiments 4.1. Implementation details and 4.2. Results 4.3. Discussion Conclusion, Acknowledgements, and References Supplementary Material A. More implementation details B. Compatibility with existing hardwares C. Latency on practical devices D. Application to DETR A. More implementation … Read more

Why Dynamic Grouping Beats Traditional Quantizers for Vision Transformers

Table of Links Abstract and 1. Introduction Related work Method 3.1. Uniform quantizer 3.2. IGQ-ViT 3.3. Group size allocation Experiments 4.1. Implementation details and 4.2. Results 4.3. Discussion Conclusion, Acknowledgements, and References Supplementary Material A. More implementation details B. Compatibility with existing hardwares C. Latency on practical devices D. Application to DETR 4.3. Discussion Comparison … Read more

Instance-Aware Grouped Quantization (IGQ-ViT) Sets New Benchmarks for ViT PTQ

Table of Links Abstract and 1. Introduction Related work Method 3.1. Uniform quantizer 3.2. IGQ-ViT 3.3. Group size allocation Experiments 4.1. Implementation details and 4.2. Results 4.3. Discussion Conclusion, Acknowledgements, and References Supplementary Material A. More implementation details B. Compatibility with existing hardwares C. Latency on practical devices D. Application to DETR 4. Experiments In … Read more

Why Uniform Quantizers Break ViTs

Table of Links Abstract and 1. Introduction Related work Method 3.1. Uniform quantizer 3.2. IGQ-ViT 3.3. Group size allocation Experiments 4.1. Implementation details and 4.2. Results 4.3. Discussion Conclusion, Acknowledgements, and References Supplementary Material A. More implementation details B. Compatibility with existing hardwares C. Latency on practical devices D. Application to DETR 3. Method In … Read more

What Makes Vision Transformers Hard to Quantize?

Table of Links Abstract and 1. Introduction Related work Method 3.1. Uniform quantizer 3.2. IGQ-ViT 3.3. Group size allocation Experiments 4.1. Implementation details and 4.2. Results 4.3. Discussion Conclusion, Acknowledgements, and References Supplementary Material A. More implementation details B. Compatibility with existing hardwares C. Latency on practical devices D. Application to DETR 2. Related work … Read more

Instance-Aware Group Quantization for Vision Transformers

:::info Authors: (1) Jaehyeon Moon, Yonsei University and Articron; (2) Dohyung Kim, Yonsei University; (3) Junyong Cheon, Yonsei University; (4) Bumsub Ham, a Corresponding Author from Yonsei University. ::: Table of Links Abstract and 1. Introduction Related work Method 3.1. Uniform quantizer [3.2. IGQ-ViT]() [3.3. Group size allocation]() Experiments 4.1. Implementation details and 4.2. Results … Read more

ISO 24027 Explained: A Practical Guide to Bias-Free, Ethical, and Compliant AI Systems

In the ever-growing constellation of ISO best-practice standards, ISO 24027:2021 gives AI technical and governance teams tools and techniques to establish whether their AI systems are fair and bias-free. It’s not the typical heavy corporate read, but a rather short and practical best-practice standard. I’ll try to demystify ISO 24027 and break it down in … Read more

Building a Distributed Timer Service at Scale: Handling 100K Timers Per Second

TL;DR We built a distributed timer service capable of handling 100,000 timer creations per second with high precision and at least once delivery guarantees. The architecture separates concerns between a stateless Timer Service API (for CRUD operations) and horizontally scalable Timer Processors (for expiration handling). Workers scan their partitions for soon-to-expire timers (2-3 minute look-ahead … Read more

The Fatal Math Error Killing Every AI Architecture – Including The New Ones

LLMs are already museum pieces Why JEPA, transformers, and LLMs all fail in the same Euclidean flatland, and the toroidal geometry that fixes them Yes, ChatGPT, Claude, Gemini, all of them. Brilliant fossils of a linguistic age that’s already ending. They’re decomposing in public, billions are still being spent to polish their coffins: bigger models, longer … Read more