Where Glitch Tokens Hide: Common Patterns in LLM Tokenizer Vocabularies

Table of Links Abstract and 1. Introduction Methods 2.1 Tokenizer analysis 2.2 Indicators for detecting under-trained tokens and 2.3 Verification of candidate tokens Results 3.1 Effectiveness of indicators and verification 3.2 Common observations 3.3 Model-specific observations Closed-source models Discussion, Acknowledgments, and References A. Verification details B. A short primer on UTF-8 encoding C. Outputs for … Read more

How Many Glitch Tokens Hide in Popular LLMs? Revelations from Large-Scale Testing

Table of Links Abstract and 1. Introduction Methods 2.1 Tokenizer analysis 2.2 Indicators for detecting under-trained tokens and 2.3 Verification of candidate tokens Results 3.1 Effectiveness of indicators and verification 3.2 Common observations 3.3 Model-specific observations Closed-source models Discussion, Acknowledgments, and References A. Verification details B. A short primer on UTF-8 encoding C. Outputs for … Read more

Comprehensive Detection of Untrained Tokens in Language Model Tokenizers

:::info Authors: (1) Sander Land, Cohere s(ander@cohere.com); (2) Max Bartolo, Cohere (max@cohere.com). ::: Table of Links Abstract and 1. Introduction Methods 2.1 Tokenizer analysis 2.2 Indicators for detecting under-trained tokens and 2.3 Verification of candidate tokens Results 3.1 Effectiveness of indicators and verification 3.2 Common observations 3.3 Model-specific observations Closed-source models Discussion, Acknowledgments, and References … Read more

Ethereum flips Coca-Cola and Alibaba as ETH gains 42% in 5 days

Ether’s market capitalization surged 42% in five days following the successful launch of Ethereum’s Pectra upgrade on its mainnet.  On May 12, the company data tracker 8marketcap showed Ether (ETH) surpassing Coca-Cola and Alibaba, ranking as the world’s 39th-largest asset by market capitalization. ETH was trading at about $2,550 at publication time, with a market … Read more

Stablecoins Will Expand Beyond Crypto Trading, Become Part of Mainstream Economy, Citi Predicts

The stablecoin market could soon eclipse the entire crypto trading ecosystem that gave birth to it as regulatory tailwinds allow for the integration of the fixed-value tokens into the mainstream economy, according to predictions from global bank Citi. Above and beyond their role as tokenized cash for the crypto trading community, stablecoins — digital tokens … Read more

What is social engineering in crypto (and how to protect yourself)?

Social engineering in crypto, explained In the world of cryptocurrency, security goes beyond just protecting your wallet with a password or private key. One of the most deceptive and increasingly dangerous threats to crypto users today is social engineering. While you might think of cyberattacks as highly technical affairs, social engineering manipulates the most vulnerable … Read more

Crypto custodian BitGo secures MiCA license in Germany

Goldman Sachs-backed cryptocurrency custody firm BitGo has become the latest cryptocurrency company to secure regulatory approval to operate across the European Union. Germany’s financial regulator, the Federal Financial Supervisory Authority (BaFin), granted BitGo Europe a Markets in Crypto-Assets Regulation (MiCA) license to provide digital asset services in the EU, the firm announced on May 12. … Read more

How Aleph Cloud Helped HyperSwap Mitigate a DDoS Attack and Save Millions in Cryptocurrencies

On the night of May 5th to 6th, HyperSwap was hit by a massive DDoS attack that impacted both their website and application. As HyperSwap was already using some of our cloud solutions, we quickly stepped in to support them migrating their front-end and redirecting the attack traffic to mitigate its effects. TL;DR Implemented a … Read more

Bitcoin set for $150K BTC price rally as US, China agree to slash tariffs

Key takeaways: Bitcoin broke above $105,700 after the US and China agreed to slash tariffs. A confirmed bull flag breakout on the weekly chart projects a $150,000. Bitwise’s sentiment index warns of potential short-term overheating. Bitcoin (BTC) bulls cheered a major development in the ongoing US-China tariff talks, with the cryptocurrency climbing over the $105,700 … Read more

Windsurf Shakes Up AI Coding Tool Market With Generous Free Tier and GPT-4.1 Access

What’s happened: Windsurf’s Offering Revamp In April 2025, Windsurf (formerly Codeium) rolled out significant pricing updates that made waves in the developer community. First, the company provided OpenAI’s o4-mini and GPT-4.1 (equipped with 1 million token context window) freely available to all users for a 2-week time period. Immediately after this free trial period, on … Read more

Can you stake Bitcoin (BTC)? Here’s what you need to know

Key takeaways Though Bitcoin doesn’t support native staking, holders can earn yield through centralized lending platforms, Wrapped Bitcoin (WBTC) on Ethereum, and Bitcoin-related networks like Babylon and Stacks. WBTC allows BTC holders to participate in lending, liquidity pools and yield farming on Ethereum-based DeFi platforms like Aave and Curve but introduces bridge and smart contract … Read more

The Top 5 Cybersecurity Podcasts to Look Out for In 2025

Keeping yourself informed on cybersecurity trends isn’t optional—it’s a necessity. But let’s be real, not everyone has the time to sift through lengthy reports or research papers. That’s where podcasts fill in. Here are the top 5 cybersecurity podcasts you should begin listening to immediately! Prefer watching instead of reading? Here’s a quick video guide … Read more

US and China slash tariffs

The pause aims to provide time for further trade negotiations. The United States and China have mutually agreed to a 90 day reduction on tariffs implemented in April, marking a significant attempt to de-escalate the trade war between the world’s two largest economies. The deal was hashed out by US and Chinese officials in Geneva … Read more

Microsoft Is Making a Push For Passwordless

Passwords are ubiquitous — from email and social media to banking and work accounts. But let’s face it: passwords are annoying. They’re difficult to recall, easy to forget, and not very secure. That’s why Microsoft is making a significant leap by making passwordless authentication the norm for all new Microsoft accounts. In this blog post, … Read more

WhatsApp Encrypts Your Messages—But What About the Backups?

End-to-end encryption (E2EE) is usually touted as the best method to secure our messages and information. Signal, WhatsApp, and other apps feature E2EE as a means to guarantee nobody—not hackers, not the app developers, nor even governments—can access your messages. What if there exists a loophole that quietly bypasses this robust security? That loophole is … Read more

AI Has Become My Co-Founder

This article is based on my experience building Linkeme.ai, a platform helping professionals maintain their digital presence through AI-driven content creation and publishing. :::info I am the CEO & founder of Easylab AI and Linkeme.ai, both of which are referenced in this article. ::: When people ask me how I built Linkeme.ai, I usually tell them … Read more

Building Reusable Components in React: Best Practices and Patterns

React is one of the most popular JavaScript frontend libraries. It helps developers to build a friendly user interface with the help of component-based architecture. The components of React are flexible, independent, and reusable which makes the user interface more maintainable and scalable. However, it depends on developers to build components that are clean and … Read more