Month: January 2025
Tenable CEO Amit Yoran dies
Longtime entrepreneur and cybersecurity executive Amit Yoran passed away Friday after a battle with cancer. Cybersecurity company Tenable, where Yoran was CEO and chairman, announced his death in a press release. Before becoming Tenable’s CEO in 2016, he held a number of roles including president of RSA, founding CEO of NetWitness, and CEO of In-Q-Tel. … Read more
PagedAttention and vLLM Explained: What Are They?
Table of Links Abstract and 1 Introduction 2 Background and 2.1 Transformer-Based Large Language Models 2.2 LLM Service & Autoregressive Generation 2.3 Batching Techniques for LLMs 3 Memory Challenges in LLM Serving 3.1 Memory Management in Existing Systems 4 Method and 4.1 PagedAttention 4.2 KV Cache Manager 4.3 Decoding with PagedAttention and vLLM 4.4 Application … Read more
General Model Serving Systems and Memory Optimizations Explained
Table of Links Abstract and 1 Introduction 2 Background and 2.1 Transformer-Based Large Language Models 2.2 LLM Service & Autoregressive Generation 2.3 Batching Techniques for LLMs 3 Memory Challenges in LLM Serving 3.1 Memory Management in Existing Systems 4 Method and 4.1 PagedAttention 4.2 KV Cache Manager 4.3 Decoding with PagedAttention and vLLM 4.4 Application … Read more
Applying the Virtual Memory and Paging Technique: A Discussion
Table of Links Abstract and 1 Introduction 2 Background and 2.1 Transformer-Based Large Language Models 2.2 LLM Service & Autoregressive Generation 2.3 Batching Techniques for LLMs 3 Memory Challenges in LLM Serving 3.1 Memory Management in Existing Systems 4 Method and 4.1 PagedAttention 4.2 KV Cache Manager 4.3 Decoding with PagedAttention and vLLM 4.4 Application … Read more
A Return to Polymathy (2015) [pdf]
Comments
The ‘MicroStrategy of Dogecoin’ Launches DOGE Yield Strategy, Eyes Bitcoin and Solana Expansion
Spirit Blockchain Capital says it is rolling out a yield-bearing strategy for its Dogecoin holdings while considering other treasury assets.
Evaluating vLLM’s Design Choices With Ablation Experiments
Table of Links Abstract and 1 Introduction 2 Background and 2.1 Transformer-Based Large Language Models 2.2 LLM Service & Autoregressive Generation 2.3 Batching Techniques for LLMs 3 Memory Challenges in LLM Serving 3.1 Memory Management in Existing Systems 4 Method and 4.1 PagedAttention 4.2 KV Cache Manager 4.3 Decoding with PagedAttention and vLLM 4.4 Application … Read more
Encryption: Ciphers, Digests, Salt, and IV – What You Need to Know
What Is Encryption? Encryption is a method of turning data into an unusable form that can be made useful only by means of decryption. The purpose is to make data available solely to those who can decrypt it (i.e., make it usable). Typically, data needs to be encrypted to make sure it cannot be obtained … Read more
Lyft will credit NYC riders for congestion fee throughout January
New York City’s congestion pricing is scheduled to take effect Sunday — but for the first month, Lyft said it will be crediting riders who pay the fee. New York’s program, which is supposed to reduce traffic in lower Manhattan while also raising funding for mass transit, was paused by Governor Kathy Hochul in June, … Read more
Twelve South’s travel-friendly Bluetooth dongle is on sale for its best price yet
The AirFly SE might not be the only way to enjoy in-flight entertainment with your own headphones, but it’s one of the most reliable. | Image: Twelve South The Twelve South AirFly SE is one of those gadgets that can make long flights go by just a little faster, allowing you to eschew the shoddy … Read more
$19 trillion in transactions settled on the Bitcoin network in 2024
Bitcoin has a current market capitalization of roughly $1.9 trillion and surpassed silver’s $1.6 trillion market cap in 2024.
How We Implemented a Chatbot Into Our LLM
Table of Links Abstract and 1 Introduction 2 Background and 2.1 Transformer-Based Large Language Models 2.2 LLM Service & Autoregressive Generation 2.3 Batching Techniques for LLMs 3 Memory Challenges in LLM Serving 3.1 Memory Management in Existing Systems 4 Method and 4.1 PagedAttention 4.2 KV Cache Manager 4.3 Decoding with PagedAttention and vLLM 4.4 Application … Read more
Arlo’s monthly subscriptions are going up again
Arlo’s cloud storage subscriptions get another price hike. | Image: Arlo Arlo has once again increased the monthly subscription pricing for its smart home cameras’ Arlo Secure cloud storage plan. The company now charges $9.99 per month (up from $7.99) to store a single camera’s recordings and $19.99 a month (up from $17.99) for unlimited … Read more
The HackerNoon Newsletter: Adaptive Lighting – An Example of HACS (1/4/2025)
How are you, hacker? 🪐 What’s happening in tech today, January 4, 2025? The HackerNoon Newsletter brings the HackerNoon homepage straight to your inbox. On this day, Zuck launched Facebook in his Harvard dorm room in 2004, World’s largest and deepest tunnel was opened in 2010, “Great Society” program aimed to eliminate poverty was launch … Read more
How Effective is vLLM When a Prefix Is Thrown Into the Mix?
Table of Links Abstract and 1 Introduction 2 Background and 2.1 Transformer-Based Large Language Models 2.2 LLM Service & Autoregressive Generation 2.3 Batching Techniques for LLMs 3 Memory Challenges in LLM Serving 3.1 Memory Management in Existing Systems 4 Method and 4.1 PagedAttention 4.2 KV Cache Manager 4.3 Decoding with PagedAttention and vLLM 4.4 Application … Read more
What will this year bring in VC? We asked a few investors
A new year brings with it hope for a better tomorrow — kind of, at least. In the world of venture capital, nothing is quite predictable. The number of firms in the U.S. has taken a sharp dip as risk-averse institutional investors splash money on only the biggest names in Silicon Valley, as reported by … Read more
Trump Wants All Future Bitcoin Mined in US—Is That Even Possible?
Donald Trump will be inaugurated this month—but how likely is the President-elect’s pledge that all Bitcoin will be mined in the States?
This Year, RISC-V Laptops Arrive
Comments
What to expect at CES 2025
Image: Cath Virginia / The Verge It’s time for the biggest tech show of the year. CES 2025 officially kicks off next week, with most of the industry’s biggest names gathering in Las Vegas to announce new products and demonstrate some of the most exciting tech they have coming throughout the year. CES is traditionally … Read more