August 2024 – Page 49 – Kamal Reader

Police Chief Says Cops Have a 5th Amendment Right to Leave Body Cameras Off

August 25, 2024 by

Comments

San Francisco officials weigh in on departure of X headquarters: ‘Good riddance’

August 25, 2024 by

Comments

Show HN: Z80 Sans

August 25, 2024 by

Comments

Japan’s Real First Console? Bandai’s TV Jack 5000

August 25, 2024 by

Comments

Rollstack (YC W23) Is Hiring TypeScript Engineers – US/Canada (East Coast Only)

August 25, 2024 by

Comments

Deriving the DPO Objective Under the Plackett-Luce Model

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Deriving the DPO Objective Under the Bradley-Terry Model

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Deriving the Optimum of the KL-Constrained Reward Maximization Objective

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Behind the Scenes: The Team Behind DPO

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

GPT-4 vs. Humans: Validating AI Judgment in Language Model Training

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Theoretical Analysis of Direct Preference Optimization

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Bypassing the Reward Model: A New RLHF Paradigm

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

How AI Learns from Human Preferences

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Simplifying AI Training: Direct Preference Optimization vs. Traditional RL

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Writing a Rust compiler in C

August 25, 2024 by

Comments

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

August 25, 2024 by

:::info Authors: (1) Rafael Rafailo, Stanford University and Equal contribution; more junior authors listed earlier; (2) Archit Sharma, Stanford University and Equal contribution; more junior authors listed earlier; (3) Eric Mitchel, Stanford University and Equal contribution; more junior authors listed earlier; (4) Stefano Ermon, CZ Biohub; (5) Christopher D. Manning, Stanford University; (6) Chelsea Finn, … Read more

Martin Shkreli must surrender his Wu-Tang album copies

August 25, 2024 by

Photo: Drew Angerer / Getty Images Former pharmaceutical executive Martin Shkreli must turn over his copies of The Wu-Tang Clan’s Once Upon a Time in Shaolin album to comply with a preliminary injunction issued by federal Judge Pamela Chen in an ongoing lawsuit, ArtNet reported on Friday. PleasrDAO, NFT collective and current owner of Shaolin … Read more

Pi Pico 2 Extreme Teardown

August 25, 2024 by

Comments

This Week in Crypto Games: ‘Catizen’ Airdrop With HashKey, ‘Ragnarok’ Ronin Beta

August 25, 2024 by

Catch up on this week’s biggest crypto and NFT gaming news and find some weekend reads in our latest roundup.

Database “sharding” came from Ultima Online? (2009)

August 25, 2024 by

Comments

The Power of Attraction: How Beauty Influences Startup Investments

August 25, 2024 by

Comments

Olivetti Programma 101: At the Origins of the Personal Computer

August 25, 2024 by

Comments

Big Pharma claims lower prices means giving up miracle medications. Ignore them

August 25, 2024 by

Comments

MacOS X Malware Development

August 25, 2024 by

Comments

Essays: NSA Surveillance: A Guide to Staying Secure – Schneier on Security

August 25, 2024 by

Comments

Telegram issues official statement on Pavel Durov detention

August 25, 2024 by

The Telegram team disputes reports that Durov had reason to avoid traveling within Europe.

Forgotten Runiverse Open Beta Billed as ‘Coming Out Party’ for Ethereum Franchise

August 25, 2024 by

The co-founder of Forgotten Runes believes the Ronin-based MMORPG will help push the project to become a “global franchise.”

Is Telegram really an encrypted messaging app?

August 25, 2024 by

Comments

Stripe Data vs. Open‐Source Alternatives: A MRR Example

August 25, 2024 by

Comments

Telegram abides by EU laws, including the Digital Services Act

August 25, 2024 by

Comments