Using Language Models to Simulate Human Samples: Appendix

:::info Authors: (1) TIMNIT GEBRU, Black in AI; (2) JAMIE MORGENSTERN, University of Washington; (3) BRIANA VECCHIONE, Cornell University; (4) JENNIFER WORTMAN VAUGHAN, Microsoft Research; (5) HANNA WALLACH, Microsoft Research; (6) HAL DAUMÉ III, Microsoft Research; University of Maryland; (7) KATE CRAWFORD, Microsoft Research. ::: Table of Links 1 Introduction 1.1 Objectives 2 Development Process … Read more

Using Language Models to Simulate Human Samples: Acknowledgments and References

:::info Authors: (1) TIMNIT GEBRU, Black in AI; (2) JAMIE MORGENSTERN, University of Washington; (3) BRIANA VECCHIONE, Cornell University; (4) JENNIFER WORTMAN VAUGHAN, Microsoft Research; (5) HANNA WALLACH, Microsoft Research; (6) HAL DAUMÉ III, Microsoft Research; University of Maryland; (7) KATE CRAWFORD, Microsoft Research. ::: Table of Links 1 Introduction 1.1 Objectives 2 Development Process … Read more

Datasheets for Datasets: Impact and Adoption Across Academic and Industry Sectors

:::info Authors: (1) TIMNIT GEBRU, Black in AI; (2) JAMIE MORGENSTERN, University of Washington; (3) BRIANA VECCHIONE, Cornell University; (4) JENNIFER WORTMAN VAUGHAN, Microsoft Research; (5) HANNA WALLACH, Microsoft Research; (6) HAL DAUMÉ III, Microsoft Research; University of Maryland; (7) KATE CRAWFORD, Microsoft Research. ::: Table of Links 1 Introduction 1.1 Objectives 2 Development Process … Read more

Ensuring Dataset Health: Strategies for Effective Maintenance and Support

:::info Authors: (1) TIMNIT GEBRU, Black in AI; (2) JAMIE MORGENSTERN, University of Washington; (3) BRIANA VECCHIONE, Cornell University; (4) JENNIFER WORTMAN VAUGHAN, Microsoft Research; (5) HANNA WALLACH, Microsoft Research; (6) HAL DAUMÉ III, Microsoft Research; University of Maryland; (7) KATE CRAWFORD, Microsoft Research. ::: Table of Links 1 Introduction 1.1 Objectives 2 Development Process … Read more

Guidelines for Sharing AI Datasets Responsibly

:::info Authors: (1) TIMNIT GEBRU, Black in AI; (2) JAMIE MORGENSTERN, University of Washington; (3) BRIANA VECCHIONE, Cornell University; (4) JENNIFER WORTMAN VAUGHAN, Microsoft Research; (5) HANNA WALLACH, Microsoft Research; (6) HAL DAUMÉ III, Microsoft Research; University of Maryland; (7) KATE CRAWFORD, Microsoft Research. ::: Table of Links 1 Introduction 1.1 Objectives 2 Development Process … Read more

Evaluating WikiSP on WikiWebQuestions: The Experiments We Ran

:::info Authors: (1) Silei Xu, Computer Science Department, Stanford University Stanford, CA with equal contribution {silei@cs.stanford.edu}; (2) Shicheng Liu, Computer Science Department, Stanford University Stanford, CA with equal contribution {shicheng@cs.stanford.edu}; (3) Theo Culhane, Computer Science Department, Stanford University Stanford, CA {tculhane@cs.stanford.edu}; (4) Elizaveta Pertseva, Computer Science Department, Stanford University Stanford, CA, {pertseva@cs.stanford.edu}; (5) Meng-Hsi Wu, … Read more

Implementation Details of the Entity Linker and the WikiSP Semantic Parser

:::info Authors: (1) Silei Xu, Computer Science Department, Stanford University Stanford, CA with equal contribution {silei@cs.stanford.edu}; (2) Shicheng Liu, Computer Science Department, Stanford University Stanford, CA with equal contribution {shicheng@cs.stanford.edu}; (3) Theo Culhane, Computer Science Department, Stanford University Stanford, CA {tculhane@cs.stanford.edu}; (4) Elizaveta Pertseva, Computer Science Department, Stanford University Stanford, CA, {pertseva@cs.stanford.edu}; (5) Meng-Hsi Wu, … Read more

WikiWebQuestions (WWQ) Dataset: What Is It?

:::info Authors: (1) Silei Xu, Computer Science Department, Stanford University Stanford, CA with equal contribution {silei@cs.stanford.edu}; (2) Shicheng Liu, Computer Science Department, Stanford University Stanford, CA with equal contribution {shicheng@cs.stanford.edu}; (3) Theo Culhane, Computer Science Department, Stanford University Stanford, CA {tculhane@cs.stanford.edu}; (4) Elizaveta Pertseva, Computer Science Department, Stanford University Stanford, CA, {pertseva@cs.stanford.edu}; (5) Meng-Hsi Wu, … Read more

Unlocking the Secrets of URL: A Journey for All

:::info Ideogram Prompt: Create a pixel-perfect Insta logo on a vibrant mushroom. The typography above mario reads “Beneath the Links” written in contra sega games letters., 3d render, illustration, typography, vibrant, anime ::: Have you ever wondered what happens behind the scenes when you click a link? The Internet is a vast network, and web … Read more

How to Improve the Parallelization of Torch Dataloaders Using Torch.multiprocessing

Introduction PyTorch’s DataLoader (torch.utils.data.Dataloader) is already a useful tool for efficiently loading and preprocessing data for training deep learning models. By default, PyTorch uses a single-worker process (num_workers=0), but users can specify a higher number to leverage parallelism and speed up data loading. However, since it is a general-purpose dataloader, and even though it offers … Read more

What Frontend Devs Want (From Backend Devs)

This one goes to all the backend developers who create APIs for the frontend developers and want their team to succeed, become more productive, and rock everybody’s socks. Here are a few simple things that can decrease your time-to-market or improve other fancy metrics your managers want you to improve. I will tell it from … Read more

The Security and Authenticity of NFTs

Non-fungible tokens represent an exciting new development in the world of digital assets that has a lot of people excited. Among those people are hackers who thus far have been successful in exploiting a range of vulnerabilities. From the very beginning, it sounded suspicious. The auction was offering an NFT from a famous artist – … Read more

2024 Complete Full-Stack Developers Roadmap

Getting into tech may be nutcracking, especially if you don’t know your way about it. Things may be a bit confusing, and most developers wish they could go back in time and start their tech journey from scratch. I have saved you from your future what-ifs by writing this complete Full-Stack Developer road map, which … Read more

Kafka Schema Evolution: A Guide to the Confluent Schema Registry

While applications are producing and consuming messages to and fro Kafka, you’ll notice that new consumers of existing topics start emerging. These new consumers (applications) might have been written by the same engineers who wrote the original producer of those messages or by people you don’t know. The emergence of these consumers is perfectly normal. … Read more

Defeating TLS Fingerprinting: Bypassing Firewall Protection for HTTPS Requests

I was utterly frustrated with scraping HTTP data from a firewall-protected website. Despite using residential proxies from multiple providers, my requests kept getting blocked without any clear reason. Sometimes, the script worked on my local machine, but it would fail when running on a cloud server. After extensive research, I stumbled upon the concept of … Read more

At the Potomac, Where DC, the Analog Political National Capital, and VC, the Digital Capital, Meet

By Jeff Garzik and Ralph Benko It may seem paradoxical. But sometimes shortages can be a shortcut to abundance. Economists call that “a forcing function,” explained by the Interaction Design Foundation as “an aspect of a design that prevents the user from taking an action without consciously considering information relevant to that action. It forces … Read more

Polkadot Allocates $14.4M to Support DeFi Growth through Hydration Project

Polkadot has allocated 2 million DOT tokens, equivalent to $14.4 million, to its leading DeFi project, Hydration. The funds are aimed at enhancing liquidity and trading efficiency on Hydration’s single-sided liquidity provisioning platform, Omnipool. The allocation of the DOT tokens will be split into two parts. One million DOT will be used over the course … Read more

New DeFi Platform Jellyverse Launches on Sei Blockchain, Introducing DeFi 3.0 Features

Jellyverse, a decentralized finance (DeFi) platform, has launched its ecosystem on the Sei blockchain, introducing a suite of DeFi tools and services. The platform aims to bring DeFi 3.0 capabilities to users by integrating real-world assets and offering enhanced portfolio diversification options. The Jellyverse ecosystem consists of three main components: JellySwap, a decentralized exchange (DEX) … Read more

dWallet Network Teams with Aptos to Revolutionize DeFi and Gaming with Zero Trust

dWallet Network, a pioneer in native multi-chain technology, has announced its expansion onto the Aptos blockchain. This move introduces Zero Trust Protocols (ZTPs) to the DeFi and gaming sectors within Aptos, enhancing multi-chain interactions without the need for traditional bridging or wrapping methods. By leveraging dWallet’s Zero Trust architecture, these protocols will allow for transactions … Read more

Using Laravel Facades for Cleaner, Testable Code

For one reason or another, Laravel Facades don’t get much love. I often read about how they are not a true facade implementation, and let me tell you, they’re not 🤷. They’re more like proxies than facades. If you think about it, they simply forward calls to their respective classes, effectively intercepting the request. When … Read more

How You Like Them Sandwiches?

You probably like sandwiches as a snack, but what about being sandwiched by your competitors on Google Search? Elastic Email (a Mailchimp competitor) has been advertising on Google for months. Google Ads constitute ~40% of their ad spend, so they must get fantastic results there, right? WRONG!! If you Google Elastic Email right now, here’s … Read more

Effortlessly Deploy Django Apps to the Cloud with GitHub Actions and Heroku

Continuous integration and continuous delivery (CI/CD) capabilities are basic expectations for modern development teams who want fast feedback on their changes and rapid deployment to the cloud. In recent years, we’ve seen the growing adoption of GitHub Actions, a feature-rich CI/CD system that dovetails nicely with cloud hosting platforms such as Heroku. In this article, … Read more

The Earned Secret: Why the Era of Serendipitous Success is Over

Hi All! n n Here is my weekly letter on mental models, performance, business, and entrepreneurship. If you love this content (please share it), but also… :::tip Check out my Podcast, connect with me on YouTube / Twitter. You can also subscribe to my DAILY newsletter here. ::: What’s in today’s newsletter? The days of stumbling upon a billion-dollar idea while … Read more

Revisiting Separation of Variables in Hamilton-Jacobi Equations: Conclusion, Statements and More

:::info Author: (1) V V Obukhov, Institute of research and development, Tomsk State Pedagogical University, Kievskaya str. 60, Tomsk 634061, Russia & International laboratory of theoretical cosmology, Tomsk State University of Control Systems and Radioelectronics, Lenina pr. 40, Tomsk 634050, Russia (E-mail: obukhov@tspu.edu.ru); (2) K E Osetrin, Center for Mathematical and Computer Physics, Tomsk State … Read more

Criminal IP Unveils Innovative Fraud Detection Data Products On Snowflake Marketplace

TORRANCE, United States / California, June 10th, 2024, CyberNewsWire/–AI SPERA, a leader in Cyber Threat Intelligence (CTI) solutions, announced that it has started selling its paid threat detection data from its CTI search engine ‘Criminal IP” on the Snowflake Marketplace. Criminal IP is committed to offering advanced cybersecurity solutions through Snowflake, the leading cloud-based data … Read more