Understanding Related Research on Tool-Augmented Learning

:::info Authors: (1) Nicholas Farn, Microsoft Corporation {Microsoft Corporation {nifarn@microsoft.com}; (2) Richard Shin, Microsoft Corporation {eush@microsoft.com}. ::: Table of Links Abstract and Intro Dataset Design Evaluation Methodology Experiments and Analysis Related Work Conclusion, Reproducibility, and References A. Complete list of tools B. Scenario Prompt C. Unrealistic Queries D. Nuances comparing prior work 5 RELATED WORK … Read more

Analyzing AI Assistant Performance: Lessons from ToolTalk’s Analysis of GPT-3.5 and GPT-4

:::info Authors: (1) Nicholas Farn, Microsoft Corporation {Microsoft Corporation {nifarn@microsoft.com}; (2) Richard Shin, Microsoft Corporation {eush@microsoft.com}. ::: Table of Links Abstract and Intro Dataset Design Evaluation Methodology Experiments and Analysis Related Work Conclusion, Reproducibility, and References A. Complete list of tools B. Scenario Prompt C. Unrealistic Queries D. Nuances comparing prior work 4 EXPERIMENTS AND … Read more

EU’s ChatGPT taskforce offers first look at detangling the AI chatbot’s privacy compliance

A data protection taskforce that’s spent over a year considering how the European Union’s data protection rulebook applies to OpenAI’s viral chatbot, ChatGPT, reported preliminary conclusions Friday. The top-line takeaway is that the working group of privacy enforcers remains undecided on crux legal issues, such as the lawfulness and fairness of OpenAI’s processing. The issue … Read more

Action vs Non-action Tools: Evaluating AI Assistant Correctness

:::info Authors: (1) Nicholas Farn, Microsoft Corporation {Microsoft Corporation {nifarn@microsoft.com}; (2) Richard Shin, Microsoft Corporation {eush@microsoft.com}. ::: Table of Links Abstract and Intro Dataset Design Evaluation Methodology Experiments and Analysis Related Work Conclusion, Reproducibility, and References A. Complete list of tools B. Scenario Prompt C. Unrealistic Queries D. Nuances comparing prior work 3 EVALUATION METHODOLOGY … Read more

5 days left to get your early-bird Disrupt passes

The countdown to early-bird savings for TechCrunch Disrupt, taking place October 28–30 in San Francisco, continues. You have just five days left to save up to $800 on the price of admission. The window closes on Friday, May 31 at 11:59 p.m. PDT. Grab your tickets today. Find the best early-bird pass for you Why … Read more

LatAm startups: Apply to Startup Battlefield 200

Here’s a shoutout to LatAm early-stage startup founders! We want YOU to apply for the Startup Battlefield 200 at TechCrunch Disrupt 2024. But you’d better hurry — time is running out. The application window closes on June 10 at 11:59 p.m. PDT. LatAm — Join the Startup Battlefield 200 Startup Battlefield 200 (SB 200) is … Read more