General Analysis Blog

General Analysis x Together AI

TLDR: We are excited to announce our partnership with Together AI to stress-test the safety of open-source (and closed) language models.

2025-05-06

The Jailbreak Cookbook

We have created a comprehensive overview of the most influential LLM jailbreaking methods.

2025-03-21

Generating Diverse Test Cases with Diversity Transfer from LegalBench

TLDR: we utilized LegalBench as a diversity source to enhance the diversity of our generation of red teaming questions. We show that diversity transfer from a domain-specific knowledge base is a simple and practical way to build a solid red teaming benchmark.

2025-02-19

Red Teaming GPT-4o: Uncovering Hallucinations in Legal AI Models

In this work we explore automated red teaming, applied to GPT-4o in the legal domain. Using a Llama3 8B model as an attacker, we generate more than 50,000 adversarial questions that cause GPT-4o to hallucinate responses in over 35% of cases.

2025-01-23
LogoGeneral Analysis

© 2025 All rights reserved.

Menu

HomeBlogContact

Follow Us

TwitterLinkedInGitHub
Concrete texture