DA^3: A Distribution-Aware Adversarial Attack against Language Models

Published in The 2024 Conference on Empirical Methods in Natural Language Processing, 2024