Name	Name	Last commit message	Last commit date
parent directory ..
Makefile	Makefile
README.md	README.md
main.py	main.py
requirements.txt	requirements.txt

Name

Last commit message

Last commit date

Basics and requirements

Install uv and llmart, and download/navigate to this folder.

Attacks on diffusion large language models (dLLMs) using `llmart`

This example shows how to use llmart to attack a non-autoregressive language model architecture: a masked discrete diffusion model. The victim model is the publicly available LLaDA-8B-Instruct model.

For more details on the principles behind masked diffusion language models, see the original project page. For users that are unfamiliar with diffusion models on continuous inputs, this blog post offers an overview from first principles.

From a hands-on researcher point of view, there is just one main difference that needs to be addressed before deploying llmart on this model:

What is the loss function?

In this example, we opt for using the cross-entropy loss at diffusion time t=1, where all response tokens are masked. This proves to be powerful enough to transfer to the entire end-to-end diffusion sampling (on a much more granular time grid). The example requires the external generate.py file from the original repo to be present in the directory, which will be automatically downloaded when running a quick test using:

make run

Once the file has been downloaded at least once, command-line arguments can be directly specified by running with uv:

uv run --with-requirements requirements.txt main.py --n_tokens 1 --suffix 20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Basics and requirements

Attacks on diffusion large language models (dLLMs) using `llmart`

FilesExpand file tree

llada

Directory actions

More options

Directory actions

More options

Latest commit

History

llada

Folders and files

parent directory

README.md

Basics and requirements

Attacks on diffusion large language models (dLLMs) using llmart

Attacks on diffusion large language models (dLLMs) using `llmart`