Attack on an HF model split into multiple GPUs #798

RealPolitiX · 2024-08-02T14:16:35Z

Does textattack allow running the attack calculation on a single model doing distributed inference across multiple GPUs?

For example, one can use the argument device_map="auto", which distributes a large model to multiple GPUs, on HuggingFaceModelWrapper. However, it seems that if you split a single instance of a large model onto multiple GPUs, then when doing the attack, such as using attacker.attack_dataset(), then there will be a RuntimeError similar to the following (if two GPUs are present)
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!

The text was updated successfully, but these errors were encountered:

RealPolitiX · 2024-08-11T15:22:44Z

Solved this problem by commenting out the following two lines

if torch.cuda.is_available():
     self.attack.cuda_()

in the attacker.py file. It seems that cuda_() is enforcing a specific mapping of the neural network to GPU memory.

RealPolitiX changed the title ~~Attack on HF models split into multiple GPUs~~ Attack on an HF model split into multiple GPUs Aug 2, 2024

RealPolitiX mentioned this issue Aug 11, 2024

Textattack for cuml models not utilising much GPU resources #790

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attack on an HF model split into multiple GPUs #798

Attack on an HF model split into multiple GPUs #798

RealPolitiX commented Aug 2, 2024 •

edited

Loading

RealPolitiX commented Aug 11, 2024 •

edited

Loading

Attack on an HF model split into multiple GPUs #798

Attack on an HF model split into multiple GPUs #798

Comments

RealPolitiX commented Aug 2, 2024 • edited Loading

RealPolitiX commented Aug 11, 2024 • edited Loading

RealPolitiX commented Aug 2, 2024 •

edited

Loading

RealPolitiX commented Aug 11, 2024 •

edited

Loading