Attacks API¶

This page documents the adversarial attack components of the Segmentation Robustness Framework.

Attack Classes¶

`segmentation_robustness_framework.attacks.attack` ¶

Classes¶

`AdversarialAttack(model: nn.Module)` ¶

Bases: ABC

Define the base class for adversarial attacks.

Attributes:

Name	Type	Description
`model`	`Module`	Segmentation model to be attacked.
`device`	`str \| device`	The device to use for the attack.

Initialize the adversarial attack.

Parameters:

Name	Type	Description	Default
`model`	`Module`	Segmentation model to be attacked.	required

Source code in segmentation_robustness_framework/attacks/attack.py

def __init__(self, model: nn.Module):
    """Initialize the adversarial attack.

    Args:
        model (nn.Module): Segmentation model to be attacked.
    """
    self.model = model

    try:
        self.device = next(model.parameters()).device
    except Exception:
        self.device = torch.device("cpu")
        logger.warning("Failed to detect model device. Using CPU. You can try `set_device()`")

Functions¶

`set_device(device: str | torch.device) -> None` ¶

Set the device for the attack.

Parameters:

Name	Type	Description	Default
`device`	`str \| device`	The device to use for the attack.	required

Source code in segmentation_robustness_framework/attacks/attack.py

def set_device(self, device: str | torch.device) -> None:
    """Set the device for the attack.

    Args:
        device (str | torch.device): The device to use for the attack.
    """
    self.device = torch.device(device) if isinstance(device, str) else device
    logger.info(f"Attack device set to: {self.device}")

`apply(image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` `abstractmethod` ¶

Perform an attack on the segmentation model.

This method should be implemented by subclasses to define the attack logic.

Parameters:

Name	Type	Description	Default
`image`	`Tensor`	The input image tensor to be perturbed.	required
`labels`	`Tensor`	The true or target labels for the image.	required

Returns:

Type	Description
`Tensor`	torch.Tensor: The perturbed image tensor.

Source code in segmentation_robustness_framework/attacks/attack.py

@abstractmethod
def apply(self, image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor:
    """Perform an attack on the segmentation model.

    This method should be implemented by subclasses to define the attack logic.

    Args:
        image (torch.Tensor): The input image tensor to be perturbed.
        labels (torch.Tensor): The true or target labels for the image.

    Returns:
        torch.Tensor: The perturbed image tensor.
    """
    pass

`segmentation_robustness_framework.attacks.fgsm` ¶

Classes¶

`FGSM(model: nn.Module, eps: float = 2 / 255)` ¶

Bases: AdversarialAttack

Fast Gradient Sign Method (FGSM) method from "Explaining and harnessing adversarial examples". Paper: https://arxiv.org/abs/1412.6572

Attributes:

Name	Type	Description
`model`	`Module`	The model that the adversarial attack will be applied to.
`eps`	`float`	The magnitude of the perturbation.

Initialize FGSM attack.

Parameters:

Name	Type	Description	Default
`model`	`Module`	The model that the adversarial attack will be applied to.	required
`eps`	`float`	The magnitude of the perturbation. Defaults to 2/255.	`2 / 255`

Source code in segmentation_robustness_framework/attacks/fgsm.py

def __init__(
    self,
    model: nn.Module,
    eps: float = 2 / 255,
):
    """Initialize FGSM attack.

    Args:
        model (nn.Module): The model that the adversarial attack will be applied to.
        eps (float): The magnitude of the perturbation. Defaults to 2/255.
    """
    super().__init__(model)
    self.eps = eps

Functions¶

`get_params() -> dict[str, float]` ¶

Get attack parameters.

Returns:

Type	Description
`dict[str, float]`	dict[str, float]: Dictionary containing attack parameters.

Source code in segmentation_robustness_framework/attacks/fgsm.py

def get_params(self) -> dict[str, float]:
    """Get attack parameters.

    Returns:
        dict[str, float]: Dictionary containing attack parameters.
    """
    return {"epsilon": self.eps}

`apply(image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` ¶

Apply FGSM attack to input images.

Parameters:

Name	Type	Description	Default
`image`	`Tensor`	Input image tensor [B, C, H, W].	required
`labels`	`Tensor`	Target labels tensor [B, H, W].	required

Returns:

Type	Description
`Tensor`	torch.Tensor: Adversarial image tensor [B, C, H, W].

Source code in segmentation_robustness_framework/attacks/fgsm.py

def apply(self, image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor:
    """Apply FGSM attack to input images.

    Args:
        image (torch.Tensor): Input image tensor [B, C, H, W].
        labels (torch.Tensor): Target labels tensor [B, H, W].

    Returns:
        torch.Tensor: Adversarial image tensor [B, C, H, W].
    """
    self.model.eval()

    image = image.to(self.device, non_blocking=True)
    image.requires_grad = True
    labels = labels.to(self.device, non_blocking=True)

    valid_mask = labels >= 0

    if not torch.any(valid_mask):
        return image.detach()

    outputs = self.model(image)
    self.model.zero_grad()

    # Reshape outputs and labels for loss computation
    # outputs: [B, C, H, W] -> [B*H*W, C]
    # labels: [B, H, W] -> [B*H*W]
    B, C, H, W = outputs.shape
    outputs_flat = outputs.permute(0, 2, 3, 1).reshape(-1, C)  # [B*H*W, C]
    labels_flat = labels.reshape(-1)  # [B*H*W]

    # Only compute loss on valid pixels
    valid_indices = valid_mask.reshape(-1)  # [B*H*W]
    valid_outputs = outputs_flat[valid_indices]  # [N_valid, C]
    valid_labels = labels_flat[valid_indices]  # [N_valid]

    # Compute loss only on valid pixels
    loss = torch.nn.CrossEntropyLoss()
    cost = loss(valid_outputs, valid_labels)
    cost.backward()

    adv_image = image + self.eps * image.grad.sign()
    adv_image = torch.clamp(adv_image, 0, 1)

    # Memory cleanup
    del outputs, outputs_flat, labels_flat, valid_outputs, valid_labels, cost
    if self.device == "cuda":
        torch.cuda.empty_cache()

    return adv_image

Functions¶

`segmentation_robustness_framework.attacks.pgd` ¶

Classes¶

`PGD(model: nn.Module, eps: float = 2 / 255, alpha: float = 2 / 255, iters: int = 10, targeted: bool = False)` ¶

Bases: AdversarialAttack

Projected Gradient Descent (PGD) method from "Towards Deep Learning Models Resistant to Adversarial Attacks". Paper: https://arxiv.org/abs/1706.06083

Attributes:

Name	Type	Description
`model`	`Module`	The model that the adversarial attack will be applied to.
`eps`	`float`	The magnitude of the perturbation.
`alpha`	`float`	The step size for each iteration.
`iters`	`int`	The number of iterations.
`targeted`	`bool`	Indicates whether the attack is targeted or not.

Initializes PGD attack.

Parameters:

Name	Type	Description	Default
`model`	`Module`	The model that the adversarial attack will be applied to.	required
`eps`	`float`	The magnitude of the perturbation. Defaults to 2/255.	`2 / 255`
`alpha`	`float`	The step size for each iteration. Defaults to 2/255.	`2 / 255`
`iters`	`int`	The number of iterations. Defaults to 10.	`10`
`targeted`	`bool`	If True, performs a targeted attack; otherwise, performs an untargeted attack. Defaults to False.	`False`

Source code in segmentation_robustness_framework/attacks/pgd.py

def __init__(
    self,
    model: nn.Module,
    eps: float = 2 / 255,
    alpha: float = 2 / 255,
    iters: int = 10,
    targeted: bool = False,
):
    """Initializes PGD attack.

    Args:
        model (nn.Module): The model that the adversarial attack will be applied to.
        eps (float, optional): The magnitude of the perturbation. Defaults to 2/255.
        alpha (float, optional): The step size for each iteration. Defaults to 2/255.
        iters (int, optional): The number of iterations. Defaults to 10.
        targeted (bool, optional): If True, performs a targeted attack; otherwise, performs
            an untargeted attack. Defaults to False.
    """
    super().__init__(model)
    self.eps = eps
    self.alpha = alpha
    self.iters = iters
    self.targeted = targeted
    self.device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

Functions¶

`get_params() -> dict[str, float]` ¶

Get attack parameters.

Returns:

Type	Description
`dict[str, float]`	dict[str, float]: Dictionary containing attack parameters.

Source code in segmentation_robustness_framework/attacks/pgd.py

def get_params(self) -> dict[str, float]:
    """Get attack parameters.

    Returns:
        dict[str, float]: Dictionary containing attack parameters.
    """
    return {"epsilon": self.eps, "alpha": self.alpha, "iters": self.iters, "targeted": self.targeted}

`apply(images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` ¶

Apply PGD attack to a batch of images.

Parameters:

Name	Type	Description	Default
`images`	`Tensor`	Batch of input images [B, C, H, W].	required
`labels`	`Tensor`	Batch of target labels [B, H, W].	required

Returns:

Type	Description
`Tensor`	torch.Tensor: Batch of adversarial images [B, C, H, W].

Source code in segmentation_robustness_framework/attacks/pgd.py

def apply(self, images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor:
    """Apply PGD attack to a batch of images.

    Args:
        images (torch.Tensor): Batch of input images [B, C, H, W].
        labels (torch.Tensor): Batch of target labels [B, H, W].

    Returns:
        torch.Tensor: Batch of adversarial images [B, C, H, W].
    """
    self.model.eval()

    images = images.to(self.device, non_blocking=True)
    labels = labels.to(self.device, non_blocking=True)

    if labels.dim() == 4 and labels.shape[1] == 1:
        labels = labels.squeeze(1)  # [B, H, W]

    valid_mask = labels >= 0

    if not torch.any(valid_mask):
        return images.detach()

    loss = torch.nn.CrossEntropyLoss()

    adv_images = images.clone().detach()
    adv_images = adv_images + torch.empty_like(adv_images).uniform_(-self.eps, self.eps)
    adv_images = torch.clamp(adv_images, min=0, max=1).detach()

    for _ in range(self.iters):
        adv_images.requires_grad = True

        outputs = self.model(adv_images)  # [B, num_classes, H, W]

        # Reshape outputs and labels for loss computation
        # outputs: [B, C, H, W] -> [B*H*W, C]
        # labels: [B, H, W] -> [B*H*W]
        B, C, H, W = outputs.shape
        outputs_flat = outputs.permute(0, 2, 3, 1).reshape(-1, C)  # [B*H*W, C]
        labels_flat = labels.reshape(-1)  # [B*H*W]

        valid_indices = valid_mask.reshape(-1)  # [B*H*W]
        valid_outputs = outputs_flat[valid_indices]  # [N_valid, C]
        valid_labels = labels_flat[valid_indices]  # [N_valid]

        if self.targeted:
            cost = -loss(valid_outputs, valid_labels)
        else:
            cost = loss(valid_outputs, valid_labels)

        self.model.zero_grad()
        grad = torch.autograd.grad(cost, adv_images, retain_graph=False, create_graph=False)[0]

        adv_images = adv_images.detach() + self.alpha * grad.sign()
        delta = torch.clamp(adv_images - images, min=-self.eps, max=self.eps)
        adv_images = torch.clamp(images + delta, min=0, max=1).detach()

        # Memory cleanup
        del outputs, outputs_flat, labels_flat, valid_outputs, valid_labels, cost, grad
        if self.device == "cuda":
            torch.cuda.empty_cache()

    return adv_images

Functions¶

`segmentation_robustness_framework.attacks.rfgsm` ¶

Classes¶

`RFGSM(model: nn.Module, eps: float = 8 / 255, alpha: float = 2 / 255, iters: int = 10, targeted: bool = False)` ¶

Bases: AdversarialAttack

Random Fast Gradient Sign Method (R+FGSM) from the paper "Ensemble Adversarial Training : Attacks and Defences". Paper: https://arxiv.org/abs/1705.07204

Attributes:

Name	Type	Description
`model`	`Module`	The model that the adversarial attack will be applied to.
`eps`	`float`	Strength of the attack or maximum perturbation.
`alpha`	`float`	Step size.
`iters`	`int`	Number of iterations.
`targeted`	`bool`	Indicates whether the attack is targeted or not.

Initializes R+FGSM attack.

Parameters:

Name	Type	Description	Default
`model`	`Module`	The model that the adversarial attack will be applied to.	required
`eps`	`float`	Strength of the attack or maximum perturbation. Defaults to 8/255.	`8 / 255`
`alpha`	`float`	Step size. Defaults to 2/255.	`2 / 255`
`iters`	`int`	Number of iterations. Defaults to 10.	`10`
`targeted`	`bool`	If True, performs a targeted attack; otherwise, performs an untargeted attack. Defaults to False.	`False`

Source code in segmentation_robustness_framework/attacks/rfgsm.py

def __init__(
    self,
    model: nn.Module,
    eps: float = 8 / 255,
    alpha: float = 2 / 255,
    iters: int = 10,
    targeted: bool = False,
):
    """Initializes R+FGSM attack.

    Args:
        model (nn.Module): The model that the adversarial attack will be applied to.
        eps (float, optional): Strength of the attack or maximum perturbation. Defaults to 8/255.
        alpha (float, optional): Step size. Defaults to 2/255.
        iters (int, optional): Number of iterations. Defaults to 10.
        targeted (bool, optional): If True, performs a targeted attack; otherwise, performs
            an untargeted attack. Defaults to False.
    """
    super().__init__(model)
    self.eps = eps
    self.alpha = alpha
    self.iters = iters
    self.targeted = targeted

Functions¶

`get_params() -> dict[str, float]` ¶

Get attack parameters.

Returns:

Type	Description
`dict[str, float]`	dict[str, float]: Dictionary containing attack parameters.

Source code in segmentation_robustness_framework/attacks/rfgsm.py

def get_params(self) -> dict[str, float]:
    """Get attack parameters.

    Returns:
        dict[str, float]: Dictionary containing attack parameters.
    """
    return {"epsilon": self.eps, "alpha": self.alpha, "iters": self.iters, "targeted": self.targeted}

`apply(images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` ¶

Apply R+FGSM attack to a batch of images.

Parameters:

Name	Type	Description	Default
`images`	`Tensor`	Batch of input images [B, C, H, W].	required
`labels`	`Tensor`	Batch of target labels [B, H, W].	required

Returns:

Type	Description
`Tensor`	torch.Tensor: Batch of adversarial images [B, C, H, W].

Source code in segmentation_robustness_framework/attacks/rfgsm.py

def apply(self, images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor:
    """Apply R+FGSM attack to a batch of images.

    Args:
        images (torch.Tensor): Batch of input images [B, C, H, W].
        labels (torch.Tensor): Batch of target labels [B, H, W].

    Returns:
        torch.Tensor: Batch of adversarial images [B, C, H, W].
    """
    self.model.eval()

    images = images.to(self.device, non_blocking=True)
    labels = labels.to(self.device, non_blocking=True)

    if labels.dim() == 4 and labels.shape[1] == 1:
        labels = labels.squeeze(1)  # [B, H, W]

    valid_mask = labels >= 0

    if not torch.any(valid_mask):
        return images.detach()

    adv_images = images + (self.eps - self.alpha) * torch.randn_like(images).sign()
    adv_images = torch.clamp(adv_images, min=0, max=1).detach()

    loss = torch.nn.CrossEntropyLoss()

    for _ in range(self.iters):
        adv_images.requires_grad = True

        outputs = self.model(adv_images)  # [B, num_classes, H, W]

        # Reshape outputs and labels for loss computation
        # outputs: [B, C, H, W] -> [B*H*W, C]
        # labels: [B, H, W] -> [B*H*W]
        B, C, H, W = outputs.shape
        outputs_flat = outputs.permute(0, 2, 3, 1).reshape(-1, C)  # [B*H*W, C]
        labels_flat = labels.reshape(-1)  # [B*H*W]

        valid_indices = valid_mask.reshape(-1)  # [B*H*W]
        valid_outputs = outputs_flat[valid_indices]  # [N_valid, C]
        valid_labels = labels_flat[valid_indices]  # [N_valid]

        if self.targeted:
            cost = -loss(valid_outputs, valid_labels)
        else:
            cost = loss(valid_outputs, valid_labels)

        self.model.zero_grad()
        grad = torch.autograd.grad(cost, adv_images, retain_graph=False, create_graph=False)[0]

        adv_images = adv_images.detach() + self.alpha * grad.sign()
        delta = torch.clamp(adv_images - images, min=-self.eps, max=self.eps)
        adv_images = torch.clamp(images + delta, min=0, max=1).detach()

        # Memory cleanup
        del outputs, outputs_flat, labels_flat, valid_outputs, valid_labels, cost, grad
        if self.device == "cuda":
            torch.cuda.empty_cache()

    return adv_images

Functions¶

`segmentation_robustness_framework.attacks.tpgd` ¶

Classes¶

`TPGD(model: nn.Module, eps: float = 8 / 255, alpha: float = 2 / 255, iters: int = 10)` ¶

Bases: AdversarialAttack

PGD based on KL-Divergence loss from the paper "Theoretically Principled Trade-off between Robustness and Accuracy". Paper: https://arxiv.org/abs/1901.08573

Attributes:

Name	Type	Description
`model`	`Module`	The model that the adversarial attack will be applied to.
`eps`	`float`	Strength of the attack or maximum perturbation.
`alpha`	`float`	Step size.
`iters`	`int`	Number of iterations.

Initializes TPGD attack.

Parameters:

Name	Type	Description	Default
`model`	`Module`	The model that the adversarial attack will be applied to.	required
`eps`	`float`	Strength of the attack or maximum perturbation.	`8 / 255`
`alpha`	`float`	Step size.	`2 / 255`
`iters`	`int`	Number of iterations.	`10`

Source code in segmentation_robustness_framework/attacks/tpgd.py

def __init__(
    self,
    model: nn.Module,
    eps: float = 8 / 255,
    alpha: float = 2 / 255,
    iters: int = 10,
):
    """Initializes TPGD attack.

    Args:
        model (nn.Module): The model that the adversarial attack will be applied to.
        eps (float): Strength of the attack or maximum perturbation.
        alpha (float): Step size.
        iters (int): Number of iterations.
    """
    super().__init__(model)
    self.eps = eps
    self.alpha = alpha
    self.iters = iters

Functions¶

`apply(images: torch.Tensor, labels: torch.Tensor = None) -> torch.Tensor` ¶

Apply TPGD attack to a batch of images.

Parameters:

Name	Type	Description	Default
`images`	`Tensor`	Batch of input images [B, C, H, W].	required
`labels`	`Tensor`	Batch of target labels [B, H, W]. Not used in TPGD.	`None`

Returns:

Type	Description
`Tensor`	torch.Tensor: Batch of adversarial images [B, C, H, W].

Source code in segmentation_robustness_framework/attacks/tpgd.py

def apply(self, images: torch.Tensor, labels: torch.Tensor = None) -> torch.Tensor:
    """Apply TPGD attack to a batch of images.

    Args:
        images (torch.Tensor): Batch of input images [B, C, H, W].
        labels (torch.Tensor, optional): Batch of target labels [B, H, W]. Not used in TPGD.

    Returns:
        torch.Tensor: Batch of adversarial images [B, C, H, W].
    """
    self.model.eval()

    images = images.clone().detach().to(self.device, non_blocking=True)

    with torch.no_grad():
        logit_ori = self.model(images).detach()

    adv_images = images + 0.001 * torch.randn_like(images)
    adv_images = torch.clamp(adv_images, min=0, max=1).detach()

    loss = torch.nn.KLDivLoss(reduction="sum")

    for _ in range(self.iters):
        adv_images.requires_grad = True

        logit_adv = self.model(adv_images)  # [B, num_classes, H, W]

        cost = loss(F.log_softmax(logit_adv, dim=1), F.softmax(logit_ori, dim=1))

        self.model.zero_grad()
        grad = torch.autograd.grad(cost, adv_images, retain_graph=False, create_graph=False)[0]

        adv_images = adv_images.detach() + self.alpha * grad.sign()
        delta = torch.clamp(adv_images - images, min=-self.eps, max=self.eps)
        adv_images = torch.clamp(images + delta, min=0, max=1).detach()

        # Memory cleanup
        del logit_adv, cost, grad
        if self.device == "cuda":
            torch.cuda.empty_cache()

    return adv_images

Functions¶

Attack Overview¶

The framework provides a comprehensive suite of adversarial attacks designed to test the robustness of segmentation models. All attacks inherit from the AdversarialAttack base class.

AdversarialAttack Base Class¶

The base class that all attacks must implement:

from abc import ABC, abstractmethod
import torch

class AdversarialAttack(ABC):
    """Base class for adversarial attacks."""

    def __init__(self, model, eps=0.1, device="cuda"):
        self.model = model
        self.eps = eps
        self.device = device

    @abstractmethod
    def apply(self, x: torch.Tensor, y: torch.Tensor) -> torch.Tensor:
        """Apply the attack to input x with target y."""
        pass

Available Attacks¶

FGSM (Fast Gradient Sign Method)¶

A simple but effective first-order attack:

from segmentation_robustness_framework.attacks import FGSM

# Create FGSM attack
attack = FGSM(model, eps=0.02)

# Apply attack
adversarial_x = attack.apply(x, y)

Parameters: - eps: Maximum perturbation magnitude (default: 2/255 ≈ 0.008)

PGD (Projected Gradient Descent)¶

A more powerful iterative attack:

from segmentation_robustness_framework.attacks import PGD

# Create PGD attack
attack = PGD(
    model=model,
    eps=0.02,
    alpha=0.02,
    iters=10,
    targeted=False
)

# Apply attack
adversarial_x = attack.apply(x, y)

Parameters: - eps: Maximum perturbation magnitude (default: 2/255 ≈ 0.008) - alpha: Step size for each iteration (default: 2/255 ≈ 0.008) - iters: Number of iterations (default: 10) - targeted: Whether to perform targeted attack (default: False)

RFGSM (R-FGSM with Momentum)¶

FGSM with momentum for better convergence:

from segmentation_robustness_framework.attacks import RFGSM

# Create RFGSM attack
attack = RFGSM(
    model=model,
    eps=0.1,
    alpha=0.01,
    iters=10,
    targeted=False
)

# Apply attack
adversarial_x = attack.apply(x, y)

Parameters: - eps: Maximum perturbation magnitude (default: 0.1) - alpha: Step size for each iteration (default: 0.01) - iters: Number of iterations (default: 10) - targeted: Whether to perform targeted attack (default: False)

TPGD (Targeted Projected Gradient Descent)¶

from segmentation_robustness_framework.attacks import TPGD

# Create TPGD attack
attack = TPGD(
    model=model,
    eps=0.1,
    alpha=0.01,
    iters=10
)

# Apply attack
adversarial_x = attack.apply(x, y)

Parameters: - eps: Maximum perturbation magnitude (default: 0.1) - alpha: Step size for each iteration (default: 0.01) - iters: Number of iterations (default: 10)

Attack Configuration¶

Configure attacks in YAML configuration files:

attacks:
  - name: fgsm
    eps: 0.02
  - name: pgd
    eps: 0.02
    alpha: 0.02
    iters: 10
    targeted: false
  - name: rfgsm
    eps: 0.02
    alpha: 0.02
    iters: 10
    targeted: false
  - name: tpgd
    eps: 0.02
    alpha: 0.02
    iters: 10

Custom Attacks¶

Create custom attacks by inheriting from AdversarialAttack:

from segmentation_robustness_framework.attacks import AdversarialAttack
import torch

class MyCustomAttack(AdversarialAttack):
    def __init__(self, model, eps=0.1, custom_param=1.0):
        super().__init__(model, eps)
        self.custom_param = custom_param

    def apply(self, x: torch.Tensor, y: torch.Tensor) -> torch.Tensor:
        """Apply your custom attack logic here."""
        # Your attack implementation
        x.requires_grad_(True)

        # Forward pass
        logits = self.model.logits(x)
        loss = torch.nn.functional.cross_entropy(logits, y)

        # Backward pass
        loss.backward()

        # Create perturbation
        perturbation = self.custom_param * x.grad.sign()

        # Apply perturbation with clipping
        adversarial_x = x + perturbation
        adversarial_x = torch.clamp(adversarial_x, 0, 1)

        return adversarial_x.detach()

# Use custom attack
attack = MyCustomAttack(model, eps=0.1, custom_param=0.5)
adversarial_x = attack.apply(x, y)

Attack Registration¶

Register custom attacks for automatic discovery:

from segmentation_robustness_framework.attacks import register_attack

@register_attack("my_custom_attack")
class MyCustomAttack(AdversarialAttack):
    def __init__(self, model, eps=0.1, custom_param=1.0):
        super().__init__(model, eps)
        self.custom_param = custom_param

    def apply(self, x: torch.Tensor, y: torch.Tensor) -> torch.Tensor:
        # Your attack implementation
        pass

# Now you can use it in configuration
# attacks:
#   - name: my_custom_attack
#     eps: 0.1
#     custom_param: 0.5

Attack Usage in Pipeline¶

Attacks are automatically used by the pipeline:

from segmentation_robustness_framework.pipeline import SegmentationRobustnessPipeline
from segmentation_robustness_framework.attacks import FGSM, PGD

# Create attacks
attacks = [
    FGSM(model, eps=0.1),
    PGD(model, eps=0.1, alpha=0.01, iters=10)
]

# Use in pipeline
pipeline = SegmentationRobustnessPipeline(
    model=model,
    dataset=dataset,
    attacks=attacks,
    metrics=[metrics.mean_iou],
    batch_size=4,
    device="cuda"
)

results = pipeline.run()

Attack Evaluation¶

Evaluate attack effectiveness:

# Compare clean vs adversarial performance
clean_iou = results['clean']['mean_iou']
fgsm_iou = results['attack_fgsm']['mean_iou']
pgd_iou = results['attack_pgd']['mean_iou']

print(f"Clean IoU: {clean_iou:.3f}")
print(f"FGSM IoU: {fgsm_iou:.3f}")
print(f"PGD IoU: {pgd_iou:.3f}")

# Calculate robustness
fgsm_robustness = fgsm_iou / clean_iou
pgd_robustness = pgd_iou / clean_iou

print(f"FGSM Robustness: {fgsm_robustness:.3f}")
print(f"PGD Robustness: {pgd_robustness:.3f}")

Performance Considerations¶

GPU Acceleration: All attacks support GPU acceleration
Memory Efficiency: Optimized for batch processing
Gradient Computation: Efficient gradient computation for iterative attacks
Convergence: Automatic convergence detection for iterative attacks

Attacks API¶

Attack Classes¶

segmentation_robustness_framework.attacks.attack ¶

Classes¶

AdversarialAttack(model: nn.Module) ¶

Functions¶

set_device(device: str | torch.device) -> None ¶

apply(image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor abstractmethod ¶

segmentation_robustness_framework.attacks.fgsm ¶

Classes¶

FGSM(model: nn.Module, eps: float = 2 / 255) ¶

Functions¶

get_params() -> dict[str, float] ¶

apply(image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor ¶

Functions¶

segmentation_robustness_framework.attacks.pgd ¶

Classes¶

PGD(model: nn.Module, eps: float = 2 / 255, alpha: float = 2 / 255, iters: int = 10, targeted: bool = False) ¶

Functions¶

get_params() -> dict[str, float] ¶

apply(images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor ¶

Functions¶

segmentation_robustness_framework.attacks.rfgsm ¶

Classes¶

RFGSM(model: nn.Module, eps: float = 8 / 255, alpha: float = 2 / 255, iters: int = 10, targeted: bool = False) ¶

Functions¶

get_params() -> dict[str, float] ¶

apply(images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor ¶

Functions¶

segmentation_robustness_framework.attacks.tpgd ¶

Classes¶

TPGD(model: nn.Module, eps: float = 8 / 255, alpha: float = 2 / 255, iters: int = 10) ¶

Functions¶

apply(images: torch.Tensor, labels: torch.Tensor = None) -> torch.Tensor ¶

Functions¶

Attack Overview¶

AdversarialAttack Base Class¶

Available Attacks¶

FGSM (Fast Gradient Sign Method)¶

PGD (Projected Gradient Descent)¶

RFGSM (R-FGSM with Momentum)¶

TPGD (Targeted Projected Gradient Descent)¶

Attack Configuration¶

Custom Attacks¶

Attack Registration¶

Attack Usage in Pipeline¶

Attack Evaluation¶

Performance Considerations¶

`segmentation_robustness_framework.attacks.attack` ¶

`AdversarialAttack(model: nn.Module)` ¶

`set_device(device: str | torch.device) -> None` ¶

`apply(image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` `abstractmethod` ¶

`segmentation_robustness_framework.attacks.fgsm` ¶

`FGSM(model: nn.Module, eps: float = 2 / 255)` ¶

`get_params() -> dict[str, float]` ¶

`apply(image: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` ¶

`segmentation_robustness_framework.attacks.pgd` ¶

`PGD(model: nn.Module, eps: float = 2 / 255, alpha: float = 2 / 255, iters: int = 10, targeted: bool = False)` ¶

`get_params() -> dict[str, float]` ¶

`apply(images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` ¶

`segmentation_robustness_framework.attacks.rfgsm` ¶

`RFGSM(model: nn.Module, eps: float = 8 / 255, alpha: float = 2 / 255, iters: int = 10, targeted: bool = False)` ¶

`get_params() -> dict[str, float]` ¶

`apply(images: torch.Tensor, labels: torch.Tensor) -> torch.Tensor` ¶

`segmentation_robustness_framework.attacks.tpgd` ¶

`TPGD(model: nn.Module, eps: float = 8 / 255, alpha: float = 2 / 255, iters: int = 10)` ¶

`apply(images: torch.Tensor, labels: torch.Tensor = None) -> torch.Tensor` ¶