Improving the adversarial transferability with relational graphs ensemble adversarial attack

Pi, Jiatian; Luo, Chaoyang; Xia, Fen; Jiang, Ning; Wu, Haiying; Wu, Zhiyou

doi:10.3389/fnins.2022.1094795

ORIGINAL RESEARCH article

Front. Neurosci., 01 February 2023

Sec. Visual Neuroscience

Volume 16 - 2022 | https://doi.org/10.3389/fnins.2022.1094795

This article is part of the Research TopicNeuroscience-Inspired Visual Sensing and UnderstandingView all 7 articles

Improving the adversarial transferability with relational graphs ensemble adversarial attack

Jiatian Pi¹^†

Chaoyang Luo²^†

Fen Xia³

Ning Jiang³^*

Haiying Wu³

Zhiyou Wu²

¹National Center for Applied Mathematics in Chongqing, Chongqing, China
²Department of Mathematical Sciences, Chongqing Normal University, Chongqing, China
³Mashang Consumer Finance Co., Ltd., Chongqing, China

In transferable black-box attacks, adversarial samples remain adversarial across multiple models and are more likely to attack unknown models. From this view, acquiring and exploiting multiple models is the key to improving transferability. For exploiting multiple models, existing approaches concentrate on differences among models but ignore the underlying complex dependencies. This exacerbates the issue of unbalanced and inadequate attacks on multiple models. To this problem, this paper proposes a novel approach, called Relational Graph Ensemble Attack (RGEA), to exploit the dependencies among multiple models. Specifically, we redefine the multi-model ensemble attack as a multi-objective optimization and create a sub-optimization problem to compute the optimal attack direction, but there are serious time-consuming problems. For this time-consuming problem, we define the vector representation of the model, extract the dependency matrix, and then equivalently simplify the sub-optimization problem by utilizing the dependency matrix. Finaly, we theoretically extend to investigate the connection between RGEA and the traditional multiple gradient descent algorithm (MGDA). Notably, combining RGEA further enhances the transferability of existing gradient-based attacks. The experiments using ten normal training models and ten defensive models on the labeled face in the wild (LFW) dataset demonstrate that RGEA improves the success rate of white-box attacks and further boosts the transferability of black-box attacks.

1. Introduction

The graph embedding model (Scarselli et al., 2009; Cui et al., 2019) demonstrates the expressive potential of deep learning on graph-structured data, and has shown promise in several applications including the classification of structural roles (Tu et al., 2018), biological analysis (Hamilton et al., 2017), financial monitoring (Paranjape et al., 2017), and the prediction of molecular features (Duvenaud et al., 2015). However, recent research (Dai et al., 2018; Zügner et al., 2018; Bojchevski and Günnemann, 2019) reveals that numerous types of graph embedding techniques, such as Graph Convolutional Networks, DeepWalk, etc., are susceptible to adversarial attacks. Therefore, A lot of attention has been paid to generating adversarial examples, called adversarial attacks, because they may be used to estimate the robustness of various models (Rauber et al., 2020; Tramer et al., 2020) and boost their robustness through adversarial training (Xu et al., 2017; Kurakin et al., 2018b; Mehrabi et al., 2021).

Additionally, adversarial examples often exhibit good transferability across the models (Liu et al., 2016; Papernot et al., 2016; Dong et al., 2019; Chen et al., 2021), i.e., examples created for one model can still deceive other models. Several attack techniques (Szegedy et al., 2013; Carlini and Wagner, 2017; Kurakin et al., 2018b; Madry et al., 2018) treat the adversarial example generation process as an iterative optimization and exhibit high attack performance in a white-box setting (Goodfellow et al., 2014). Nevertheless, in an unknown black-box environment (Papernot et al., 2016), these approaches suffer from a serious lack of transferability.

Previous studies (Dong et al., 2018, 2019; Lin et al., 2019; Xie et al., 2019) attributed the lack of transferability to overfitting surrogate models. Therefore, various techniques have been proposed to mitigate overfitting, including advanced gradient methods (Dong et al., 2018; Lin et al., 2019; Wang and He, 2021; Zou et al., 2022), ensemble multi-model attacks (Liu et al., 2016; Dong et al., 2018; Li et al., 2020), input transformations (Dong et al., 2019; Xie et al., 2019; Wang X. et al., 2021), and model-specific methods (Wu et al., 2019; Guo et al., 2020). Almost experiments (Dong et al., 2018; Zhao et al., 2021) and top entries in competitions (Kurakin et al., 2018a) shows that ensemble multi-model attacks and input transformations (e.g., random resizing and padding, transformations, scaling, etc.) are among the most effective methods. Moreover, Lin et al. (2019) suggested that the input transformation can be conceived as a model augmentation to attack more models. As well, Li et al. (2020) suggested dynamic erosion of certain intermediate structures of the surrogate model to the same end. In conclusion, acquiring and utilizing multiple models is key to achieving better transferability, yet investigations on utilizing multiple models are quite lacking.

For utilizing multiple models, we find that, compared to the two-by-two orthogonality among classification models, facial recognition models have more complex relationships among models. Recall that some real-world tasks, such as recommendation (Fan et al., 2019; Tan et al., 2020), urban data mining (Dai et al., 2020; Wang et al., 2020), and multi-task learning (Chen et al., 2019; Cao et al., 2022), enhance the utilization of information by using graphical representations to capture and exploit pairwise dependent relationships. This perspective leads us to consider the following question: Can we enhance the utilization of multiple models by capturing and exploiting the dependent relationships among the models?

For this purpose, we propose a novel method, called the Relational Graphs Ensemble Attack (RGEA), to improve the transferability. Specifically, 1) To exploit the complex dependencies among models, we redefine the multi-model ensemble attack as multi-objective optimization, and construct Sub-optimization problem 1 to find the optimal attack direction in the iteration; 2) Since the high dimensionality of the images causes a serious time-consuming problem. To eliminates the time-consuming problem, we define the vector representation of the model and the dependency relationships among the models, and build the equivalent Sub-optimization problem 2. 3) Furthermore, we theoretically prove the equivalence between the Sub-optimization problem 1 and 2, as well as analyze the association between RGEA and MGDA (Désidéri, 2012). 4) Extensive experiments on the LFW facial dataset show that RGEA improves the benchmarking methods in a black-box setting, which indicates that RGEA can effectively exploit the dependencies among models to reliably improve transferability.

The remainder of this paper is organized as follows: Section 2 summarizes the related work. Section 3 describes RGEA in terms of motivation, details, and theoretical analysis. Section 4 reports the experimental results and comparisons. Section 5 gives some concluding remarks.

2. Related work

In this paper, we choose the more challenging targeted attack, as well as consider the widely studied perturbation constraint of the infinite norm L_inf = ||·||_inf.

2.1. Optimization model of adversarial attacks

Suppose {F_i(x)|i = 1, ⋯ , m} is a set of pre-trained deep facial recognition models and the corresponding loss function:

\begin{array}{l} L o s s_{i} (x, x_{t a r g e t}) = c o s (F_{i} (x), F_{i} (x_{t a r g e t})), & (1) \end{array}

where x is the input facial image, x_target is the corresponding target facial image, {F_i(x)|i = 1, ⋯ , k} is surrogate models being attacked, and {F_i(x)|i = k+1, ⋯ , m} is unknown black-box models being tested.

For an arbitrary facial example x and a target face x_taget, we find the corresponding adversarial example x_adv, i.e., maximizes the objective $\sum_{i = 1}^{k} L o s s_{i} (x_{a d v}, x_{t a r g e t})$ on the surrogate models, but still keeping the ϵ imperceptibility constraint, as the following constrained optimization problem:

\begin{array}{l} x_{a d v} = \underset{x_{a d v} : || x_{a d v} - x {||}_{i n f}}{a r g m a x} \sum_{i = 1}^{k} L o s s_{i} (x_{a d v}, x_{t a r g e t}) & (2) \end{array}

2.2. The gradient-based methods

In this section, we introduce a series of gradient-based methods which have been developed to improve transferability. Iterative Fast Gradient Sign Method (I-FGSM) (Kurakin et al., 2018b; Madry et al., 2018) was used as the backbone of the gradient methods with L_inf bounded. This iterative approach, given an input x and the corresponding target x_target, calculates the perturbation output x_T by applying T steps of the following updated steps (with x₀ = x):

\begin{array}{l} x_{t + 1} = \prod_{B_{i n f} (x, ε)} (x_{t} + α s_{t}), s_{t} = \prod_{\partial B_{i n f} (0, 1)} (\nabla_{x_{t}} L o s s (x_{t}, x_{t a r g e t})), & (3) \end{array}

Where ∏_s is the projection into the set S, B_inf(x, ε) is the L_inf ball of radius ϵ around x, α is the step size, ∂U is the boundary of a set U, and s_t is the maximum inner product projection of the gradient ∇_{x_t}Loss(x_t, x_target) at x_t onto the unit L_inf ball.

Since (Goodfellow et al., 2014) proposes that DNNs have linear properties, s_t can be interpreted as the maximum inner product projection of ∇_{x_t}Loss(x_t, x_target) in the local region to enhance the attack ability of adversarial samples after a finite number of iterative attacks. Note that, in the case of the L_inf norm, We can convert (Equation 3) to the following form:

\begin{array}{l} x_{t + 1} = C l i p_{ε}^{x} (x_{t} + α \cdot s i g n (\nabla_{x_{t}} L o s s (x_{t}, x_{t a r g e t}))) . & (4) \end{array}

Momentum Iterative Method (MIM) (Dong et al., 2018) improves the transferability by introducing momentum terms in the attack process, given as:

\begin{array}{l} m_{t} = μ \cdot m_{t - 1} + \frac{\nabla_{x_{t}} L o s s (x_{t}, x_{t a r g e t})}{|| \nabla_{x_{t}} L o s s (x_{t}, x_{t a r g e t}) {||}_{1}}, \\ x_{t + 1} = C l i p_{ε}^{x} (x_{t} + α \cdot s i g n (m_{t})), & (5) \end{array}

Where m_t denotes the accumulated gradient at (t)_th iteration, and μ is the decay factor.

The Nesterov accelerated gradient (NIM) (Lin et al., 2019) is integrated into the I-FGSM-based attack method to improve the sensitivity of the momentum method when the current gradient has a large gap in the direction of momentum, and further increases the transferability of adversarial examples, given as:

\begin{array}{l} x_{t}^{n e s} = x_{t} + α \cdot μ \cdot m_{t}, \\ m_{t} = μ \cdot m_{t - 1} + \frac{\nabla_{x_{t}^{n e s}} L o s s (x_{t}^{n e s}, x_{t a r g e t})}{|| \nabla_{x_{t}^{n e s}} L o s s (x_{t}^{n e s}, x_{t a r g e t}) {||}_{1}}, \\ x_{t + 1} = C l i p_{ε}^{x} (x_{t} + α \cdot s i g n (m_{t})) . & (6) \end{array}

Scale-Invariant Method (SIM) (Lin et al., 2019) uses scale replicates of the input image to further enhance the transferability. However, SIM uses a lot more resources and running time, given as:

\begin{array}{l} x_{t + 1} = C l i p_{ε}^{x} (x_{t} + α \cdot s i g n (\frac{1}{m} \sum_{i = 1}^{m} \nabla_{x_{t}} L o s s (\frac{x_{t}}{2^{i}}, x_{t a r g e t}))) . & (7) \end{array}

The Diversity Input Method (DIM) (Xie et al., 2019) applies random resizing and padding to adversarial examples with probability p in each iteration to further improve the transferability of the adversarial examples, given as:

\begin{array}{l} x_{t + 1} = C l i p_{ε}^{x} (x_{t} + α \cdot s i g n (\nabla_{x_{t}} L o s s (T (x_{t}, p), x_{t a r g e t}))), & (8) \end{array}

where T(·, p) indicates that the input transformation is performed with probability p.

Translation-Invariant Method (TIM) (Dong et al., 2019) generates adversarial examples that are insensitive to the discriminative region of the surrogate models by translation invariance and uses predefined convolution kernels instead of translation operations to improve efficiency, given as:

\begin{array}{l} x_{t + 1} = C l i p_{ε}^{x} (x_{t} + α \cdot s i g n (W \otimes \nabla_{x_{t}} L o s s (x_{t}, x_{t a r g e t}))), & (9) \end{array}

where W is the pre-defined convolution kernel and ⊗ represents the convolution operation.

In conclusion, current approaches analogize transferability to generalizability and move the strategies for enhancing generalizability to gradient-based attack methods. However, they ignore the complex relationships among models, which limits the transferability of generated adversarial samples. Inspired by multi-relational graphs, we find optimal descent directions by solving sub-optimization problems based on relational graphs in iterative attacks, further improving transferability.

2.3. Graph-based modeling

Notably, several recent works use graphs to capture and extract dependencies between entities, enhancing the performance of existing models and algorithms. For example, in traffic prediction, ST-GCN (Yu et al., 2018) and ST-MGCN (Geng et al., 2019) propose to model the dependencies among regions by using graph convolutional networks, and then combine the dependencies to further enhance the model prediction. In anomaly detection, SCAN (Xu et al., 2007) captures and extracts the dependencies among entities through graphs, and then detects anomalous entities by looking for anomalous dependencies among them. Likewise, in multi-task learning, ML-GCN (Chen et al., 2019) proposes capturing and exploring dependencies among multiple labels based on graphs, and combining this dependency to improve recognition performance. In addition, RMTL (Cao et al., 2022) proposes capturing data-data and task-task dependencies by building knowledge graphs that connect data points and tasks, and combining these dependencies to achieve more accurate predictions for new tasks. Motivated by these works, we propose the Relational Graphs Ensemble Attack (RGEA), which combined the dependencies among models to facilitate the exploitation of multiple models. It has not been explored in existing attack methods.

3. Methodology

In Section 2.2, we systematically introduce the existing gradient-based methods. Remarkably, combining several methods can further improve the transferability of the adversarial samples. Thus, we give the current general attack framework in Figure 1A by combining multi-model ensemble attacks with advanced gradient methods and input transformations. In this framework, RGEA focuses on the second step, as shown in Figure 1B, where we first construct a multi-model graph; then extract the dependency matrix of the model from the constructed graph; finally, find the final descent direction by constructing a Optimization problem based on the dependency matrix.

FIGURE 1

Figure 1. (A) An attack framework that combines multi-model ensemble attacks with existing attack methods which are advanced gradient optimization techniques and input transformation techniques. (B) The overall RGEA calculation process, where the set of surrogate models is {F_i(x)|i = 1, ⋯ , 6}.

Given that our proposed approach is for multi-model ensemble attacks, we first discuss the problems of ensemble attack, then introduce our motivation and elaborate on the proposed RGEA algorithm, and finally provide a comprehensive theoretical analysis of the RGEA algorithm.

3.1. The problem of ensemble attack

Intuitively, adversarial examples is more likely to transfer attack capabilities to other models, if it remains an adversary to multiple models. Based on this insight, acquiring and utilizing multiple models is the key to obtaining better transferability. For exploiting multiple models, earlier studies (Dong et al., 2018; Che et al., 2020) focused on uniform fusing the outputs of different layers of DNNs. For example, Liu et al. (2016) pioneered the study of multi-model ensemble attacks and proposed uniform fusion loss, Dong et al. (2018) proposed two uniform fusion strategies for logit or probabilistic outputs of models, and Che et al. (2020) proposed uniform fusion intermediate features. However, they ignore inter-model differences, thereby He et al. (2022) proposed a gradient normalized ensemble attack to address inter-model gradient magnitude differences, and Xiong et al. (2022) proposed a stochastic variance reduction ensemble (SVRE) attack to tackle inter-model gradient variance.

Nevertheless, differences among models only one-sidedly reveal complex dependencies, and we find that the dependencies among deep facial recognition models are more complex than those of classification models. Specifically, Figure 2 shows that multiple deep facial recognition model gradients have larger and more complex cosine similarities, compared to the classification models. In addition, there is a complex inherent similarity between different subsets of surrogate models, as shown in Figure 2B, where the similarity among the top six models is much greater than others. In conclusion, existing approaches only one-sidedly consider the differences among models, and ignore the complex dependencies behind them. It leads to an unbalanced and inadequate attack on multiple models such that it limits transferability black-box attacks. This inspired us to exploit the complex dependencies among models to improve the transferability of the generated adversarial samples.

FIGURE 2

Figure 2. The cosine similarity of the gradients of the sampled photos on the different models is depicted in the figure, where (A) is the visualization result of multiple classification models, and (B) is the visualization result of multiple deep facial recognition models.

In essence, existing approaches fail to exploit complex dependencies among models, which stems from their treating multiple models as one complex multivariate model and using single-objective optimization. For this reason, this paper redefines the multi-model ensemble attack as multi-objective optimization, defined as follows:

\begin{array}{l} \underset{x_{a d v}}{a r g m a x} {J_{i} (x_{a d v})}_{i = 1, \dots, k}, \\ s . t . D (x, x_{a d v}) \leq ε, & (10) \end{array}

where J_i(x_adv) ∈ R is a continuous function measuring the attack of the adversarial sample relative to the model F_i, usually choosing the loss function Loss(F_i(x_adv), x_target). D(x, x_adv) ∈ R is a continuous function that mainly measures the amount of perturbation in the sample, usually choosing the L_p norm, i.e., ||x−x_adv||_p, and ϵ is the maximum disturbance distance.

Inspired by MGDA, we performed a theoretical analysis based on the first-order Taylor approximation. Assume that at x_n is the n iteration of the adversarial sample, $g_{n}^{*}$ is the final direction of descent, $g_{n}^{i} (i = 1, \dots k)$ is the gradient of the objective function J_i(x) computed on x_n, and α is the iteration step size. Under the assumptions, we have $x_{n + 1} = x_{n} + α g_{n}^{*}$ and ∇J_i(x_n) = J_i(x_n+1)−J_i(x_n). When the step size is small enough, the following approximate transformations are available:

\begin{array}{l} (\begin{matrix} \nabla J_{1} (x_{n}) \\ \dots \\ \nabla J_{k} (x_{n}) \end{matrix}) \approx (\begin{matrix} α 〈 g_{n}^{1}, g_{n}^{*} 〉 \\ \dots \\ α 〈 g_{n}^{k}, g_{n}^{*} 〉 \end{matrix}) = α || g_{n}^{*} || (\begin{matrix} || g_{n}^{1} {||}_{2} cos (g_{n}^{1}, g_{n}^{*}) \\ \dots \\ || g_{n}^{k} {||}_{2} cos (g_{n}^{k}, g_{n}^{*}) \end{matrix}), & (11) \end{array}

Where $| | g_{n}^{1} | |, \dots, | | g_{n}^{k} | |$ is deterministic and $| | g_{n}^{*} | |$ , α is constant, then the effect of each objective's descent in each iteration is primarily influenced by $cos (g_{n}^{1}, g_{n}^{*}), \dots, cos (g_{n}^{k}, g_{n}^{*})$ .

In summary, we cam eliminate the problem of unbalanced and inadequate attacks by finding the final descent direction which $c o s (g_{n}^{1}, g_{n}^{*})$ $, \dots, cos (g_{n}^{k}, g_{n}^{*})$ is as equal and large as possible. We translate this idea into solving the following optimization problems:

\begin{array}{l} \underset{g}{a r g m i n} || g {||}_{2}, \\ s . t . {g | \underset{g}{arg min} || G g - e ||, e = {(1, \dots, 1)}^{T}}, & (12) \end{array}

where $G = {(g_{n}^{1} / || g_{n}^{1} {||}_{2}, \dots, g_{n}^{k} / || g_{n}^{k} {||}_{2})}^{T} \in R^{k \times N}$ is a Jacobi matrix, k is the number of models and N is the image dimension. Optimization problem in Equation (12) will be referred to as Optimization problem 1 in the discussion that follows.

3.2. Relational graphs ensemble adversarial attack

Since Optimization problem 1 is built based on the high-dimensional space of images, which causes serious time-consuming problems. However, the number of surrogate models employed is much smaller than the image dimension due to the large number of computational resources required to load the models. It stimulates us to explore building a new optimization problem based on the dependencies among the models.

In this paper, we use a graph to extract the dependencies among models, which is a flexible way to capture the topology of the model space. Specifically, we represent each node of the graph as an embedding vector of the model. For better theoretical analysis, we simplify the vector representation and define the embedding vector of model F_i on x_n to be represented as $g_{n}^{i} / || g_{n}^{i} {||}_{2}$ . As well as the dependencies among the models are expressed in the following form:

\begin{array}{l} {〈 F_{i}, F_{j} 〉}_{x_{n}} = cos (g_{n}^{i}, g_{n}^{j}) . & (13) \end{array}

For characteristics of the solution to Optimization problem 1, we have the following proposition.

Proposition 1. If $g_{n}^{*}$ the solution of Optimization problem 1, then there exists w^* such that $g_{n}^{*} = G^{T} w^{*}$ .

Proof. Suppose d₁⋯ , d_r is the orthogonal bases of the subspace spaned by g₁⋯ , g_k, as well as d_r+1⋯ , d_n and d₁⋯ , d_r are the orthogonal bases of Rⁿ. Therefore, there exist $w_{1}^{*} \in R^{r}$ and $w_{2}^{*} \in R^{n - r}$ with $g_{n}^{*} = (d_{1}, \dots, d_{r}) w_{1}^{*} + (d_{r + 1}, \dots, d_{n}) w_{2}^{*}$ , such that the following equation holds.

\begin{array}{l} G g_{n}^{*} = G (d_{1}, \dots, d_{r}) w_{1}^{*} & (14) \end{array}

furthermore, Optimization problem 1 takes ||g||₂ to be extremely small, so it follows that $g_{n}^{*} = (d_{1}, \dots, d_{r}) w_{1}^{*}$ . From this, there exists w^* such that $g_{n}^{*} = G^{T} w^{*}$ .

By this proposition, we can denote $g_{n}^{*}$ as a linear combination of each model embedding vector, i.e., $g_{n}^{*} = G^{T} w_{n}$ where w_n is the weight vector of the linear combination, and we can transform the equation in Equation (11) as follows:

\begin{array}{l} (\begin{matrix} \nabla J_{1} (x_{n}) \\ \dots \\ \nabla J_{k} (x_{n}) \end{matrix}) \approx (\begin{matrix} α 〈 g_{n}^{1}, g_{n}^{*} 〉 \\ \dots \\ α 〈 g_{n}^{k}, g_{n}^{*} 〉 \end{matrix}) \\ = α {(\begin{matrix} {〈 F_{1}, F_{1} 〉}_{x_{n}} & \dots & {〈 F_{1}, F_{k} 〉}_{x_{n}} \\ ⋮ & ⋱ & ⋮ \\ {〈 F_{k}, F_{1} 〉}_{x_{n}} & \dots & {〈 F_{k}, F_{k} 〉}_{x_{n}} \end{matrix})}_{k \times k} (\begin{matrix} || g_{n}^{1} {||}_{2} w_{n}^{1} \\ \dots \\ || g_{n}^{k} {||}_{2} w_{n}^{k} \end{matrix}), & (15) \end{array}

further, associating (Equations 11, 15) and simplifying them yields the following equation:

\begin{array}{l} (\begin{matrix} cos (g_{n}^{1}, g_{n}^{*}) \\ \dots \\ cos (g_{n}^{k}, g_{n}^{*}) \end{matrix}) = \frac{1}{|| g_{n}^{*} {||}_{2}} {(\begin{matrix} {〈 F_{1}, F_{1} 〉}_{x_{n}} & \dots & {〈 F_{1}, F_{k} 〉}_{x_{n}} \\ ⋮ & ⋱ & ⋮ \\ {〈 F_{k}, F_{1} 〉}_{x_{n}} & \dots & {〈 F_{k}, F_{k} 〉}_{x_{n}} \end{matrix})}_{k \times k} (\begin{matrix} w_{n}^{1} \\ \dots \\ | w_{n}^{k} \end{matrix}), & (16) \end{array}

hence, we establish the connection between $cos (g_{n}^{1}, g_{n}^{*}), \dots, cos (g_{n}^{k}, g_{n}^{*})$ and dependency among models. With this connection, we can transform Optimization problem 1 as follows, and the proof of equivalence transformation is given in Section 3.3.

\begin{array}{l} \underset{w}{a r g m i n} || A w - e {||}_{2}, \\ w h e r e A = G G^{T} = {(\begin{matrix} {〈 F_{1}, F_{1} 〉}_{x_{n}} & \dots & {〈 F_{1}, F_{k} 〉}_{x_{n}} \\ ⋮ & ⋱ & ⋮ \\ {〈 F_{k}, F_{1} 〉}_{x_{n}} & \dots & {〈 F_{k}, F_{k} 〉}_{x_{n}} \end{matrix})}_{k \times k}, e = (1, \dots, 1), & (17) \end{array}

where A is a multi-relationship matrix transformed from a multi-model diagram, and the ultimate direction of descent $g_{n}^{*}$ is equal to $G^{T} w_{n}^{*}$ , where $w_{n}^{*}$ is the solution to the optimization issue in Equation (17). Optimization problem in Equation (17) will be referred to as Optimization problem 2 in the discussion that follows.

Theoretically, RGEA can be used with a variety of currently used iterative gradient-based attack strategies. An explanation of the integration of RGEA and I-FGSM is provided by Algorithm 1. Because the descent direction determined by solving optimization problem 2 maintains the optimal angle to multiple objective gradients. In Algorithm 1, we replace the symbolic operation of the I-FGSM algorithm with the normalization method as follows:

\begin{array}{l} g^{*} = \frac{g^{*} \cdot \sqrt{N}}{|| g^{*} {||}_{2}}, w h e r e g \in R^{N} . & (18) \end{array}

ALGORITHM 1

Algorithm 1. The RGEA-I-FGSM attack algorithm

We perform ablation experiments on this in Section 4.2.2 to investigate the effect of this operation on the transferability of the generated adversarial samples.

3.3. Theoretical analysis of RGEA

According to Section 3.2, the key core of RGEA is Optimization problem 2, and we next present a detailed theoretical analysis of Optimization problem 2. We first demonstrate that Optimization Problems 2 and 1 are equivalent, after which we discuss the relationship between MGDA and RGEA.

Continuing with the notation in Section 3.2, we next state the proposition on the equivalence between Optimization problem 2 and Optimization problem 1, and proof it.

Proposition 2. The assumption is that G ∈ R^k×N is a matrix and A = GG^T is a matrix belonging to R^k×k. Then for arbitrary b, there is an equivalence between Optimization problem 2 and Optimization problem 1.

Proof. Let A^* and G^* be the Moore-Penrose generalized inverse matrices of A and G, respectively, then we get the following equation:

\begin{array}{l} G^{*} G G^{*} = G^{*}, G G^{*} G = G, {(G G^{*})}^{T} = G G^{*}, {(G^{*} G)}^{T} \\ = G^{*} G, A^{*} = {G^{*}}^{T} G^{*} . & (19) \end{array}

Given that A^* is the Moore-Penrose generalized inverse matrices, the general solution to Optimization problem 2 can be formulated as w^* = A^*b+(I−A^*A)y, where y ∈ R^k is arbitrary. The following will demonstrate that G^Tw^* is a solution to problem 1, where w^* is the general solution to Optimization problem 2:

\begin{array}{l} G^{T} w^{*} = G^{T} (A^{*} b + (I - A^{*} A) y) \\ = G^{T} A^{*} b + (G^{T} - G^{T} A^{*} A) y \\ = G^{T} {G^{*}}^{T} G^{*} b + (G^{T} - G^{T} {G^{*}}^{T} G^{*} G G^{T}) y \\ = G^{*} b + (G^{T} - G^{*} G G^{T}) y \\ = G^{*} b + (G^{T} - G^{T} {G^{*}}^{T} G^{T}) y \\ = G^{*} b + (G^{T} - G^{T}) y = G^{*} b \end{array}

In summary, for the generic solution w^* of Optimization problem 2, we have G^Tw^* = G^*b, and G^*b is the unique solution to Optimization problem 1, then solving Optimization problem 1 can be equivalently converted to solving Optimization problem 2.

To facilitate the discussion of the association between the optimization problem of finding the optimal descent direction in Désidéri (2012) and ours, we first introduce that optimization problem of MGDA in Equation (20) and the definition of Pareto-stationary point in Definition 1. For the optimization problem of MGDA, we chose standard normalization to better balance, geometric properties and theoretical analysis. Optimization problem in Equation (20) will be referred to as Optimization problem 3 in the following discussion.

\begin{array}{l} \underset{g}{a r g m i n} || g {||}_{2}^{2}, \\ s . t . {g | g = \sum_{i = 1}^{k} c_{i} \frac{g_{i}}{|| g_{i} {||}_{2}}, c_{i} \geq 0 (i = 1, \dots, k), \sum_{i = 1}^{k} c_{i} = 1} . & (20) \end{array}

Definition 1. Let x₀ be a point at the center of an open ball in the feasible domain Ω, k smooth objective functions J_i(x)_{i = 1, ⋯ , k} with g_i = ∇_xJ_i(x₀) being the gradient. A point x₀ is said to be Pareto stable if there exists a convex combination of gradient vectors {_{g_i}i = 1, ⋯ , k} equal to zero:

\begin{array}{l} \sum_{i = 1}^{k} c_{i} g_{i} = 0, c_{i} \geq 0 (i = 1, \dots, k), \sum_{i = 1}^{k} c_{i} = 1 . & (21) \end{array}

Under certain conditions, the solution to Optimisation Problem 3 is in the same direction as Optimisation Problem 1, and I will state and prove this proposition below.

Proposition 3. The solution to Optimisation Problem 3 is in the same direction as Optimisation Problem 1, if the solution $g^{*} = \sum_{i = 1}^{k} c_{i}^{*} \frac{g_{i}}{|| g_{i} {||}_{2}}$ of Optimization problem 3 has $c_{i}^{*} > 0 (i = 1, \dots, k)$ and g^*≠0.

Proof. Let $g^{*} = \sum_{i = 1}^{k} c_{i}^{*} \frac{g_{i}}{|| g_{i} {||}_{2}}$ is the solution of Optimization problem 3 with $c_{i}^{*} > 0 (i = 1, \dots, k)$ , $\sum_{i = 1}^{k} c_{i}^{*} = 1$ . Since $c_{i}^{*} > 0 (i = 1, \dots, k)$ , none of the inequality constraints is saturated. Consequently, g^* is also a optimal solution to the following optimization problem:

\begin{array}{l} \underset{g}{a r g m i n} || g {||}_{2}^{2}, \\ s . t . {g | g = \sum_{i = 1}^{k} c_{i} \frac{g_{i}}{|| g_{i} {||}_{2}}, \sum_{i = 1}^{k} c_{i} = 1} . & (22) \end{array}

Consequently, using the vector c∈R^k as the finite-dimensional variable and $c^{*} = {(c_{1}^{*}, \dots, c_{k}^{*})}^{T}$ . The Lagrangian writes as $L (c, λ) = || (\frac{g_{1}}{|| g_{1} {||}_{2}}, \dots, \frac{g_{k}}{|| g_{k} {||}_{2}}) c {||}_{2}^{2} + λ (\sum_{i = 1}^{k} c_{i} - 1)$ , and for all indices i, calculate the partial derivatives for c_i:

\begin{array}{l} \frac{\partial L}{\partial c_{i}} = \frac{\partial || (\frac{g_{1}}{|| g_{1} {||}_{2}}, \dots, \frac{g_{k}}{|| g_{k} {||}_{2}}) c | |_{2}^{2}}{\partial c_{i}} + λ = 2 〈 \frac{g_{i}}{|| g_{i} {||}_{2}}, \sum_{i = 1}^{k} \frac{g_{i}}{|| g_{i} {||}_{2}} c_{i} 〉 + λ . & (23) \end{array}

Since the optimality condition, $\frac{\partial L}{\partial c_{i}} = 0, \frac{\partial L}{\partial λ} = 0$ , is satisfied at the vector c^* , we have $〈 g_{i}, g^{*} 〉 = - \frac{λ}{2} || g_{i} {||}_{2} (i = 1, \dots, k)$ , then there are the following equations:

\begin{array}{l} (\begin{matrix} 〈 \frac{g_{1}}{|| g_{1} {||}_{2}}, g^{*} 〉 \\ \dots \\ 〈 \frac{g_{k}}{|| g_{k} {||}_{2}}, g^{*} 〉 \end{matrix}) = - \frac{λ}{2} (\begin{matrix} 1 \\ \dots \\ 1 \end{matrix}) & (24) \end{array}

From lemma 2.1 in Désidéri (2012), for all g_i(i = 1, ⋯ , k): $〈 g_{i}, g^{*} 〉 \geq || g^{*} | |_{2}^{2}$ ; therefore: $- \frac{λ}{2} > 0$ . Since $c^{* *} = - \frac{2}{λ} c^{*}$ is the optimal solution to Optimisation Problem 2, it is proved that the solution to Optimisation Problem 3 is in the same direction as Optimisation Problem 1.

Proposition 3 illustrates that Optimization Problem 2 is capable of determining the common descent direction for multiple objectives, effectively resolving the issue of insufficient attacks on multiple models. Additionally, we offer a simple geometric explanation for MGDA.

From previous papers (Chen et al., 2021; Zhao et al., 2021), it is argued that more iterative attacks and larger perturbations can effectively improve the transferability of adversarial samples. As well, studies such as Dong et al. (2018) and Wang and He (2021) propose to increase perturbations by stabilizing the update direction and effectively improving transferability. From the definition of the Pareto-stationary point and Optimization problem 3, the MGDA algorithm stops at the Pareto-stationary point, inapplicable for transferability black-box attacks. Conversely, our approach at the Pareto-stationary point can also attack in a direction that is synthetically effective against multiple objectives.

4. Experiments

4.1. Experimental setup

4.1.1. Dataset

We conducted experiments on the LFW (Huang et al., 2007) dataset, which is the most widely used benchmark for image face verification, contains 13,233 face images from 5,749 different individuals, and includes faces with a variety of poses, expressions, and illuminations. The unrestricted external data protocol with labels includes 6,000 pairs of faces, of which 3,000 pairs have the same identity and 3,000 pairs have different identities. We performed targeted transferability attacks on the 3,000 pairs that have different identities.

4.1.2. Facial recognition models

For our experiments, we used five surrogate models for the ensemble attack, where the pre-training parameters for the models were obtained from the open-source library in the paper (Wang Q. et al., 2021). The number of unknown black-box models tested was twenty, with ten normally trained models and ten defensive models, and the pre-training parameters for the models were obtained from the open-source library of RobFR (Yang et al., 2020). We tabulate the full model information in Table 1, which includes the name of the models, as well as the best threshold of discrimination (Threshold) on the LFW dataset and the accuracy (Acc) concerning it. Among all the defense models, the top three defense models adopt the defense transform methods (BR, Xu et al., 2017; RP, Xie et al., 2018; JPEG, Dziugaite et al., 2016), and the next seven are adversarially trained models.

TABLE 1

Table 1. The models' information includes sequence number, name, the best discriminant threshold (threshold), and the accuracy (Acc) corresponding to the threshold on the LFW dataset.

4.1.3. Baselines

The baseline methods to compare are the extensively studied gradient-based attack method, including I-FGSM (Madry et al., 2018), TI-FGSM (Dong et al., 2019), DI-FGSM (Xie et al., 2019), DI-TI-MI-FGSM, and SI-DI-FGSM (Lin et al., 2019). For the baseline approach, we used SVRE and Ens as ensemble methods, where Ens is a uniform fusion of the losses from different models.

4.1.4. Evaluation metric

Given an attack method $A_{ϵ} (x, x_{t a r g e t})$ under the L_inf norm, an adversarial example $x^{a d v} = A_{ϵ} (x, x^{t a r g e t})$ is generated for the input x and the target image x^target. Following the previous face recognition (Wang Q. et al., 2021) and targeted attack settings (Yang et al., 2020; Zhao et al., 2021), the evaluation metric is the attack success rate (Asr) on the face recognition model F_i(x), defined as follows.

\begin{array}{l} A s r (F_{i}, A_{ε}) = \frac{1}{N} \sum_{j = 1}^{N} 𝕀 (cos (F_{i} (x_{j}^{t a r g e t}), F_{i} (x_{i}^{a d v})) > δ_{i}), x_{i}^{a d v} \\ = A_{ε} (x_{j}, x_{j}^{t a r g e t}) & (25) \end{array}

where ${(x_{i}, x_{i}^{t a r g e t})}_{i = 1}^{N}$ is the paired test set, 𝕀(·) is the indicator function, and δ_i is the best threshold corresponding to F_i.

4.1.5. Hyper-parameters

Following previous work (Dong et al., 2018; Lin et al., 2019; Xie et al., 2019; Yang et al., 2020), we set the maximum perturbation to ϵ = 8, the range of pixel values for each image to [0, 255], the number of iterations was 20, and the step size was α = 0.6. For MIM, we set the decay factor μ = 1. For TIM, we used a Gaussian kernel of size 7 × 7. For DIM, the transition probability p was set to 0.8. For SIM, we set the number of copies m = 4. For SVRE, we set the internal update frequency M to four times the number of ensemble models, the internal step size β is set 0.6 and the internal decay factor μ₂ is set to 1.0. For RGEA, we set the step size α = 1, since our method removes the sign function, resulting in much smaller perturbations at each iteration than Ens. For fairness, we also verified the disturbance distance $l_{2} (x) = || x {||}_{2} / \sqrt{d}, x \in R^{d}$ of RobFR (Yang et al., 2020) in Table 2.

TABLE 2

Table 2. Perturbation distance (l₂) for the adversarial examples and success rate (%) of attacks on the normal model for Ens, SVRE, Ens-l₂ and RGEA, where the adversarial examples were all crafted on the surrogate models.

4.2. Ablation study

The ablation study is conducted to verify the benefits of RGEA and to determine the effects of key parameters. Specifically, we first tested the effect of parameters in existing methods on RGEA and then investigated the effectiveness of Optimization Problem 2.

4.2.1. Effect of parameters

In this section, we explore the impact of p in DI and μ in MI for RGEA using five normally trained models (M-6, M-7, M-8, M-9, and M-10).

On the decay factor μ in RGEA-MI-FGSM: We investigate the influence of the decay factor μ of the RGEA-MI-FGSM on the success rate. We combine the MI-FGSM attack with the RGEA, and the decay factor μ has a granularity of 0.1 and runs from 0 to 1. RGEA-MI-FGSM degrades to the RGEA-I-FGSM attack method if μ = 0. Figure 3A displays several networks' success rates and their average values. In contrast to the experimental results in Dong et al. (2018), we observe that the attack success rate of RGEA-MI-FGSM increases as μ rises and reaches a maximum around μ = 0.4, after which the success rate significantly decreases. It is possible that too large μ destroys the optimal direction of descent for the current calculation, resulting in a significant decrease in transferability. In the following experiments, we will set μ = 0.4 for RGEA.

FIGURE 3

Figure 3. Success rate of RGEA-MI-FGSM (A) and RGEA-DI-FGSM (B) when the corresponding parameters are changed.

On the probability p in RGEA-DI-FGSM: We then study the influence of the transformation probability p on the success rates under black-box settings. The transformation probability p varies from 0 to 1 and RGEA-DI-FGSM degrades to RGEA-I-FGSM if p = 0. We show the success rates on various networks and their average values in Figure 3B. Similar to the experimental results in the paper (Xie et al., 2019), the black-box success rate of RGEA-DI-FGSM is higher as p increases. Furthermore, the black-box success rate can be significantly increased if p is small, i.e., if only a small number of transformed inputs are used. In the following experiments, we will set p = 0.8 for RGEA.

4.2.2. Effect of optimization problem 2

In this section, ablation experiments are conducted to evaluate the efficacy of Optimization Problem 2. Specifically, we drop Optimization Problem 2 for finding the optimal descent direction in RGEA, but we keep the normalization approach in equation (18) and name it as Ens-l₂. Then, on the normally trained model, compared the transferability of RGEA and Ens-l₂.

In Section 4.3.1, the findings reveal that RGEA outperforms Ens-l₂ in all experiments, where on DI-FGSM, DI-TI-FGSM, DI-TI-MI-FGSM, and SI-DI-FGSM, the average success of RGEA is increased by 6.32, 6.06, 6.24, and 3.82%, respectively. The results show that Optimization Problem 2 in RGEA can effectively exploit the complex inter-model dependencies and improve transferability.

Furthermore, we found that Ens-l₂ outperformed Ens in all experiments. It may be that targeted transferability relies on activating target semantic features, yet this is different from non-targeted transferability, which attacks the activated features. The result is that targeted transferability attacks need a more precise attack direction during each iteration. For this observation, we further visualize the adversarial perturbation noise images of Ens and Ens-l₂. In Figure 4, we can see that the adversarial perturbation noise image generated by Ens-l₂ has more semantical information about the target face.

FIGURE 4

Figure 4. The adversarial perturbation noise images for Ens and Ens-l₂ with I-FGSM.

4.3. Comparisons with state-of-the-art methods

4.3.1. Attack normally trained models

We first compared the transferability capabilities on normal training models. Specifically, adversarial samples were generated on multiple surrogate models by Ens, SVRE and RGEA combined with various basic methods and then tested for l₂ perturbation distance and the attack success on normal training models. The attack performance of the normal training models is displayed in Table 2.

In general, RGEA performed better than Ens and SVRE in almost experiments. Especially in DI-FGSM and DI-TI-FGSM, RGEA is higher than Ens by 12.05 and 11.08%, and 1.12% higher than SVRE in TI-DI-MI-FGSM. The disturbances of RGEA are smaller than Ens, which indicates that rather than just increasing the perturbation, the RGEA improved transferability from successfully utilizing the complex dependencies among the models. Furthermore, we find that compared with Ens, RGEA-I-FGSM combined with DI shows fewer changes in disturbances, which indicates that RGEA can effectively alleviate the instability problems caused by DI.

For SVRE, except DI-TI-MI-FGSM, it is significantly lower than Ens and RGEA in targeted transferability black-box attack tests. Regarding this phenomenon, we attribute it to three aspects: 1) SVRE embeds an inner loop in the iterative attack, where the inner loop iteratively randomly selects models to compute gradients for internal updating to correct the average gradient in the iterative attack. However, SVRE chooses the inner loop's last update direction as the iterative attack's update direction, and the update step size of both is the same large. This causes the iterative attack's update direction loses the current iteration point's local information, which destroys the effectiveness of the iterative attack method. Meanwhile, with the same normalization operation and step size, the perturbation of SVRE is much smaller than Ens, which strongly supports the view. 2) For MI, SVRE also uses the momentum method in the inner loop and updates the iterative attack's momentum using the inner loop's momentum. This skillfully overcomes the ineffective forward-looking of the momentum method and also stabilizes the update direction. This perspective is supported by the fact that for DI-TI-MI-FGSM, the SVRE outperforms Ens by 10.31 and the perturbation up to 7.25. 3) SVRE only considers non-target attacks, however target attacks are more sensitive to the precision of the update direction compared to non-target attacks.

4.3.2. Attack advanced defense models

To more comprehensively verify the effectiveness of RGEA, experiments were conducted on models with defensive capabilities. Specifically, Ens, SVRE and RGEA were combined with various basic methods to generate adversarial samples and then tested on input transformations and adversarially trained defense models.

For the defense model with input transformation, shown in Table 3, the results indicate a significant advantage of RGEA over Ens and SVRE for all tests. In particular, the average attack success rate of RGEA is 11.03% higher than Ens on DI-FGSM, and is 0.71% higher than SVRE on DI-TI-MI-FGSM. In addition, the results of the white-box attack are shown in Table 3, RGEA improves the performance of the attack in most cases. This indicates that RGEA exploits the dependencies among the models while not destroying the attack capability.

TABLE 3

Table 3. The black-box attack success rates (%) against three models with the defense transform methods (BR, Xu et al., 2017; RP, Xie et al., 2018; and JPEG, Dziugaite et al., 2016).

In Table 4, RGEA exhibits gains in all tests for the adversarial training model, except DI-TI-MI-FGSM. In addition, the transferability impacts of all approaches are all below 23%. It suggests that there is a significant amount of space for advancement in RGEA for target transferability attacks on adversarial training models and deserves further investigation.

TABLE 4

Table 4. The black-box attack success rates (%) against seven adversarially trained models.

Additionally, we conduct a comparative analysis referring to Tables 3, 4. The input-transformed defense model exhibited worse defense compared to the adversarial training model. Probably, the input transformation only partially disrupts the adversarial perturbation, but it does not change the recognition pattern of the model. Contrarily, adversarial training makes the model acquire a completely different recognition pattern compared to the normally trained model.

5. Discussion and conclusion

In this paper, we propose the Relational Graphs Ensemble Attack (RGEA) to enhance the transferability of black-box attacks. Specifically, We find that facial recognition models, compared to classification models, have more complex correlations. This inspired us to exploit the complex dependencies among models to improve the transferability of the generated adversarial samples. To this end, we designed a suboptimization problem based on a multi-model relationship graph to obtain a more transferable descent direction. Extensive experiments show that RGEA significantly improves the transferability of almost baseline methods in a black box environment.

For the transferability black-box attacks, we provide a new perspective to enhance the adversarial transferability, i.e., to facilitate the transferability of adversarial samples by efficiently extracting the complex dependencies among models by graphs. Additionally, our experimental results show that: for targeted transferability attacks on adversarially trained models, there is still significant room for improvement in RGEA and existing methods. We will continue to develop more effective methods to extract complex dependencies among models to overcome this challenge in the future.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

CL: conceptualization, methodology, and software. JP and NJ: validation. JP and CL: formal analysis and writing-original draft preparation. HW: investigation and visualization. ZW and JP: resources, writing-review, editing, and supervision. JP: data curation and project administration. ZW: funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China 11871128, and the Chongqing Education Commission Key Project KJZD-K202114802. Each project has provided relevant equipment and related labor costs for research.

Conflict of interest

NJ and HW were employed by Mashang Consumer Finance Co., Ltd.

The authors declare that this study received funding from Mashang Consumer Finance Co., Ltd. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Bojchevski, A., and Günnemann, S. (2019). “Adversarial attacks on node embeddings via graph poisoning,” in Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, eds K. Chaudhuri and R. Salakhutdinov (New York, NY: PMLR), 695–704

Google Scholar

Cao, K., You, J., and Leskovec, J. (2022). “Relational multi-task learning: Modeling relations between data and tasks,” in International Conference on Learning Representations. Available online at: https://openreview.net/

Google Scholar

Carlini, N., and Wagner, D. (2017). “Towards evaluating the robustness of neural networks,” in 2017 IEEE Symposium on Security and Privacy (SP) (San Jose, CA: IEEE), 39–57.

Google Scholar

Che, Z., Borji, A., Zhai, G., Ling, S., Li, J., and Le Callet, P. (2020). A new ensemble adversarial attack powered by long-term gradient memories. Proc. AAAI Conf. Artif. Intell. 34, 3405–3413. doi: 10.1609/aaai.v34i04.5743

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, S., Tao, Q., Ye, Z., and Huang, X. (2021). Going far boosts attack transferability, but do not do it. arXiv preprint arXiv:2102.10343. doi: 10.1109/cvpr.2019.00532

CrossRef Full Text | Google Scholar

Chen, Z.-M., Wei, X.-S., Wang, P., and Guo, Y. (2019). “Multi-label image recognition with graph convolutional networks,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Long Beach, CA: IEEE).

Google Scholar

Cui, P., Wang, X., Pei, J., and Zhu, W. (2019). A survey on network embedding. IEEE Trans. Knowl. Data Eng. 31, 833–852. doi: 10.1109/TKDE.2018.2849727

CrossRef Full Text | Google Scholar

Dai, H., Li, H., Tian, T., Huang, X., Wang, L., Zhu, J., et al. (2018). “Adversarial attack on graph structured data,” in Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, eds J. Dy and A. Krause (New York, NY: PMLR), 1115–1124.

Google Scholar

Dai, R., Xu, S., Gu, Q., Ji, C., and Liu, K. (2020). “Hybrid spatio-temporal graph convolutional network: improving traffic prediction with navigation data,” in KDD '20 (New York, NY: Association for Computing Machinery), 3074–3082.

Google Scholar

Désidéri, J.-A. (2012). Multiple-gradient descent algorithm (mgda) for multiobjective optimization. Comptes Rendus Mathematique 350, 313–318. doi: 10.1016/j.crma.2012.03.014

CrossRef Full Text | Google Scholar

Dong, Y., Liao, F., Pang, T., Su, H., Zhu, J., Hu, X., et al. (2018). “Boosting adversarial attacks with momentum,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (Salt Lake City, UT: IEEE).

Google Scholar

Dong, Y., Pang, T., Su, H., and Zhu, J. (2019). “Evading defenses to transferable adversarial examples by translation-invariant attacks,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Piscataway, NJ: IEEE).

Google Scholar

Duvenaud, D. K., Maclaurin, D., Iparraguirre, J., Bombarell, R., Hirzel, T., Aspuru-Guzik, A., et al. (2015). “Convolutional networks on graphs for learning molecular fingerprints,” in Advances in Neural Information Processing Systems, Vol. 28, eds C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett (Cambridge, MA: Curran Associates, Inc.).

PubMed Abstract | Google Scholar

Dziugaite, G. K., Ghahramani, Z., and Roy, D. M. (2016). A study of the effect of jpg compression on adversarial images. arXiv preprint arXiv:1608.00853. doi: 10.48550/arXiv.1608.00853

CrossRef Full Text | Google Scholar

Fan, W., Ma, Y., Li, Q., He, Y., Zhao, E., Tang, J., et al. (2019). “Graph neural networks for social recommendation,” in The World Wide Web Conference, WWW '19 (New York, NY: Association for Computing Machinery), 417–426.

Google Scholar

Geng, X., Li, Y., Wang, L., Zhang, L., Yang, Q., Ye, J., et al. (2019). Spatiotemporal multi-graph convolution network for ride-hailing demand forecasting. Proc. AAAI Conf. Artif. Intell. 33, 3656–3663. doi: 10.1609/aaai.v33i01.33013656

CrossRef Full Text | Google Scholar

Goodfellow, I. J., Shlens, J., and Szegedy, C. (2014). Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572. doi: 10.48550/arXiv.1412.6572

CrossRef Full Text | Google Scholar

Guo, Y., Li, Q., and Chen, H. (2020). Backpropagating linearly improves transferability of adversarial examples. Adv. Neural Inf. Process. Syst. 33, 85–95. doi: 10.5555/3495724.3495732

CrossRef Full Text | Google Scholar

Hamilton, W., Ying, Z., and Leskovec, J. (2017). “Inductive representation learning on large graphs,” in Advances in Neural Information Processing Systems, Vol. 30, eds I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Cambridge, MA: Curran Associates, Inc.).

PubMed Abstract | Google Scholar

He, Z., Wang, W., Dong, J., and Tan, T. (2022). Revisiting ensemble adversarial attack. Signal Process. Image Commun. 107, 116747. doi: 10.1016/j.image.2022.116747

CrossRef Full Text | Google Scholar

Huang, G. B., Ramesh, M., Berg, T., and Learned-Miller, E. (2007). Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst.

Google Scholar

Kurakin, A., Goodfellow, I., Bengio, S., Dong, Y., Liao, F., Liang, M., et al. (2018a). “Adversarial attacks and defences competition,” in The NIPS'17 Competition: Building Intelligent Systems (Cham: Springer), 195–231.

Google Scholar

Kurakin, A., Goodfellow, I. J., and Bengio, S. (2018b). “Adversarial examples in the physical world,” in Artificial Intelligence Safety and Security (London: Chapman and Hall; CRC), 99–112.

PubMed Abstract | Google Scholar

Li, Y., Bai, S., Zhou, Y., Xie, C., Zhang, Z., and Yuille, A. (2020). Learning transferable adversarial examples via ghost networks. Proc. AAAI Conf. Artif. Intell. 34, 11458–11465. doi: 10.1609/aaai.v34i07.6810

CrossRef Full Text | Google Scholar

Lin, J., Song, C., He, K., Wang, L., and Hopcroft, J. E. (2019). “Nesterov accelerated gradient and scale invariance for adversarial attacks,” in International Conference on Learning Representations. Available online at: https://openreview.net/

Google Scholar

Liu, Y., Chen, X., Liu, C., and Song, D. (2016). “Delving into transferable adversarial examples and black-box attacks,” in International Conference on Learning Representations. Available online at: https://openreview.net/

Google Scholar

Madry, A., Makelov, A., Schmidt, L., Tsipras, D., and Vladu, A. (2018). “Towards deep learning models resistant to adversarial attacks,” in International Conference on Learning Representations. Available online at: https://openreview.net/

Google Scholar

Mehrabi, M., Javanmard, A., Rossi, R. A., Rao, A., and Mai, T. (2021). “Fundamental tradeoffs in distributionally adversarial training,” in Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, eds M. Meila and T. Zhang (New York, NY: PMLR), 7544–7554.

Google Scholar

Papernot, N., McDaniel, P., and Goodfellow, I. (2016). Transferability in machine learning: from phenomena to black-box attacks using adversarial samples. arXiv preprint arXiv:1605.07277. doi: 10.48550/arXiv.1605.07277

CrossRef Full Text | Google Scholar

Paranjape, A., Benson, A. R., and Leskovec, J. (2017). “Motifs in temporal networks,” in Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, WSDM '17 (New York, NY: Association for Computing Machinery), 601–610.

Google Scholar

Rauber, J., Zimmermann, R., Bethge, M., and Brendel, W. (2020). Foolbox native: Fast adversarial attacks to benchmark the robustness of machine learning models in pytorch, tensorflow, and jax. J. Open Source Softw. 5, 2607. doi: 10.21105/joss.02607

CrossRef Full Text | Google Scholar

Scarselli, F., Gori, M., Tsoi, A. C., Hagenbuchner, M., and Monfardini, G. (2009). The graph neural network model. IEEE Trans. Neural Netw. 20, 61–80. doi: 10.1109/TNN.2008.2005605

PubMed Abstract | CrossRef Full Text | Google Scholar

Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., et al. (2013). Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199. doi: 10.48550/arXiv.1312.6199

CrossRef Full Text | Google Scholar

Tan, Q., Liu, N., Zhao, X., Yang, H., Zhou, J., and Hu, X. (2020). “Learning to hash with graph neural networks for recommender systems,” in Proceedings of the Web Conference 2020, WWW '20 (New York, NY: Association for Computing Machinery), 1988–1998.

Google Scholar

Tramer, F., Carlini, N., Brendel, W., and Madry, A. (2020). “On adaptive attacks to adversarial example defenses,” in Advances in Neural Information Processing Systems, Vol. 33, eds H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin (Curran Associates, Inc.), 1633–1645.

Google Scholar

Tu, K., Cui, P., Wang, X., Yu, P. S., and Zhu, W. (2018). “Deep recursive network embedding with regular equivalence,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '18 (New York, NY: Association for Computing Machinery), 2357–2366.

Google Scholar

Wang, Q., Zhang, P., Xiong, H., and Zhao, J. (2021). Face.evolve: a high-performance face recognition library. arXiv preprint arXiv:2107.08621. doi: 10.48550/arXiv.2107.08621

CrossRef Full Text | Google Scholar

Wang, X., and He, K. (2021). “Enhancing the transferability of adversarial attacks through variance tuning,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Piscataway, NJ: IEEE), 1924–1933.

Google Scholar

Wang, X., He, X., Wang, J., and He, K. (2021). “Admix: enhancing the transferability of adversarial attacks,” in Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) (Piscataway, NJ: IEEE), 16158–16167.

Google Scholar

Wang, X., Ma, Y., Wang, Y., Jin, W., Wang, X., Tang, J., et al. (2020). “Traffic flow prediction via spatial temporal graph neural network,” in Proceedings of The Web Conference 2020, WWW '20 (New York, NY: Association for Computing Machinery), 1082–1092.

Google Scholar

Wu, D., Wang, Y., Xia, S.-T., Bailey, J., and Ma, X. (2019). “Skip connections matter: on the transferability of adversarial examples generated with resnets,” in International Conference on Learning Representations. Available online at: https://openreview.net/

Google Scholar

Xie, C., Wang, J., Zhang, Z., Ren, Z., and Yuille, A. (2018). “Mitigating adversarial effects through randomization,” in International Conference on Learning Representations. Available online at: https://openreview.net/

Google Scholar

Xie, C., Zhang, Z., Zhou, Y., Bai, S., Wang, J., Ren, Z., et al. (2019). “Improving transferability of adversarial examples with input diversity,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Piscataway, NJ: IEEE).

Google Scholar

Xiong, Y., Lin, J., Zhang, M., Hopcroft, J. E., and He, K. (2022). “Stochastic variance reduced ensemble adversarial attack for boosting the adversarial transferability,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (Piscataway, NJ: IEEE), 14983–14992.

Google Scholar

Xu, W., Evans, D., and Qi, Y. (2017). Feature squeezing: Detecting adversarial examples in deep neural networks. arXiv preprint arXiv:1704.01155. doi: 10.14722/ndss.2018.23198

CrossRef Full Text | Google Scholar

Xu, X., Yuruk, N., Feng, Z., and Schweiger, T. A. J. (2007). “Scan: a structural clustering algorithm for networks,” in Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '07 (New York, NY: Association for Computing Machinery), 824–833.

Google Scholar

Yang, X., Yang, D., Dong, Y., Yu, W., Su, H., and Zhu, J. (2020). Delving into the adversarial robustness on face recognition. arXiv preprint arXiv:2007.04118.

Google Scholar

Yu, B., Yin, H., and Zhu, Z. (2018). “Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting,” in Proceedings of the 27th International Joint Conference on Artificial Intelligence, IJC AI'18 (Palo Alto, CA: AAAI Press), 3634–3640.

Google Scholar

Zhao, Z., Liu, Z., and Larson, M. (2021). On success and simplicity: a second look at transferable targeted attacks. Adv. Neural Inf. Process. Syst. 34, 6115–6128. doi: 10.48550/arXiv.2012.11207

CrossRef Full Text | Google Scholar

Zou, J., Duan, Y., Li, B., Zhang, W., Pan, Y., and Pan, Z. (2022). Making adversarial examples more transferable and indistinguishable. Proc. AAAI Conf. Artif. Intell. 36, 3662–3670. doi: 10.1609/aaai.v36i3.20279

CrossRef Full Text | Google Scholar

Zügner, D., Akbarnejad, A., and Günnemann, S. (2018). “Adversarial attacks on neural networks for graph data,” in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2847–2856.

Google Scholar

Keywords: multi-model ensemble attack, multi-objective optimization, deep facial recognition, adversarial transferability, graphs

Citation: Pi J, Luo C, Xia F, Jiang N, Wu H and Wu Z (2023) Improving the adversarial transferability with relational graphs ensemble adversarial attack. Front. Neurosci. 16:1094795. doi: 10.3389/fnins.2022.1094795

Received: 10 November 2022; Accepted: 28 December 2022;
Published: 01 February 2023.

Edited by:

Lei Bai, The University of Sydney, Australia

Reviewed by:

Shuhao Shi, PLA Strategy Support Force Information Engineering University, China
Di Yuan, Xidian University, China

Copyright © 2023 Pi, Luo, Xia, Jiang, Wu and Wu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ning Jiang, yes bmluZy5qaWFuZzAyQG1zeGYuY29t

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.