ORIGINAL RESEARCH article

Front. Artif. Intell., 30 April 2025

Sec. Logic and Reasoning in AI

Volume 8 - 2025 | https://doi.org/10.3389/frai.2025.1535845

Navigating AI ethics: ANN and ANFIS for transparent and accountable project evaluation amidst contesting AI practices and technologies

  • 1Department of Mathematics, Pandit Deendayal Energy University, Gandhinagar, India
  • 2Faculty of Economics and Administrative Sciences, Universidad Católica de la Santísima Concepción, Concepción, Chile
  • 3Unidad Regional Culiacán, Universidad Autónoma de Occidente, Culiacán, Mexico

Introduction: The rapid evolution of Artificial Intelligence (AI) necessitates robust ethical frameworks to ensure responsible project deployment. This study addresses the challenge of quantifying ethical criteria in AI projects amidst contesting communicative practices, organizational structures, and enabling technologies, which shape AI’s societal implications.

Methods: We propose a novel framework integrating Artificial Neural Networks (ANN) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS) to evaluate AI project performance and model ethical uncertainties using Fuzzy logic. A Fuzzy weighted average approach quantifies critical ethical dimensions: transparency, fairness, accountability, privacy, security, explainability, human involvement, and societal impact.

Results: The framework enables a structured assessment of AI projects, enhancing transparency and accountability by mapping ethical criteria to project outcomes. ANN evaluates performance metrics, while ANFIS models uncertainties, providing a comprehensive ethical evaluation under complex conditions.

Discussion: By combining ANN and ANFIS, this study advances the understanding of AI’s ethical dimensions, offering a scalable approach for accountable AI systems. It reframes organizational communication and decision-making, embedding ethics within AI’s technological and structural contexts. This work contributes to responsible AI innovation, fostering trust and societal alignment in AI deployments.

1 Introduction

Artificial Intelligence (AI) increasingly defines our technological landscape, powered by algorithms and machine learning that drive capabilities like image recognition and automated decision-making (Abiodun et al., 2018). From autonomous vehicles to predictive analytics, AI reshapes societal systems (Shrestha et al., 2019), yet its integration sparks ethical scrutiny (Killen et al., 2012). These ethics are contested through communicative practices shaping trust, organizational structures adapting to AI’s influence, and enabling technologies embedding biased values dynamics that challenge responsible deployment (Shrestha et al., 2019; Jobin et al., 2019). Governments globally call for frameworks to navigate this contested terrain, ensuring ethical innovation (Mittelstadt et al., 2016; Floridi et al., 2018).

AI-powered project decision-making raises profound ethical stakes transparency, fairness, accountability, privacy, security, and societal impact (Arifin, 2018; Whittlestone et al., 2019). These criteria are not fixed; they shift amidst contesting AI practices and technologies, complicating alignment with human values (Jobin et al., 2019). For instance, organizational structures can obscure accountability (Shrestha et al., 2019), while enabling technologies amplify fairness debates (Fountaine et al., 2019). This contestability echoed in “Contesting Artificial Intelligence” demands robust evaluation to balance innovation and ethics in AI projects (Killen et al., 2012).

Navigating this complexity requires advanced tools beyond debate. Ethical project evaluation hinges on criteria like explainability, fairness, and human oversight, yet their quantification amidst uncertainty is intricate (Cooper, 2003; Ahmad et al., 2023). Artificial Neural Networks (ANN) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS) address this, with ANN modeling performance (e.g., transparency metrics) and ANFIS handling uncertainty (e.g., fairness trade-offs) via Fuzzy logic (Amirkhani et al., 2015; Jang, 1993). Proven in fields like solar power (Amirkhani et al., 2015) and structural monitoring, these methods enable accountable assessment (Masoudi et al., 2018). Paired with Fuzzy Multi-Criteria Decision Making (Fuzzy MCDM), they integrate contested perspectives cultural, regulatory, and societal offering a nuanced ethical lens (Blanco-Mesa et al., 2017; Hwang and Masud, 2012).

This study leverages ANN and ANFIS to evaluate AI project ethics, focusing on transparency and accountability amidst contested practices (Amirkhani et al., 2015; Abiodunet al., 2018). Building on frameworks from Jobin et al. and Floridi et al., it examines criteria data ethics, algorithmic fairness, explainability, privacy, societal impact across diverse contexts (Cooper, 2003). The research aims to inform ethical guidelines, fostering responsible AI deployment (Broekhuizen et al., 2023). Section 2 reviews AI ethics and ANN/ANFIS roles. Section 3 details the methodology for ethical project evaluation. Section 4 presents findings, and Section 5 explores future directions.

2 Literature review

This section provides a comprehensive review of existing research on AI ethics, focusing on ethical frameworks, privacy and data protection, bias and fairness, transparency, societal impacts, and the role of advanced computational techniques like artificial neural networks (ANNs) and adaptive neuro-fuzzy inference systems (ANFIS) in ethical decision-making. The review establishes the foundation for our methodology and addresses gaps identified in prior studies.

2.1 Ethical AI-powered projects

The rapid proliferation of AI technologies has heightened the need for robust ethical frameworks to guide their development and deployment. Recent studies emphasize privacy, bias, transparency, accountability, and societal impact as critical ethical dimensions (Floridi et al., 2018; Mittelstadt et al., 2016). The IEEE Global Initiative on Ethics of Autonomous and Intelligent Systems (Chatila and Havens, 2019) provides a widely recognized framework, advocating for human-centric design and accountability in AI systems. Similarly, the European Commission’s Ethics Guidelines for Trustworthy AI (Hleg, 2019) outline seven key requirements, including human agency, privacy, and fairness, which have been applied in projects ranging from healthcare diagnostics to autonomous vehicles (Jobin et al., 2019).

Challenges persist across industries. In healthcare, AI-driven diagnostic tools raise concerns about patient consent and data security (Vayena et al., 2018), while in finance, algorithmic trading systems have been criticized for amplifying market biases (Zetzsche et al., 2020). These studies underscore the need for tailored ethical guidelines, a gap our methodology seeks to address by integrating stakeholder input and advanced AI techniques.

2.2 Privacy and data protection in AI

Privacy and data protection are paramount in AI-powered projects, given their reliance on vast datasets. General discussions of privacy (e.g., Dwork, 2008) have evolved into specific algorithmic solutions. Federated Learning (FL), for instance, enables model training across decentralized datasets without sharing raw data, preserving individual privacy. Google’s implementation of FL in Gboard demonstrates its scalability for predictive text while adhering to privacy standards (Hard et al., 2018). Similarly, Differential Privacy (DP) adds noise to datasets to protect individual identities, with applications in census data analysis by the U.S. Census Bureau (Abowd, 2018).

Regulatory frameworks like the General Data Protection Regulation (GDPR) (Sartor and Lagioia, 2020) further shape AI development by mandating data minimization and user consent. A study by Yang et al. (2019) found that GDPR compliance increased the adoption of privacy-preserving techniques in European AI projects by 35% between 2018 and 2020. Our study builds on these approaches, incorporating privacy considerations into the ethical evaluation of AI projects.

2.3 Bias and fairness in AI algorithms

Bias in AI algorithms remains a significant ethical challenge. Research highlights how biased training data can perpetuate discrimination in domains like criminal justice and hiring (Barabas, 2020). Techniques such as fairness-aware machine learning adjust model outputs to minimize disparities across demographic groups (Hard et al., 2018), while algorithmic auditing identifies bias post-deployment. A notable case is IBM’s AI Fairness 360 toolkit, which has been used to audit healthcare models for gender bias (Bellamy et al., 2019).

Our methodology leverages ANFIS to model uncertainty in ethical criteria, potentially enhancing fairness by capturing nuanced stakeholder judgments, an area underexplored in existing fairness studies.

2.4 Transparency and explainability

Transparency and explainability are essential for building trust in AI systems. Explainable AI (XAI) techniques, such as LIME and SHAP (Lundberg and Lee, 2017), provide insights into model decisions, improving user understanding. For example, Darpa’s XAI program has enhanced transparency in military AI applications. Additionally, human-in-the-loop (HITL) systems integrate expert feedback to refine AI outputs, as seen in medical diagnostics (Holzinger et al., 2024).Our use of ANFIS aligns with XAI goals by modeling fuzzy logic-based decisions, offering a transparent alternative to black-box models like deep neural networks.

2.5 Societal impact and stakeholder engagement

Impact assessments predict AI’s societal consequences, guiding ethical decision-making. The AI Now Institute’s cross-sector analysis revealed that participatory approaches, involving citizens and regulators, enhanced equity in urban planning AI projects. Hagendorff (2022) reviewed 20 impact studies, highlighting that cost-benefit analyses often neglect long-term societal costs, such as job displacement. Stakeholder engagement aligns AI with societal values (De Almeida et al., 2021), as demonstrated by a Canadian AI healthcare initiative where patient and clinician involvement reduced deployment resistance by 40% (Morley et al., 2020). Our expert panel approach builds on these insights, systematically integrating diverse perspectives into ethical evaluations.

2.6 Role of ANNs and ANFIS in ethical decision-making

Artificial Neural Networks (ANNs) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS) are powerful tools for ethical decision-making. ANNs excel in pattern recognition and performance estimation. ANFIS, combining neural networks with fuzzy logic, addresses uncertainty, making it ideal for ethical evaluations where criteria are subjective (Jang, 1993). A study applied ANFIS to assess sustainability in construction projects, highlighting its ability to handle qualitative inputs (see Table 1).

Table 1
www.frontiersin.org

Table 1. Compares ANNs and ANFIS with traditional methods like decision trees and rule-based systems.

2.7 Fuzzy logic and MCDM in ethical AI project selection

Fuzzy Logic (FL) and Multi-Criteria Decision-Making (MCDM) provide flexible frameworks for ethical evaluations. FL captures linguistic variables (e.g., “high ethical risk”), as shown in supplier selection studies. MCDM, such as the Analytic Hierarchy Process (AHP), ranks alternatives based on multiple criteria (Saaty, 2008). A hybrid FL-MCDM approach was used by Chen and Huang (2007) to evaluate AI projects in smart cities, integrating expert judgments into quantifiable scores. Our methodology extends this by using fuzzy membership functions to model ethical criteria, enhancing decision transparency.

3 Methodological approach

This study presents a framework where AI-powered project selection is approached as a fuzzy multi-criteria decision-making challenge, incorporating criteria such as Data Collection and Processing, Algorithmic Design and Model Development, Explainability and Interpretability, Fairness and Bias Mitigation, Transparency and Accountability, Privacy and Data Protection, Human Oversight and Intervention, Security and Robustness, Societal Impact and Ethical Considerations, Continuous Improvement and Adaptability, Regulatory Compliance (Floridi et al., 2018; Jobin et al., 2019; Ministry of Electronics and Information Technology, Government of India, 2019). The model comprises four stages. The initial stage involves criteria selection, where expert panels evaluate communicative practices and organizational structures (Fountaine et al., 2019), establishing evaluation criteria (Cooper and Kleinschmidt, 1995). Subsequently, linguistic variables and fuzzy numbers assess each criterion using fuzzy weights and ratings (Dubois and Prade, 1978; Hwang et al., 1981). The next stage consolidates fuzzy numbers using the Fuzzy Weighted Average (FWA) method with a fractional programming approach (Chang and Hung, 2005), with the Decision Index (DI) evaluating project appeal (Relich and Pawlewski, 2017; Poveda-Bautista et al., 2018), involving α-cuts. The final phase prioritizes projects using a fuzzy ranking approach (Chen and Huang, 2007; Killen et al., 2012), categorizing them with Artificial Neural Networks (ANNs) (Abiodun et al., 2018).

In the initial assessment, a panel of experts employs linguistic variables to evaluate the criteria of AI-Powered projects, which can be defined as a Fuzzy set using the Equation 1.

A = x , μ A x | x X     (1)

Here, X represents the universe of discourse, and μ A x denotes the membership function of the Fuzzy set A, which assigns a degree of membership to each element x in the universe X . The membership function μ A x is a process that assigns a specific value from the range to each element of the universe X , where 0 represents complete non-membership and 1 represents complete membership.

A Fuzzy number can be defined as a subset of the real numbers that exhibits fuzziness, serving as an expansion of the notion of a confidence interval. The characteristics of a Fuzzy number A can be precisely outlined using a triangular membership function. Specifically, A is considered convex, and its properties can be defined through the following inequality of Equation 2.

μ A λ x 1 + 1 λ x 2 min μ A x 1 , μ A x 2 where x 1 , x 2 X , λ 0 1     (2)

A Fuzzy number A is considered normal when its height equals 1. The α cut A α of A refers to a specific subset of A that forms a close area, where α ranges from 0 to 1. In the context of Fuzzy numbers, A can be depicted as A = L M U , where L and U represent the lower and upper bounds of the Fuzzy number, respectively, while M signifies the mode or central value of A. Triangular Fuzzy numbers are a specific type of Fuzzy number where the membership function can be expressed as a triangular shape, with values increasing linearly from the left bound to the mode and then decreasing linearly to the right bound using Equation 3.

μ A x = { x L M L , L x M , U x U M , M x U , 0 , otherwise     (3)

This representation allows for a clear visualization of the uncertainty or fuzziness associated with the number. To effectively incorporate the Fuzzy ratings and weights linked with the criteria of the AI-Powered project, the method of Fuzzy Weighted Average (FWA) is utilized. Various techniques have been suggested in research for computing the FWA, with this study opting for the fractional programming approach (Chen and Huang, 2007). The Decision Index (DI) determined according to Equation 4 serves as an informative metric to evaluate the appeal of AI-Powered projects, providing a quantitative measure of their attractiveness within the context of the analysis (Relich and Pawlewski, 2017).

D I = j = 1 N w j r j / j = 1 N w j     (4)

Here, r j represents the Fuzzy rating and w j denotes the Fuzzy weight assigned to each criterion within the project portfolio, where j ranges from 1 to N, covering all the criteria under consideration.

The α cuts of r j and w j can be determined through the following calculation using Equation 5:

r j α = r j α L , r j α U and w j α = w j α L w j α U     (5)

These calculations involve establishing the lower and upper bounds of the α cuts for both the Fuzzy rating and Fuzzy weights. These computations are based on specific assumptions that are considered in the analysis:

Let t = 1 / j = 1 N w j and v j = t w j , by utilizing these transformation, the decision index’s membership function can be defined using the provided Equations 6, 7.

D I α L = min j = 1 N v j r j α L , where t w j α L v n t w j α U , j = 1 N v j = 1 , t 0     (6)

and

D I α U = max j = 1 N v j r j α U where t w j α L v n t w j α U , j = 1 N v j = 1 , t 0     (7)

In order to pinpoint the most promising AI-powered project, employing a ranking method is imperative. This study utilizes the Fuzzy ranking approach following the methodology outlined by Chen and Huang. The calculation for determining the right and left scores for A involves the following steps using Equations 8−11:

S R = sup x μ A R x μ max x     (8)
S L = sup x μ A L x μ min x     (9)

where,

μ A R x = { x , 0 x 1 0 , otherwise     (10)
μ A L x = { 1 x , 0 x 1 0 , otherwise     (11)

To determine the crisp score (CS) of a Fuzzy number A , the following computation is used using Equation 12.

C S = S R + 1 S L / 2     (12)

The final stage of the methodology involves the categorization of AI-powered projects based on their performance across various criteria such as Data Collection and Processing, Algorithmic Design and Model Development, Explainability and Interpretability, Fairness and Bias Mitigation, Transparency and Accountability, Privacy and Data Protection, Human Oversight and Intervention, Security and Robustness, Societal Impact and Ethical Considerations, Continuous Improvement and Adaptability, Regulatory Compliance. These categories are assessed utilizing Artificial Neural Networks (ANNs) to ensure a comprehensive evaluation of project performance.

The adequacy of the linguistic variables selected for assessing the ethical criteria of AI-powered projects is paramount in ensuring the comprehensiveness and effectiveness of the research methodology. These linguistic variables have been meticulously chosen to closely reflect the ethical considerations outlined by the companies under study. Moreover, the inclusion of a spectrum of importance levels, ranging from “Unimportant” to “Very Important,” allows for finespun evaluations and captures the varying degrees of significance associated with each criterion. Importantly, the linguistic variables are designed to be easily interpretable, facilitating consensus-building among project stakeholders and ensuring clarity in decision-making processes. Their adaptability to subjective assessments further enhances the robustness of the research findings, accommodating diverse perspectives and qualitative judgments that may arise in the evaluation of ethical criteria. By adhering to industry standards and best practices, these linguistic variables contribute to the credibility and relevance of the research within the field of ethical assessment methodologies for AI-powered projects (Akinrinola et al., 2024; Deloitte, 2023).

4 Illustrative case

In addition to the comprehensive criteria assessment, contextual insights and industry benchmarks play a pivotal role in evaluating the ethical dimensions of AI-powered projects. Drawing from industry best practices and case studies, the expert panel delves into meticulous considerations to ensure a holistic evaluation. This contextual analysis enriches the understanding of ethical implications within the specific domains of AI application, guiding the subsequent ranking process (Chen and Klein, 1997).

4.1 Utilizing the fuzzy weighted average methodology to prioritize ethical AI-powered projects

Drawing on insights from AI-powered projects, available resources, technological capabilities, organizational strategies, and customer feedback, an expert panel has identified 33 criteria to assess Ethics in AI-Powered projects. These criteria are categorized under Data Collection and Processing (DCP), Algorithmic Design and Model Development (ADMD), Explainability and Interpretability (EI), Fairness and Bias Mitigation (FBM), Transparency and Accountability (TA), Privacy and Data Protection (PDP), Human Oversight and Intervention (HOI), Security and Robustness (SR), Societal Impact and Ethical Considerations (SIEC), Continuous Improvement and Adaptability (CIA), Regulatory Compliance (RC). The corresponding weights for each criterion are detailed in Table 2.

Table 2
www.frontiersin.org

Table 2. Criteria for evaluating ethical AI-powered decision making.

Table 3 includes linguistic terms that correspond to importance weightings, criteria ratings, and their respective Triangular Fuzzy Numbers (TFNs). Evaluating project performance criteria entails referencing analogous past projects and articulating their assessments in linguistic terms. Table 4 delineates linguistic terms tailored to three distinct criteria related to project performance: Human intervention mechanisms (H1), Ethical considerations in decision-making (H2), and Handling morally sensitive situations (H3).

Table 3
www.frontiersin.org

Table 3. Linguistic expressions for relevance weights and criteria ratings.

Table 4
www.frontiersin.org

Table 4. Linguistics terms for project performance criteria.

For each of the 11 prospective AI-powered projects, criteria ratings have been allocated and are outlined in Table 5. Subsequently, the Fuzzy numbers are consolidated utilizing the Fuzzy Weighted Average (FWA) technique to derive the Fuzzy rating for each facet of an AI-Powered project.

Table 5
www.frontiersin.org

Table 5. The fuzzy weighted averages across various facets of AI-powered projects.

In a Fuzzy decision-making model, the weights assigned to criteria are instrumental in assessing their significance within the decision framework. Ranging from “Very Unimportant (VU)” to “Very Important (VI)” in the provided table, these weights signify the varying degrees of importance attributed to each category (Jafarzadeh et al., 2018). Criteria designated as “Very Important (VI)” carry the highest weight, indicating their critical role in shaping the project’s outcome. Following suit, “Important (I)” criteria hold considerable importance, while “Medium (M)” and “Medium Important (MI)” criteria represent moderate significance. Conversely, “Unimportant (U)” and “Medium Unimportant (MU)” criteria have lesser weight, and “Very Unimportant (VU)” criteria are deemed negligible. Through this allocation, each criterion receives appropriate weighting, ensuring a balanced and accurate decision-making process within Fuzzy decision-making models. Additionally, numerical values assigned to linguistic variables delineate the boundaries of triangular Fuzzy numbers (TFNs), aiding in modeling subjective judgments’ uncertainty. Employing the Triangular Fuzzy Number Representation Method, these values adhere to the TFN formula, encapsulating the criteria’s significance through lower bound (L), central value (C), and upper bound (U).

The numerical values assigned to the linguistic variables in the table serve as the boundaries of TFNs, which are employed to model the linguistic terms. These TFNs are instrumental in capturing the inherent uncertainty and imprecision associated with subjective judgments in decision-making processes. In the table, lower bound values range from 0 to 0.167, depending on the linguistic variable. Central Value (C), serving as the midpoint of the TFN, acts as a reference point, representing moderate significance or importance. In the provided table, central values are assigned accordingly, such as 0.333 for “Medium Unimportant (MU)” and 0.5 for “Medium (M).” The upper bound (U) is reflecting the maximum value of the linguistic variable and denotes the highest significance or importance accorded to the criteria. The table assigns upper bound values ranging from 0.5 to 1, depending on the linguistic variable. The TFNs are constructed using these three values according to the formula: TFN = (L, C, U). For instance, consider the linguistic variable “Very Important (VI)” with TFN boundaries (0.833, 1, 1). Here, L = 0.833 (lower bound), C = 1 (central value), U = 1 (upper bound). This TFN indicates that criteria categorized as “Very Important (VI)” possess a high level of significance or importance, with a central value of 1 and a range extending from 0.833 to 1. In essence, this method enables decision-makers to quantitatively represent qualitative judgments using TFNs, facilitating the integration of uncertainty and vagueness into decision-making processes.

The projects under evaluation span across diverse industries benefiting from AI technologies, including healthcare, finance, retail, manufacturing, transportation, energy, education, e-commerce, telecommunications, entertainment, and government. In healthcare, AI enhances patient care through clinical decision support systems, medical imaging analysis, and personalized medicine. In finance, AI aids in fraud detection and risk assessment. The retail benefits from AI are in demand forecasting and personalized recommendations, while manufacturing utilizes it for predictive maintenance and quality control. Across sectors, AI improves efficiency, accuracy, and decision-making. The criteria provided below facilitate ethical evaluation, ensuring adherence to transparency, fairness, privacy, security, and societal impact standards throughout project development and implementation (see Table 6).

Table 6
www.frontiersin.org

Table 6. Criteria rating for AI-powered decision-making company.

For example, the Fuzzy weighted average for “Data Collection and Processing” and “Transparency and Accountability” aspect of AI-Powered project P1 is calculated as follows:

D I D C P , P 1 = j = 1 3 w j r j / j 3 w j
= 0.167 0.333 0.5 0.5 0.667 0.833 0.5 0.667 0.833 0.833 1 1 0.667 0.833 1 0.667 0.833 1 / 0.167 0.333 0.5 0.5 0.667 0.833 0.667 0.833 1
= 0.71 0.86 0.96
D I T A , P 1 = j = 1 11 w j r j / j = 1 11 w j
= 0.5 0.667 0.833 0.5 0.667 0.833 0.667 0.833 1 0.667 0.833 1 0.833 1 1 0.5 0.667 0.833 / 0.5 0.667 0.833 0.667 0.833 1 0.833 1 1
= 0.56 0.72 0.89

Table 7 illustrates the Fuzzy weighted average, depicting the potential of AI-Powered Projects across various dimensions of project assessments.

Table 7
www.frontiersin.org

Table 7. Ranking of AI-powered projects for different scenarios.

The comprehensive evaluation of an AI-powered project D I Pi involves calculating its overall Fuzzy weighted average, which is derived from the aggregation of Fuzzy weighted averages across various project aspects (as presented in Table 7) and their corresponding importance weights. Consequently, the overall Fuzzy weighted average for project P1 is computed by integrating these aspect-wise Fuzzy weighted averages with their respective importance weights.

D I P 1 = j = 1 11 w j r j / j 11 w j
= 0.833 1 1 0708 0.864 0.964 0.667 0.833 1 0.389 0.556 0.726 0.5 0.667 0.833 0.639 0.800 0.902 0.333 0.5 0.667 0.569 0.744 0.882 0.167 0.333 0.5 0.556 0.722 0.892 0 0.167 0.333 0.792 0.955 1 0 0 0.167 0.611 0.778 0.941 0.833 1 1 0.597 0.767 0.882 0.667 0.833 1 0.694 0.856 0.951 0.5 0.667 0.833 0.611 0.778 0.941 0.333 0.5 0.667 0.709 0.878 1 / 0.833 1 1 0.667 0.833 1 0.5 0.667 0.833 0.333 0.5 0.667 0.167 0.333 0.5 0 0.167 0.333 0 0 0.167 0.833 1 1 0.667 0.833 1 0.5 0.667 0.833 0.333 0.5 0.667
= 0.611 0.780 0.906

To determine the ranking of AI-powered projects, the crisp score of a Fuzzy number is calculated using Equations 9–12. The detailed outcomes, including aggregate Fuzzy Weighted Averages (DI), Crisp Scores (CS), and rankings for potential AI-powered projects across different scenarios, are presented comprehensively in Table 7. These scenarios reflect the evaluation team’s stance on Ethics in AI-powered projects and encompass varying degrees of significance attributed to distinct project facets.

Contrasting the proposed approach with other established methods involves ranking Fuzzy numbers for AI-Powered project attractiveness and choosing alternatives in product design. The proposed approach undergoes comparison with the Center of Area defuzzification (CoA) and Euclidean distance methods commonly used for ranking Fuzzy numbers (Herrera and Herrera-Viedma, 2000).

C D I = U L + M L 3 + L     (13)

The crisp rating of decision index (CDI) for CoA can be obtained using Equation 13, where L, M, and U represent the lower, middle, and upper values of DI. The proposed approach demonstrates superior prediction performance under various working conditions compared to existing methods. In the context of ranking Fuzzy numbers, methods based on defuzzification and distance between Fuzzy numbers is widely used. The ranking involves precise numerical distances and Fuzzy distances, with a new method based on Fuzzy distances showing advantages in reliability and effectiveness. This new method considers the novel Fuzzy distance of each Fuzzy number from the ideal Fuzzy number for ranking purposes.

4.2 ANN and ANFIS for forecasting AI-powered performance criteria

Artificial Neural Networks (ANNs) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS) are pivotal in evaluating AI-powered project performance, offering robust tools to quantify ethical and operational criteria amidst uncertainty. Prior research has leveraged ANNs for pattern recognition in project estimation (Akinrinola et al., 2024) and ANFIS for modeling complex, nonlinear systems with interpretability (Chen and Klein, 1997). This study builds on these foundations to assess criteria such as transparency, fairness, and accountability in AI projects.

4.2.1 ANN model description

We employed a Multilayer Feed-Forward Neural Network (MFFNN) with a backpropagation algorithm to predict project performance metrics, including duration and cost. The network architecture, structured as 11-n-1, comprises 11 input nodes representing project variables (e.g., team size, task complexity, resource allocation, scope, and external dependencies), a hidden layer with 20 to 100 neurons (optimized via testing), and one output node for project metrics. Weight optimization utilized the Levenberg–Marquardt (LM) algorithm, enhanced by momentum-driven gradient descent and the adaptive learning rate of the GDX algorithm (Herrera and Herrera-Viedma, 2000). The dataset, sourced from enterprise systems of 11 AI-powered projects, was split into training (P1-P7) and testing (P8-P11) sets. Pre-processing via Principal Component Analysis (PCA) reduced dimensionality while retaining key variances (Abiodun et al., 2018).

Training spanned 100 iterations, with neuron counts varied to minimize Mean Squared Error (MSE). The best configuration (40 hidden neurons) achieved an MSE of 0.0026316 at epoch 1 on validation data, indicating strong predictive accuracy (see Figure 1). Regression analysis (Figure 2) showed correlation coefficients (R) of 0.5485 (training), 0.74811 (validation), and 0.5559 (testing), reflecting moderate to robust fits (e.g., validation: y ≈ 0.68 · target + 0.12).

Figure 1
A three-part training performance plot from a neural network training session. The first plot shows the gradient values decreasing steadily across epochs, ending at 0.0016449 at epoch 7. The second plot shows the mu (learning rate adjustment parameter), starting at 1e-3, decreasing to 1e-5, and then stabilizing at 1e-4. The third plot shows validation checks, increasing from epoch 0 to 7 with a total of 6 validation checks indicating no further improvement was observed.

Figure 1. Validation performance of neural network.

Figure 2
A stacked error histogram chart with 20 bins representing the distribution of prediction errors in training (blue), validation (green), and test (red) datasets. The x-axis shows the error range from approximately -0.62 to 0.37, and the y-axis indicates the number of instances. The histogram shows a mostly centered error distribution with some bias towards positive errors. A vertical orange line marks the zero-error point for reference.

Figure 2. Regression analysis of model predictions.

4.2.2 ANFIS model description

ANFIS integrates neural networks with fuzzy logic to model uncertainties in ethical criteria, such as fairness trade-offs (Mensah, 2023). Our ANFIS model used a Sugeno-type fuzzy inference system with 23 training data pairs derived from expert panel assessments. The architecture included 5 input variables (e.g., transparency, accountability), 10 membership functions (Gaussian), and 10 fuzzy rules, yielding 50 nodes, 20 linear parameters, and 30 nonlinear parameters. Training employed a hybrid algorithm (gradient descent and least squares), completed after 1,000 epochs, achieving a minimal RMSE of 0.000148646 (Figure 3).

Figure 3
A line chart showing the mean squared error (MSE) over 7 epochs for training (blue), validation (green), and test (red) datasets. The best validation performance, with the lowest MSE of 0.026316, occurs at epoch 1. A green dotted circle marks this best point. After epoch 1, the MSE remains stable with minimal variation, indicating convergence.

Figure 3. Progress of iterative process over epochs.

Despite a warning about the training data size (23 pairs) being smaller than the modifiable parameters (50), the model demonstrated effective learning and generalization.

4.2.3 Expert panel contribution

An expert panel of 10 professionals (5 AI ethicists, 3 project managers, 2 data scientists) assessed project data, assigning linguistic variables (e.g., “high transparency”) to criteria, later quantified as fuzzy numbers. Data were collected via structured surveys over 2 months, aggregated into the dataset (Floridi et al., 2018). This process informed both ANN inputs and ANFIS membership functions, ensuring ethical alignment.

4.2.4 Results presentation

Figures 3, 4 illustrate the training progress (RMSE decline) and error distribution (histogram with 20 bins peaking at 0.005112), respectively. Figure 1 shows validation performance, and Figure 2 presents regression plots across datasets. These visualizations replace large numerical tables, offering clear insights into model accuracy and error trends.

Figure 4
A 2x2 grid of scatter plots showing regression analysis results. The top left plot shows training data with R=0.5485 and a linear fit line. The top right shows validation data with R=0.74811. The bottom left shows test data with R=0.5559, and the bottom right shows the combined dataset with R=0.56488. Each plot includes a fit line and a reference line where the target equals the output.

Figure 4. Error distribution centered around zero, showing model accuracy across datasets.

Figure 2 illustrates the regression analysis of an Artificial Neural Network (ANN) model’s predictions compared to actual project performance metrics across three datasets: training, validation, and testing. The training dataset shows a moderate correlation coefficient (R = 0.5485), suggesting the model captures some data patterns effectively. In contrast, the validation dataset demonstrates a stronger correlation (R = 0.74811) with a best-fit line of y ≈ 0.68 × target + 0.12, highlighting the model’s ability to generalize well to new, unseen data. The testing dataset yields a similar correlation (R = 0.5559), reinforcing the model’s consistent performance across different data subsets. These findings emphasize the ANN model’s reasonable accuracy in predicting project performance metrics, with a notable strength during the validation phase. This capability makes it a valuable tool for ethical project evaluation, reliably assessing key performance indicators in AI-powered projects.

5 Conclusion

This study introduces an innovative framework that integrates Artificial Neural Networks (ANN) and Adaptive Neuro-Fuzzy Inference Systems (ANFIS) within a fuzzy multi-criteria decision-making (MCDM) approach to assess the ethical implications of AI-powered projects. Applied across 11 diverse projects, the framework successfully prioritized initiatives based on key ethical principles such as transparency, fairness, and accountability. Notably, projects like P4 consistently ranked high across various ethical scenarios, underscoring the framework’s ability to identify ethically robust initiatives. By merging advanced computational methods with fuzzy logic, this approach effectively navigates the uncertainty and subjectivity inherent in ethical evaluations, providing organizations with a practical tool to ensure AI projects align with societal values. However, the study is not without limitations: its dependence on expert judgments introduces subjectivity, and the small ANFIS training dataset (23 pairs compared to 50 modifiable parameters) raises potential overfitting concerns, despite the model’s strong performance. Looking ahead, future research could enhance the framework by incorporating larger and more diverse datasets, exploring additional ethical criteria, and examining the influence of organizational structures on AI governance. Furthermore, integrating emerging technologies such as blockchain and Explainable AI (XAI) could bolster transparency and accountability (Soori et al., 2023). As AI continues to transform industries and societies, this framework represents a vital step toward responsible innovation, offering a rigorous and adaptable method to evaluate AI projects ethically and fostering the development of AI systems that prioritize human values and societal well-being.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Author contributions

SW: Writing – original draft, Writing – review & editing, Conceptualization. MS: Writing – original draft, Writing – review & editing, Methodology. EL-C: Writing – original draft, Writing – review & editing, Formal analysis. MO-L: Supervision, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by UCSC 2025.

Acknowledgments

We are grateful for the invaluable resource, data and information provided by Ministry of Electronics & Information Technology, Government of India, Alphabet Inc., Deloitte and Capgemini (n.d.) Research Institute which have been instrumental in facilitating our research and analysis. Also, this research is supported by Red Sistemas Inteligentes y Expertos Modelos Computacionales Iberoamericanos (SIEMCI), project number 522RT0130 in Programa Iberoamericano de Ciencia y Tecnologia para el Desarrollo (CYTED).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abiodun, O. I., Jantan, A., Omolara, A. E., Dada, K. V., Mohamed, N. A., and Arshad, H. (2018). State-of-the-art in artificial neural network applications: a survey. Heliyon 4,:e00938, doi: 10.1016/j.heliyon.2018.e00938, PMCID: PMC6260436

PubMed Abstract | Crossref Full Text | Google Scholar

Abowd, J. M. (2018). “The US census bureau adopts differential privacy,’’ in Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 2867.

Google Scholar

Ahmad, S. F., Han, H., Alam, M. M., Rehmat, M. K., Irshad, M., Arraño-Muñoz, M., et al. (2023). Impact of artificial intelligence on human loss in decision making, laziness and safety in education. Humanit. Soc. Sci. Commun. 10, 1–14.

Google Scholar

Akinrinola, O., Okoye, C. C., Ofodile, O. C., and Ugochukwu, C. E. (2024). Navigating and reviewing ethical dilemmas in AI development: strategies for transparency, fairness, and accountability. GSC Adv. Res. Rev. 18, 050–058. doi: 10.30574/gscarr.2024.18.3.0088

Crossref Full Text | Google Scholar

Amirkhani, S., Nasirivatan, S., Kasaeian, A., and Hajinezhad, A. (2015). ANN and ANFIS models to predict the performance of solar chimney power plants. Renew. Energy 83, 597–607. doi: 10.1016/j.renene.2015.04.072

Crossref Full Text | Google Scholar

Arifin, S. R. M. (2018). Ethical considerations in qualitative study. Int. J. Care Scholars 1, 30–33. doi: 10.31436/ijcs.v1i2.82

Crossref Full Text | Google Scholar

Barabas, C. (2020). Beyond bias: re-imagining the terms of “ethical AI” in criminal law. Geo. J. L. Mod. Critical Race Persp. 12:83. doi: 10.1007/978-3-031-09367-8_4

Crossref Full Text | Google Scholar

Bellamy, R. K., Dey, K., Hind, M., Hoffman, S. C., Houde, S., Kannan, K., et al. (2019). AI fairness 360: an extensible toolkit for detecting and mitigating algorithmic bias. IBM J. Res. Dev. 63, 4:1–4:15. doi: 10.1147/JRD.2019.2942287

PubMed Abstract | Crossref Full Text | Google Scholar

Blanco-Mesa, F., Merigo, J. M., and Gil-Lafuente, A. M. (2017). Fuzzy decision making: a bibliometric-based review. J. Intell. Fuzz Res. Dev. 32, 2033–2050. doi: 10.3233/JIFS-161640

PubMed Abstract | Crossref Full Text | Google Scholar

Broekhuizen, T., Dekker, H., de Faria, P., Firk, S., Nguyen, D. K., and Sofka, W. (2023). AI for managing open innovation: opportunities, challenges, and a research agenda. J. Bus. Res. 167:114196. doi: 10.1016/j.jbusres.2023.114196

Crossref Full Text | Google Scholar

Capgemini (n.d.). Available online at: https://www.capgemini.com/wp-content/uploads/2020/03/Report-Conversations-1.pdf (Accessed January 29, 2024)

Google Scholar

Chang, P.-T., and Hung, K.-C. (2005). Applying the fuzzy-weighted-average approach to evaluate network security systems. Comput. Math. Appl. 49, 1797–1814. doi: 10.1016/j.camwa.2004.10.042

Crossref Full Text | Google Scholar

Chatila, R., and Havens, J. C. (2019). The IEEE global initiative on ethics of autonomous and intelligent systems. in Robotics and well-being. Intelligent systems, control and automation: science and engineering, vol 95. eds. M. Aldinhas Ferreira, J. Silva Sequeira, G. Singh Virk, M. Tokhi and E. Kadar (Cham: Springer).

Google Scholar

Chen, C.-T., and Huang, S.-F. (2007). Applying fuzzy method for measuring criticality in project network. Inf. Sci. 177, 2448–2458. doi: 10.1016/j.ins.2007.01.035

Crossref Full Text | Google Scholar

Chen, C.-B., and Klein, C. M. (1997). A simple approach to ranking a group of aggregated fuzzy utilities. IEEE Trans. Syst. Man Cybern. B Cybern. 27, 26–35. doi: 10.1109/3477.552183

PubMed Abstract | Crossref Full Text | Google Scholar

Cooper, L. P. (2003). A research agenda to reduce risk in new product development through knowledge management: a practitioner perspective. J. Eng. Technol. Manag. 20, 117–140. doi: 10.1016/S0923-4748(03)00007-9

Crossref Full Text | Google Scholar

Cooper, R. G., and Kleinschmidt, E. J. (1995). Benchmarking the firm’s critical success factors in new product development. J. Prod. Innov. Manage. 12, 374–391. doi: 10.1111/1540-5885.1250374

Crossref Full Text | Google Scholar

De Almeida, P. G. R., dos Santos, C. D., and Farias, J. S. (2021). Artificial intelligence regulation: a framework for governance. Ethics Inf. Technol. 23, 505–525. doi: 10.1007/s10676-021-09593-z

Crossref Full Text | Google Scholar

Dubois, D., and Prade, H. (1978). Operations on fuzzy numbers. Int. J. Syst. Sci. 9, 613–626. doi: 10.1080/00207727808941724

Crossref Full Text | Google Scholar

Dwork, C. (2008). Differential privacy: a survey of results. In International conference on theory and applications of models of computation (Springer), 1–19, doi: 10.1007/978-3-540-79228-4_1

PubMed Abstract | Crossref Full Text | Google Scholar

Floridi, L., Cowls, J., Beltrametti, M., Chatila, R., Chazerand, P., Dignum, V., et al. (2018). AI4People—an ethical framework for a good AI society: opportunities, risks, principles, and recommendations. Mind. Mach. 28, 689–707, doi: 10.1007/s11023-018-9482-5, PMCID: PMC6404626

PubMed Abstract | Crossref Full Text | Google Scholar

Fountaine, T., McCarthy, B., and Saleh, T. (2019). Building the AI-powered organization. Harv. Bus. Rev. 97, 62–73.

Google Scholar

Hagendorff, T. (2022). A virtue-based framework to support putting AI ethics into practice. Philos. Technol. 35:55. doi: 10.1007/s13347-022-00553-z

Crossref Full Text | Google Scholar

Hard, A., Rao, K., Mathews, R., Ramaswamy, S., Beaufays, F., Augenstein, S., et al. (2018). Federated learning for mobile keyboard prediction. arXiv preprint arXiv:1811.03604

Google Scholar

Herrera, F., and Herrera-Viedma, E. (2000). Linguistic decision analysis: steps for solving decision problems under linguistic information. Fuzzy Sets Syst. 115, 67–82. doi: 10.1016/S0165-0114(99)00024-X

Crossref Full Text | Google Scholar

Hleg, A. (2019). European commission’s ethics guidelines for trustworthy artificial intelligence (issue May)

Google Scholar

Holzinger, A., Zatloukal, K., and Müller, H. (2024). Is human oversight to AI systems still possible? New Biotechnol. 85, 59–62. doi: 10.1016/j.nbt.2024.12.003

Crossref Full Text | Google Scholar

Hwang, C.-L., and Masud, A. S. M. (2012). Multiple objective decision making — methods and applications: A state-of-the-art survey, in Lecture notes in economics and mathematical systems vol. 164. eds. S. R. Paidy and K. Yoon (Springer Science & Business Media).

Google Scholar

Hwang, C.-L., Yoon, K., Hwang, C.-L., and Yoon, K. (1981). “Methods for multiple attribute decision making” in Multiple attribute decision making: methods and applications a state-of-the-art survey. (Berlin, Heidelberg: Springer), 58–191.

Google Scholar

Jafarzadeh, H., Akbari, P., and Abedin, B. (2018). A methodology for project portfolio selection under criteria prioritisation, uncertainty and projects interdependency–combination of fuzzy QFD and DEA. Expert Syst. Appl. 110, 237–249. doi: 10.1016/j.eswa.2018.05.028

Crossref Full Text | Google Scholar

Jang, J.-S. (1993). ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man Cybern. 23, 665–685. doi: 10.1109/21.256541

Crossref Full Text | Google Scholar

Jobin, A., Ienca, M., and Vayena, E. (2019). The global landscape of AI ethics guidelines. Nat. Mach. Intell. 1, 389–399. doi: 10.1038/s42256-019-0088-2

Crossref Full Text | Google Scholar

Killen, C. P., Jugdev, K., Drouin, N., and Petit, Y. (2012). Advancing project and portfolio management research: applying strategic management theories. Int. J. Proj. Manag. 30, 525–538. doi: 10.1016/j.ijproman.2011.12.004

Crossref Full Text | Google Scholar

Lundberg, S. M., and Lee, S.-I. (2017). A unified approach to interpreting model predictions. Adv. Neural Inf. Proces. Syst. 12:30.

Google Scholar

Masoudi, S., Sima, M., and Tolouei-Rad, M. (2018). Comparative study of ANN and ANFIS models for predicting temperature in machining. J. Eng. Sci. Technol. 13, 211–225.

Google Scholar

Mensah, G. B. (2023). Artificial intelligence and ethics: a comprehensive review of bias mitigation, ransparency, and accountability in ai systems. Preprint, November 10.

Google Scholar

Ministry of Electronics and Information Technology, Government of India. (2019). Available online at: https://www.meity.gov.in/static/uploads/2024/02/11ab.pdf (Accessed January 24, 2024)

Google Scholar

Mittelstadt, B. D., Allo, P., Taddeo, M., Wachter, S., and Floridi, L. (2016). The ethics of algorithms: mapping the debate. Big Data Soc. 3:2053951716679679. doi: 10.1177/2053951716679679

Crossref Full Text | Google Scholar

Morley, J., Machado, C. C. V., Burr, C., Cowls, J., Joshi, I., Taddeo, M., et al. (2020). The ethics of AI in health care: a mapping. Ethics, Govern. Policies Artificial Intelligence. 1, 313–346.

Google Scholar

Poveda-Bautista, R., Diego-Mas, J.-A., and Leon-Medina, D. (2018). Measuring the project management complexity: the case of information technology projects. Complexity 2018:6058480. doi: 10.1155/2018/6058480

Crossref Full Text | Google Scholar

Relich, M., and Pawlewski, P. (2017). A fuzzy weighted average approach for selecting portfolio of new product development projects. Neurocomputing 231, 19–27. doi: 10.1016/j.neucom.2016.05.104

Crossref Full Text | Google Scholar

Saaty, T. L. (2008). Decision making with the analytic hierarchy process. Int. J. Serv. Sci. 1, 83–98. doi: 10.1504/IJSSCI.2008.017590

Crossref Full Text | Google Scholar

Sartor, G., and Lagioia, F. (2020). The impact of the general data protection regulation (GDPR) on artificial intelligence. European Parliament.

Google Scholar

Shrestha, Y. R., Ben-Menahem, S. M., and Von Krogh, G. (2019). Organizational decision-making structures in the age of artificial intelligence. Calif. Manag. Rev. 61, 66–83. doi: 10.1177/0008125619862257

Crossref Full Text | Google Scholar

Soori, M., Dastres, R., and Arezoo, B. (2023). Ai-powered blockchain technology in industry 4.0, a review. J. Econ. Technol. 1, 222–241. doi: 10.1016/j.ject.2024.01.001

Crossref Full Text | Google Scholar

Vayena, E., Blasimme, A., and Cohen, I. G. (2018). Machine learning in medicine: addressing ethical challenges. PLoS Med. 15,:e1002689, doi: 10.1371/journal.pmed.1002689, PMCID: PMC6219763

PubMed Abstract | Crossref Full Text | Google Scholar

Whittlestone, J., Nyrup, R., Alexandrova, A., Dihal, K., and Cave, S. (2019). Ethical and societal implications of algorithms, data, and artificial intelligence: A roadmap for research. London: Nuffield Foundation, 1–59.

Google Scholar

Yang, Q., Liu, Y., Chen, T., and Tong, Y. (2019). Federated machine learning: concept and applications. ACM Trans. Intell. Syst. Technol. (TIST) 10, 1–24. doi: 10.1145/3339474

PubMed Abstract | Crossref Full Text | Google Scholar

Zetzsche, D. A., Arner, D. W., and Buckley, R. P. (2020). Decentralized finance. J. Finan. Regul. 6, 172–203. doi: 10.1093/jfr/fjaa010

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: artificial intelligence (AI), artificial neural networks (ANN), adaptive neuro-fuzzy inference systems (ANFIS), accountability, organizational structures

Citation: Wankhade S, Sahni M, León-Castro E and Olazabal-Lugo M (2025) Navigating AI ethics: ANN and ANFIS for transparent and accountable project evaluation amidst contesting AI practices and technologies. Front. Artif. Intell. 8:1535845. doi: 10.3389/frai.2025.1535845

Received: 05 February 2025; Accepted: 02 April 2025;
Published: 30 April 2025.

Edited by:

Cesar Isaza, Polytechnic University of Querétaro, Mexico

Reviewed by:

Jonny Paul Zavala De Paz, Polytechnic University of Querétaro, Mexico
Chao Zhang, Shanxi University, China

Copyright © 2025 Wankhade, Sahni, León-Castro and Olazabal-Lugo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Manoj Sahni, bWFub2ouc2FobmlAc290LnBkcHUuYWMuaW4=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.