Skip to content
LinkedInX

What Is Responsible AI?

About 10 minutes

Target audience: Those using AI in business or products, those who want to learn the basics of AI ethics
Prerequisites: No prior knowledge required

As AI is used in more and more areas of society, it’s no longer enough for technology to simply “work correctly.” Is it unfairly discriminating against anyone? Can the basis for its decisions be explained? Is personal information protected? Answering these questions is a prerequisite for continuing to use AI safely and fairly. This page explains the concept of Responsible AI and its five core principles.

Responsible AI is a collective term for the ideas and practices of developing, operating, and using AI technology with awareness of ethical and social responsibility.

Alongside pursuing technical performance (accuracy, speed), it aims to ensure “fair treatment for everyone,” “transparency in decision-making,” “protection of privacy,” and “accountability when problems occur.”

As AI’s influence grows, cases of unintended harm have been reported.

Real-world cases that became problems

CaseProblem
Gender bias in hiring AI (Amazon, 2018)A model trained on historical hiring data was reported to rate female applicants lower.[3]
Racial bias in facial recognitionThe MIT Media Lab Gender Shades study found commercial gender-classification systems had error rates up to 34.7% for darker-skinned women.[4]
Bias in healthcare algorithmsA healthcare algorithm using cost as a proxy for need was reported to underestimate the medical needs of Black patients.[5]
Harmful speech from chatbotsMicrosoft took Tay offline after malicious users induced inappropriate outputs within the first 24 hours.[6]

These cases demonstrate that pursuing only technical accuracy is insufficient to prevent social harm.

Regulatory trends

AI regulations are being developed worldwide.

  • EU AI Act (entered into force in 2024): Classifies AI by risk level and imposes obligations on high-risk AI.[2]
  • NIST AI RMF (2023): AI risk management framework from the National Institute of Standards and Technology.[1]
  • G7 Hiroshima AI Process (2023): International guiding principles and a code of conduct for organizations developing advanced AI systems.[7]
graph TD
    A["Responsible AI"] --> B["Fairness"]
    A --> C["Transparency"]
    A --> D["Explainability"]
    A --> E["Privacy"]
    A --> F["Accountability"]

Fairness means that AI does not unjustly discriminate based on attributes such as race, gender, age, or nationality.

Fairness has multiple definitions, and which one to adopt depends on the use case.

Type of FairnessDefinitionApplication Example
Individual fairnessTreat similar individuals similarlyApply the same interest rate to people with the same credit score
Group fairnessEqualize error rates across different groupsEqualize pass rates between genders in hiring AI
Equal opportunityEqualize probability of positive predictions across groupsEqualize loan approval rates across racial groups

Transparency means making the way an AI system operates understandable to users and supervisors.

  • Disclosure that it’s AI (“This chat is powered by AI”)
  • Providing information about the data and algorithms used in decision-making
  • Clearly stating the model’s capabilities and limitations

Transparency is a prerequisite for people using AI to appropriately trust or question the system.

Explainability (XAI: eXplainable AI) means presenting the reasons behind an AI’s predictions or decisions in a form that humans can understand.

Example:
Transparency (bad example): "Your loan application has been denied."
Explainability (good example): "Your loan application has been denied. The main reasons are:
① Two late payments in the past three years, ② a high debt-to-income ratio,
③ and a short credit history."

Fields where explainability is important: medical diagnosis support, credit assessment, hiring decisions, criminal justice

Privacy means that AI handles personal data appropriately and protects it from unintended leaks or misuse.

  • Data minimization: Collect and use only the minimum necessary data
  • Purpose limitation: Do not use collected data for purposes other than what it was collected for
  • Consent: Obtain explicit consent for the use of personal data
  • Right to deletion: Respond to requests to delete personal data

Related regulations: EU GDPR (General Data Protection Regulation), Japan’s Act on the Protection of Personal Information.[8][9]

Accountability means clearly defining responsibility for the behavior and impact of an AI system, and having the capacity to explain and correct issues when they arise.

  • Defining the scope of responsibility for developers, operators, and users
  • Establishing escalation processes when problems occur
  • Mechanisms for challenging or appealing AI decisions

AI bias refers to an AI’s tendency to systematically make incorrect judgments about specific groups or conditions.

graph LR
    A["Sources of Bias"] --> B["Training Data Bias"]
    A --> C["Algorithmic Bias"]
    A --> D["Amplification of Social Bias"]
    B --> E["Imbalanced data collection\nReflection of historical bias"]
    C --> F["Feature selection errors\nOptimization target errors"]
    D --> G["Existing social discrimination\nreproduced and reinforced by AI"]

Training data bias occurs when the data used to train the model is itself skewed.

  • Lack of representation: Insufficient data for certain groups (e.g., insufficient data for minority ethnicities in medical AI)
  • Reflection of historical bias: Learning from past discriminatory decision data reproduces those patterns

Concrete example: When hiring AI learns engineering role hiring data, if men were predominantly hired in the past, the model learns to favor men (the Amazon case).

Algorithmic bias occurs during the model’s design and optimization process.

  • Feature problems: Using features that indirectly indicate race, gender, etc. (zip code, educational background, etc.) causes indirect discrimination
  • Optimization target problems: Maximizing overall accuracy can sacrifice accuracy for minority groups

Amplification of social bias occurs when AI learns social biases contained in training data and then reinforces and spreads them.

  • Language model example: Learning patterns that associate “doctor” with “male” and “nurse” with “female”
  • Image generation example: Generating predominantly white male images for the prompt “CEO”

The EU AI Act is a regulation that classifies AI by risk level and imposes obligations based on that level.[2]

graph TD
    A["EU AI Act\nRisk Classification"] --> B["Unacceptable Risk\n(Prohibited)"]
    A --> C["High Risk\n(Obligations · Review required)"]
    A --> D["Limited Risk\n(Transparency obligations)"]
    A --> E["Minimal Risk\n(Voluntary response)"]
    B --> B1["Mass surveillance via biometrics\nBehavioral manipulation systems\nSocial scoring"]
    C --> C1["Medical diagnosis · Hiring · Credit assessment\nCritical infrastructure management\nLaw enforcement"]
    D --> D1["Chatbots\nImage generation AI"]
    E --> E1["Spam filters\nGame AI"]

The NIST AI RMF is an AI risk management framework published by the National Institute of Standards and Technology (NIST) in 2023.[1]

Four core functions:

  1. GOVERN: Building organizational systems for AI risk management
  2. MAP: Identifying and classifying AI context and risks
  3. MEASURE: Analyzing and evaluating risks
  4. MANAGE: Prioritizing, responding to, and monitoring risks

Constitutional AI is a safety technique published by Anthropic.[10]

  • Defines a set of principles (Constitution) that AI should follow, and the model self-evaluates and improves based on those principles
  • Core principles: “Helpful, Harmless, and Honest”
  • The mechanism underlying Claude’s safety
  • Responsible AI consists of five principles: fairness, transparency, explainability, privacy, and accountability
  • AI bias arises from three sources: training data, algorithms, and social amplification
  • The EU AI Act is risk-based regulation that ranges from prohibition to voluntary response depending on the application
  • It’s important to consider social fairness and safety from the design stage, not just technical accuracy

Q: Is Responsible AI something only large companies need to think about?

A: No. It’s relevant to all organizations and individuals that use AI. Even small organizations — when using AI for hiring or deploying a customer support chatbot — need to consider bias, transparency, and privacy. The EU AI Act applies to all AI systems that affect EU citizens, regardless of company size.

Q: Can AI bias be completely eliminated?

A: Complete elimination is realistically difficult. Training data reflects society’s historical and cultural biases, and completely removing them is a challenging problem. Also, fairness has multiple definitions, and satisfying all of them simultaneously is sometimes mathematically impossible.[11] A realistic approach is to “identify and mitigate serious unfairness” and “continuously monitor to detect problems early.”

Q: Is there a trade-off between explainability and accuracy?

A: Generally, deep learning models are highly accurate but hard to explain (black boxes), while simpler models like decision trees are easier to explain but tend to have lower accuracy. However, explanation techniques like LIME and SHAP have been studied as ways to explain complex model predictions after the fact.[12][13]

Q: Does the EU AI Act apply to Japanese companies?

A: It can. The EU AI Act includes providers placing AI systems on the EU market or putting them into service in the EU, as well as providers and deployers of AI systems whose output is used in the EU.[2] Japanese companies serving EU markets should check whether their AI systems fall within its scope.

  1. NIST, AI Risk Management Framework
  2. European Union, Regulation (EU) 2024/1689 laying down harmonised rules on artificial intelligence, July 12, 2024
  3. Reuters, Amazon scraps secret AI recruiting tool that showed bias against women, October 10, 2018
  4. Joy Buolamwini and Timnit Gebru, Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification, Proceedings of Machine Learning Research, 2018
  5. Ziad Obermeyer et al., Dissecting racial bias in an algorithm used to manage the health of populations, Science, 2019
  6. Microsoft, Learning from Tay’s introduction, March 25, 2016
  7. Ministry of Foreign Affairs of Japan, G7 Hiroshima Process International Code of Conduct for Organizations Developing Advanced AI Systems, October 30, 2023
  8. European Union, Regulation (EU) 2016/679 (General Data Protection Regulation), April 27, 2016
  9. Personal Information Protection Commission, Act on the Protection of Personal Information
  10. Anthropic, Constitutional AI: Harmlessness from AI Feedback, 2022
  11. Jon Kleinberg, Sendhil Mullainathan, and Manish Raghavan, Inherent Trade-Offs in the Fair Determination of Risk Scores, 2016
  12. Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin, Why Should I Trust You?: Explaining the Predictions of Any Classifier, 2016
  13. Scott M. Lundberg and Su-In Lee, A Unified Approach to Interpreting Model Predictions, 2017