목차
What is Ethics?
The field of ethics involves systematizing, defending, and recommending concepts of right and wrong behavior.
Ethical dilemmas are scenarios where there is no obviously good choice.
Values
Values: What is important to us in life
Features of values:
- Priorities guide Actions
- Beliefs linked to Affect (e.g. emotions)
- Refer to Goals
- Transcend Contexts (values span more than 1 context)
- Standards of criteria
- Ordered by Importance
Preferences: Values vs Interests
A preference means a more positive attitude (leaning) toward one alternative over another alternative
Preferences over values vs Preferences over interests:
- An interest is manifested as sustained attention involving cognition of the interest object, accompanying positive impact
- A value is manifested as affective valuation
- Both values and interests influence behavior
- When judgement in preference is based on liking (i.e. attraction), it is an interest; when the basis is importance (i.e. significance or meaning), the preference is a value
Value-Sensitive Design (VSD)
VSD methods are not standalone but are intended to be integrated with other methods and processes for technical design and development
Focus: Value identification, value preference elicitation (want to elicit value for each stakeholder)
Value Identification
Use an existing model of values
E.g. Schwartz Value Model - 10 universal values
Not all values are relevant to all contexts: Context-Specific Values
The meaning of a value depends on the context
Value Identification:
We need to identify and define values in a context: context-specific values
We use Axies, a hybrid methodology for identifying context-specific values.
To be effective for collaborative agents, the methodology needs to be:
- Democratic → Based on population input
- Hybrid → AI + human intuition
- Unbiased → Performed by a group of annotators
- Simple → On a web platform
Axies Overview
Start with value-laden opinion corpus. (No labels)
Methodology:
- involves value annotators (people who can abstract opinions into values)
Exploration: look at different opinions, see which value is applicable to the context
Consolidation: Put them together, ouput a list of values applicable
AI in Axies: help value annotators to efficeintly explore what's in the corpus
Axies input - Word Embeddings
The input of the methodology is population feedback on a context (e.g. tweets).
Words and sentences are represented as points in a n-dimensional space where semantic similarities are preserved.
Axies Methodology - Exploration
In the exploration phase, each annotator independently develops a value list by analyzing the population input.
All input is embedded through sentence embeddings, the next opinion shown to the annotator is selected with the Farthest First Traversal algorithm:
- Start with a random sample.
- Measure distances with other samples to distance sample and choose the one with largest distance.
Then again, find distances with other samples
→ Try to explore the space
→ Even only small samples, cover large areas
Given V as the set of visited points and N the set of novel points, FFT selects the next point as follows:
5 and 11, so it's assigned to the 5.
Axies Methodology - Consolidation (굳히다)
In the consolidation phase, the annotators in a group collaborate to merge their individual value lists
How to know if I know we covered large proportion of compass: 새로운 anotated value들이 안 생길 때까
Example of Values Identified by Axies
We compare the values we elicited (values identifeid by the annotators) with the Schwarz (x axis).
For self-direction, it can be identified to many values for this specific context (need more fine-grained vocabulary for the context).
Value Elicitation
The result of Axies is a list of values relevant to a context.
A concrete application is the elicitation of value preferences through natural language.
Estimating values from natural language allows:
- Humans to express values naturally
- Agents to have meaningful conservations with us
- Agents to learn a user's value profile by aggregating values estimated from sentences
Value Classification
We can treat value classification as a supervised classification.
We need:
- Labelling schema → Output of Axies
- Data → Tweets, surveys, etc
- Labels → Crowd annotations
Then we can just perform a regular supervised classification → Wrong
Label Subjectivity
The subjectivity of the interpretation of values must be considered, especially in collaborative settings.
Textual inputs must be annotated by multiple annotators.
Then, we select the majority annotation to perform supervised classification. → Wrong
That would result in the tyranny of the majority.
We must consider a plurality of opinions, so we must build uncertainty in the model.
Data Representation
Example: three annotators have annotated one datapoint:
A1: [0, 1, 0, 0]
A2: [0, 1, 1, 0]
A3: [0, 0, 0, 1]
Examples of data representation are:
- Majority agreement aggregation: [0, 1, 0, 0]
- Repeated lables: [0, 1, 0, 0], [0, 1, 1, 0], [0, 0, 0, 1]
- Distribution of labels: [0, 0.66, 0.33, 0.33]
Evaluation of Supervised Classifiers
Supervised classifiers are typically evaluated with the F1 score, assuming that a prediction is either correct or wrong.
P = Precision
R = Recall
Evaluation Metrics
Examples of evaluation metrics are:
- Accuracy and (class-weighted) F1 score
- Disagreement-weighted F1 score (items with a lot of disagreement are weighted less than "easy" items)
- Cross-entropy (to compare annotated and predicted distributions of labels)
Takeaways
Artificial agents must understand values operate among humans
Values represent what humans find important in life
Values are subjective and context dependent
Values can be identified and elicited via systematic AI (NLP) techniques
Social Norms
A social norm is a directed social expectation between principals (stakeholders)
Formal representation: Norm_Type(Subject, Object, Antecedent, Consequent)
Subject and Object: The two principals involved in the norm
Antecedent: The condition that brings a norm into force
Consequent: The outcome determining the satisfaction or violation of a norm
Why social norms?
Accountability - A principal can call another to account for its actions
Flexibility - A principal can violate norms (but bears the consequences)
Explainability - Provides an opportunity for principals to explain their actions
Types of social norms
The norm type defines the semantics of expectations underlying a norm
Types of Norms: Commitment (전념, 헌신)
Within an organizational context,
the subject (i.e. debtor) commits to the object (i.e. creditor) that if the antecedent holds, the debtor will bring about the consequent
E.g in e-commerce STS:
A seller must send the goods to a buyer upon payment
Norm: Commitment(Seller, Buyer, Payment, Goods)
Types of Norms: Prohibition
Within an organizational context,
The object prohibits the subject from bringing about the consequent provided the antecedent holds
E.g. in a healthcare STS:
Patient's personal health information (PHI) should not be published online under any circumstances
Norm: Prohibition(Hospital, Patient, TRUE, Publish_PHI_Online)
Types of Norms: Authorization
Within an organizational context,
The object authorizes (permits) the subject to bring about the consequent when the antecedent holds
E.g. in a healthcare STS:
In emergencies, hospital physicians may share a patient's PHI with outside physicians
Norm: Authorization(Hospital, Patient, Emergency, Share_PHI_Outside)
Types of Norms: Power
Within an organizational context,
when the antecedent holds, the object empowers the subject to bring about the consequent at will
Example in a university STS:
The university empowers the system administer to create accounts
Norm: Power(System_Admin, University, TRUE, Create_Account)
Types of Norms: Sanction
Within an organizational context,
The object would sanction (i.e. reward or punish) the subject by bringing about the consequent provided the antecedent holds
Example in an e-commerce STS:
A buyer can saction a seller by providing a poor rating if the product received is not as advertised
Norm: Sanction(Seller, Buyer, Product_Damaged, Poor_Rating)
Responsible Design of Collaborative AI System
Values
- What does a user find as important in a system?
- General vs Context-specific values
- Axies: A methodology for identifying context-specific values
Norms
- What are the mutual expectations among the users in the system?
- Norms help capture the mutual expectations
- Norms must be modeled as technical constructs so that agents representing human users can act in accordance with the applicable norms
'학교 > CAI' 카테고리의 다른 글
Lecture 10: Automated Negotiation (0) | 2024.03.27 |
---|---|
Lecture 9: Negotiation Formalization (1) | 2024.03.27 |
Lecture 8: Negotiation (1) | 2024.03.26 |
Lecture 7: Computational Coalition Formation (5) | 2024.03.06 |
Lecture 6: Social Choice Part 2 (0) | 2024.03.03 |
댓글