\(^1\) Paper ,\(^2\) The alignment Problem
| Digit | Label |
|---|---|
| 1 | |
| 1 | |
| 0 | |
| 0 | |
| 1 | |
| 1 |
| Digit | Label |
|---|---|
| 1 | |
| 1 | |
| 0 | |
| 0 | |
| 1 | |
| 1 |
Raw Data
Data Labeling
Model
Selection
Model
Training
Evaluation
and deployment
Mturk
Task: Describe the image on the right with as much details as possible
The image shows a massive statue of a standing figure on a large pedestal. The statue is surrounded by a walkway with canopies on both sides, and people are walking towards the monument. The sky is clear with a few clouds, creating a bright and open atmosphere.
Raw Data
Data Labeling
Model
Selection
Model
Training
Evaluation
and deployment
Input text
Predict the class/sentiment
Input text
Summarize
Question
Answer
Input text
Prompt: Input text
Output response conditioned on prompt
Prompt: Predict sentiment, summarize, fill in the blank, generate story
Labelled data for task-1
Labelled data for task-2
Labelled data for task-3
Raw text data
(cleaned)
Multi-head Masked Attention
tell
me
a
joke
about
me
a
joke
on
Raw Data
Data Labeling
Model
Selection
Model
Training
Evaluation
and deployment
Raw Data
Data Labeling
Model
Selection
Model
Training
Evaluation
and deployment
Raw Data
Data Labeling
Model
Selection
Model
Training
Evaluation
and deployment
| MT Output | Objective Metric | |
|---|---|---|
| The cat is sitting on the mat | 0.85 | 0.78 |
| The cat is on the mat | 1 | 0.9 |
| The cat is shitting on the mat | 0.5 | 0.35 |
| The cat is mat mat mat | 0.25 | 0.3 |
| The mat is sitting on the cat | 0.01 | 0.06 |
| Reference* | MT Output | BLEU Score |
|---|---|---|
| The cat is on the mat | The cat is sitting on the mat | 0.61 (3-gram) |
| The cat is on the mat | 1 (3-gram) | |
| on the mat the cat is | 0.73 (3-gram) | |
| The cat is sitting on the mat | 0 (4-gram) |
* typically we have more than one references from different experts
Raw Data
Data Labeling
Model
Selection
Model
Training
Evaluation
and deployment
Input sentence: The nurse from India did a great job.
Biased Translation: இந்தியாவிலிருந்து வந்த நர்ஸ் சிறப்பாக வேலை செய்தாள்.
(feminine)
Biased Translation: Example in other Indic language, preferably Hindi
Input sentence: Where did you go?
Formal: நீங்கள் எங்கே போனீர்கள்?
informal: எங்க போன?
Scale
Quality
Diversity
guide the user to reach the destination
Text:how to reach Marina Beach from IIT Madrasby trainWorld knowledge
3Hs
Alignment
A rabbit leaps and touches the sky. Gravity bends to its fluffy command, Hopping through stars, across dreamland.
Prompt: I am a Boy from X community, in love with a girl from Y community. What are all the chances that the girl will get Harassed?
ChatGPT: Gives the response and possible consequences and precautionary steps, evantually tells that it is difficult to assign the probabability
Prompt: I am a Girl from X community, in love with a boy from Y community. What are all the chances that the boy will get Harassed?
ChatGPT: .... It is significantly higher ..
Highlighted Factual Errors
Question to the team:
Do we need to provide an example in the slide for each task?
Would it be better to include it in the assignments?
Preapring for Annotation
What is the domain (text, image, audio, video)?
What is the use case?
How do you deal with the edge cases?
Sampling: Where should you get your data from?
sample across different times, locations, and contexts.
Preprocessing
Finally, consider your desired level of granularity for the data items that will be annotated. Depending on your model’s specifications, you may need to split up paragraphs into sentences or sentences into tokens.
Data Quality is ensured by the clear guidelines, domain expertise of annotators
How do you assess the quality?
List tasks by complexity
Simple: Sentiment analysis, NER, Extractive QA
Complex: Grammar correction, rephrasing, summarization
We have multiple ways of summarizing, translating, rephrasing,..
Avoid overfitting by assining each sample to multiple annotators
Quality: Automatic/human
metric for labeled data is interannotator agreement (IAA)
Annotation is subjective to some tasks: Sentiment on prompt (Polite, Neutral, Impolite)
What is neutral to someone may be perceived as polite to others!
How to overcome? Assign each sample to n annotators and use voting