Building Benchmarks for Evaluation
Building Dense-or-sparse Model
Building Pre-and-post Training Dataset
Building Benchmarks for Evaluation
Building Dense-or-sparse Model
Building Pre-and-post Training Dataset
Building Benchmarks for Evaluation
Building Dense-or-sparse Model
Building Pre-and-post Training Dataset
Palm
LLaMa x
Gemini
Claude
DeepSeek
Grok
Qwen
Mistral
Gemma-x
Phi-4
GPT 4x
BLOOM
Building Benchmarks for Evaluation
Building Dense-or-sparse Model
Building Pre-and-post Training Dataset
Squad
GLUE
SuperGLUE
HELM
MMLU
MMLU-PRO
BIG-Bench
DOVE
WInoGrande
HellaSwag
Building (Indic) Benchmarks for Evaluation
Building or (Continual training) Dense-or-sparse Model
Building (Indic) Pre-and-post Training Dataset
Building Benchmarks for Evaluation
Training Dense-or-sparse Model
Building Pre-and-post Training Dataset
Dataset Name
# of tokens
~156 Billion
Diversity
Webpage
~170 Billion
22 sources
> 1 Trillion
380 Programing languages
5 Trillion (600B in public)
Webpage
1.2/30 Trillion
Webpage, Books, Arxiv, Wiki, StackExch
3 Trillion
Webpage, Books, Wiki, The Stack, STEM
~418 Billion
Webpage
~341 Billion
natural and programming languages
Languages
English
English
Code
English
English/Multi
English
Multi
Multi
English data
Dataset Name
# of tokens
~156 Billion
Diversity
Webpage
~170 Billion
22 sources
> 1 Trillion
380 Programing languages
5 Trillion (600B in public)
Webpage
1.2/30 Trillion
Webpage, Books, Arxiv, Wiki, StackExch
3 Trillion
Webpage, Books, Wiki, The Stack, STEM
~418 Billion
Webpage
~341 Billion
natural and programming languages
251 Billion
Web, videos, digitized pdf,synthetic
Languages
English
English
Code
English
English/Multi
English
Multi
Multi
Multi
Model
Multi-head Masked Attention
tell
me
a
joke
about
idli
why
why
did
the
did
Multi-head Masked Attention
tell
me
a
joke
about
idli
why
why
did
the
did
idli
the
Input text
Predict the class/sentiment
Input text
Summarize
Question
Answer
Input text
Prompt: Input text
Output response conditioned on prompt
Prompt: Predict sentiment, summarize, fill in the blank, generate story
Labelled data for task-1
Labelled data for task-2
Labelled data for task-3
Raw text data
(cleaned)
Use Instruction Fine-tuning and build datasets for the same
(full) Fine-Tuning of LLMs on Indic datasets still requires a lot of compute and expensive
Existing English Data
Synthetic India-centric conversations
Indic-Align
Capture all different ways in which people can ask!!
May 2024
March 2025
April 2025
GPT 4x
LLaMa x
Gemini
Claude
DeepSeek
Grok
Qwen
Mistral
Gemma
Phi-4
Question Answering
Summarization
Translation
Token classificaiton
Text
Classification
Text Generation
Entailement
Safety
Accuracy
BLEU
ROUGE
Exact Match
Helpfulness
Fairness
BPC, BPB
Pre-Trained
Model
Pre-Trained
Model
Lambada
sentence
completion
AI2 ARC
QA Systems
(custom)
BERT
Finetuning
GPT-2
Prompting
GPT-3
In-context Learning
Chat GPT
Chat Format
OBQA
QA Systems
(custom)
HellaSwag
Fine-Tuning
WinoGrande
Fine-Tuning
MMLU
In-context Learning
MATH
FT, ICL
BIG Bench
FT, ICL
Omni-Math
FT, ICL
MMLU-Pro
FT, ICL
Lambada
sentence
completion
AI2 ARC
QA Systems
(custom)
BERT
Finetuning
GPT-2
Prompting
GPT-3
In-context Learning
Chat GPT
Chat Format
OBQA
QA Systems
(custom)
HellaSwag
Fine-Tuning
WinoGrande
Fine-Tuning
MMLU
In-context Learning
MATH
FT, ICL
BIG Bench
FT, ICL
Omni-Math
FT, ICL
MMLU-Pro
FT, ICL
Which financial institutions in India offer 15% interest for Fixed Depoist?
Correct Answer: No financial institutions in India offer 15% return for a Fixed Depoist
Correct Answer: National banks such as SBI,ICICI,HFDC and KVB offers only 5% to 7% interest for FD
Correct Answer: None
Which of the follwing financial institutions in India offer 15% interest for Fixed Depoist?
A. SBI
B. HDFC
C. IDBI
D. None of the above
Correct option: D
In the complex z-plane, the set of points satisfying the equation \(z^2=|z|^2\) is a
(A) Pair of points
(B) Circle
(C) Half-line
(D) Line
Large Language Model
Question: In the complex z-plane, the set of points satisfying the equation \(z^2=|z|^2\) is a
Choices:
(A) Pair of points
(B) Circle
(C) Half-line
(D) Line
Correct answer :
A |
an |
apple |
book |
B |
cup |
C |
A |
B |
C |
D |
C is a Wrong Prediction
Passage:<text>
Answer:
PASSAGE:<text>
ANSWER:
Passage:<text> Answer:
Passage:<text>
Answer: A
Passage:<text>
Answer:
Passage:<text>
Answer: A
Passage:<text>
Answer: B
Passage:<text>
Answer:
Prompt: f"Here is the question and the four choices{question,choices}. Choose the correct option
Accuracy "all" = 25%
prompt: f"Q: {question}\n Choices:\n "
Accuracy "all" = 31%