Concept
Dataset is a collection of datapoints. It can be used for the following purposes:- Data storage for use in future fine-tuning or prompt-tuning.
- Provide inputs and expected outputs for Evaluations.
Format
Every datapoint has two fixed JSON objects:data
and target
, each with arbitrary keys.
target
is only used in evaluations.
data
– the actual datapoint data,target
– data additionally sent to the evaluator function.metadata
– arbitrary key-value metadata about the datapoint.
data
and target
, the value can be any JSON value.