Collection of data with a common format and goal-relevant content. Ideally, the data selected in this way represents the larger data set or the assumed real characteristics.
Note: Datasets can be used for training, validation and testing of an AI model. In the context of machine learning, datasets serve as the basis for training a machine learning algorithm.
Example 1: Microblogging posts from June 2020 linked to the hashtags #rugby and #football.
Example 2: Macro photos of flowers with size 256x256 pixels.
ISO/IEC DIS 22989 collection of data with a shared format and goal-relevant content.
EXAMPLE 1: Micro-blogging posts from June 2020 associated with hashtags #rugby and #football.
EXAMPLE 2: Macro photographs of flowers in 256x256 pixels.
Note 1 to entry: Datasets can be used for validating or testing an AI model. In a machine learning (3.2.9) context, datasets can also be used to train a machine learning algorithm (3.2.10)
ISTQB - CTAI Syllabus A collection of data used for training, evaluation, testing and prediction in ML