dataset (de.: Datensatz)

Collection of data with a common format and goal-relevant content. Ideally, the data selected in this way represents the larger data set or the assumed real characteristics.

Note: Datasets can be used for training, validation and testing of an AI model. In the context of machine learning, datasets serve as the basis for training a machine learning algorithm.

Example 1: Microblogging posts from June 2020 linked to the hashtags #rugby and #football.

Example 2: Macro photos of flowers with size 256x256 pixels.

ISO/IEC DIS 22989 collection of data with a shared format and goal-relevant content.

EXAMPLE 1: Micro-blogging posts from June 2020 associated with hashtags #rugby and #football.

EXAMPLE 2: Macro photographs of flowers in 256x256 pixels.

Note 1 to entry: Datasets can be used for validating or testing an AI model. In a machine learning (3.2.9) context, datasets can also be used to train a machine learning algorithm (3.2.10)

ISTQB - CTAI Syllabus A collection of data used for training, evaluation, testing and prediction in ML

Source: AI-Glossary.org (https://www.ai-glossary.org), License of definition text (excl. standard references): CC BY-SA 4.0, accessed: 2024-11-21

BibTeX-Information

@misc{aiglossary_dataset_18mllfa,
author = {{AI-Glossary.org}},
title = {{dataset}},
howpublished = "https://www.ai-glossary.org/index.php?p=18mllfa\&l=en",
year = "2024",
note = "online, accessed: 2024-11-21" }