TY - COMP AU - Thielicke-Witt, Valerian AU - Weiß, Ana-Nzinga AU - Miltzow, Hannah PY - 2025 DA - 2025// TI - Generative Artificial Intelligence, Cultural Context and Discrimination Generative: [research data]: = Künstliche Intelligenzen, kultureller Kontext und Diskriminierung : Datensatz - Textdokumente CY - No place, unknown, or undetermined; Rostock; München AB - The dataset contains data from a qualitative study on texts generated by large language models (Mistral Large Instruct, Gemma 3, DeepSeek R1, Meta Llama 3.1, Llama Sauerkraut, Qwen 3) using various comparable prompts in three different languages (German, English, French) to define diversity in order to identify political and cultural bias in the training material. Each result was generated using a new context window and the same or comparable settings between the LLMs (medium temp, top_p and the same system prompt). The process was repeated at least five times for each prompt in the respective language. In addition, the settings were experimented with in an additional run. In total, the dataset comprises more than 270 comparable documents and more than 50 experimental documents, which are stored as .rtf files and .txt files in the dataset. UR - https://purl.uni-rostock.de/rosdok/id00005015 UR - https://doi.org/10.18453/rosdok_id00005015 DO - 10.18453/rosdok_id00005015 LA - English N1 - Valerian Thielicke-Witt, Ana-Nzinga Weiß, Hannah Miltzow ID - 1945030178 ER -