# IBM-Synthetic-Data-Sets **Repository Path**: mirrors_ibm/IBM-Synthetic-Data-Sets ## Basic Information - **Project Name**: IBM-Synthetic-Data-Sets - **Description**: IBM Synthetic Data Sets - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-08-06 - **Last Updated**: 2026-01-04 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # IBM Synthetic Data Sets IBM Synthetic Data Sets is a family of artificially generated, enterprise-grade datasets that enhance predictive artificial intelligence (AI) model training and large language models (LLMs). These pre-built datasets are downloadable and packaged as comma-separated values (CSVs) and data definition language (DDL) files, making them familiar to use, and compatible with everything from databases to spreadsheets to hardware platforms to standard AI tools. The IBM Synthetic Data Sets family contains the following features: - IBM Synthetic Data Sets for Payment Cards - IBM Synthetic Data Sets for Core Banking and Money Laundering - IBM Synthetic Data Sets for Homeowners Insurance This repository includes schema files for the datasets. The folder structure is organized as follows: ``` schemas/ └── / └── / └── ``` This repository contains some example best practices for open source repositories: * [LICENSE](LICENSE) * [README.md](README.md) * [CONTRIBUTING.md](CONTRIBUTING.md) * [MAINTAINERS.md](MAINTAINERS.md) * [CHANGELOG.md](CHANGELOG.md) Additonal Resources: - Product Page - https://www.ibm.com/products/synthetic-data-sets - IBM Redbooks - https://www.redbooks.ibm.com/abstracts/redp5748.html If you have any questions or issues you can contact AI on Z (aionz@us.ibm.com) ## Versioning & Changes Details about version control and updates can be found in the [CHANGELOG.md](./CHANGELOG.md). ## License If you would like to see the detailed LICENSE click [here](LICENSE).