# realworldqa

**Repository Path**: hf-datasets/realworldqa

## Basic Information

- **Project Name**: realworldqa
- **Description**: Mirror of https://huggingface.co/datasets/visheratin/realworldqa
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: main
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2024-04-17
- **Last Updated**: 2024-06-09

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

---
dataset_info:
  features:
  - name: question
    dtype: string
  - name: answer
    dtype: string
  - name: image
    dtype: image
  splits:
  - name: test
    num_bytes: 678377348
    num_examples: 765
  download_size: 678335644
  dataset_size: 678377348
configs:
- config_name: default
  data_files:
  - split: test
    path: data/test-*
task_categories:
- visual-question-answering
language:
- en
pretty_name: RealWorldQA
---

# RealWorldQA dataset

This is the benchmark dataset released by xAI along with the Grok-1.5 Vision [announcement](https://x.ai/blog/grok-1.5v). 
This benchmark is designed to evaluate basic real-world spatial understanding capabilities of multimodal models. 
While many of the examples in the current benchmark are relatively easy for humans, they often pose a challenge for frontier models.

This release of the RealWorldQA consists of 765 images, with a question and easily verifiable answer for each image. 
The dataset consists of anonymized images taken from vehicles, in addition to other real-world images.

## License

CC BY-ND 4.0