Pages
Label
Annotations
JSONL format compatible with HuggingFace Datasets. Each line is one page, matching the PESP schema.
Fields: filename, journal, year, issue, uri, yolo, yolo_predictions.
To merge with PESP: datasets.concatenate_datasets([pesp, load_dataset('json', data_files='vsesvit.jsonl')])