DocTamper

The DocTamper dataset is now avaliable at BaiduDrive and Google Drive (part1 and part2).

The DocTamper dataset is only available for non-commercial use, you can request a password for it by sending an email with education email to 202221012612@mail.scut.edu.cn explaining the purpose.

To visualize the images and their corresponding ground-truths from the provided .mdb files, you can run this command "python vizlmdb.py --input DocTamperV1-FCD --i 0".

The official implementation of the paper Towards Robust Tampered Text Detection in Document Image: New Dataset and New Solution is in the "models" directory.

I delay the release of training codes as forced by my supervisor and the cooperative enterprise who bought them. My training pipline for DocTamper dataset and the IoU metric heavily brought from a famous project in this area, the results of the paper can be easily re-produced with it, you just need to adjust the loss functions and the learing rate decay curve. I also used its augmentation pipline except for (RandomBrightnessContrast, ShiftScaleRotate, CoarseDropout).

Open Source Scheme:
1、Inference models and codes: June, 2023.
2、Training codes: TBD.
3、Data synthesis code: Within 2024.

Any question about this work please contact 202221012612@mail.scut.edu.cn.

crisps / DocTamper2

DocTamper

简介

发行版

贡献者

近期动态

crisps / DocTamper2 .gitee-modal { width: 500px !important; }

DocTamper

简介

发行版

贡献者

近期动态

搜索帮助

crisps / DocTamper2