# CascadeTabNet
**Repository Path**: debug-huwei/CascadeTabNet
## Basic Information
- **Project Name**: CascadeTabNet
- **Description**: This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
- **Primary Language**: Unknown
- **License**: MIT
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2021-03-24
- **Last Updated**: 2021-03-24
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
# CascadeTabNet
[](https://paperswithcode.com/sota/table-detection-on-icdar2013-1?p=cascadetabnet-an-approach-for-end-to-end)
[](https://pytorch.org/)
[](https://github.com/open-mmlab/mmdetection)
> **CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents**
> [Devashish Prasad](https://github.com/DevashishPrasad),
> [Ayan Gadpal](https://github.com/ayangadpal),
> [Kshitij Kapadni](https://github.com/kshitijkapadni),
> [Manish Visave](https://github.com/ManishDV),
>
> [CVPR Link of Paper](http://openaccess.thecvf.com/content_CVPRW_2020/papers/w34/Prasad_CascadeTabNet_An_Approach_for_End_to_End_Table_Detection_and_CVPRW_2020_paper.pdf)
> [arXiv Link of Paper](https://arxiv.org/abs/2004.12629)
> Supplementary file
> The paper was presented (Orals) at [CVPR 2020 Workshop on Text and Documents in the Deep Learning Era](https://cvpr2020text.wordpress.com/)
> Virtual Oral Presentation [YOUTUBE VIDEO](https://www.youtube.com/watch?v=6rovEyWKZw8)
## 1. Introduction
CascadTabNet is an automatic table recognition method for interpretation of tabular data in document images. We present an improved deep learning-based end to end approach for solving both problems of table detection and structure recognition using a single Convolution Neural Network (CNN) model. CascadeTabNet is a Cascade mask Region-based CNN High-Resolution Network (Cascade mask R-CNN HRNet) based model that detects the regions of tables and recognizes the structural body cells from the detected tables at the same time. We evaluate our results on ICDAR 2013, ICDAR 2019 and TableBank public datasets. We achieved 3rd rank in ICDAR 2019 post-competition results for table detection while attaining the best accuracy results for the ICDAR 2013 and TableBank dataset. We also attain the highest accuracy results on the ICDAR 2019 table structure recognition dataset.
## 2. Setup
Models are developed in Pytorch based MMdetection framework (Version 1.2)
pip install -q mmcv terminaltables git clone --branch v1.2.0 'https://github.com/open-mmlab/mmdetection.git' cd "mmdetection" pip install -r "/content/mmdetection/requirements/optional.txt" python setup.py install python setup.py develop pip install -r {"requirements.txt"} pip install pillow==6.2.1 pip install mmcv==0.4.3Code is developed under following library dependencies
pip install torch==1.4.0+cu100 torchvision==0.5.0+cu100 -f https://download.pytorch.org/whl/torch_stable.html**If you are using Google Colaboratory (Colab), Then you need add** ``` from google.colab.patches import cv2_imshow ``` and replace all the `cv2.imshow` with `cv2_imshow` ## 3. Model Architecture
Model Name | Checkpoint File |
---|---|
General Model table detection | Checkpoint |
ICDAR 13 table detection | Checkpoint |
ICDAR 19 (Track A Modern) table detection | Checkpoint |
Table Bank Word table detection | Checkpoint |
Table Bank Latex table detection | Checkpoint |
Table Bank Both table detection | Checkpoint |
ICDAR 19 (Track B2 Modern) table structure recognition | Checkpoint |