# SynthText_Chinese_py3 **Repository Path**: v_wanglei/SynthText_Chinese_py3 ## Basic Information - **Project Name**: SynthText_Chinese_py3 - **Description**: No description available - **Primary Language**: Python - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 2 - **Created**: 2019-11-04 - **Last Updated**: 2020-12-19 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # SynthText_Chinese_version from JarveeLee Modify from https://github.com/JarveeLee/SynthText_Chinese_version.git to generate chinese character. Due to SynthText_Chinese_version is writen by python2 and opencv2.So, I modify this program by python3 and opencv3. My OS is Ubuntu18.01, python3.6, opencv3.4. But I am not sure whether it can run on other OS. - **dset.h5**: For the "dset.h5" file,I have generated it which containts 99 images and corresponding to depth and seg infomation.If you don't want to genetate by yourself you can download from [dset.h5](https://pan.baidu.com/s/1-s7b_68O-GTK3dH4OJt6zw), password: fxtj Of course,you can genetate the "dset.h5" file by yourself.You must download these files: The 8,000 background images used in the paper, along with their segmentation and depth masks, have been uploaded here: http://www.robots.ox.ac.uk/~vgg/data/scenetext/preproc/filename, where, filename can be: - **imnames.cp [180K]**: names of filtered files, i.e., those files which do not contain text - **bg_img.tar.gz [8.9G]**: compressed image files (more than 8000, so only use the filtered ones in imnames.cp) - **depth.h5 [15G]**: depth maps - **seg.h5 [6.9G]**: segmentation maps You can create a folder,such as "dataset" and put the files into this folder,and copying the "gen_dset.py" into this folder. You also have to unzip the "bg_img.tar.gz" to this folder.You only run: ``` python gen_dset.py ``` The "gen_dset.py" file can generate 99 images infomation,if you want to generate more images infomation,You can modify the 35th line of this file.Modify "i == 100" to "i == n",'n' denotes a number which is you want to generate quantity of image. Then you just copy the generated file "dset.h5" to the folder "data".You only run: ``` python gen.py ``` If You want to visualize these synthtext images,you can run: ``` python gen.py --viz ``` Note: I do not own the copyright to these images. More detail content,you can consult the https://github.com/ankush-me/SynthText.