1 Star 2 Fork 1

WXG / crawler-config-plugin

Create your Gitee Account
Explore and code with more than 5 million developers,Free private repositories !:)
Sign up
Clone or download
Cancel
Notice: Creating folder will generate an empty file .keep, because not support in Git
Loading...
README.en.md

crawler-config-plugin

输入图片说明

Introduction

Visualized Click-Selection Configuration of Chrome Plug-in in Crawler Collection Field Based on Vue

Usage

Download Project:

Git clone https://gitee.com/xgpxg/crawler-config-plugin.git

Add the plugin directory to the Chrome browser as a decompressed plug-in

CTR + Shift + Z opens plug-in interface

Click on the elements you want to collect on the web page.

After configuring a field group, click the "Temporary Field Group" button to complete the configuration of a field group. After configuring all fields, click the "copy" button to copy JSON data to the clipboard.

Sample json:

{
    "group_0": [
        {
            "fieldCode": "title", 
            "fieldName": "标题", 
            "fieldRule": "div#1 > h3:nth-child(1) > a", 
            "sampleValue": "鸡你太美是什么梗?为什么我同学经常开玩笑_百度知道", 
            "_heyui_uuid": "e264db9d-54ba-d856-54df-ae0d511cb113"
        }, 
        {
            "fieldCode": "summary", 
            "fieldName": "简介", 
            "fieldRule": "div.c-abstract.bg-green-color", 
            "sampleValue": "最佳答案: 你们凭什么发这种东西骂人,这叫网络暴力,很容易出人命的,比如我快笑死了😢", 
            "_heyui_uuid": "486aa727-2b60-967c-6ab3-3a32c17ea4b0"
        }, 
        {
            "fieldCode": "from", 
            "fieldName": "来源", 
            "fieldRule": "div#1 > div:nth-child(4) > a:nth-child(1) > span:nth-child(2)", 
            "sampleValue": "百度知道", 
            "_heyui_uuid": "ab26ac8a-577c-366a-832c-213bc30ede30"
        }
    ], 
    "group_1": [
        {
            "fieldCode": "title", 
            "fieldName": "标题", 
            "fieldRule": "div#2 > h3:nth-child(3) > a", 
            "sampleValue": "【蔡徐坤】 鸡你太美!-娱乐-高清正版视频在线观看–爱奇艺", 
            "_heyui_uuid": "2ffb975d-5c2b-3da8-af6c-5bcb41d54571"
        }, 
        {
            "fieldCode": "summary", 
            "fieldName": "简介", 
            "fieldRule": "div#2 > div:nth-child(4) > div:nth-child(2) > font > p:nth-child(2)", 
            "sampleValue": "2019年3月18日 - 【蔡徐坤】鸡你太美!是娱乐类高清视频,画面清晰,播放流畅,发布时间:20190318。节目简介:", 
            "_heyui_uuid": "b7d85413-2039-5e9f-c311-5dac7bb50f35"
        }, 
        {
            "fieldCode": "from", 
            "fieldName": "来源", 
            "fieldRule": "div#2 > div:nth-child(4) > div:nth-child(2) > font > div:nth-child(3) > a:nth-child(2)", 
            "sampleValue": "百度快照", 
            "_heyui_uuid": "39c1d4a8-9e63-0bca-4be8-74e2fd30ff74"
        }
    ]
}

Comments ( 0 )

Sign in for post a comment

About

基于vue的爬虫采集字段可视化点选配置chrome插件 spread retract
JavaScript
Apache-2.0
Cancel

Releases

No release

Contributors

All

Activities

load more
can not load any more
JavaScript
1
https://gitee.com/xgpxg/crawler-config-plugin.git
git@gitee.com:xgpxg/crawler-config-plugin.git
xgpxg
crawler-config-plugin
crawler-config-plugin
master

Search