Sign in
Sign up
Explore
Enterprise
Education
Search
Help
Terms of use
About Us
Explore
Enterprise
Education
Blog
Sign in
Sign up
Categories
Development Tools
Version Management System
Dev/Debug
Wiki/Document
Compile/Build/Deploy
Maven Plugin
IDEA Plugin
Gulp Extension
Testing Tool
Code Scan
Web Development
Web Framework
jQuery Plugin
UI Framework
JavaScript Toolkit
RESTful Projects
Backend Management
Website Theme
Vue.js Components
Web Sipder
OAuth/SSO
Angular Plguin
Bootstrap Plugin
React Compnent
RPC Development Framework
API Gateway
短网址
Mobile Dev
Android Component/ Project
Mobile App
iOS Component
Alipay Applet
PhoneGap/Cordova Plugin
Cross-platform Mobile Development
Baidu Applet
QuickApp
harmony
TV Devel
Development Lib
Chinese/English Segmenter
Payment Dev
Security Dev
Common Toolkit
Excel Toolkit
Barcode/QRCode
Template Engine
Desktop UI
Network Development Package
Audio Process
Network Tool
Network Service
Data Mining
Job/Task Scheduling
Programming Language/Scripting Language
Cache
Markdown Tools
Search Engine
Microservice
Workflow
Chart/Diagram Component
Authority Management
Reporting Tool
Code Generator
IoC/AOP Framework
Image Library
Rule Engine
JSON Toolkit
Log Toolkit
Spring Boot Extension
Verification Code
Algorithm/Mathematical Calculation
Node Extension
Process Engine
Animation Development
3G/4G/5G
Web System
Content Management System
New-Sale/E-Shop
BBS
Blog
Questionnaire
SNS
Teaching Managment
Album/Gallery/Picture
RSS/Atom Tool
Enterprise App
Task/Project Management
Enterprise Application System
Business Intelligence
Financial/Stock Securities
GIS/Map/Navigation/Positioning
Server Development
Distributed Service/Framework
Message Server/Message Queue
Docker
Container/Virtual Machine
Nginx Module
Big Data
Cloud Computing
One-click Installation Package
OpenResty Extension
系统性能优化
Serverless
Application
File Management System
Multimedia
Text Editor
Instant Messaging
Application Software
Visual Studio Code Plugin
RPA-机器人过程自动化
DevOps/Network
DevOps
Network Management Tool
System Monitor
Game/Recreation
Game
Game Development
3D Engine
Database Related
DB Development Package
Database Service
Database Management/Monitor
Plugins/Extension
Chrome Extension
Wordpress Plugin
Eclipse Plugin
Firefox Extension
Safari Extension
Jenkins Plugins
Other
Simulation Project
Handbook/Manual/Tutorial
ACM/OJ Project
Operation System
Tutorial Code
Teaching Managment
RISC-V Development
Bio/Medical
2020公益黑客马拉松
新冠病毒相关开源
AI/ML
Artificial Intelligence
VR/AR
Machine Learning/Deep Learning
Computer Vision/Face Recognition
Natural Language Processing
Blockchain
bitcoin
Wechat Projects
Wechat Development Package
WeChat Applet/Game
WeChat Application
WeChat Game
New Tech
Hardware
IoT/Edge Computing
Car Application
Smart Home
Autopilot
Robots
5G
低代码
科研论文
Web Development
/
Web Sipder
LGPL-3.0
All
MulanPSL-2.0
0BSD
AFL-3.0
AGPL-3.0
Apache-2.0
Artistic-2.0
BSD-2-Clause
BSD-3-Clause
BSD-3-Clause-Clear
BSL-1.0
CC-BY-4.0
CC-BY-SA-4.0
CC0-1.0
ECL-2.0
EPL-1.0
EPL-2.0
EUPL-1.1
EUPL-1.2
GPL-2.0
GPL-3.0
ISC
LGPL-2.1
LPPL-1.3c
MIT
MPL-2.0
MS-PL
MS-RL
MulanPSL-1.0
MulanPubL-1.0
NCSA
OFL-1.1
OSL-3.0
PostgreSQL
UPL-1.0
Unlicense
WTFPL
Zlib
Java
All Languages
JavaScript
PHP
Python
C#
Android
Objective-C
Go
C++
C
HTML
NodeJS
Swift
其他
TypeScript
微信
HTML/CSS
Ruby
Shell
Dart
CSS
C/C++
Kotlin
Docker
Lua
Scala
Matlab
Delphi
Rust
SQL
TeX/LaTeX
Visual Basic
Verilog
ASP
R
Groovy
ActionScript
易语言
Erlang
XML
VimL
Arduino
Pascal
Perl
FORTRAN
Assembly
QML
PowerShell
汇编
Clojure
Emacs Lisp
CoffeeScript
Julia
AutoHotkey
VHDL
Haskell
M
Elixir
Lisp
D
Scheme
XSLT
Racket
Common Lisp
Vala
OCaml
Logos
DOT
Coq
Haxe
Puppet
LiveScript
Smalltalk
Prolog
Nemerle
Pawn
Crystal
Eiffel
Standard ML
Ada
eC
Scilab
Awk
Slash
Zephir
ColdFusion
Stars
Stars
Recommend
Last updated
代码神童/YayCrawler
Java
Web Sipder
LGPL-3.0
1.1K
分布式爬虫系统,简单使用,高级配置。可扩展,减轻开发量,能docker化,适应各种急切需求核心框架:WebMagic, Spring Boot ,MongoDB, ActiveMQ ,Spring + Quartz,Spring Jpa , Druid,Redis, Ehcache ,SLF4J、Log4j2, Bootstrap + Jquery 等,不详细列举了
2 years ago
3 issues
itlabers/CrawlerDemon
Java
Web Sipder
LGPL-3.0
133
分布式爬虫 Crawler
4 years ago
1 issue
Trending Today
Weekly
ssssssss-team/spider-flow
3.1K
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
黄亿华/webmagic
3.7K
webmagic 是一个无须配置、便于二次开发的爬虫框架,它提供简单灵活的API,只需少量代码即可实现一个爬虫。
MountFuji/easycrawl
98
基于webmagic的通用爬虫抓取应用,核心在于简单易用,搭建好后轻松抓取数据
xtuhcy/Gecco
1.8K
Gecco 是一款用java语言开发的轻量化的易用的网络爬虫,整合了jsoup、httpclient、fastjson、spring、htmlunit、redission等优秀框架。
自风/Spiderman2
1.7K
二代蜘蛛侠,此版本完全重新开发,比上一代更加强大(性能,易用,架构,分布式,简洁,成熟)
鬼画符/templatespider
1.7K
扒网站工具,看好哪个网站,指定好URL,自动扒下来做成模版。所见网站,皆可为我所用!
自风/Spiderman
3.1K
强力 Java 爬虫,列表分页、详细页分页、ajax、微内核高扩展、配置灵活
virjar/vscrawler
205
适合抓取封堵的爬虫框架
CVTeam_CN/FaceSpider
142
目标识别爬虫
程序员薛师兄/AIPa
141
一款小巧、灵活的Java多线程爬虫框架(AiPa)内嵌Jsoup 零成本上手
ssssssss-team/spider-flow
3.1K
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
黄亿华/webmagic
3.7K
webmagic 是一个无须配置、便于二次开发的爬虫框架,它提供简单灵活的API,只需少量代码即可实现一个爬虫。
鬼画符/templatespider
1.7K
扒网站工具,看好哪个网站,指定好URL,自动扒下来做成模版。所见网站,皆可为我所用!
MountFuji/easycrawl
98
基于webmagic的通用爬虫抓取应用,核心在于简单易用,搭建好后轻松抓取数据
xtuhcy/Gecco
1.8K
Gecco 是一款用java语言开发的轻量化的易用的网络爬虫,整合了jsoup、httpclient、fastjson、spring、htmlunit、redission等优秀框架。
自风/Spiderman2
1.7K
二代蜘蛛侠,此版本完全重新开发,比上一代更加强大(性能,易用,架构,分布式,简洁,成熟)
孤狼/pikachu
77
去吧皮卡丘,为什么取个名字叫皮卡丘,大概是这样萌一些。小哥哥是很可爱的。然后本项目是个爬虫项目,使用时候就像派出小精灵一样,派出皮卡丘,就会为你抓回对应的数据。
logic/QuickCompanyCollect
29
运行于java环境的一个免费开源的企业信息采集器(简单的java网络爬虫)。 信息采集完成后自动导出Excel表格。 基于Jsoup+Poi+Sqlite开发完成。
liinux/ghost-login
484
专门用来解决爬虫采集相关网站数据时模拟自动登录,验证码自动识别的问题;欢迎加入一起开发完善。
virjar/vscrawler
205
适合抓取封堵的爬虫框架
Going to Help Center
Search
Git 命令在线学习
如何在 Gitee 导入 GitHub 仓库
Git 仓库基础操作
企业版和社区版功能对比
SSH 公钥设置
如何处理代码冲突
仓库体积过大,如何减小?
如何找回被删除的仓库数据
Gitee 产品配额说明
GitHub仓库快速导入Gitee及同步更新
什么是 Release(发行版)
See more results
Share to
Back to the top