Sign in
Sign up
Explore
Enterprise
Education
Search
Help
Terms of use
About Us
Explore
Enterprise
Education
Gitee Premium
Blog
I know
View Details
Sign in
Sign up
Categories
New Tech
Hardware
IoT/Edge Computing
Car Application
Smart Home
Autopilot
Robots
5G
低代码
科研论文
quantum
chips
Web 3.0
Privacy Computing
Cloud Native
OpenHarmony
HarmonyOS Button
HarmonyOS EditText
HarmonyOS Layout
HarmonyOS Image
HarmonyOS Progress
HarmonyOS Menu
HarmonyOS Popup
HarmonyOS Selector
HarmonyOS TextView
HarmonyOS ListView
HarmonyOS Loading
HarmonyOS Notification
HarmonyOS View Transition
HarmonyOS Slider
HarmonyOS Chart
HarmonyOS Draw
HarmonyOS Counter
HarmonyOS Animate
HarmonyOS Captcha
HarmonyOS Multimedia
HarmonyOS Barcode
HarmonyOS Advanced
HarmonyOS Map
OpenHarmony Games
HarmonyOS Networking
HarmonyOS Communication
HarmonyOS Payment
HarmonyOS Database
HarmonyOS Drivers
OpenHarmony Guide
OpenHarmony DevTools
OpenHarmony App
HMS
HarmonyOS Permission
HarmonyOS Toolkit
OpenHarmony Components
Gesture
Development Lib
Chinese/English Segmenter
Payment Dev
Security Dev
Common Toolkit
Excel Toolkit
Barcode/QRCode
Template Engine
Desktop UI
Network Development Package
Audio Process
Network Tool
Network Service
Data Mining
Job/Task Scheduling
Programming Language/Scripting Language
Cache
Markdown Tools
Search Engine
Microservice
Workflow
Chart/Diagram Component
Authority Management
Reporting Tool
Code Generator
IoC/AOP Framework
Image Library
Rule Engine
JSON Toolkit
Log Toolkit
Spring Boot Extension
Verification Code
Algorithm/Mathematical Calculation
Node Extension
Process Engine
Animation Development
3G/4G/5G
AI/ML
Artificial Intelligence
VR/AR
Machine Learning/Deep Learning
Computer Vision/Face Recognition
Natural Language Processing
LLM
Blockchain
bitcoin
NFT
Wechat Projects
Wechat Development Package
WeChat Applet/Game
WeChat Application
WeChat Game
Enterprise App
Task/Project Management
Enterprise Application System
Business Intelligence
Financial/Stock Securities
GIS/Map/Navigation/Positioning
Web System
Content Management System
New-Sale/E-Shop
BBS
Blog
Questionnaire
SNS
Teaching Managment
Album/Gallery/Picture
RSS/Atom Tool
Application
File Management System
Multimedia
Text Editor
Instant Messaging
Application Software
RPA-机器人过程自动化
Web Development
Web Framework
jQuery Plugin
UI Framework
JavaScript Toolkit
RESTful Projects
Backend Management
Website Theme
Vue.js Components
Web Sipder
OAuth/SSO
Angular Plugin
Bootstrap Plugin
React Compnent
RPC Development Framework
API Gateway
短网址
layui-components
DevOps/Network
Network Management Tool
System Monitor
DevOps
Mobile Dev
Android Component/ Project
iOS Component
Mobile App
Alipay Applet
Baidu Applet
PhoneGap/Cordova Plugin
Cross-platform Mobile Development
QuickApp
TV Devel
uniapp components
Development Tools
Version Management System
Dev/Debug
Wiki/Document
Compile/Build/Deploy
Maven Plugin
Gulp Extension
Testing Tool
Code Scan
Server Development
Distributed Service/Framework
Message Server/Message Queue
Docker
Container/Virtual Machine
Nginx Module
Big Data
Cloud Computing
One-click Installation Package
OpenResty Extension
系统性能优化
Serverless
storage
Database Related
DB Development Package
Database Service
Database Management/Monitor
Game/Recreation
Game
Game Development
3D Engine
Plugins/Extension
Chrome Extension
Wordpress Plugin
Eclipse Plugin
IDEA Plugin
Firefox Extension
Safari Extension
Visual Studio Code Plugin
Jenkins Plugins
Other
Simulation Project
Handbook/Manual/Tutorial
ACM/OJ Project
Operation System
Teaching Managment
Tutorial Code
RISC-V Development
Bio/Medical
2020公益黑客马拉松
新冠病毒相关开源
Development Lib
/
Chinese/English Segmenter
Licenses
MulanPSL-2.0
0BSD
AFL-3.0
AGPL-3.0
Apache-2.0
Artistic-2.0
BSD-2-Clause
BSD-3-Clause
BSD-3-Clause-Clear
BSL-1.0
CC-BY-4.0
CC-BY-SA-4.0
CC0-1.0
ECL-2.0
EPL-1.0
EPL-2.0
EUPL-1.1
EUPL-1.2
GPL-2.0
GPL-3.0
ISC
LGPL-2.1
LGPL-3.0
LPPL-1.3c
MIT
MPL-2.0
MS-PL
MS-RL
MulanPSL-1.0
MulanPubL-1.0
MulanPubL-2.0
NCSA
OFL-1.1
OSL-3.0
PostgreSQL
UPL-1.0
Unlicense
WTFPL
Zlib
All Languages
Java
JavaScript
HTML
CSS
Python
Shell
C
C++
PHP
C#
TypeScript
Go
Objective-C
Android
Ruby
Kotlin
Assembly
Swift
NodeJS
Perl
Dart
Lua
Matlab
其他
Rust
HTML/CSS
PowerShell
微信
Scala
Groovy
C/C++
XSLT
Verilog
R
Docker
Pascal
CoffeeScript
QML
Erlang
FORTRAN
ActionScript
Emacs Lisp
Smalltalk
SQL
Delphi
ASP
TeX/LaTeX
VHDL
Visual Basic
Clojure
M
Common Lisp
Elixir
Haskell
Awk
LiveScript
Scheme
易语言
Julia
OCaml
Puppet
AutoHotkey
Ada
YAML
Pawn
D
XML
Standard ML
Arduino
VimL
Logos
Prolog
汇编
Haxe
ColdFusion
Vala
Crystal
Scilab
Racket
Coq
Lisp
Slash
Eiffel
eC
DOT
Zephir
Nemerle
Stars
Stars
Recommend
Last updated
狮子的魂/jcseg
GVP
2.1K
Jcseg是基于mmseg算法的一个轻量级Java中文分词器,同时集成了关键字提取,关键短语提取,关键句子提取和文章自动摘要等功能,并且提供了一个基于Jetty的web服务器,方便各大语言直接http调用,同时提供了最新版本的lucene、solr、elasticsearch、opensearch的搜索分词接口
Java
Chinese/English Segmenter
|
14 days ago
林良益/IK Analyzer 2012FF
467
IK Analyzer 是一个开源的,基于java语言开发的轻量级的中文分词工具包
Java
Chinese/English Segmenter
|
over 8 years ago
Yener/Jiagu
407
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要
Python
Chinese/English Segmenter
Machine Learning/Deep Learning
|
over 2 years ago
狮子的魂/friso
GVP
357
Friso 是使用 c 语言开发的一款开源的高性能中文分词器,使用流行的mmseg算法实现。完全基于模块化设计和实现,可以很方便的植入其他程序中, 例如:MySQL,PHP,并且提供了php5, php7, ocaml, lua的插件实现
C
Chinese/English Segmenter
|
14 days ago
sunjunyi/jieba
265
结巴中文分词做最好的Python分词组件
Python
Chinese/English Segmenter
|
over 10 years ago
Rocky/FoolNLTK
241
中文处理工具包,可能不是最快的开源中文分词,但很可能是最准的开源中文分词
Python
Chinese/English Segmenter
|
over 3 years ago
百度开源/lac
213
LAC全称Lexical Analysis of Chinese,是百度自然语言处理部研发的一款联合的词法分析工具,实现中文分词、词性标注、专名识别等功能
Python
Natural Language Processing
Chinese/English Segmenter
|
7 months ago
Indexea/ideaseg
158
基于 NLP 技术实现的中文分词插件,准确度比常用的分词器高太多,同时提供 ElasticSearch 和 OpenSearch 插件。
ElasticSearch
openSearch
hanlp
中文分词
Java
Chinese/English Segmenter
|
2 days ago
Eugen/hanlp-tokenizer
134
基于HanLP自然语言处理包的elasticsearch分词器
Java
Chinese/English Segmenter
|
10 months ago
vz/gse
120
Go 语言高效分词, 支持英文、中文、日文等
Go
Chinese/English Segmenter
|
almost 2 years ago
罗瑶光/快速中文分词分析word segmentation
79
快速中文分词分析word segmentation
Java
Chinese/English Segmenter
|
1 year ago
koth/kcws
68
kcws 是一个基于深度学习的分词系统和语料项目。 Deep Learning Chinese Word Segment
Python
Chinese/English Segmenter
|
6 years ago
Jtyoui/snsg
61
该项目已经更换,在码云上不在更新,请更换地址如下。
Python
Chinese/English Segmenter
|
over 4 years ago
m631521383/IKAnalyzer2017_6_6_0
60
IK中文分词,兼容solr/lucene6.6.0,优化数字和英文搜索
Java
Chinese/English Segmenter
|
6 years ago
震秦/paoding-analysis
54
Paoding分词器基于Lucene4.x
Java
Chinese/English Segmenter
|
over 9 years ago
1
2
3
4
Trending Today
Weekly
Indexea/ideaseg
158
基于 NLP 技术实现的中文分词插件,准确度比常用的分词器高太多,同时提供 ElasticSearch 和 OpenSearch 插件。
百度开源/lac
213
LAC全称Lexical Analysis of Chinese,是百度自然语言处理部研发的一款联合的词法分析工具,实现中文分词、词性标注、专名识别等功能
Indexea/ideaseg
158
基于 NLP 技术实现的中文分词插件,准确度比常用的分词器高太多,同时提供 ElasticSearch 和 OpenSearch 插件。
百度开源/lac
213
LAC全称Lexical Analysis of Chinese,是百度自然语言处理部研发的一款联合的词法分析工具,实现中文分词、词性标注、专名识别等功能
Going to Help Center
Search
Git 命令在线学习
如何在 Gitee 导入 GitHub 仓库
Git 仓库基础操作
企业版和社区版功能对比
SSH 公钥设置
如何处理代码冲突
仓库体积过大,如何减小?
如何找回被删除的仓库数据
Gitee 产品配额说明
GitHub仓库快速导入Gitee及同步更新
什么是 Release(发行版)
将 PHP 项目自动发布到 packagist.org
Back to the top