Sign in
Sign up
Explore
Enterprise
Education
Search
Help
Terms of use
About Us
Explore
Enterprise
Education
Gitee Premium
Gitee AI
I know
View Details
Sign in
Sign up
Categories
New Tech
Hardware
IoT/Edge Computing
Car Application
Smart Home
Autopilot
Robots
5G
低代码
科研论文
quantum
chips
Web 3.0
Privacy Computing
Cloud Native
OpenHarmony
HarmonyOS Button
HarmonyOS EditText
HarmonyOS Layout
HarmonyOS Image
HarmonyOS Progress
HarmonyOS Menu
HarmonyOS Popup
HarmonyOS Selector
HarmonyOS TextView
HarmonyOS ListView
HarmonyOS Loading
HarmonyOS Notification
HarmonyOS View Transition
HarmonyOS Slider
HarmonyOS Chart
HarmonyOS Draw
HarmonyOS Counter
HarmonyOS Animate
HarmonyOS Captcha
HarmonyOS Multimedia
HarmonyOS Barcode
HarmonyOS Advanced
HarmonyOS Map
OpenHarmony Games
HarmonyOS Networking
HarmonyOS Communication
HarmonyOS Payment
HarmonyOS Database
HarmonyOS Drivers
OpenHarmony Guide
OpenHarmony DevTools
OpenHarmony App
HMS
HarmonyOS Permission
HarmonyOS Toolkit
OpenHarmony Components
Gesture
Development Lib
Chinese/English Segmenter
Payment Dev
Security Dev
Common Toolkit
Excel Toolkit
Barcode/QRCode
Template Engine
Desktop UI
Network Development Package
Audio Process
Network Tool
Network Service
Data Mining
Job/Task Scheduling
Programming Language/Scripting Language
Cache
Markdown Tools
Search Engine
Microservice
Workflow
Chart/Diagram Component
Authority Management
Reporting Tool
Code Generator
IoC/AOP Framework
Image Library
Rule Engine
JSON Toolkit
Log Toolkit
Spring Boot Extension
Verification Code
Algorithm/Mathematical Calculation
Node Extension
Process Engine
Animation Development
3G/4G/5G
AI/ML
Artificial Intelligence
VR/AR
Machine Learning/Deep Learning
Computer Vision/Face Recognition
Natural Language Processing
LLM
Blockchain
bitcoin
NFT
Wechat Projects
Wechat Development Package
WeChat Applet/Game
WeChat Application
WeChat Game
Enterprise App
Task/Project Management
Enterprise Application System
Business Intelligence
Financial/Stock Securities
GIS/Map/Navigation/Positioning
Engineering
Web System
Content Management System
New-Sale/E-Shop
BBS
Blog
Questionnaire
SNS
Teaching Managment
Album/Gallery/Picture
RSS/Atom Tool
Application
File Management System
Multimedia
Text Editor
Instant Messaging
Application Software
RPA-机器人过程自动化
Web Development
Web Framework
jQuery Plugin
UI Framework
JavaScript Toolkit
RESTful Projects
Backend Management
Website Theme
Vue.js Components
Web Sipder
OAuth/SSO
Angular Plugin
Bootstrap Plugin
React Compnent
RPC Development Framework
API Gateway
短网址
layui-components
DevOps/Network
Network Management Tool
System Monitor
DevOps
Mobile Dev
Android Component/ Project
iOS Component
Mobile App
Alipay Applet
Baidu Applet
PhoneGap/Cordova Plugin
Cross-platform Mobile Development
QuickApp
TV Devel
uniapp components
Development Tools
Version Management System
Dev/Debug
Wiki/Document
Compile/Build/Deploy
Maven Plugin
Gulp Extension
Testing Tool
Code Scan
Server Development
Distributed Service/Framework
Message Server/Message Queue
Docker
Container/Virtual Machine
Nginx Module
Big Data
Cloud Computing
One-click Installation Package
OpenResty Extension
系统性能优化
Serverless
storage
Database Related
DB Development Package
Database Service
Database Management/Monitor
Game/Recreation
Game
Game Development
3D Engine
Plugins/Extension
Chrome Extension
Wordpress Plugin
Eclipse Plugin
IDEA Plugin
Firefox Extension
Safari Extension
Visual Studio Code Plugin
Jenkins Plugins
Other
Simulation Project
Handbook/Manual/Tutorial
ACM/OJ Project
Operation System
Teaching Managment
Tutorial Code
RISC-V Development
Bio/Medical
2020公益黑客马拉松
新冠病毒相关开源
Development Lib
/
Chinese/English Segmenter
Licenses
MulanPSL-2.0
0BSD
AFL-3.0
AGPL-3.0
Apache-2.0
Artistic-2.0
BSD-2-Clause
BSD-3-Clause
BSD-3-Clause-Clear
BSD-4-Clause
BSL-1.0
CC-BY-4.0
CC-BY-SA-4.0
CC0-1.0
CECILL-2.1
CERN-OHL-P-2.0
CERN-OHL-S-2.0
CERN-OHL-W-2.0
ECL-2.0
EPL-1.0
EPL-2.0
EUPL-1.1
EUPL-1.2
GFDL-1.3
GPL-2.0
GPL-3.0
ISC
LGPL-2.1
LGPL-3.0
LPPL-1.3c
MIT
MIT-0
MPL-2.0
MS-PL
MS-RL
MulanPSL-1.0
MulanPubL-1.0
MulanPubL-2.0
NCSA
ODbL-1.0
OFL-1.1
OSL-3.0
PostgreSQL
UPL-1.0
Unlicense
Vim
WTFPL
Zlib
All Languages
Java
JavaScript
HTML
CSS
Python
C
Shell
C++
PHP
TypeScript
C#
Go
Objective-C
Android
Kotlin
Ruby
Assembly
Swift
NodeJS
Perl
Dart
Lua
Matlab
Rust
其他
PowerShell
HTML/CSS
微信
Scala
Groovy
C/C++
XSLT
Verilog
R
QML
Pascal
Docker
CoffeeScript
FORTRAN
Erlang
Emacs Lisp
ActionScript
SQL
Smalltalk
Delphi
VHDL
M
TeX/LaTeX
ASP
Visual Basic
Clojure
Common Lisp
Awk
LiveScript
Haskell
Scheme
Elixir
Julia
易语言
OCaml
YAML
AutoHotkey
Pawn
Puppet
Ada
D
Standard ML
XML
Logos
Arduino
Prolog
VimL
汇编
Coq
Haxe
Vala
ColdFusion
Crystal
Scilab
Racket
Lisp
Slash
Eiffel
eC
DOT
Zephir
Nemerle
Stars
Stars
Recommend
Last updated
狮子的魂/jcseg
GVP
2.2K
Jcseg是基于mmseg算法的一个轻量级Java中文分词器,同时集成了关键字提取,关键短语提取,关键句子提取和文章自动摘要等功能,并且提供了一个基于Jetty的web服务器,方便各大语言直接http调用,同时提供了最新版本的lucene、solr、elasticsearch、opensearch的搜索分词接口
Java
中英文分词
|
1年前
林良益/IK Analyzer 2012FF
474
IK Analyzer 是一个开源的,基于java语言开发的轻量级的中文分词工具包
Java
中英文分词
|
接近10年前
Yener/Jiagu
466
Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要
Python
中英文分词
机器学习/深度学习
|
3年多前
狮子的魂/friso
GVP
371
Friso 是使用 c 语言开发的一款开源的高性能中文分词器,使用流行的mmseg算法实现。完全基于模块化设计和实现,可以很方便的植入其他程序中, 例如:MySQL,PHP,并且提供了php5, php7, ocaml, lua的插件实现
C
中英文分词
|
1年前
sunjunyi/jieba
274
结巴中文分词做最好的Python分词组件
Python
中英文分词
|
11年多前
百度开源/lac
241
LAC全称Lexical Analysis of Chinese,是百度自然语言处理部研发的一款联合的词法分析工具,实现中文分词、词性标注、专名识别等功能
Python
自然语言处理
中英文分词
|
1年多前
Rocky/FoolNLTK
240
中文处理工具包,可能不是最快的开源中文分词,但很可能是最准的开源中文分词
Python
中英文分词
|
接近5年前
Indexea/ideaseg
206
基于 NLP 技术 ( HanLP ) 实现的中文分词插件,准确度比常用的分词器高太多,同时提供 ElasticSearch 和 OpenSearch 插件。
ElasticSearch
openSearch
hanlp
中文分词
Java
Chinese/English Segmenter
|
4 months ago
Eugen/hanlp-tokenizer
139
基于HanLP自然语言处理包的elasticsearch分词器
Java
Chinese/English Segmenter
|
2 years ago
vz/gse
127
Go 语言高效分词, 支持英文、中文、日文等
Go
中英文分词
|
3年前
罗瑶光/快速中文分词分析word segmentation
80
快速中文分词分析word segmentation
Java
Chinese/English Segmenter
|
over 2 years ago
koth/kcws
68
kcws 是一个基于深度学习的分词系统和语料项目。 Deep Learning Chinese Word Segment
Python
中英文分词
|
7年多前
王万宝/Surfing-Segment
62
Surfing-Segment是一个先进的文本分词工具,专门增强ik-analyzer。包含多个自定义词典,动态识别型号、同义词功能、elasticsearch插件等功能。显著的增强了对专业术语及复杂型号的分词精确度。是电商平台优化体验的理想选择。
Java
Chinese/English Segmenter
|
9 months ago
m631521383/IKAnalyzer2017_6_6_0
61
IK中文分词,兼容solr/lucene6.6.0,优化数字和英文搜索
Java
中英文分词
|
7年前
Jtyoui/snsg
61
该项目已经更换,在码云上不在更新,请更换地址如下。
Python
中英文分词
|
5年多前
1
2
3
4
5
Trending Today
Weekly
百度开源/lac
241
LAC全称Lexical Analysis of Chinese,是百度自然语言处理部研发的一款联合的词法分析工具,实现中文分词、词性标注、专名识别等功能
狮子的魂/jcseg
2.2K
Jcseg是基于mmseg算法的一个轻量级Java中文分词器,同时集成了关键字提取,关键短语提取,关键句子提取和文章自动摘要等功能,并且提供了一个基于Jetty的web服务器,方便各大语言直接http调用,同时提供了最新版本的lucene、solr、elasticsearch、opensearch的搜索分词接口
百度开源/lac
241
LAC全称Lexical Analysis of Chinese,是百度自然语言处理部研发的一款联合的词法分析工具,实现中文分词、词性标注、专名识别等功能
Going to Help Center
Search
Git 命令在线学习
如何在 Gitee 导入 GitHub 仓库
Git 仓库基础操作
企业版和社区版功能对比
SSH 公钥设置
如何处理代码冲突
仓库体积过大,如何减小?
如何找回被删除的仓库数据
Gitee 产品配额说明
GitHub仓库快速导入Gitee及同步更新
什么是 Release(发行版)
将 PHP 项目自动发布到 packagist.org
Back to the top