登录
注册
开源
企业版
高校版
搜索
帮助中心
使用条款
关于我们
开源
企业版
高校版
私有云
模力方舟
登录
注册
代码拉取完成,页面将自动刷新
开源项目
>
数据库相关
>
数据库服务
&&
捐赠
捐赠前请先登录
取消
前往登录
扫描微信二维码支付
取消
支付完成
支付提示
将跳转至支付宝完成支付
确定
取消
Watch
不关注
关注所有动态
仅关注版本发行动态
关注但不提醒动态
442
Star
1.5K
Fork
1.8K
openGauss
/
openGauss-server
代码
Issues
960
Pull Requests
165
Wiki
统计
流水线
服务
质量分析
Jenkins for Gitee
腾讯云托管
腾讯云 Serverless
悬镜安全
阿里云 SAE
Codeblitz
SBOM
我知道了,不再自动展开
更新失败,请稍后重试!
移除标识
内容风险标识
本任务被
标识为内容中包含有代码安全 Bug 、隐私泄露等敏感信息,仓库外成员不可访问
【测试类型:工具功能】【测试版本:6.0.0】【UWAL】 安装完一主两备集群,部署配置uwal后,重启集群后,集群core
已验收
#IA8JV8
缺陷
裴琳倩
创建于
2024-06-26 18:22
<!-- #请认真填写以下信息,否则可能由于无法定位,导致issue无法解决而被取消 --> 【标题描述】: 安装完一主两备集群,部署配置uwal后,重启集群后,集群core 【测试类型:工具功能】【测试版本:6.0.0】【uwal】 安装完一主两备集群,部署配置uwal后,重启集群后,集群core 【操作系统和硬件信息】(查询命令: cat /etc/system-release, uname -a): openEuler release 20.03 (LTS) 【测试环境】(单机/1主x备x级联备): 一主两备 【被测功能】: uwal环境部署安装bt 【测试类型】: 功能测试 【数据库版本】(查询命令: gaussdb -V): gsql (openGauss 6.0.0 build b061d510) compiled at 2024-06-12 10:07:05 commit 0 last mr 【预置条件】: 1. 安装一主两备 【操作步骤】(请填写详细的操作步骤): 1. 根据uwal官网文档进行部署,并配置postgres.conf参数 2. 重启数据库,并查询数据库状态 【预期输出】: 1. 根据uwal官网文档进行部署,并配置postgres.conf参数,成功 2. 重启数据库,并查询数据库状态,成功 【实际输出】: 1. 根据uwal官网文档进行部署,并配置postgres.conf参数,成功 2. 重启数据库成功,并查询数据库状态,状态异常,产生core 【原因分析】: 代码优化导致 【日志信息】(请附上日志文件、截图、coredump信息): **pg日志:** ``` 2024-06-25 10:53:32.447 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [DBL_WRT] LOG: [batch flush] DW truncate end: file_head[dwn 0, start 60], total_pages 0 2024-06-25 10:53:32.456 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: keep all the xlog segments, because current segno = 5, less than wal_keep_segments = 16 2024-06-25 10:53:32.456 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: slotname: dn_6002, dummy: 0, restartlsn: 0/5001570 2024-06-25 10:53:32.456 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: slotname: dn_6003, dummy: 0, restartlsn: 0/5001570 2024-06-25 10:53:32.512 667a30ef.6131 [unknown] 140185511655168 dn_6001 0 dn_6001_6002_6003 58P01 0 [BACKEND] PANIC: uwal write failed, retCode = -100000. 2024-06-25 10:53:32.512 667a30ef.6131 [unknown] 140185511655168 dn_6001 0 dn_6001_6002_6003 58P01 0 [BACKEND] BACKTRACELOG: tid[31495]'s backtrace: /data/relia_app/relia0625/cluster/app/bin/gaussdb(+0xdceadb) [0x563d1ab5dadb] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z9errfinishiz+0x58b) [0x563d1ab5542b] /data/relia_app/relia0625/cluster/app/bin/gaussdb(+0x1833391) [0x563d1b5c2391] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z19XLogBackgroundFlushv+0x3a5) [0x563d1b5c49a5] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z13WalWriterMainv+0x466) [0x563d1b10de16] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z17GaussDbThreadMainIL15knl_thread_role39EEiP14knl_thread_arg+0x3cf) [0x563d1b0f52bf] /data/relia_app/relia0625/cluster/app/bin/gaussdb(+0x1340285) [0x563d1b0cf285] /lib64/libpthread.so.0(+0x7dd5) [0x7f8101f75dd5] /lib64/libc.so.6(clone+0x6d) [0x7f8101c9eead] Use addr2line to get pretty function name and line 2024-06-25 10:53:32.530 667a30f0.6137 [unknown] 140185489569536 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: walrcvwriter find uwal datasize : 4278190080 2024-06-25 10:53:32.530 667a30f0.6137 [unknown] 140185489569536 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: uwalRcvStateInit: truncate 83886080, write 83892016 ``` **uwal日志:** ``` 2024-06-25 10:53:32.499693 31495 info duwal_client_meta.c 73 DuwalInsertMetaToAvlTree] Insert client meta cache succ, location(0), uwalId(64094258-3-3667803286536193). 2024-06-25 10:53:32.499697 31495 info duwal_client_create.c 376 DuwalLoadVector] No.1: uwalId(64094258-3-3667803286536193). 2024-06-25 10:53:32.510558 31495 error duwal_server_meta.c 1084 DuwalServerFindHeadOffset] Invalid head data, curOffset(262080). 2024-06-25 10:53:32.510587 31495 error duwal_server_meta.c 1154 DuwalServerGetHead] Get head offset failed:-1, dataOffset(0), uwalId(64094258-3-3667803286536193) 2024-06-25 10:53:32.510673 31495 error duwal_server_msg.c 511 DuwalGetHeadReqHandle] Query meta fail, ret(-1). 2024-06-25 10:53:32.510682 31495 error duwal_client_meta.c 742 DuwalGetHeadCb] Get master head failed, ret(-1), dstNid(0). 2024-06-25 10:53:32.510688 31495 error duwal_client_meta.c 805 DuwalGetHead] Get head fail:-1, need retry. 2024-06-25 10:53:32.510693 31495 error duwal_client_append.c 890 DuwalConstructAppendSpecifiedCtxImpl] Get master server head info failed(-1), uwalId(64094258-3-3667803286536193), dataOffset(0) 2024-06-25 10:53:47.069670 31418 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069748 31424 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069777 31420 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069791 31428 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069890 31432 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069802 31422 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.071749 31436 error trans_hcom.c 1241 HcomLogHandler] [HCOM sock_wrapper.h:1412] sock 140737489042052 receive header failed, errno 4. 2024-06-25 10:53:47.071789 31436 warning trans_hcom.c 1238 HcomLogHandler] [HCOM net_sock_driver_oob.cpp:1177] sock uwal_tcp received an error request and it is causing ep destroy. 2024-06-25 10:53:47.072161 31430 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.072285 31435 error trans_hcom.c 1241 HcomLogHandler] [HCOM sock_wrapper.h:1412] sock 70368744288499 receive header failed, errno 4. 2024-06-25 10:53:47.072310 31435 warning trans_hcom.c 1238 HcomLogHandler] [HCOM net_sock_driver_oob.cpp:1177] sock uwal_tcp received an error request and it is causing ep destroy. 2024-06-25 10:53:47.072359 31435 debug trans_hcom.c 1188 HcomEndPointBroken] bChannel(140184304750656), usrCtx(1), payLoad(1) 2024-06-25 10:53:47.072390 31435 info trans_hcom.c 1235 HcomLogHandler] [HCOM net_sock_driver_oob.cpp:922] Destroy endpoint id 70368744288499. 2024-06-25 10:53:47.072401 31435 error trans_hcom.c 1241 HcomLogHandler] [HCOM sock_wrapper.h:1412] sock 70368744667780 receive header failed, errno 4. ``` core堆栈信息:  【测试代码】:
<!-- #请认真填写以下信息,否则可能由于无法定位,导致issue无法解决而被取消 --> 【标题描述】: 安装完一主两备集群,部署配置uwal后,重启集群后,集群core 【测试类型:工具功能】【测试版本:6.0.0】【uwal】 安装完一主两备集群,部署配置uwal后,重启集群后,集群core 【操作系统和硬件信息】(查询命令: cat /etc/system-release, uname -a): openEuler release 20.03 (LTS) 【测试环境】(单机/1主x备x级联备): 一主两备 【被测功能】: uwal环境部署安装bt 【测试类型】: 功能测试 【数据库版本】(查询命令: gaussdb -V): gsql (openGauss 6.0.0 build b061d510) compiled at 2024-06-12 10:07:05 commit 0 last mr 【预置条件】: 1. 安装一主两备 【操作步骤】(请填写详细的操作步骤): 1. 根据uwal官网文档进行部署,并配置postgres.conf参数 2. 重启数据库,并查询数据库状态 【预期输出】: 1. 根据uwal官网文档进行部署,并配置postgres.conf参数,成功 2. 重启数据库,并查询数据库状态,成功 【实际输出】: 1. 根据uwal官网文档进行部署,并配置postgres.conf参数,成功 2. 重启数据库成功,并查询数据库状态,状态异常,产生core 【原因分析】: 代码优化导致 【日志信息】(请附上日志文件、截图、coredump信息): **pg日志:** ``` 2024-06-25 10:53:32.447 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [DBL_WRT] LOG: [batch flush] DW truncate end: file_head[dwn 0, start 60], total_pages 0 2024-06-25 10:53:32.456 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: keep all the xlog segments, because current segno = 5, less than wal_keep_segments = 16 2024-06-25 10:53:32.456 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: slotname: dn_6002, dummy: 0, restartlsn: 0/5001570 2024-06-25 10:53:32.456 667a30f0.6135 [unknown] 140185696204544 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: slotname: dn_6003, dummy: 0, restartlsn: 0/5001570 2024-06-25 10:53:32.512 667a30ef.6131 [unknown] 140185511655168 dn_6001 0 dn_6001_6002_6003 58P01 0 [BACKEND] PANIC: uwal write failed, retCode = -100000. 2024-06-25 10:53:32.512 667a30ef.6131 [unknown] 140185511655168 dn_6001 0 dn_6001_6002_6003 58P01 0 [BACKEND] BACKTRACELOG: tid[31495]'s backtrace: /data/relia_app/relia0625/cluster/app/bin/gaussdb(+0xdceadb) [0x563d1ab5dadb] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z9errfinishiz+0x58b) [0x563d1ab5542b] /data/relia_app/relia0625/cluster/app/bin/gaussdb(+0x1833391) [0x563d1b5c2391] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z19XLogBackgroundFlushv+0x3a5) [0x563d1b5c49a5] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z13WalWriterMainv+0x466) [0x563d1b10de16] /data/relia_app/relia0625/cluster/app/bin/gaussdb(_Z17GaussDbThreadMainIL15knl_thread_role39EEiP14knl_thread_arg+0x3cf) [0x563d1b0f52bf] /data/relia_app/relia0625/cluster/app/bin/gaussdb(+0x1340285) [0x563d1b0cf285] /lib64/libpthread.so.0(+0x7dd5) [0x7f8101f75dd5] /lib64/libc.so.6(clone+0x6d) [0x7f8101c9eead] Use addr2line to get pretty function name and line 2024-06-25 10:53:32.530 667a30f0.6137 [unknown] 140185489569536 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: walrcvwriter find uwal datasize : 4278190080 2024-06-25 10:53:32.530 667a30f0.6137 [unknown] 140185489569536 dn_6001 0 dn_6001_6002_6003 00000 0 [BACKEND] LOG: uwalRcvStateInit: truncate 83886080, write 83892016 ``` **uwal日志:** ``` 2024-06-25 10:53:32.499693 31495 info duwal_client_meta.c 73 DuwalInsertMetaToAvlTree] Insert client meta cache succ, location(0), uwalId(64094258-3-3667803286536193). 2024-06-25 10:53:32.499697 31495 info duwal_client_create.c 376 DuwalLoadVector] No.1: uwalId(64094258-3-3667803286536193). 2024-06-25 10:53:32.510558 31495 error duwal_server_meta.c 1084 DuwalServerFindHeadOffset] Invalid head data, curOffset(262080). 2024-06-25 10:53:32.510587 31495 error duwal_server_meta.c 1154 DuwalServerGetHead] Get head offset failed:-1, dataOffset(0), uwalId(64094258-3-3667803286536193) 2024-06-25 10:53:32.510673 31495 error duwal_server_msg.c 511 DuwalGetHeadReqHandle] Query meta fail, ret(-1). 2024-06-25 10:53:32.510682 31495 error duwal_client_meta.c 742 DuwalGetHeadCb] Get master head failed, ret(-1), dstNid(0). 2024-06-25 10:53:32.510688 31495 error duwal_client_meta.c 805 DuwalGetHead] Get head fail:-1, need retry. 2024-06-25 10:53:32.510693 31495 error duwal_client_append.c 890 DuwalConstructAppendSpecifiedCtxImpl] Get master server head info failed(-1), uwalId(64094258-3-3667803286536193), dataOffset(0) 2024-06-25 10:53:47.069670 31418 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069748 31424 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069777 31420 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069791 31428 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069890 31432 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.069802 31422 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.071749 31436 error trans_hcom.c 1241 HcomLogHandler] [HCOM sock_wrapper.h:1412] sock 140737489042052 receive header failed, errno 4. 2024-06-25 10:53:47.071789 31436 warning trans_hcom.c 1238 HcomLogHandler] [HCOM net_sock_driver_oob.cpp:1177] sock uwal_tcp received an error request and it is causing ep destroy. 2024-06-25 10:53:47.072161 31430 error bdm_disk.c 792 BdmDiskEventsThread] disk event epoll_wait, error(Interrupted system call). 2024-06-25 10:53:47.072285 31435 error trans_hcom.c 1241 HcomLogHandler] [HCOM sock_wrapper.h:1412] sock 70368744288499 receive header failed, errno 4. 2024-06-25 10:53:47.072310 31435 warning trans_hcom.c 1238 HcomLogHandler] [HCOM net_sock_driver_oob.cpp:1177] sock uwal_tcp received an error request and it is causing ep destroy. 2024-06-25 10:53:47.072359 31435 debug trans_hcom.c 1188 HcomEndPointBroken] bChannel(140184304750656), usrCtx(1), payLoad(1) 2024-06-25 10:53:47.072390 31435 info trans_hcom.c 1235 HcomLogHandler] [HCOM net_sock_driver_oob.cpp:922] Destroy endpoint id 70368744288499. 2024-06-25 10:53:47.072401 31435 error trans_hcom.c 1241 HcomLogHandler] [HCOM sock_wrapper.h:1412] sock 70368744667780 receive header failed, errno 4. ``` core堆栈信息:  【测试代码】:
评论 (
1
)
登录
后才可以发表评论
状态
已验收
待办的
已确认
已答复
已取消
挂起
修复中
已完成
待回归
测试中
已验收
负责人
未设置
luqichao
luqichao
负责人
协作者
+负责人
+协作者
标签
未设置
项目
未立项任务
未立项任务
里程碑
未关联里程碑
未关联里程碑
Pull Requests
未关联
未关联
关联的 Pull Requests 被合并后可能会关闭此 issue
分支
未关联
分支 (17)
标签 (29)
master
6.0.0
master_bak08271930
5.0.0
bugfix_0725
3.0.0
7.0.0-RC1
iud_dev
dev_board
5.1.0
kms
2.0.0
3.1.0
2.1.0
1.1.0
1.0.1
1.0.0
v6.0.2
v7.0.0-RC1
v6.0.1
v3.0.6
v6.0.0
v3.0.5B009
v5.0.3
v5.0.2
v6.0.0-RC1
v3.0.5
v5.0.1
v5.1.0
5.1.0
v5.0.0
v3.0.3
v3.1.1
v3.0.2
v3.1.0
v2.0.5
v3.0.1
v2.0.4
v2.0.3
v3.0.0
v2.1.0
v2.0.1
v2.0.0
v1.1.0
v1.0.1
v1.0.0
开始日期   -   截止日期
-
置顶选项
不置顶
置顶等级:高
置顶等级:中
置顶等级:低
优先级
不指定
严重
主要
次要
不重要
预计工期
(小时)
参与者(1)
C++
1
https://gitee.com/opengauss/openGauss-server.git
git@gitee.com:opengauss/openGauss-server.git
opengauss
openGauss-server
openGauss-server
点此查找更多帮助
搜索帮助
Git 命令在线学习
如何在 Gitee 导入 GitHub 仓库
Git 仓库基础操作
企业版和社区版功能对比
SSH 公钥设置
如何处理代码冲突
仓库体积过大,如何减小?
如何找回被删除的仓库数据
Gitee 产品配额说明
GitHub仓库快速导入Gitee及同步更新
什么是 Release(发行版)
将 PHP 项目自动发布到 packagist.org
仓库举报
回到顶部
登录提示
该操作需登录 Gitee 帐号,请先登录后再操作。
立即登录
没有帐号,去注册