# learn-spring-cloud **Repository Path**: lin92/learn-spring-cloud ## Basic Information - **Project Name**: learn-spring-cloud - **Description**: No description available - **Primary Language**: Java - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2021-03-22 - **Last Updated**: 2024-10-15 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # learn-spring-cloud结合 spring-cloud-alibaba 比较新版学习 ## 1、创建一个简单的父工程 learn-spring-cloud，然后修改pom.xml文件中依赖。 ``` spring-boot和spring-cloud 版本选择，这里spring-boot使用2.2.2.RELEASE， spring-cloud版本是 Hoxton.SR1, spring.cloud.alibaba 2.1.0.RELEASE ``` ### 2、创建一个子模块module,cloud-provider-payment8001, 修改依赖并添加配置文件。创建实体类Payment 和公用返回CommonResult类，对应的dao层， paymentDao, 业务层接口PaymentService，以及实现层PaymentServiceImpl, 还有控制层PaymentController 示例控制层: ```java package com.learn.springcloud.controller; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import com.learn.springcloud.service.PaymentService; import lombok.extern.slf4j.Slf4j; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; import org.springframework.web.bind.annotation.PostMapping; import org.springframework.web.bind.annotation.RestController; import javax.annotation.Resource; /** * @ClassName: PaymentController * @Description: * @Author: lin * @Date: 2020/8/15 22:20 * History: * @ 1.0 */ @RestController @Slf4j public class PaymentController { @Resource private PaymentService paymentService; @PostMapping(value = "/payment/create") public CommonResult create(@RequestBody Payment payment){ int result = paymentService.create(payment); log.info("*****插入结果:" + result); if(result > 0){ return new CommonResult(200, "插入成功", result); }else { return new CommonResult(444, "插入失败", null); } } @GetMapping(value = "/payment/get/{id}") public CommonResult getPaymentById(@PathVariable("id") Long id){ Payment payment = paymentService.getPaymentById(id); log.info("*****查询结果:" + payment); if(payment != null){ return new CommonResult(200, "查询成功", payment); }else { return new CommonResult(444, "没有对应记录，查询id:"+id, null); } } } ``` 请求测试：http://localhost:8001/payment/get/23 ，返回的是json数据格式，如图所示 ![img](image/payment-query-test.png) 插入使用postman测试 ![img](image/payment-create-test.png) 开启热部署 ``` 1:添加依赖devtools 2:在父类工作中添加springboot maven plugin插件 3:设置 Enabling automatic build 4:update the value of 5:重启idea ``` ![img](image/devtools-setting.png) 设置： ![img](image/devtool-update-setting.png) 上面的是一个服务的时候，如果有其他的服务来调用的时候怎么办？ ### 3、创建一个新的mould模块，cloud-consumer-order80 添加相关配置并且将实体类copy到这个mould中来。两个服务之间调用最原始的是通过http来调用，还有一个是restTemplate。这里先使用 restTemplate来进行调用 ``` restTemplate 提供了多种便捷访问远程Http服务的方法，是一种简单便捷的访问Restful服务模板类，是spring 提供的用于访问Rest服务的客户端模板工具集。 ``` 添加一个config ```java package com.learn.springcloud.config; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; import org.springframework.web.client.RestTemplate; /** * @ClassName: ApplicationContextConfig * @Description: * @Author: lin * @Date: 2020/8/15 23:34 * History: * @ 1.0 */ @Configuration public class ApplicationContextConfig { @Bean public RestTemplate getRestTemplate(){ return new RestTemplate(); } } ``` 服务的提供者：cloud-provider-payment8001 ,服务的消费者：cloud-consumer-order80 使用 http://localhost/consumer/get/23 调用服务消费者，其实是调用的服务提供中的方法。 ![img](image/consumer-order-request-test.png) 在运行时候，使用run Dashboard的组件,如果没有显示那么就到这个项目中idea 目录下中workspace.xml添加下面的组件。再次运行就可以看到了 ``` ``` ### 4、上面的两个模块中存在一个问题，相同的类在两个模块中都出现了所有我们将相同地类提出来放到一个公用的模块中去，这里新建立一个模块。cloud-api-commons ### 5、使用eureka来进行服务注册,创建新模块 cloud-eureka-server7001 ``` Eureka包含两个组件：Eureka Server和Eureka Client Eureka Server 提供服务注册服务各个微服务节点通过配置启动后，会在Eureka Server中进行注册，这样Eureka Server中的服务注册表这将会存储所有可能服务节点的信息，服务节点的信息可以在界面中直观看到。 Eureka Client通过注册中进行访问是一个java客户端，用于简化Eureka Server的交互，客户端同时也具备一个内置的，使用轮询负载算法的负载均衡器。在应用启动后，将会向Eureka Server发送心跳(默认周期为30秒)。如果Eureka Server在多个心跳周期内没有接收到某个节点的心跳。 Eureka Server将会从服务注册表中把这个服务节点移除（默认90s） ``` 修改eureka服务的yml配置文件。添加启动类 ```java package com.learn.springcloud; import org.springframework.boot.SpringApplication; import org.springframework.boot.autoconfigure.SpringBootApplication; import org.springframework.cloud.netflix.eureka.server.EnableEurekaServer; /** * @ClassName: EurekaMain7001 * @Description: * @Author: lin * @Date: 2020/8/16 16:46 * History: * @ 1.0 */ @SpringBootApplication @EnableEurekaServer public class EurekaMain7001 { public static void main(String[] args) { SpringApplication.run(EurekaMain7001.class, args); } } ``` 启动eureka注册中心，可以看到当前还没有服务注册进去 ![img](image/eureka-server-test.png) ### 6、将服务提供者注册到eureka中，那么修改provider和consumer启动类，添加EurekaClient注解。 ```java package com.learn.springcloud; import org.springframework.boot.SpringApplication; import org.springframework.boot.autoconfigure.SpringBootApplication; import org.springframework.cloud.netflix.eureka.EnableEurekaClient; /** * @ClassName: PaymentMain8001 * @Description: * @Author: lin * @Date: 2020/8/15 21:29 * History: * @ 1.0 */ @SpringBootApplication @EnableEurekaClient public class PaymentMain8001 { public static void main(String[] args) { SpringApplication.run(PaymentMain8001.class, args); } } ``` 并且修改pom文件。添加eurekaClient依赖 ``` org.springframework.cloud spring-cloud-starter-netflix-eureka-client ``` 修改application.yml配置文件，让服务提供组注册到注册中心去 ``` eureka: client: #表示是否将自己注册进EurekaServer默认为true register-with-eureka: true #是否从EurekaServer抓取已有的注册消息，默认为true，单节点无所谓， #集群必须设置为true才能配合ribbon使用负载均衡 fetch-registry: true service-url: #集群版 #defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/ #单机版 # defaultZone: http://eureka7001.com:7001/eureka/ defaultZone: http://localhost:7001/eureka/ instance: instance-id: payment8001 prefer-ip-address: true #访问路径可以显示ip #Eureka客户端向服务端发送心跳的实际间隔，单位为秒（默认为30秒） lease-renewal-interval-in-seconds: 1 #Eureka服务端收到最后一次心跳后等待时间上线，单位为秒（默认为90秒）超时将剔除服务 lease-expiration-duration-in-seconds: 2 ``` 启动服务提供者，然后刷新注册中心，就可以看到这个服务提供者已经注册到 eureka中去了。在下面的图中可以一行红色字，它是eureka的自我保护机制。 ![img](image/eureka-provider-client.png) ### 7、修改服务消费consumer的启动类 ```java package com.learn.springcloud; import org.springframework.boot.SpringApplication; import org.springframework.boot.autoconfigure.SpringBootApplication; import org.springframework.cloud.netflix.eureka.EnableEurekaClient; /** * @ClassName: OrderMain80 * @Description: * @Author: lin * @Date: 2020/8/15 23:26 * History: * @ 1.0 */ @EnableEurekaClient @SpringBootApplication public class OrderMain80 { public static void main(String[] args) { SpringApplication.run(OrderMain80.class, args); } } ``` 在pom文件中添加eurekaClient依赖，并且修改yml文件让其服务消费者也注册到eureka中去。修改之后刷新注册中可以看到consumer也已经注册到eureka中了。 ![img](image/eureka-consumer-client.png) 注意：在yml配置文件，需要注意层次缩进和空格。不然会报错 ``` Failed to bind properties under 'eureka.client.service-url' to java.util.map ``` ### 8、eureka集群构建 ``` Eureka 服务注册：将服务信息注册进注册中心服务发现：从注册中心获取服务信息实质：存key服务名，取value调用地址 1:启动eureka注册中心 2:启动服务提供者payment支付服务 3:支付服务启动后会把自身信息(比如服务地址以别名方式注册进eureka) 4:消费者order服务在需要调用接口时，使用服务别名去注册中心获取实际的RPC远程调用地址 5:消费者获得调用地址后，底层实际是利用HttpClient技术实现远程调用 6:消费者获得服务地址后会缓存在本地jvm内存中，默认每间隔30更新一次服务调用地址 ``` ### 9、创建一给module,这个模块是cloud-eureka-server7002,这样来搭建eureka集群。同样的在pom文件中加入和cloud-eureka-server7001 加入相同的依赖，然后修改application.yml文件。因为是集群，所有应该是相互注册。并且修改hosts文件。 ``` 127.0.0.1 eureka7001.com 127.0.0.1 eureka7002.com ``` cloud-eureka-server7001的application.yml配置文件的hostname 要和 server7002 不一样，这样才好区分. 然后相互注册，也就是cloud-eureka-server7001 注册到 cloud-eureka-server7002中。 cloud-eureka-server7002 注册到 cloud-eureka-server7001中 ``` cloud-eureka-server7001中的 hostname: eureka7001.com #eureka服务端的实例名称 cloud-eureka-server7002中的 hostname: eureka7002.com #eureka服务端的实例名称 ``` ### 10、将provider和consumer注册到 eureka集群中去，就需要修改application.yml。将这个服务提供者和服务消费者注册进入。 ``` #集群版 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/ ``` 要先启动eureka7001、eureka7002集群服务，然后启动cloud-provider-payment8001,再启动cloud-consumer-order80。访问 eureka7001.com:7001可以看到，服务提供者和服务消费者已经注册到eureka集群中去了。 ![img](image/eureka-server7001.png) ### 11、在服务单机版是，如果访问量不大得情况下还可以使用但是当请求数量增大时单机版压力就增大，如果服务器宕机了那么整个服务就不可用，所以这里搭建服务提供者集群版，新建立一个模块 could-provider-payment8002。pom依赖和could-provider-payment8001 一致。需求修改application.yml配置文件的端口号。 ![img](image/provider-cluster-8001-8002.png) 但是在访问http://localhost/consumer/get/23，返回的都是8001 端口，这是因为服务消费者调用服务提供者是单机版(写死了服务提供地址)，那么在集群模式下的服务提供，我们不能将提供者地址写死，需要将这个地址写成集群中的名称CLOUD-PAYMENT-SERVICE。这样在调用消费者进行调用的时候就不会一直只是访问8001 端口了。 ```java package com.learn.springcloud.controller; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import lombok.extern.slf4j.Slf4j; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; import org.springframework.web.bind.annotation.RestController; import org.springframework.web.client.RestTemplate; import javax.annotation.Resource; /** * @ClassName: OrderController * @Description: * @Author: lin * @Date: 2020/8/15 23:28 * History: * @ 1.0 */ @RestController @Slf4j public class OrderController { // public static final String PAYMENT_URL = "http://localhost:8001"; public static final String PAYMENT_URL = "http://CLOUD-PAYMENT-SERVICE"; @Resource private RestTemplate restTemplate; @GetMapping("/consumer/payment/create") public CommonResult create(Payment payment){ return restTemplate.postForObject(PAYMENT_URL + "payment/create",payment, CommonResult.class); } @GetMapping("/consumer/get/{id}") public CommonResult getPaymentById(@PathVariable("id") Long id){ return restTemplate.getForObject(PAYMENT_URL + "/payment/get/"+id, CommonResult.class); } } ``` 注意上面修改了之后，访问会报错。因为需要开restTemplate的负载均的能力，不然不知道调用那台服务,在ApplicationContextConfig 中添加 @LoadBalanced 注解。 ```java package com.learn.springcloud.config; import org.springframework.cloud.client.loadbalancer.LoadBalanced; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; import org.springframework.web.client.RestTemplate; /** * @ClassName: ApplicationContextConfig * @Description: * @Author: lin * @Date: 2020/8/15 23:34 * History: * @ 1.0 */ @Configuration public class ApplicationContextConfig { @Bean @LoadBalanced public RestTemplate getRestTemplate(){ return new RestTemplate(); } } ``` 那么再次访问 http://localhost/consumer/get/23 时，就会根据负载均衡得方式在集群中选择一台服务来访问。端口8001/8002交替出现 ### 12、不暴露ip地址，只是暴露服务名称，那么修改yml配置，在配置文件中加入 instance 实例。如果在鼠标移动到服务时显示ip 那么需要添加 prefer-ip-address: true 。 ``` instance: instance-id: payment8001 prefer-ip-address: true #访问路径可以显示ip #Eureka客户端向服务端发送心跳的实际间隔，单位为秒（默认为30秒） lease-renewal-interval-in-seconds: 1 #Eureka服务端收到最后一次心跳后等待时间上线，单位为秒（默认为90秒）超时将剔除服务 lease-expiration-duration-in-seconds: 2 ``` ### 13、服务发现Discovery,在cloud-provider-payment8001中的controller中添加 DiscoveryClient 注解。并且在启动类中加入@EnableDiscoveryClient 注解。我可以查看这个这个注册中心由那些服务。 ``` /** * 服务发现，查看这个注册注册中心由那些服务， * 这个发服务下的具体信息 * @return */ @GetMapping(value = "/payment/discovery") public Object discovery(){ List services = discoveryClient.getServices(); for (String element : services) { log.info("*****element:" + element); } // 一个微服务下的全部实例 List instances = discoveryClient.getInstances("CLOUD-PAYMENT-SERVICE"); for (ServiceInstance instance : instances) { log.debug(instance.getServiceId() + "\t" + instance.getHost() + "\t" + instance.getPort() + instance.getUri()); } return this.discoveryClient; } ``` 访问这个接口可以看到这个注册中心有那些服务。下面注册了两个微服务 ![img](image/service-discovery.png) 控制台打印可以看到微服务下CLOUD-PAYMENT-SERVICE 下存在两个实例。 ``` 2020-08-17 09:23:06.739 INFO 10928 --- [nio-8001-exec-3] c.l.s.controller.PaymentController : CLOUD-PAYMENT-SERVICE 192.168.2.187 8002http://192.168.2.187:8002 2020-08-17 09:23:06.739 INFO 10928 --- [nio-8001-exec-3] c.l.s.controller.PaymentController : CLOUD-PAYMENT-SERVICE 192.168.2.187 8001http://192.168.2.187:8001 ``` ## 14、Eureka保护模式保护模式主要用于一组客户端和Eureka Server之间存在网络分区场景下的保护，一旦进入保护模式，Eureka Server将会尝试保护其服务注册表中的信息，不再删除服务注册表中的数据，也就是不会注销任何微服务。如果再Eureka Server的首页看到有 JUST TO BE SAFE 则表明进入了保护模式。通俗的说就是：某个时刻某一个微服务不可用了，Eureka不会立刻清理，依旧对该微服务的信息进行保存 .为什么会产生Eureka自我保护机制？为了防止EurekaClien可以正常运行，但是与EurekaServer网络不通情况下，EurekaServer不会立刻将EurekaClient 服务剔除 .什么时自我保护模式？默认情况下，如果EurekaServer再一定时间内没有接收到某个微服务实例的心跳，EurekaServer将会注销该实例(默认90s)。但是当网络分区故障发生(延时、卡顿、拥挤)时，微服务与EurekaServer之间无法正常通信，以上行为可能变得非常危险了--- 因为微服务本身其实是健康的，此时本不应该注销这个微服务。Eureka通过"自我保护模式"来解决这个问题----当EurekaServer 节点再短时间内丢失过多客户端时(可能发生了网络分区故障)，那么这个节点就会进入自我保护模式。在自我保护模式中，EurekaServer会保护服务注册表中的信息，不再注销任何服务实例。它的设计哲学就是宁可保留错误的服务注册信息，也不盲目注销任何可能健康的服务实例。一句话讲解：好死不如赖活着综上，自我保护模式是一种应对网络异常的安全保护措施。它的架构哲学是宁可同时保留所有微服务(健康的微服务和不健康的微服务都会保留)也不盲目注销任何健康的微服务。使用自我保护模式，可以让Eureka集群更加健壮、稳定。 ### 14.1、禁止Eureka保护模式 ``` 在eurekaServer中yml配置文件下，修改配置 #关闭自我保护模式，保证不可用服务被及时删除 enable-self-preservation: false #间隔时间改短一点 eviction-interval-timer-in-ms: 2000 在服务提供者和服务消费者中也配置 #Eureka客户端向服务端发送心跳的实际间隔，单位为秒（默认为30秒） lease-renewal-interval-in-seconds: 1 #Eureka服务端收到最后一次心跳后等待时间上线，单位为秒（默认为90秒）超时将剔除服务 lease-expiration-duration-in-seconds: 2 ``` ## 15、使用zookeeper作为注册中心，我们先要将zookeeper在linux中安装并且启动，这里使用zookeeper-3.4.14版本。然后在项目中创建一个新的模块。cloud-provider-payment8004。然后在依赖中添加zookeeper依赖，创建启动类，然后启动之后看到这里出现了冲突。curator-x-discovery:4.0.1中自带是zookeeper-3.5.3-beta.jar 和服务器的版本不一致。 ![img](image/zookeeper-start-conflict.png) ### 15.1 解决zookeeper依赖包冲突问题,修改项目pom依赖，将自身带有的zookeeper去掉。然后添加 zookeeper,和服务器版本一致 ``` org.springframework.cloud spring-cloud-starter-zookeeper-discovery org.apache.zookeeper zookeeper 然后添加 zookeeper,和服务器版本一致 org.apache.zookeeper zookeeper 3.4.14 ``` 注意这里引入的zookeeper版本和 lombok中的slf4j 版本冲突，所以也要讲zookeeper-3.4.14中的slf4j依赖排除掉 ``` org.apache.zookeeper zookeeper 3.4.14 org.slf4j slf4j-api org.slf4j slf4j-log4j12 ``` ### 15.2 再次启动payment8004 可以看到已经能注册到zookeeper中去了。 ![img](image/payment-8004-zookeeper-start.png) 然后去zookeeper服务器也可以看到cloud-provider-payment8004 已经注册进去了 ![img](image/payment8004-zookeeper-server-success.png) 访问8004查询路径，可以看到也是成功的 ![img](image/payment8004-query-success.png) 可以在zookeeper中查看服务注册的详细信息 ``` [zk: localhost:2181(CONNECTED) 7] get /services/cloud-provider-payment/c803e2ff-c099-42fe-9696-550177c1eb51 {"name":"cloud-provider-payment","id":"c803e2ff-c099-42fe-9696-550177c1eb51","address":"LAPTOP-LQ52K6M2","port":8004,"sslPort":null,"payload":{"@class":"org.springframework.cloud.zookeeper.discovery.ZookeeperInstance","id":"application-1","name":"cloud-provider-payment","metadata":{}},"registrationTimeUTC":1597644201503,"serviceType":"DYNAMIC","uriSpec":{"parts":[{"value":"scheme","variable":true},{"value":"://","variable":false},{"value":"address","variable":true},{"value":":","variable":false},{"value":"port","variable":true}]}} cZxid = 0x11 ctime = Mon Aug 17 14:03:29 CST 2020 mZxid = 0x11 mtime = Mon Aug 17 14:03:29 CST 2020 pZxid = 0x11 cversion = 0 dataVersion = 0 aclVersion = 0 ephemeralOwner = 0x100001c8e650003 dataLength = 536 numChildren = 0 [zk: localhost:2181(CONNECTED) 8] ``` ### 15.3 添加一个消费者服务cloud-consumerzk-order80让其注册到zookeeper中去，这个模块和服务消费者原来的一样的的建立方式只需要要修改pom.xml依赖文件中的 zookeeper依赖。同时application.yml 修改让其注册到zookeeper中。OrderController实例如下 ```java package com.learn.springcloud.controller; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import lombok.extern.slf4j.Slf4j; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; import org.springframework.web.bind.annotation.RequestMapping; import org.springframework.web.bind.annotation.RestController; import org.springframework.web.client.RestTemplate; import javax.annotation.Resource; import java.util.UUID; /** * @ClassName: OrderZkController * @Description: * @Author: lin * @Date: 2020/8/17 14:33 * History: * @ 1.0 */ @RestController @Slf4j public class OrderZkController { /** * 取注册到zookeeper中的微服务名 */ public static final String PAYMENT_URL = "http://cloud-provider-payment"; @Resource private RestTemplate restTemplate; @RequestMapping(value = "/consumer/payment/zk") public String paymentInfo(){ return restTemplate.getForObject(INVOKE_URL + "/payment/zk", String.class); } } ``` 启动OrderZkMain80第一次启动比较慢且这个是从windows到linux中，启动后在zookeeper注册服务中可以看到服务消费者已经注册进去了 ![img](image/zookeeper-consumer-order80.png) 同时使用这个模块中定义的接口访问，http://localhost/consumer/payment/zk 可以访问成功。 ## 16、使用consul做注册中心。 consul是什么？ .consul是一套开源的分布式服务发现和配置管理系统，由HashiCorp公司用go语言开发。提供了服务系统中的服务治理、配置中心、控制总线等功能。这些功能中的每一个都可以根据需要单独使用，也可以一起使用以构建全方位的服务网格，总之Consul提供了一种完整的服务网格解决方案。它具有很多优点。包括：基于raft协议，比较简洁；支持健康检查，同时支持Http和DNS协议支持跨数据中心的WAN集群提供图像界面跨平台，支持Linux、Mac、Windows ### 16.1 根据实际需求下载consul，然后下载之后解压文件只有一个consul.exe文件，那么使用cmd进入这个文件的位置然后使用 consul agent -v 来启动。启动之后访问http://localhost:8500/ui/dc1/services，就可以看到一个界面。 ![img](image/consul-ui.png) ### 16.2 创建模块，将服务提供者注册到consul中，创建cloud-provide-consul-payment8006,然后修改pom依赖，添加consul的依赖，配置文件修改。 ``` spring: application: name: cloud-provider-payment cloud: consul: # consul注册中心地址 host: localhost port: 8500 discovery: hostname: 127.0.0.1 service-name: ${spring.application.name} ``` 启动主启动类，然后可以看到已经注册到consul中了 ![img](image/consul-provider-payment-8006.png) 同样访问http://localhost:8006/payment/consul 这个接口可以访问成功 ### 16.3 添加消费者模块cloud-consumer-consul-order80,然后pom文件和配置文件。让其也注册到consul中去然后添加controller, 启动主启动类。然后看看服务消费者是否注册到consul中去。 ```java package com.learn.springcloud.controller; import lombok.extern.slf4j.Slf4j; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.RestController; import org.springframework.web.client.RestTemplate; import javax.annotation.Resource; /** * @ClassName: OrderConsulController * @Description: * @Author: lin * @Date: 2020/8/17 15:56 * History: * @ 1.0 */ @RestController @Slf4j public class OrderConsulController { public static final String INVOKE_URL = "http://cloud-provider-payment"; @Resource private RestTemplate restTemplate; /** * http://localhost/consumer/payment/consul * * @return */ @GetMapping("/consumer/payment/consul") public String paymentInfo() { return restTemplate.getForObject(INVOKE_URL + "/payment/consul", String.class); } } ``` 从http://localhost:8500/ui/dc1/services 页面可以看到服务消费也已经注册进去了 ![img](image/consul-consumer-order.png) ## 17.三个注册中心的异同点 1、Eureka 使用java编写，CAP分布式理论是:AP方式，服务健康检查:可配支持，对外暴露接口方式：HTTP, springcloud 已集成 2、Consul 使用go编写， CAP分布式理论是:CP方式，服务健康检查:支持，对外暴露接口方式：HTTP/DNS, springcloud 已集成 3、Zookeeper 使用java编写，CAP分布式理论是:CP方式，服务健康检查:支持，对外暴露接口方式：客户端, springcloud 已集成 ``` CAP(consistency 一致性、Availability 可用性、Partition Tolerance 分区容错性) CAP理论的核心是：一个分布式系统不可能同时很好的满足一致性，可用性和分区容错性这三个需求，因此，根据CAP原理将NoSQL的数据分成了满足CA原则、满足CP原则和满足AP原则三大类： CA-单点集群，满足一致性，可用性的系统，通常在可扩展上不太强大。 CP-满足一致性，分区容错的系统，通常性能不是特别高 AP-满足可用性，分区容错的系统，通常可能对一致性要求低一些 ``` ## 18、Ribbon负载均衡和服务调用还是使用EurekaMain7001、EurekaMain7002、PaymentMain8001、PaymentMain8002、OrderMain80。来进行测试 .首先Spring Cloud Ribbon是什么？Ribbon是基于Netflix Ribbon实现的一套客户端负载均衡算法和服务调用的工具。 ``` 简单的说，Ribbon是Netflix发布的开源项目，主要功能是提供客户端的软件负载均衡算法合和服务调用。Ribbon客户端组件提供一系列完善的配置项如连接超时，重试等。简单的说，就是配置文件中列出Load Balancer(简称LB) 后面所有的机器， Ribbon会自动的帮助你基于某种规则去连接这些机器，我们很容易使用Ribbon实现自定义的负载均衡算法。 ``` ### 18.1 LB负载均衡是什么 ``` 1、简单的说就是将用户的请求平摊的分配到多个服务上，从而达到系统的HA(高可用)。常见的负载均衡有软件Nginx,LVS，硬件F5等 2、Ribbon本地负载均衡客户端 VS Nginx服务端负载均衡区别 Nginx是服务器负载均衡，客户端所有请求都会交给nginx,然后由nginx实现转发请求。即负载均衡是由服务端实现的。 Ribbon本地负载均衡，在调用微服务接口时候，会在注册中心上获取注册信息服务列表之后缓存到jvm本地，从而在本地实现RPC远程服务调用技术。 3、集中式LB 即再服务的消费方和提供方之间使用独立的LB设施，由该设施负责把访问请求通过某种策略转发至服务的提供方。 4、进程内LB 将LB逻辑集成到消费方，消费方从服务注册中心获知有哪些地址可用，然后自己再从这些地址中选择一个合适的服务器。 Ribbon就属于进行内LB，它只是一个类库，集成于消费方进程，消费方通过它来获取到服务提供方的地址。 ``` ### 18.2 在上面的cloud-consumer-order80 我们访问localhost:80/consumer/payment/get/23时候出现的端口是轮询的方式出现，但是我们在pom中没有加入Ribbon 那么这个负载均衡是如何实现的呢？原来使用的 spring-cloud-starter-netflix-eureka-client依赖中带有Ribbon，所以客户端就会自带的实现负载均衡功能。 ### 18.3 restTemplate中的 getForObject和getForEntity的区别。第一个可以理解返回的是一个json格式，第二个表示返回对象为ResponseEntity对象，包含了响应中的一些重要信息，比如响应头、响应状态码、响应体等 ``` @GetMapping("/consumer/payment/getForEntity/{id}") public CommonResult getPaymentById2(@PathVariable("id") Long id){ ResponseEntity entity = restTemplate.getForEntity(PAYMENT_URL + "/payment/get/" + id, CommonResult.class); if(entity.getStatusCode().is2xxSuccessful()){ return entity.getBody(); }else { return new CommonResult<>(444,"操作失败"); } } ``` ### 18.4 Ribbon核心之间IRule ``` com.netflix.loadbalancer.RoundRobinRule---轮询 com.netflix.loadbalancer.RandomRule---随机 com.netflix.loadbalancer.RetryRule--先按照RoundRobbinRule的策略获取服务，如果获取服务失败则在指定时间内进行重试，获取可用的服务 WeightedResponseTimeRule----对RoundRobbinRule的扩展，响应速度越快的实例选择权重越大，越容易选择 BestAvailableRule----会先过滤掉由于多次访问故障而处于断路跳闸状态的服务，然后选择一个并发量最小的服务 AvailabilityFilteringRule----先过滤掉故障实例，再选择并发较小的实例 ZoneAvoidanceRule----默认规则，复合判断server所在区域的性能和server的可用性选择服务器 ``` ### 18.5 使用自己定义的负载均衡算法替换，在cloud-consumer-order80新建立一个包不要主启动放在同一个路径下。在其官方文档下给出了明确的警告。 ``` 自定义配置类不能放在@ComponentScan所扫描的当前包下以及子包下，否则我们自定义的这个配置类就会被所有的 Ribbon客户端所共享，达不到特殊化定制的目的了。 ``` 新创建一个类叫做MySelfRule 在这个类上添加@Configuration注解配置 ```java package com.myrule; import com.netflix.loadbalancer.IRule; import com.netflix.loadbalancer.RandomRule; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; /** * @ClassName: MySelfRule * @Description: * @Author: lin * @Date: 2020/8/17 22:53 * History: * @ 1.0 */ @Configuration public class MySelfRule { /** * * @return */ @Bean public IRule myRule(){ return new RandomRule(); } } ``` 然后再主启动类中添加RibbonClient注解指向自定义的规则,启动主启动类，然后访问接口可以看到是一个随机访问的结果 ``` @RibbonClient(name = "CLOUD-PAYMENT-SERVICE", configuration = MySelfRule.class) ``` 负载均衡算法：rest接口第几次请求数 % 服务器集群总数量 = 实际调用服务器位置下标，每次服务重启后rest接口计数从1开始。 ``` 如： List[0] instances = 127.0.0.1:8002 List[1] instances = 127.0.0.1:8001 8001+8002组合成为集群，它们共计2台机器，集群总数为2，按照轮询算法原理：当总请求数为1时：1 % 2 =1 对应下标位置为1，则获得服务地址为127.0.0.1:8001 当总请求数为2时：2 % 2 =0 对应下标位置为0，则获得服务地址为127.0.0.1:8002 当总请求数为3时：3 % 2 =1 对应下标位置为1，则获得服务地址为127.0.0.1:8001 当总请求数为4时：4 % 2 =0 对应下标位置为0，则获得服务地址为127.0.0.1:8002 ``` ### 18.6 Ribbon的默认算法是RoundRobinRule,其实使用的算法就是根据取余方式来计算的 ```java /* * * Copyright 2013 Netflix, Inc. * * Licensed under the Apache License, Version 2.0 (the "License"); * you may not use this file except in compliance with the License. * You may obtain a copy of the License at * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. * */ package com.netflix.loadbalancer; import com.netflix.client.config.IClientConfig; import org.slf4j.Logger; import org.slf4j.LoggerFactory; import java.util.List; import java.util.concurrent.atomic.AtomicInteger; /** * The most well known and basic load balancing strategy, i.e. Round Robin Rule. * * @author stonse * @author Nikos Michalakis * */ public class RoundRobinRule extends AbstractLoadBalancerRule { private AtomicInteger nextServerCyclicCounter; private static final boolean AVAILABLE_ONLY_SERVERS = true; private static final boolean ALL_SERVERS = false; private static Logger log = LoggerFactory.getLogger(RoundRobinRule.class); public RoundRobinRule() { nextServerCyclicCounter = new AtomicInteger(0); } public RoundRobinRule(ILoadBalancer lb) { this(); setLoadBalancer(lb); } public Server choose(ILoadBalancer lb, Object key) { if (lb == null) { log.warn("no load balancer"); return null; } Server server = null; int count = 0; while (server == null && count++ < 10) { List reachableServers = lb.getReachableServers(); List allServers = lb.getAllServers(); int upCount = reachableServers.size(); int serverCount = allServers.size(); if ((upCount == 0) || (serverCount == 0)) { log.warn("No up servers available from load balancer: " + lb); return null; } int nextServerIndex = incrementAndGetModulo(serverCount); server = allServers.get(nextServerIndex); if (server == null) { /* Transient. */ Thread.yield(); continue; } if (server.isAlive() && (server.isReadyToServe())) { return (server); } // Next. server = null; } if (count >= 10) { log.warn("No available alive servers after 10 tries from load balancer: " + lb); } return server; } /** * Inspired by the implementation of {@link AtomicInteger#incrementAndGet()}. * * @param modulo The modulo to bound the value of the counter. * @return The next value. */ private int incrementAndGetModulo(int modulo) { //这里是自旋操作，不是死循环。 for (;;) { int current = nextServerCyclicCounter.get(); int next = (current + 1) % modulo; //比较并替换，这里的nextServerCyclicCounter 内存值是上面构造方法设置的， // nextServerCyclicCounter=0 //所以这里使用的是当前值和内存值是否一样，如果没有修改过，那么就将内存值修改为next。 if (nextServerCyclicCounter.compareAndSet(current, next)) return next; } } @Override public Server choose(Object key) { return choose(getLoadBalancer(), key); } @Override public void initWithNiwsConfig(IClientConfig clientConfig) { } } ``` ### 18.7 自定义负载均衡算法首先需要将ApplicationContextConfig中 @LoadBalanced注解去掉因为要自己定义算法的，所以就不需要Ribbon中的负载均衡算法。首先定义一个接口，这个接口是来获取服务实例的 ```java package com.learn.springcloud.lb; import org.springframework.cloud.client.ServiceInstance; import java.util.List; /** * 自行实现负载均衡算法接口 * @ClassName: LoadBalancer * @Description: * @Author: lin * @Date: 2020/8/18 10:44 * @History: * @ 1.0 */ public interface LoadBalancer { /** * 定义服务实例，也就是微服务中的服务实例 * @param serviceInstances * @return */ ServiceInstance instance(List serviceInstances); } ``` 实现 LoadBalancer接口，然后去编写算法 ```java package com.learn.springcloud.lb; import org.springframework.cloud.client.ServiceInstance; import java.util.List; import java.util.concurrent.atomic.AtomicInteger; /** * 实现接口，并在这个方法里面写Ribbon轮询算法 * @ClassName: MyLb * @Description: * @Author: lin * @Date: 2020/8/18 10:49 * History: * @ 1.0 */ @Component //将其交给spring容器管理 public class MyLb implements LoadBalancer{ /** * 定义一个变量，这个变量再进行比较并设置的时候需要用 */ private AtomicInteger atomicInteger = new AtomicInteger(0); /** * 这个方法的目的是获取就是获取rest接口第几次请求数 * @return */ public final int getAndIncrement(){ int current = 0; int next = 0; do{ //获取当前值，注意这里不要弄错了，如果获取错误，那么就会一直进入到自旋操作中 current = this.atomicInteger.get(); //判断当前是否超过整形int的最大值，如果超过就从0重新开始，如果没有那么就+1； int maxSize = 2147483647; next = current >= maxSize ? 0 : current + 1; //自旋操作 // 如果内存值和当前值相同，那么久返回next。这里取反就表示不在进行循环操作了 }while (!atomicInteger.compareAndSet(current, next)); System.out.println("****第几次访问,次数next:" + next); return next; } /** * 负载均衡算法：rest接口第几次请求数 % 服务器集群总数量 = 实际调用服务器位置下标，每次服务重启后rest接口计数从1开始。 * @param serviceInstances * @return */ @Override public ServiceInstance instance(List serviceInstances) { // 这个计算和Ribbon的思路一样，获取到请求次数然后和服务实例数据进行取余操作。 // 然后在取余操作之后的得到的数据到服务实例集合中去获取服务实例 int index = getAndIncrement() % serviceInstances.size(); return serviceInstances.get(index); } } ``` 最后在 controller中添加接口进行测试 http://localhost:80/consumer/payment/lb可以看到的不同的端口变化 ## 19、openFeign是什么 ``` 官网地址：https://docs.spring.io/spring-cloud-openfeign/docs/2.2.4.RELEASE/reference/html/ Fegin是一个声明式的web服务客户端，让编写web服务客户端变得非常容易，只需要创建一个接口并在接口上添加注解就可以。 ``` Feign能干什么 ``` Feign旨在使编写java Http客户端变得更容易。前面在使用Ribbon+RestTemplate时，利用RestTemplate对http请求的封装处理，形成了一套模板化的调用方法。但是实际开发中，由于对服务依赖的调用可能不止一处，往往一个接口会被多处调用，所以通常都会针对每个微服务自行封装一些客户端来包装这些依赖服务的调用。所以，Feign在此基础上做了进一步封装，由他来帮助我们定义和实现依赖服务接口的定义。在Fegin的实现下我们只需创建一个接口并使用注解的方式来配置它(以前是在Dao接口上面标注Mapper注解，现在是一个微服务接口上面标注一个Feign注解即可)，即可完成对服务提供方的接口绑定，简化了使用Spring cloud ribbon时，自动封装服务调用客户端的开发量。 Feign集成了Ribbon 利用Ribbon维护了payment的服务列表信息，并且通过轮询实现了客户端的负载均衡。而与Ribbon不同的是，通过feign只需要定义服务绑定接口且以声明式的方法，优雅而简单的实现了服务调用 ``` Feign 和 OpenFeign区别 ``` Feign是spring cloud组件中的一个轻量级RESTful的Http服务客户端，Feign内置了Ribbon,用来做客户端负载均衡，去调用服务注册中心的服务。Feign的使用方式是：使用Feign的注解定义接口，调用这个接口，就可以调用服务注册中心的服务。 OpenFeign是spring cloud在Feign的基础上支持了SpringMVC的注解，如@RequestMapping等待。OpenFeign 的@FeignClient可以解析SpringMVC的@ReuestMapping注解下的接口，并通过动态代理的方式产生实现类，实现类中做负载均衡并调用其他服务。 ``` ### 19.1 创建一个模块，cloud-consumer-feign-order80 。然后使用feign, 从官网文档可知feign是用着客户端的 ``` Feign is a declarative web service client。 ``` 在创建模块后添加OpenFeign的依赖的，然后修改配置文件，因为这个feign不是一种服务 so不将其注册到注册中心去。要使用feign的功能所以在启动类中需要添加注解@EnableFeignClients 来激活。 ```java package com.learn.springcloud; import org.springframework.boot.SpringApplication; import org.springframework.boot.autoconfigure.SpringBootApplication; import org.springframework.cloud.netflix.eureka.EnableEurekaClient; import org.springframework.cloud.openfeign.EnableFeignClients; /** * @ClassName: OrderFeignMain80 * @Description: * @Author: lin * @Date: 2020/8/18 14:30 * History: * @ 1.0 */ @SpringBootApplication @EnableEurekaClient @EnableFeignClients //要使用feign,那么就要激活feign，所以添加该注解 public class OrderFeignMain80 { public static void main(String[] args) { SpringApplication.run(OrderFeignMain80.class, args); } } ``` 创建一个接口PaymentFeignService,来调用provider，注解指定微服务名称。 ```java package com.learn.springcloud.service; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import org.apache.ibatis.annotations.Param; import org.springframework.cloud.openfeign.FeignClient; import org.springframework.stereotype.Component; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; /** * 定义接口，这个接口添加@FeignClient 注解来调用 provider服务 * @ClassName: PaymentFeignService * @Description: * @Author: lin * @Date: 2020/8/18 14:33 * History: * @ 1.0 */ @Component @FeignClient(value = "CLOUD-PAYMENT-SERVICE") public interface PaymentFeignService { /** * 根据id查询 * @param id * @return */ @GetMapping(value = "/payment/get/{id}") public CommonResult getPaymentById(@PathVariable("id") Long id); } ``` 在controller中调用接口来访问服务提供方，访问地址http://localhost/consumer/payment/get/23 可以看到可以多次访问端口不一样。也实现了负载均衡 ![img](image/consumer-feign-order80.png) ### 19.2 消费者调用服务提供者是两个不同的微服务，就会存在超时现象。所以在服务提供方故意写一个超时访问的接口。 openfeign默认超时是1s，如果超过1s那么就会报错。 ``` @GetMapping(value = "payment/feign/timeout") public String paymentFeignTimeout(){ try { Thread.sleep(3); } catch (InterruptedException e) { e.printStackTrace(); } return serverPort; } ``` 在消费中，PaymentFeignService接口同编写一个接口来访问服务提供者，然后测试访问是否超时。默认Feign客户端只等待一秒钟，但是服务端处理需要超过1秒钟，导致Feign客户端不想等待了，直接返回报错。为了避免这样的情况，有时候我们需要设置Feign客户端的超时控制。所以在配置文件中修改超时时间 ``` /** * * @return */ @GetMapping(value = "/consumer/payment/feign/timeout") public String paymentFeignTimeout(){ // openfeign-ribbon, 客户端一般默认等待1秒钟 return paymentFeignService.paymentFeignTimeout(); } ``` ### 19.3 OpenFeign日志增强 ``` Feign提供了日志打印功能，我们可以通过配置来调整日志级别，从而了解Feign中Http请求的细节。也就是对 Feign接口的调用情况进行监控和输出日志级别： NONE:默认的，不显示任何日志 BASIC:仅仅记录请求方法、URL、响应状态码及执行时间； HEADERS:处理BASIC中定义的信息之外，还有请求和响应的头信息； FULL:除了HEADERS中定义的信息之外，还有请求和响应的正文及元数据。 ``` 需要配置日志bean, FeignConfig这里配置日志级别,然后再到yml中配置feign调用监控日志级别 ```java package com.learn.springcloud.config; import feign.Logger; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; /** * @ClassName: FeignConfig * @Description: * @Author: lin * @Date: 2020/8/18 15:36 * History: * @ 1.0 */ @Configuration public class FeignConfig { @Bean public Logger.Level feignLoggerLevel(){ return Logger.Level.FULL; } } ``` 可以看到控制台，打印了feign调用过程的日志 ![img](image/consumer-feign-logger-level.png) ## 20、分布式系统面临什么问题 ``` 在分布式微服务中会面临很多问题，复杂的分布式体系结构中的应用程序有数十个依赖关系，每个依赖关系在某些时候将不可避免地失败。 ``` ### 20.1 hystrix是什么 ``` hystrix是一个用于处理分布式系统的延迟和容错的开源库，在分布式系统里，许多依赖不可避免的会调用失败，比如超时，异常等。hystrix能够保证在一个依赖出问题的情况下，不会导致整体服务失败，避免级联故障，以提高分布式系统的弹性。 "断路器"本身是一种开关装置，当某个服务单元发生故障之后，通过断路器的故障监控(类似熔断保险丝)，向调用方返回一个符合预期的、可处理的备选响应（FallBack），而不是长时间的等待或者抛出调用方无法处理的异常，这样就保证了服务调用的线程不会被长时间、不必要地占用，从而避免了故障在分布式系统中的蔓延，乃至雪崩。 ``` ### 20.2 hystrix能做什么，服务降级、服务熔断、接近实时的监控等 ``` 服务降级:服务器忙，请稍后再试，不让客户端等待并立刻返回一个友好提示。哪些情况会触发降级：程序运行异常，超时，服务熔断触发服务讲解，线程池/信号量打满也会导致服务降级服务熔断：类比保险丝达到最大服务访问后，直接拒绝访问，拉闸限电，然后调用服务讲解的方法并返回友好提示就像保险丝:服务的降级--进而熔断--恢复调用链路服务限流：秒杀高并发等操作，严禁一窝蜂的过来拥挤，大家排队，一秒钟N个，有序进行 ``` ### 20.3 使用一个单机版的eureka来进行注册，先恢复7001为单机版的。然后创建一个hystrix模块。cloud-provider-hystrix-payment8001 ，然后引入hystrix依赖包添加application.yml配置后。创建一个PaymentService类来编写两个方法 ```java package com.cloud.springcloud.service; import org.springframework.stereotype.Service; import java.util.concurrent.TimeUnit; /** * @ClassName: PaymentService * @Description: * @Author: lin * @Date: 2020/8/18 17:43 * @History: * @ 1.0 */ @Service public class PaymentService { /** * 正常访问 * * @param id * @return */ public String paymentInfo_OK(Integer id) { return "线程池: " + Thread.currentThread().getName() + " paymentInfo_OK,id:" + id + "\t" + "O(∩_∩)O哈哈~"; } /** * 超时访问,这个模拟是一个复杂的业务，需要处理时间长一些 * @param id * @return */ public String paymentInfo_TimeOut(Integer id) { int timeNumber = 4; try { TimeUnit.SECONDS.sleep(timeNumber); } catch (InterruptedException e) { e.printStackTrace(); } return "线程池：" + Thread.currentThread().getName()+" paymentInfo_Timeout, id: " + id +"\t" + "O(∩_∩)O哈哈~" + "耗时(秒):" + timeNumber; } } ``` 然后再controller中调用这个两个方法进行测试 ```java package com.cloud.springcloud.controller; import com.cloud.springcloud.service.PaymentService; import lombok.extern.slf4j.Slf4j; import org.springframework.beans.factory.annotation.Value; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; import org.springframework.web.bind.annotation.RestController; import javax.annotation.Resource; /** * @ClassName: PaymentController * @Description: * @Author: lin * @Date: 2020/8/18 17:48 * History: * @ 1.0 */ @RestController @Slf4j public class PaymentController { @Resource private PaymentService paymentService; @Value("${server.port}") private String serverPort; @GetMapping(value = "/payment/hystrix/ok/{id}") public String paymentInfo_OK(@PathVariable("id") Integer id) { String result = paymentService.paymentInfo_OK(id); log.info("****result: " + result); return result; } @GetMapping(value = "/payment/hystrix/timeout/{id}") public String paymentInfo_TimeOut(@PathVariable("id") Integer id) { String result = paymentService.paymentInfo_TimeOut(id); log.info("****result: " + result); return result; } } ``` 然后启动PaymentHystrixMain8001。访问localhost:8001/payment/hystrix/ok/23可以看到能够连接通。访问localhost:8001/payment/hystrix/timeout/21 也成功没有报错,不过这个接口要等待3秒钟。 ### 20.4 使用jMeter高并发来进行压力测试。 ``` 1、当有20000个请求来访问localhost:8001/payment/hystrix/timeout/21的时候，我们再去访问 localhost:8001/payment/hystrix/ok/23 就可以感觉到这个响应变慢了。这个是因为在访问第一个接口时有很多请求进来，会将资源全部占用造成响应变慢，造成这个的原因时因为springboot默认集成tomcat，这个里面有一个tomcat容器的线程池，默认是10个线程。现在突然来了20000个请求，所有的资源都去处理这个请求了，那么其他的请求就没有资源来处理了。就造成了其他的请求也变慢了。 tomcat默认的工作线程数被打满了没有多余的线程来分解压力和处理其他请求。从而出现了请求变慢的情况。 ``` ### 20.5 服务降级容错解决的维度要求 ``` 超时导致服务器变慢，那么就需要在规定的时间内返回，并且不应该返回报错信息，而是给出友好提示。超时不再等待服务器出错(宕机或者程序运行错误):出错要有兜底解决：对方服务超时了，调用者不能一直卡死等待，必须有服务降级对方服务宕机了，调用者不能一直卡死等待，必须有服务降级对方服务ok,调用者自己出故障或有自我要求(自己等待的时间小于服务提供时间)，自己处理服务降级 ``` ### 20.6服务降级，在规定的时间没有响应，那么需要兜底方案如果在请求一个接口是在规定的时间内没有响应或者访问超时，那么久需要有兜底方案，返回友好的提示给用户，让用户知道现在服务出现了问题而不再进行操作。在测试超时访问的接口上添加注解 @HystrixCommand 让其在访问超时或者错误时，调用fallbackMethod指定的兜底方法。 ``` /** * 超时访问,这个模拟是一个复杂的业务，需要处理时间长一些 * 兜底处理方法，如果访问超时 * HystrixCommand:一旦调用服务方法失败并抛出了错误信息后, * 会自动调用@HystrixCommand标注好的fallbackMethod调用类中的指定方法 * 在添加了@HystrixCommand注解后需要到主启动类中去添加注解@EnableCircuitBreaker来激活。 * * @param id * @return */ @HystrixCommand(fallbackMethod = "payment_TimeOutHandler", commandProperties = { @HystrixProperty(name = "execution.isolation.thread.timeoutInMilliseconds", value = "3000") }) public String paymentInfo_TimeOut(Integer id) { //故意制造计算异常。 //int a = 10/0; int timeNumber = 4; try { TimeUnit.SECONDS.sleep(timeNumber); } catch (InterruptedException e) { e.printStackTrace(); } return "线程池：" + Thread.currentThread().getName()+" paymentInfo_Timeout, id: " + id +"\t" + "O(∩_∩)O哈哈~" + "耗时(秒):" + timeNumber; } /** * 处理访问超时，兜底方案 * @param id * @return */ public String payment_TimeOutHandler(Integer id){ return "线程池:" + Thread.currentThread().getName() + " 系统繁忙或运行错误,请稍后重试,id:" + id + "\t" + "o(╥﹏╥)o"; } ``` 并且需要在启动类添加一个@EnableCircuitBreaker注解来激活，然后测试 http://localhost:8001/payment/hystrix/timeout/12 可以看到如果超过了规定时间就会调用fallbackMethod 指定的方法。同样计算异常也会调用指定的兜底方法。 ![img](image/hystrix-timeout-fallback.png) ### 20.7 同样消费端80微服务，也可以更好的保护自己，自己也可以进行客户端降级保护。首先修改cloud-consumer-feign-hystrix-order80的配置文件，开启hystrix。并且启动类添加激活注解 ``` feign: hystrix: # 在feign中开启Hystrix enabled: true ``` 在控制层中添加同样的 @HystrixCommand注解和 fallbackMethod方法。 ``` @GetMapping(value = "/consumer/payment/hystrix/timeout/{id}") @HystrixCommand(fallbackMethod = "payment_TimeOutHandler", commandProperties = { @HystrixProperty(name = "execution.isolation.thread.timeoutInMilliseconds", value = "1500") }) public String paymentInfo_TimeOut(@PathVariable("id") Integer id) { String result = paymentHystrixService.paymentInfo_TimeOut(id); log.info("****result: " + result); return result; } /** * 处理访问超时，兜底方案 * @param id * @return */ public String payment_TimeOutHandler(Integer id){ return "我是消费者80,对方支付系统繁忙请10秒种后再试或者自己运行出错请检查自己,o(╥﹏╥)o"; } ``` 访问测试http://localhost/consumer/payment/hystrix/timeout/23 同样在规定的时间没有响应，那么就会调用兜底方法 ![img](image/consumer-hystrix-timeout.png) ### 20.8 、采用全局兜底方案，降低代码耦合度从上面的方式来看，每个业务方法对应一个兜底的方法，从而使代码膨胀，如果有多个方法需要进行兜底操作，那么需要写多个兜底方法，并且代码耦合度变高。那么对于大多数的方法采用全局方式来处理，只要特殊地才单独处理。因此在controller中添加注解@DefaultProperties 来指定全局配置，如果没有单独配置的都走全局兜底方案，调整到统一处理结果页面。 ``` /** * global fallback * @return */ public String payment_Global_FallbackMethod(){ return "Global异常处理信息,请稍后重试.o(╥﹏╥)o"; } ``` 如果方法的方法没有单独指定fallbackMethod方法，那么就会调用全局fallback方法。 ![img](image/consumer-hystrix-global-config.png) ### 20.9 、定义一个服务降级处理的类如果客户端去调用服务端，碰上服务端宕机或关闭，上面测试的服务降级是客户端80实现完成的，与服务端 8001没有关系，只需要为Feign客户端定义的接口添加一个服务降级处理的实现类即可实现解耦。 ``` 根据cloud-consumer-feign-hystrix80已经有的PaymentHystrixService接口，重新新建立一个类 PaymentFallbackService 来实现接口，统一为接口里面的方法进行异常处理 ``` 实现接口 ```java package com.learn.springcloud.service; import org.springframework.stereotype.Component; /** * @ClassName: PaymentFallbackService * @Description: * @Author: lin * @Date: 2020/8/19 17:14 * History: * @ 1.0 */ @Component public class PaymentFallbackService implements PaymentHystrixService{ @Override public String paymentInfo_OK(Integer id) { String result ="------PaymentFallbackService fall back-paymentInfo_OK, o(╥﹏╥)o"; return result; } @Override public String paymentInfo_TimeOut(Integer id) { String result ="----PaymentFallbackService fall back-paymentInfo_TimeOut,o(╥﹏╥)o"; return result; } } ``` 在接口中添加fallback，这个添加上之后在客户端去访问这个CLOUD-PROVIDER-HYSTRIX-PAYMENT微服务，假设出现错误那么就会进入fallback指定的类来统一处理。 ``` @FeignClient(value = "CLOUD-PROVIDER-HYSTRIX-PAYMENT", fallback = PaymentFallbackService.class) ``` 然后测试，http://localhost/consumer/payment/hystrix/ok/2，这个时候如果服务提供方宕机或者服务错误那么再次请求的时候。就会根据接口指定的fallback方法来返回信息。这样做了服务降级处理客户端在服务端不可用时也会获取到提示而不会挂起耗时服务器。 ![img](image/consumer-serviceimpl-fallback.png) ### 20.10 服务熔断，当检测到该节点微服务调用响应正常后，恢复调用链路。 ``` 熔断机制概述：熔断机制是应对雪崩效应的一种微服务链路保护机制。当扇出链路的某个微服务出错不可用或者响应时间太长时，会进行服务的降级，进而熔断该节点微服务的调用，快速返回错误的响应信息。当检测到该节点微服务调用响应正常后，恢复调用链路。在Spring Cloud框架里，熔断机制通过Hystrix实现。Hystrix会监控微服务调用的状况，当失败的调用到一定阈值，缺省是5秒内20次调用失败，就会启动熔断机制。熔断机制的注解是@HystrixCommand。 ``` 在cloud-provider-hystrix-payment8001中添加方法处理服务熔断情况,这个方法上面也是加上了 @HystrixCommand注解，然后开始断路器，并且设置请求次数，在一段时间范围内，如果请求有超过60% 的出现了错了，那么就会触发熔断。服务熔断的顺序是先要服务的降级(服务不能用了)---->然后进而熔断(跳闸)---->再慢慢恢复调用链 ``` /** * 在10秒窗口期中10次请求有6次是请求失败的(失败率超过60%), 断路器将起作用。 * 相关的配置信息参数都在 HystrixCommandProperties 类中可以看到 * @param id * @return */ @HystrixCommand( fallbackMethod = "paymentCircuitBreaker_fallback", commandProperties = { @HystrixProperty(name = "circuitBreaker.enabled", value = "true"),// 是否开启断路器 @HystrixProperty(name = "circuitBreaker.requestVolumeThreshold", value = "10"),// 请求次数 @HystrixProperty(name = "circuitBreaker.sleepWindowInMilliseconds", value = "10000"),// 时间窗口期/时间范文 @HystrixProperty(name = "circuitBreaker.errorThresholdPercentage", value = "60")// 失败率达到多少后跳闸 } ) public String paymentCircuitBreaker(@PathVariable("id") Integer id){ if (id < 0) { throw new RuntimeException("*****id不能是负数"); } String serialNumber = IdUtil.simpleUUID(); return Thread.currentThread().getName() + "\t" + "调用成功,流水号:" + serialNumber; } public String paymentCircuitBreaker_fallback(@PathVariable("id") Integer id) { return "id 不能负数,请稍后重试,o(╥﹏╥)o id:" + id; } ``` 在输入的数字为正数情况下可以看到能正确返回 ![img](image/payment-circuit-breaker-1.png) 当输入错误的数字后 ![img](image/payment-circuit-breaker-2.png) 如果在输入多次错误数字后，再次使用正确得数字访问就会发现也返回错误地信息。 ![img](image/payment-circuit-breaker-3.png) 当我们输入正确的数字，它会慢慢恢复调用链。 ### 20.11 服务熔断小结论 ``` https://martinfowler.com/bliki/CircuitBreaker.html 文章 open---half open --close 熔断类型：熔断打开：请求不再进行调用当前服务，内部设置时钟一般为MTTR(平均故障处理时间)，当打开时长达到所设时钟则进入半熔断状态熔断关闭：熔断关闭不会对服务进行熔断熔断半开：部分请求根据规则调用当前服务，如果请求成功且符合规则则认为当前服务恢复正常，关闭熔断 ``` 断路器在什么情况下使用： 1、快照时间窗：断路器确定是否打开需要统计一些请求和错误数据，而统计的时间范围是快照时间窗口，默认为最近的10秒。 2、请求总数阀值：在快照时间窗内，必须满足请求总数阀值才有资格熔断。默认为20，意味着在10秒内，如果该 hystrix命令的调用次数不足20次，即使所有的请求都超时或者其它原因失败，断路器都不会打开。 3、错误百分比阀值：当请求总数在快照时间窗内超过了阀值，比如发生了30次调用，如果在这30次调用中，有15次发生了超时异常，也就是超过50%的错误百分比，在默认设定50%阀值情况下，这时候就会将断路器打开。 hystrix原来的主逻辑如何恢复对于这个问题，hystrix的自动恢复功能是，当断路器打开，对主逻辑进行熔断之后，hystrix会启动一个休眠时间窗口，在这个时间窗内，降级逻辑是临时的成为主逻辑，当休眠时间窗到期，断路器进入半打开状态，释放一次请求到原来的主逻辑上，如果此次请求正常返回，那么断路器将继续闭合，主逻辑恢复，如果这次请求依然有问题，断路器继续进入打开状态，休眠时间窗重新计时。 ## 21、Hystrix图形化dashboard搭建创建一个新的模块 cloud-consumer-hystrix-dashboard9001。在pom文件中添加dashboard的依赖，然后添加配置指定端口。创建主启动类并添加@EnableHystrixDashboard注解来开启dashboard。启动主启动类访问http://localhost:9001/hystrix。可以看到已经搭建成功 ![img](image/hystrix-dashbord-1.png) 使用dashboard来监控服务端cloud-provider-hystrix-payment8001。并且在控制界面中的输入框中输入要被监控的地址。下面是请求服务端接口的监控情况，这个是在正常情况下 ![img](image/consumer-hystrix-dashboard-provider.png) 如果是测试错误的情况下，可以看到服务熔断开启了。 ![img](image/consumer-hystrix-dashboard-provider-2.png) ## 22、服务网关GateWay和Zuul ``` Cloud全家桶中有个很重要的组件就是网关，在1.x版本中都是采用的Zuul网关；但是在2.x版本中，zuul的升级一直跳票，SpringCloud最后自己研发了一个网关替代Zuul，那就是SpringCloud GateWay ，GateWay是原Zuul1.x版的替代 ``` 网关是什么 GateWay是在Spring生态系之上构建的API网关服务，基于Spring5, SpringBoot2和Project Reactor等技术。 GateWay旨在提供一种简单而有效的方式来对API进行路由，以及提供一些强的的过滤功能，例如：熔断、限流、重试等 SpringCloud GateWay SpringCloud GateWay是SpringCloud的一个全新项目，基于Spring5.0+Spring Boot2.0和Project Reactor 等技术开发的网关，它旨在为微服务提供一种简单有效的统一的API路由管理方式。 SpringCloud GateWay 作为SpringCloud 生态系统中的网关，目标是替换Zuul,子SpringCloud2.0 以上版本中，没有对新版本的Zuul2.0以上最新高性能版本进行集成，仍然还是使用的Zuul1.x 非Reactor模式的老版本。而为了提升网关的性能，SpringCloud GateWay是基于WebFlux框架实现的，而WebFlux框架底层则使用了高性能的Reactor模式通信框架Netty。 Spring Cloud GateWay的目标提供统一的路由方式且基于Filter链的方式提供了网关基本的功能，例如：安全，监控/指标，和限流。 SpringCloud GateWay与Zuul的区别，在 SpringCloud Finchley正式版之前， SpringCloud推荐的网关是 Netflix提供的Zuul: 1、Zuul1.x,是一个基于阻塞I/O的API GateWay 2、Zuul1.x 基于Servlet2.5使用阻塞架构它不支持任何长连接(如WebSocket)Zuul的设计模式和Nginx较像，每次I/O操作都是从工作线程中选择一个执行，请求线程被阻塞到工作线程完成，但是差别是Nginx用C++实现，zuul用java实现，而JVM本身会有第一次加载较慢的情况，使得Zuul的性能相对较差。 3、zuul2.x理念更先进，想基于Netty非阻塞和支持长连接，但SpringCloud目前还没有整合。Zuul2.x的性能Zuul1.x有较大提升。在性能方面，根据官方提供的基准测试， SpringCloud GateWay的RPS(每秒请求数)是Zuul的1.6倍。 4、 SpringCloud GateWay建立在Spring Framework5、Project Reactor和Springboot2之上，使用非阻塞API。 5、 SpringCloud GateWay还支持WebSocket，并且与Spring紧密集成拥有更好的开发体验 ### 22.1、为什么选择SpringCloud GateWay 1、SpringCloud中所集成的Zuul版本，采用的是Tomcat容器，使用的是传统的Servlet IO处理模型。对于Servlet大家都应该知道，其生命周期，Servlet是由servlet container进行生命周期管理。 container启动时构造servlet对象并调用servlet init()进行初始； container运行时接受请求，并为每一个请求分配一个线程(一般从线程池中获取空闲线程)然后调用service()。 container关闭时调用servlet destory()销毁servlet; ![img](image/servlet-container.png) 上述模式的缺点 servlet是一个简单的网络IO模型，当请求进入servlet container时，servlet container就会为其绑定一个线程，在并发不高的场景下这种模型是使用的。但是一旦高并发情况下，线程梳理就会上涨，而线程资源代价是昂贵的(上下文切换，内存消费大) 严重影响请求的处理时间。在一些简单业务场景下，不希望为每个request分配一个线程，只需要1给或几个线程就能应对极大并发的请求，这种业务场景下servlet模式没有优势。所有Zuul 1.x是基于servlet之上的一个阻塞式处理模型，即spring实现了处理所有request请求的一个servlet(DispatcherServlet)并由该Servlet 阻塞式处理。所以SpringCloud Zuul无法摆脱Servlet模型的弊端。 GateWay模式，WebFlux是什么传统的Web框架，比如说：struts2,springmvc等都是基于Servlet API 与Servlet容器基础之上运行的但是，在Servlet3.1之后有了异步非阻塞的支持。而WebFlux是一个典型非阻塞异步的框架，它的核心是基于Reactor 的相关API实现的。相对于传统的Web框架来说，它可以运行在诸如Netty,Undertow及支持Servlet3.1的容器上。非阻塞式+函数式编程(Spring5必须让你使用java8) Spring WebFlux是Spring5.0引入的新的响应式框架，区别于Spring MVC，它不需要依赖Servlet API，它是完全异步非阻塞的，并且基于Reactor来实现响应式流规范。 ### 22.2、GateWay三大核心概念 ``` Route(路由) Predicate(断言) Filter(过滤) ``` 路由路由是构建网关的基本模块，它由ID，目标URL，一系列的断言和过滤器组成，如果断言为true则匹配该路由 Predicate 可以参考java8的java.util.function.Predicate,开发人员可以匹配HTTP请求中的所有内容(例如请求头或者请求参数) 如果请求与断言相匹配则进行路由 Filter 指的是Spring框架中GateWayFilter的实例，使用过滤器，可以在请求被路由前或者之后对请求进行修改。总体 ![img](image/spring-cloud-gateway-boot-application.png) web请求，通过一些匹配条件，定位到真正的服务节点。并在这转发过程的前后，进行一些精细化控制。 predicate就是我们的匹配条件；而filter，就可以理解为一个无所不能的连接器。有了这两个元素再加上目标uri，就可以实现一个具体的路由了。 ### 22.3 GateWay工作流程 ![img](image/springcloud-gateway-works.png) 客户端向SpringCloud GateWay发出请求。然后再GateWay Handler Mapping中找到与请求相匹配的路由，将其发送到Gateway Web Handler。 Handler 再通过指定的过滤器链来将请求发送到我们实际的服务执行业务逻辑，然后返回。过滤器之间用虚线分开是因为过滤器可能会在发送代理请求之前("pre")或之后("post")执行业务逻辑。 Filter再 "pre" 类型的过滤器可以做参数校验、权限校验、流量监控、日志输出、协议转换等再"post"类型的过滤器中可以做响应内容、响应头的修改，日志的输出，流量监控等有非常重要的作用。 GateWay 的核心逻辑就是路由转发+执行过滤链 ### 22.4 、创建模块测试gateway 创建一个模块 cloud-gateway-gateway9527来测试，这个模块不需要连接数据库这和编写些业务类等。只需要添加gateway的相关依赖和修改application.yml文件。这里网关配置路由的地址是8001,所以我们要启动cloud-provider-payment8001来进行测试。 ``` spring: application: name: cloud-gateway cloud: gateway: # 多个路由 routes: - id: payment_routh #payment_route #路由的ID,没有固定规则但要求唯一，建议配合服务名 uri: http://localhost:8001 # 匹配后提供服务的路由地址 predicates: - Path=/payment/get/** #断言，路径相匹配的进行路由, **表示通配符 - id: payment_routh2 #payment_route #路由的ID,没有固定规则但要求唯一，建议配合服务名 uri: http://localhost:8001 # 匹配后提供服务的路由地址 predicates: - Path=/payment/lb/** #断言，路径相匹配的进行路由 ``` 注意再修改pom依赖时候，不能引入下面的两个依赖，并且引入的公用模块中也不能引入。不然启动会报错下面错误。报错Consider defining a bean of type 'org.springframework.http.codec.ServerCodecConfigurer' in your configuration. ``` org.springframework.boot spring-boot-starter-web org.springframework.boot spring-boot-starter-actuator ``` 再次启动网关就不会报错了。然后现在测试使用http://localhost:9527/payment/get/23可以看到能访问到原来 http://localhost:8001/payment/get/23 访问的数据。通过网关来访问8001的数据，如果网关路由的地址和8001中接口地址匹配那么就能成功访问，如果不匹配则不能访问。这样就可以不暴露真实的地址，而是通过网关路由匹配地址。 ![img](image/cloud-gateway-9527.png) ### 22.5 通过编码的方式来进行网关路由的配置 ``` 官网文档 https://cloud.spring.io/spring-cloud-static/spring-cloud-gateway/2.2.1.RELEASE/reference/html/#spring-cloud-circuitbreaker-filter-factory ``` 创建一个类GateWayConfig ，这个通过编码的方式来实现路由转发 ```java package com.learn.springcloud.config; import org.springframework.cloud.gateway.route.RouteLocator; import org.springframework.cloud.gateway.route.builder.RouteLocatorBuilder; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; /** * @ClassName: GateWayConfig * @Description: * @Author: lin * @Date: 2020/8/20 14:59 * History: * @ 1.0 */ @Configuration public class GateWayConfig { /** * 配置了一个id为route-name的路由规则， * 当访问地址http://localhost:9527/guonei时会自动转发到地址 http://news.baidu.com/guonei * @param routeLocatorBuilder * @return */ @Bean public RouteLocator customerRouteLocator(RouteLocatorBuilder routeLocatorBuilder){ RouteLocatorBuilder.Builder routes = routeLocatorBuilder.routes(); //这里的id就表示 routes中id routes.route("path_route_t2", //这里的guonei 就表示请求地址后的接口名 r->r.path("/guonei") //请求地址 .uri("http://news.baidu.com/guonei")).build(); return routes.build(); } @Bean public RouteLocator customerRouteLocator2(RouteLocatorBuilder builder){ RouteLocatorBuilder.Builder routes = builder.routes(); routes.route("path_route-t2", r -> r.path("/guoji") .uri("http://news.baidu.com/guoji")).build(); return routes.build(); } } ``` 启动测试http://localhost:9527/guonei 可以看到这个通过网关转发到了baidu的相关页面 ![img](image/gateway-route-baidu.png) ### 22.6 、负载均衡的方式访问服务上面的方式只能访问写死的两个微服务，如果注册中心有多个微服务呢？怎么负载均衡得去访问呢？这里就需要修改配置文件，从注册中心选取微服务进行路由转发。修改yml配置，让其从注册中动态创建路由的功能，利用微服务进行路由。这使用8001和8002两个微服务实例来进行测试切换 ``` spring: application: name: cloud-gateway cloud: gateway: discovery: locator: enabled: true #开启从注册中心动态创建路由的功能，利用微服务名进行路由 # 多个路由 routes: #payment_route #路由的ID,没有固定规则但要求唯一，建议配合服务名 - id: payment_route1 # 匹配后提供服务的路由地址 # uri: http://localhost:8001 #将写死的地址换成服务名 uri: lb://cloud-payment-service predicates: #断言，路径相匹配的进行路由, **表示通配符 - Path=/payment/get/** #payment_route #路由的ID,没有固定规则但要求唯一，建议配合服务名 - id: payment_route2 # 匹配后提供服务的路由地址 # uri: http://localhost:8001 #将写死的地址换成服务名 uri: lb://cloud-payment-service predicates: #断言，路径相匹配的进行路由 - Path=/payment/lb/** ``` 测试 http://localhost:9527/payment/lb 可以看到这个能切换到不同的端口。通过gateway来路由到不同的服务实例上 ![img](image/gateway-payment-lb.png) ![img](image/gateway-payment-lb-02.png) 我们在启动网关9527的时候可以看到控制台打印了很多信息，这个类是RoutePredicateFactory，它有很多种类型，我们这里只是使用了Path ![img](image/gateway-start-log.png) 这里在yml添加配置，在predicates种使用After，指定时间在什么时候才能访问 ``` - After=2020-08-20T16:17:02.118+08:00[Asia/Shanghai] ``` 如果这个这个时间没有到那么访问就会报错 ![img](image/gateway-predicates-after.png) 添加Cookie测试，使用Cookie的时候用cmd 来使用curl命令来测试，首先测试没有添加cookie的情况，如下的图可知如果没加上Cookie访问就会报错。 ![img](image/gateway-predicates-cookie-curl-test1.png) 当加上Cookie 再测试可以看到，访问成功。 ![img](image/gateway-predicates-cookie-curl-test2.png) 测试Header测试 ``` - Header=X-Request-Id, \d+ #- Header=X-Request-Id, \d+ #请求头要有X-Request-Id属性，并且值为正数 ``` 测试如果是正数那么就能正确返回，如果是负数则访问时报错。 ![img](image/gateway-predicates-header-curl-test1.png) ### 22.7 、全局GlobalFilter GateWay的Filter, 一般自定义全局GlobalFilter，这个要实现GlobalFilter,ordered这两个接口。过滤器可以进行全局日志记录，统一网关鉴权等，编写一个全局过滤器MyLogGateWayFilter。 ```java package com.learn.springcloud.filter; import lombok.extern.slf4j.Slf4j; import org.springframework.cloud.gateway.filter.GatewayFilterChain; import org.springframework.cloud.gateway.filter.GlobalFilter; import org.springframework.core.Ordered; import org.springframework.http.HttpStatus; import org.springframework.http.server.reactive.ServerHttpRequest; import org.springframework.stereotype.Component; import org.springframework.web.server.ServerWebExchange; import reactor.core.publisher.Mono; /** * @ClassName: MyLogGateWayFilter * @Description: * @Author: lin * @Date: 2020/8/20 16:49 * History: * @ 1.0 */ @Component @Slf4j public class MyLogGateWayFilter implements GlobalFilter, Ordered { @Override public Mono filter(ServerWebExchange exchange, GatewayFilterChain chain) { ServerHttpRequest request = exchange.getRequest(); String uName = request.getQueryParams().getFirst("uName"); if(null == uName){ log.info("*********用户名为null,非法用户o(╥﹏╥)o"); exchange.getResponse().setStatusCode(HttpStatus.NOT_ACCEPTABLE); return exchange.getResponse().setComplete(); } return chain.filter(exchange); } @Override public int getOrder() { return 0; } } ``` 在加入uName参数的情况下测试http://localhost:9527/payment/lb?uName=2234，可以看到能测试成功 ![img](image/gateway-myfilter-test1.png) 如果没有加入参数或者参数名不正确,那么访问的时间就不能访问到。 ![img](image/gateway-myfilter-error-paramter-test1.png) ## 23 Spring Cloud Config (服务配置中心) ``` 分布式系统面临的问题：随着项目模块越来越多，那么对应的每一个工程就会有一个yml配置使项目膨胀，东西多了就要统一的管理。比如在只要几个多个模块加入了数据连接配置，那么这样去修改还行，但是当有N多个项目都加入了数据库连接配置那么这个时怎么去修改？所以需要一个统一的配置，能够一次修改处处生效。这样就能减轻配置压力，提升管理效率。微服务意味着要将单体应用中的业务拆分成一个个字服务，每个服务的粒度相对较小，因此系统中会出现大量的服务。由于每个服务都需要必要的配置信息才能运行，所以一套集中式的、动态的配置管理设施是必不可少的 Spring Cloud提供了ConfigServer来解决这个问题。 ``` ### 23.1 Spring Cloud Config 是什么 ``` SpringCloud Coonfig为微服务架构中的微服务提供集中化的外部配置支持，配置服务器为各个不同微服务应用的所以环境提供了一个中心化的外部配置。 ``` ### 23.2 Spring Cloud Config 能做什么 ``` 1、集中管理配置文件 2、不同环境不同配置，动态化的配置更新，分环境部署比如dev/test/prod/beta/release 3、运行期间动态调整配置，不再需要在每个服务部署的机器上编写配置文件，服务会像配置中心统一拉取配置集自己的信息 4、当配置发送变动时，服务不需要重启即可感知到配置的变化并应用新的的配置 5、将配置信息以REST接口的形式暴露 ``` ### 23.3 Spring Cloud Config 服务端配置与测试创建一个模块cloud-config-center-3344来测试全局配置。首先修改pom文件，然后在application.yml中指定github上的中心配置文件。 ``` server: port: 3344 spring: application: name: cloud-config-center #注册进Eureka服务器的微服务名 cloud: config: server: git: uri: https://github.com/liu92/spring-cloud-config #github上面的git仓库名字 search-paths: - springcloud-config #读取分支 label: master eureka: instance: hostname: cloud-config-center3344 client: service-url: #集群版 # defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/ #单机版 defaultZone: http://eureka7001.com:7001/eureka/ # defaultZone: http://localhost:7001/eureka/ ``` 在启动类中加入@EnableConfigServer配置，然后启动访问http://localhost:3344/master/config-dev.yml 可以看到 ,注意在中心配置文件中要注意空格。 ![img](image/spring-cloud-config-3344.png) 推荐的配置/{label}/{name}-{profile}.yml ``` label:表示分支 name：表示名称 profile:表示是那个环境 ``` ### 23.4 Spring Cloud Config 客户端配置与测试新建客户端配置cloud-config-client-3355,这里会使用bootstrap.yml配置文件 ``` application.yml 是用户级的资源配置项 bootstrap.yml 是系统级的，优先级更高 Spring Cloud会创建一个 "Bootstrap Context",作为Spring应用的'Application Context'的父上下文。初始化的时候， "Bootstrap Context"负责从外部源加载配置属性并解析配置。这两个上下文共享一个从外部获取的'Environment'。 "Bootstrap"属性有高优先级，默认情况下，它们不会被本地配置覆盖。"Bootstrap Context"和'Application Context' 有着不同的约定，所以新增一个'bootstrap.yml'文件，保证'Bootstrap Context'和'Application Context'配置的分离。要将Client模块下的application.yml文件改为bootstrap.yml这是关键，因为bootstrap.yml是比application.yml 先加载的。 ``` ### 23.5 Spring Cloud Config 客户端之动态刷新 ``` 在中心配置修改了文件之后，服务端立即生效，但是客户端没生效。那么为了避免每次更新配置都要重启客户端3355 就需要进行客户端动态刷新。 ``` 首先修改3355模块，加入actuator监控依赖，修改yml配置，暴露监控端口 ``` management: endpoints: web: exposure: include: "*" ``` 修改ConfigClientController，加入@RefreshScope注解来实现刷新 ```java package com.learn.springcloud.controller; import org.springframework.beans.factory.annotation.Value; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.RestController; /** * @ClassName: ConfigClientController * @Description: * @Author: lin * @Date: 2020/8/20 22:46 * History: * @ 1.0 */ @RestController public class ConfigClientController { /** * 这里获取的是中心配置中的配置信息 */ @Value("${config.info}") private String configInfo; @GetMapping("/configInfo") public String getConfigInfo(){ return configInfo; } } ``` 测试访问http://localhost:3355/configInfo，然后根据bootstrap.yml中的配置，去找3344中的github配置。 ![img](image/spring-cloud-config-3355.png) 在修改中心配置后，3344可以立即生效 ![img](image/spring-cloud-config-3344-2.png) 但是3355 已经加上了@RefreshScope注解，并且配置文件中也修改了还是没有生效。 ![img](image/spring-cloud-config-3355-2.png) 那么这样应该怎么处理？这就需要运维人员发送POST请求刷新3355。需要执行下面的命令来刷新，如果不加上post默认就是get请求。 ``` curl -X POST "http://localhost:3355/actuator/refresh" ``` 使用上面命令刷新 ![img](image/spring-cloud-config-3355-curl-post.png) 再次刷新http://localhost:3355/configInfo页面，可以看到已经3355也已经刷新了。这样就可以避免去重启3355服务了。 ![img](image/spring-cloud-config-3355-3.png) 如果多个微服务客户端，那么每个都需要执行一次post刷新操作，可否有广播的方式，一次通知，处处生效？进行精确的通知？现在还不能实现，所以有了消息总线来处理。 ## 23、Spring Cloud Bus消息总线是什么？ ``` 对上面的加深和扩充，分布式自动刷新配置功能，Spring Cloud Bus 配合 Spring Cloud Config使用可以实现配置的动态刷新。 Bus支持两种消息代理：RabbitMQ和Kafka。 ``` Spring Cloud Bus 是用来将分布式系统的节点与轻量级消息系统链接起来的框架，它整合了java的事件处理机制和消息中间件的功能。 ### 23.1 能干什么 ``` Spring Cloud Bus能管理和传播分布式系统间的消息，就像一个分布式执行器，可用于广播状态更改、事件推送等，也可以当作微服务间的通信通道。 ``` 什么是总线 ``` 在微服务架构的系统中，通常会使用轻量级的消息代理来构建一个共用的消息主题，并让系统中所有微服务实例都连接上来。由于该主题中产生的消息会被所有实例监听和消费，所以称它为消息总线。在总线上的各个实例，都可以方便地广播一些需要让其它连接在该主题上的实例都知道的消息。基本原理： ConfigClient实例都监听MQ中同一topic(默认是SpringCloudBus)。当一个服务刷新数据的时候，它会把这个消息放到Topic 中，这样其它监听同一Topic的服务就能得到通知，然后去更新自身的配置。 ``` ### 23.2 安装erlang 和rabbitMq ``` 在RabbitMQ的sbin目录中，输入命令rabbitmq-plugins enable rabbitmq_management 这样就可以添加可视化插件了 ``` ### 23.3 Spring Cloud Bus 动态刷新全局广播 ``` 有两种方案： 1、利用消息总线触发一个客户端/bus/refresh,而刷新所有客户端的配置 2、利用消息总线触发一个服务端ConfigServer的/bus/refresh端点，而刷新所有客户端的配置显然方案二更合适。方案一不合适的原因如下：打破了微服务的职责单一性，因为微服务本身是业务模块，它本不应该承担配置刷新的职责。破坏了微服务各节点的对等性。 ``` ### 23.4、消息总线的广播方式创建一个模块cloud-config-client-3366，然后修改pom和 application.yml文件。这个和3355一起配合测试消息总线的广播方式。修改3344配置中心，添加RabbitMQ相关依赖和添加RabbitMQ服务地址。在其它3355和3366也添加消息总线RabbitMQ支持。 ### 23.5、测试示例首先启动eureka7001、cloud-config-center-3344、cloud-config-client-3355、cloud-config-client-3366。然后访问http://config-3344.com:3344/master/config-dev.yml 这个配置还没有修改。那么接下来对其进行修改。将version修改为3之后，再次访问http://config-3344.com:3344/master/config-dev.yml 可以看到已经变化了。但是http://localhost:3355/configInfo 和 ![img](image/cloud-config-info-3355-3.png) http://localhost:3366/configInfo ![img](image/cloud-config-info-3366-3.png) 没有发生变化，下面使用curl发生post请求，来测试刷新 .测试发送curl -X POST "http://localhost:3344/actuator/bus-refresh" 来刷新这样就可以做的一次发送，处处生效执行命令之后，再次访问可以看到已经刷新为最新的配置了 ![img](image/cloud-config-info-refresh-3355.png) ![img](image/cloud-config-info-refresh-3366.png) 在RabbitMQ中可以看到这个exchange中有SpringCloud Bus,这个其实就是一个topic ![img](image/spring-cloud-bus-topic.png) ### 23.6 进行定点通知如果命令 curl -X POST "http://localhost:3344/actuator/bus-refresh" 不指定通知谁那么就是全局的通知，如果指定了那么就是精确通知。下面的添加config-client:3355就精确通知。 curl -X POST "http://localhost:3344/actuator/bus-refresh/config-client:3355" 就是在后面指定微服务名称和端口号 ## 24、Spring Cloud Stream 消息驱动 ``` 如果在不同系统中使用了不同的消息中间件，那么在这种情况下，就会存在很多问题，比如：切换、维护、开发每一个消息中间件都需要了解，那这样就会消耗很多时间。有没有一种技术可以让我们不在关注具体MQ的细节，我们只要用一种适配绑定的方式，自动的给我们在各种MQ内切换。 ``` ### 24.1、 Spring Cloud Stream是什么 ``` 官方定义Spring Cloud Stream 是一个构建消息驱动微服务的框架。应用程序通过inpus或者outpust来与Spring Cloud Stream中的binder对象交互。通过我们配置类binding(绑定)，而Spring Cloud Stream的binder对象负责与消息中间件交互。通过使用Spring Integration来连接消息代理中间件以实现消息事件驱动。 Spring Cloud Stream 为一些供应商的消息中间件产品提供了个性化的自动化配置实现，引用了发布-订阅、消费者、分区的三个核心概念。通俗的说：屏蔽底层消息中间件的差异，降低切换成本，统一消息的编程模型。 ``` ### 24.2、设计思想 ``` 在没有引入之前有什么问题，和引入之后有什么问题，对其原有的有没有造成影响。标准MQ: 生产者/消费者之间靠消息媒介传递消息内容消息必须走特定的通道------消息通道MessageChannel 消息通道里的消息如何被消费呢？谁负责收发处理为什么使用Spring Cloud Stream：比方说我们用到了RabbitMQ和Kafka,由于这两个消息中间件的架构不同，就像RabbitMQ有exchange,kafka有Topic 和Partitions分区。这些中间件的差异性导致我们实际项目开发给我们造成了一定的困扰，我们如果用来两个消息队列的其中一种，后面的业务需求，我们想往另外一种消息队列进行迁移，这时候无疑就是一个灾难性的，一大堆东西都要重新推倒重新做，因为它跟我们的系统耦合了，这个时候Spring Cloud Stream给我们提供了一种解耦合的方式。在没有绑定器这个概念的情况下，我们的SpringBoot应用要直接与消息中间件进行消息交互的时候，由于各消息中间件构建的初衷不同，它们的实现细节上会有较大的差异性。通过定义绑定器作为中间件，完美地实现了应用程序与消息中间件细节之间的隔离。通过向应用程序暴露统一的Channel通道，使得应用程序不需要再考虑各种不同的消息中间件实现。通过定义绑定器Binder作为中间层，实现了应用程序与消息中间件细节之间的隔离。 ``` ### 24.3、 Binder ``` INPUT对应于消费者 OUTPUT对应于生产者 ``` stream中的消息通信方式遵循了发布-订阅模式 ### 24.4、Spring Cloud Stream标准流程套路 1、Binder 很方便的连接中间件，屏蔽差异 2、Channel 通道，是队列Queue的一种抽象，在消息通讯系统中就是实现存储和转发的媒介，通过Channel对队列进行配置。 3、Source和Sink 简单的可理解为参照对象是Spring Cloud Stream自身，从Stream发布消息就是输出，接受消息就是输入。 ![img](image/spring-cloud-stream-application.png) ![img](image/spring-cloud-stream.png) ### 24.5、新建3个子模块来进行测试 cloud-stream-rabbitmq-provider8801,作为生产者进行发消息模块 cloud-stream-rabbitmq-consumer8802,作为消息接收模块 cloud-stream-rabbitmq-consumer8803, 作为消息接收模块 ### 24.6、消息驱动之生产者 cloud-stream-rabbitmq-provider8801。这里要添加stream-rabbitmq的依赖。然后修改yml文件绑定 rabbitmq地址和通道来绑定交互接名称，创建一个接口来发送消息，然后实现这个接口 ```java package com.learn.springcloud.service.impl; import com.learn.springcloud.service.IMessageProvider; import org.springframework.cloud.stream.annotation.EnableBinding; import org.springframework.cloud.stream.messaging.Source; import org.springframework.integration.support.MessageBuilder; import org.springframework.messaging.MessageChannel; import javax.annotation.Resource; import java.util.UUID; /** * @ClassName: MessageProviderImpl * @Description: * @Author: lin * @Date: 2020/8/21 16:15 * History: * @ 1.0 * * @EnableBinding 可以理解为我们要定义一个消息生产者的发送管道 */ @EnableBinding(Source.class) //定义消息的推送管道 public class MessageProviderImpl implements IMessageProvider { /** * //消息发送管道 */ @Resource private MessageChannel output; @Override public String send() { String serial = UUID.randomUUID().toString(); output.send(MessageBuilder.withPayload(serial).build()); System.out.println("**********serial: " + serial); return null; } } ``` 然后创建SendMessageController控制层，然后启动访问这个里面的接口看看是否将消息发送到rabbitmq中。 http://localhost:8801/sendMessage 可以看到rabbitMQ中已经创建了一个studyExchange交互机。 ![img](image/spring-cloud-stream-studyexchange.png) 在RabbitMQ中也可以看到消息已经进入了RabbitMQ中 ![img](image/spring-cloud-stream-rabbitmq-message.png) ### 24.7、消息驱动消费者创建模块cloud-stream-rabbitmq-consumer8802模块，然后添加依赖，修改application.yml 中的配置，消费端需要接收所以这边对应是input。这里将output改为input。 ``` spring: application: name: cloud-stream-consumer cloud: stream: binders: # 在此处配置要绑定的rabbitMQ的服务信息 defaultRabbit: # 表示定义的名称，用于binding的整合 type: rabbit # 消息中间件类型 environment: # 设置rabbitMQ的相关环境配置 spring: rabbitmq: host: localhost port: 5672 username: guest password: guest bindings: # 服务的整合处理 input: # 这个名字是一个通道的名称 destination: studyExchange # 表示要使用的exchange名称定义 content-type: application/json # 设置消息类型，本次为json，文本则设为text/plain binder: defaultRabbit # 设置要绑定的消息服务的具体设置 ``` 创建一个监听，用来监听消费添加 @EnableBinding(Sink.class) 来绑定sink。这个上的理论中有source和 sink。这里要注意和以前controller不同。这里ReceiveMessageListenerController是一个组件。 ```java package com.learn.springcloud.controller; import org.springframework.beans.factory.annotation.Value; import org.springframework.boot.SpringApplication; import org.springframework.boot.autoconfigure.SpringBootApplication; import org.springframework.cloud.stream.annotation.EnableBinding; import org.springframework.cloud.stream.annotation.StreamListener; import org.springframework.cloud.stream.messaging.Sink; import org.springframework.messaging.Message; import org.springframework.stereotype.Component; /** * @ClassName: ReceiveMessageListenerController * @Description: * @Author: lin * @Date: 2020/8/21 17:27 * History: * @ 1.0 */ @Component @EnableBinding(Sink.class) public class ReceiveMessageListenerController { @Value("${server.port}") private String serverPort; /** * @StreamListener(Sink.INPUT) 消息接收，指定通道 * 监听的是Sink.INPUT 输入源 * @param message */ @StreamListener(Sink.INPUT) public void input(Message message){ System.out.println("消费者1号，----->接收到的消息："+ message.getPayload() +"\t port:" + serverPort); } } ``` 启动8802然后再次请求http://localhost:8801/sendMessage。可以看到消息生产者发送了消息 ![img](image/send-message-8801.png) 在消费者端可以看到8802接收到了消息 ![img](image/receive-message-8802.png) ### 24.8分组消费与持久化,消息存在重复消费问题。 ``` 比如在如下场景中，订单系统我们做集群部署，都会从RabbitMQ中获取订单信息，那如果一个订单同时被两个服务获取到，那么就会造成数据错误，我们要避免这种情况。这时我们就可以使用Stream中消息分组来解决。注意在Stream中处于同一个group中的多个消费者是竞争关系，就能够保证消息只会被其中一个应用消费一次。不同组是可以前面消费的(重复消费)？。 ``` ### 24.9 stream中使用group解决重复消费 ``` 故障现象：重复消费导致原因：默认分组group是不同的，组流水号不一样，被认为不同组，可以消费自定义配置分组，自定义配置分为同一组，解决重复消费问题。 ``` 将8802和8803中的配置文件添加group，并且这个group是相同的。那么这样就可以避免消息被重复消费问题。启动测试http://localhost:8801/sendMessage 来发送消息，发送6条消息。 ![img](image/send-message-8801-1.png) 然后查看看8802 ![img](image/receive-message-8802-2.png) 和8803 各自接收到了3条消息。这样就不会造成每个接收者都接收了全部消息，而使消息重复消费问题。 ![img](image/receive-message-8803-2.png) ### 24.10 stream之消息持久化 ``` 如果将8802中的分组去掉，保留8803中group配置，那么再次使用8801发送消息时，8802重启之后是不能获取 8801已经发送了的消息的。这样就就造成了消息的丢失。但是如果重启8803 这个却可以接收到8801已经发送到 MQ中的消息。这样也不会使消息丢失的问题出现。 ``` ## 25、Sleuth 分布式链路追踪 ### 1、为什么会出现这种技术在微服务框架中，一个客户端发起的请求在后端系统中会经过多个不同的服务节点调用来协同产生最后的请求结果，每一个前端请求都会形成一条复杂的分布式服务调用链路，链路中的任何一环出现高延迟或错误都会引起整个请求最后的失败。下载zipkin, 只需要下载jar后，然后本地使用java -jar 启动zipkin就可以了，然后访问http://localhost:9411/zipkin/ 就可以看到zipkin的web操作界面了 ![img](image/zipkin-start.png) ``` https://dl.bintray.com/openzipkin/maven/io/zipkin/java/zipkin-server/2.12.9/ ``` ![img](image/zipkin-request.png) 简化版原理 ![img](image/zipkin-request-1.png) ``` Trace:类似于树结构的Span集合，表示一条调用链路，存在唯一标识 span:标识调用链路来源，通俗的理解span就是一次请求信息 ``` ### 25.1、测试请求链路在cloud-provider-payment8001中添加sleuth依赖,然后在application.yml中设置zipkin监控地址，在controller中添加方法，来测试请求链路 ``` /** * 链路跟踪 * * @return */ @GetMapping(value = "/payment/zipkin") public String paymentZipkin() { return "hi,i'am paymentZipkin server fall back,welcome to atguigu,O(∩_∩)O哈哈~"; } ``` 再去修改cloud-consumer-order80,统一添加sleuth依赖也修改yml文件。也同样在controller中添加测试方法 ``` /** * 链路跟踪 zipkin+sleuth * http://localhost/consumer/payment/zipkin * * @return */ @GetMapping("/consumer/payment/zipkin") public String paymentZipkin() { return restTemplate.getForObject("http://localhost:8081/payment/zipkin/", String.class); } ``` 使用80调用8001,这样就有调用链路了。 ![img](image/provider-consumer-zipkin-80.png) 进入zipkin中可以看到80调用8001的全部链路了。 ![img](image/consumer80-transfer-provider8001-zipkin.png) ## 26、Spring Cloud Alibaba 入门 ``` Spring Cloud Alibaba 致力于提供微服务开发的一站式解决方案。此项目包含开发分布式应用服务的必需组件，方便开发者通过 Spring Cloud 编程模型轻松使用这些组件来开发分布式应用服务。依托 Spring Cloud Alibaba，您只需要添加一些注解和少量配置，就可以将 Spring Cloud 应用接入阿里分布式应用解决方案，通过阿里中间件来迅速搭建分布式应用系统。 ``` ### 26.1、Spring Cloud Alibaba 能做什么 ``` 服务限流降级：默认支持Servlet、Feign、RestTemplate、Dubbo和RocketMQ限流降级功能的接入，可以在运行时通过控制台实时修改限流降级规则，还支持查看限流降级Metrics监控。服务注册与发现：适配Spring Cloud 服务注册与发现标准，默认基础了Ribbon的支持。分布式配置关联：支持分布式系统中的外部化配置，配置更改时自动刷新。消息驱动能力：基于Spring Cloud Stream 为微服务应用构建消息驱动能力。阿里云对象存储：阿里云提供的海里、安全、低成本、高可靠的云存储服务。支持在任何应用、任何时间、任何地点存储和访问任意类型的数据。分布式任务调度：提供秒级、精准、高可靠、高可用的定时(基于Cron表达式)任务调度服务。同时提供分布式的任务执行模型，如网格任务。网格任务支持海量子任务均匀分配到所有Worker（schedulerx-client）上执行。 ``` ### 26.2、 Nacos 服务注册和配置中心 ``` 为什么叫nacos:前四个字母分别是Naming和Configuration的前两个字母，最后的s为Service。 ``` ### 26.3、 Nacos 是什么 ``` 一个更易于构建云原生应用的动态服务发现、配置管理和服务管理平台。 Nacos:Dynamic Naming and Configuration Service Nacos就是注册中心+配置中心的组合 ``` 下载安装nacos,这里使用的是1.1.4版本，在windwos下载文件后解压，双击startup.cmd就可以了。访问localhost:8848/nacos，就可以看到界面了，默认密码是nacos ![img](image/nacos-start-1.png) ### 26.3、创建新的模块进行测试创建新的模块，使用Spring Cloud Alibaba。模块为cloudalibaba-provider-payment9001。创建之后引入spring cloud alibaba 依赖，然后修改配置文件，创建启动类PaymentMain9001 这里就不需要加入eureka的注解了，然后创建一个PaymentController来进行测试 ```java package com.learn.springcloud.controller; import org.springframework.beans.factory.annotation.Value; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; import org.springframework.web.bind.annotation.RestController; /** * @ClassName: PaymentController * @Description: * @Author: lin * @Date: 2020/8/22 22:53 * History: * @ 1.0 */ @RestController public class PaymentController { @Value("${server.port}") private String serverPort; @GetMapping("/payment/nacos/{id}") public String getPayment(@PathVariable("id") Integer id){ return "nacos register, serverport=" + serverPort + "\t id:" + id; } } ``` 启动主启动类，进入nacos中可以看到这个微服务已经注册到nacos中去了。 ![img](image/payment-nacos-9001.png) ### 26.4 创建新的模块，模块为cloudalibaba-provider-payment9002。同样的加入依赖，只是修改一下配置文件中的端口号就可以了。启动主启动类后在nacos中就可以看到同一个微服务名中有两个实例。 ![img](image/provider-nacos-2.png) ### 26.5 创建服务消费者cloudalibaba-consumer-nacos-order83，修改pom文件和yml配置文件。然后创建主启动类和controller类来调用服务提供的接口。注意这里没有向以前那样声明一个变量来指定服务提供者，而在在配置中配置了服务提供者的名字，这样方便修改。下面的图可以看到服务消费者已经注册到了nacos中去了 ![img](image/consumer-nacos-01.png) nacos因为集成了ribbon所以它能支持负载均衡。测试http://localhost:83/consumer/payment/nacos/23。可以看到会在9001和9002之间进行负载均衡的调用。 ![img](image/consumer-payment-nacos-02.png) nacos和其它注册中心比较 ![img](image/nacos-compare-other.png) ### 26.6 nacos支持AP和CP模式的切换 ``` C是所有节点在同一时间看到的数据是一致的；而A的定义是所有的请求都会收到响应。何时选择用何种模式？一般来说如果不需要存储服务级别的信息且服务实例时通过nacos-client注册，并能够保证心跳上报，那么就可以选择 AP模式。当前主流的服务如Spring Cloud 和 Dubbo服务，都是适用于AP模式，AP模式为了服务的可能性而减弱了一致性，因此AP模式下支持注册临时实例。如果需要在服务级别编辑或者存储配置信息，那么CP是必须，K8S服务和DNS服务测适用于CP模式。 CP模式下则支持注册持久化实例，此时则是以Raft协议为集群运行模式，该模式下注册实例之前必须先注册服务，如果服务不存在，则会返回错误。切换 curl -X PUT '$NACOS_SERVER:8848/nacos/v1/ns/operator/switches?entry=serverModer&value=CP' ``` ## 27、 nacos作为服务配置中心创建模块cloudalibaba-config-nacos-client3377，这里有两个配置一个 bootstrap.yml 一个是application.yml 文件。nacos和SpringCloud-config一样，在项目初始时，要保证先从配置中心进行配置拉取，拉取配置之后，才能保证项目的正常启动。配置的优先级是bootstrap高于application。创建ConfigClientController来测试 ```java package com.learn.springcloud.controller; import org.springframework.beans.factory.annotation.Value; import org.springframework.cloud.context.config.annotation.RefreshScope; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.RestController; /** * @ClassName: ConfigClientController * @Description: * @Author: lin * @Date: 2020/8/23 9:30 * History: * @ 1.0 */ @RestController @RefreshScope // 支持nacos的动态刷新 public class ConfigClientController { @Value("${config.info}") private String configInfo; @GetMapping("/config/info") public String getConfigInfo(){ return configInfo; } } ``` 在配置文件的规则要和nacos中规则对应。 ``` # 官网配置匹配规则 # ${spring.application.name}-${spring.profile.active}.${spring.cloud.nacos.config.file-extension} # nacos-config-client-dev.yml ``` ![img](image/nacos-config-11.png) 规则说明 ![img](image/nacos-config-009.png) 现在启动3377, 使用http://localhost:3377/config/info 来查看配置信息 ![img](image/nacos-config-info-1.png) 如果修改了配置中心的文件，那么也能立即刷新 ![img](image/nacos-config-info-2.png) ### 27.1、nacos多环境项目管理，使用Namespace, Group ，DataID这三个来区分。 ![img](image/nacos-namespace-group-service.png) 最外层的namespace是可以用于区分部署环境的，Group和DataID逻辑上区分两个目标对象。默认情况： Namespace=public, Group=DEFAULT_GROUP,默认cluster是DEFAULT Nacos默认的命名空间是public,Namespace主要用来实现隔离。比方说我们现在有三个环境:开发、测试、生产环境，我们就可以创建三个Namespace，不同的Namespace之间是隔离的 Group模式是DEFAULT_GROUP，Group可以把不同的微服务划分到同一个分组里面去 Service就是微服务；一个Service可以保护多个Cluster(集群)，Nacos默认Cluster是DEFAULT，Cluster是指定微服务的一个虚拟划分。比如说为了容灾，将service微服务分别部署在了杭州机房和广州机房，这时可以给杭州机房的Service微服务起一个集群名称(HZ)，给广州机房的Service微服务其一个集群名称(GZ)，还可以尽量让同一个机房的微服务互相调用，以提升性能。 ### 27.2、Nacos之DataID配置 ``` 指定spring.profile.active和配置文件的DataID来使不同环境下读取不同的配置默认空间+默认分组+新建dev和test两个DataID。通过spring.profile.active属性就能进行多环境下配置文件的读取。 ``` 在nacos中添加一个配置，nacos-config-client-test.yaml，然后子啊3377的application.yml中修改active 指定到不同的环境。这样就能进行不同环境的切换。 ![img](image/nacos-config-test-01.png) 重启3377，然后访问http://localhost:3377/config/info 就可以看到这个已经切换到了test环境了。 ![img](image/nacos-config-test-02.png) ### 27.3、Nacos之Group分组配置。在nacos中创建配置，然后使用相同的Data ID, 但是分组却不一样 ![img](image/nacos-config-group-01.png) 然后修改项目的配置文件，添加group属性，然后指定是哪一个分组。 http://localhost:3377/config/info 访问测试 ![img](image/nacos-config-group-02.png) ### 27.4、namespace 命名空间。默认的命名空间是public。这里创建dev和test两个命名空间 ![img](image/nacos-namespace-01.png) 然后在bootstrap中添加namespace，指定刚刚创建的环境。在dev命名空间下分不同的组创建配置。 ![img](image/nacos-namespace-dev-group-01.png) 重启测试，http://localhost:3377/config/info 可以看到访问的是dev命名空间下的DEV_GROUP配置信息。 nacos-namespace-dev-001.png ### 27.5、nacos集群和持久化配置 ``` 默认Nacos使用嵌入式数据库实现数据的存储。所以，如果启动多个默认配置的Nacos节点，数据存储是存在一致性问题的。为了解决这个问题，Nacos采用了集中式存储的方式来支持集群化部署，目前只支持MYSQL的存储。 Nacos支持是三种部署模式 1、单机模式-用于测试和单机试用 2、集群模式-用于生产环境，确保高可用。 3、多集群模式-用于多数据中心场景 ``` ### 27.6 nacos默认的嵌入式数据库derby,切换到mysql。先在window中测试。在下载nacos的文件中将config中的 nacos-mysql.sql放到数据库中执行。然后修改application.properties配置文件，加入下面的配置进行切换。 ``` spring.datasource.platform=mysql db.num=1 db.url.0=jdbc:mysql://127.0.0.1:3306/nacos? characterEncoding=utf8&connectTimeout=1000&socketTimeout=3000&autoReconnect=true&useUnicode=true&useSSL=false&serverTimezone=UTC db.user=root db.password=123 ``` 重启nacos，新建立一个配置 ![img](image/nacos-config-mysql-01.png) 在数据库可以看到新建立的配置信息了。 ![img](image/nacos-config-mysql-info-1.png) ### 27.7、在linux上配置nacos集群。复制一份cluster示例文件，然后修改ip ![img](image/nacos-cluster-01.png) 如果是在一台机器上配置nacos集群，那么如果是三个不同的端口那么就修改下面的nacos中bin目录下的startup.sh文件 ```shell script # m:代表走那种模式 # f:代表走 FUNCTION_MODE 模式 # p:这里新加一个 while getopts ":m:f:s:p:" opt do case $opt in m) MODE=$OPTARG;; f) FUNCTION_MODE=$OPTARG;; s) SERVER=$OPTARG;; p) PORT=$OPTARG;; ?) echo "Unknown parameter" exit 1;; esac done #在最后加入 .Dserver.port=${PORT} # start echo "$JAVA ${JAVA_OPT}" > ${BASE_DIR}/logs/start.out 2>&1 & nohup $JAVA.Dserver.port=${PORT} ${JAVA_OPT} nacos.nacos >> ${BASE_DIR}/logs/start.out 2>&1 & echo "nacos is starting，you can check the ${BASE_DIR}/logs/start.out" ~ ``` Nginx下载解压 ``` 进入解压后的目录，指定安装路径，输入cd /usr/local/nginx ./configure --prefix=/usr/local/nginx --conf-path=/usr/local/nginx/nginx.conf 注：不指定prefix,则可执行文件默认放在/usr/local/bin, 库文件默认放在/usr/local/lib,配置文件默认放在/usr/local/etc ``` ![img](image/nginx-config-01.png) 编译：/usr/local/nginx目录下输入 make ![img](image/nginx-make.png) 安装：/usr/local/nginx目录下输入make install ![img](image/nginx-make-install.png) nginx下的文件 ![img](image/nginx-config-02.png) 如果出现了下面的错误，那么就是在执行 ./configure 的时候路径不对 ``` "conf/koi-win" 与"/usr/local/nginx-1.18.0/conf/koi-win" 为同一文件 ``` 只有nginx编译后，才会在文件下发现sbin等目录。配置Nginx代理 ```shell script upstream cluster{ server 192.168.199.201:8848; server 192.168.199.202:8848; server 192.168.199.203:8848; } server { #默认的端口是80，现在改为1111，意味着所有的访问来之后都先访问1111这个端口 listen 1111; server_name localhost; #charset koi8-r; #access_log logs/host.access.log main; location / { # root html; # index index.html index.htm; # 使用自己的代理 proxy_pass http://cluster; } #error_page 404 /404.html; # redirect server error pages to the static page /50x.html # ...... # } ``` 在nginx文件中进入sbin目录，然后输入./nginx 启动nginx后，访问指定的ip http://192.168.199.201/ 就可以看到nginx已经启动了 ![img](image/nginx-start-0001.png) 启动nacos，如下图可以看到 ![img](image/nacos-cluster-linux-01.png) 查看集群个数为3个 ```shell script [root@cc1 bin]# ps -ef|grep nacos|grep -v grep|wc -l 3 #开始端口 [root@cc1 bin]# firewall-cmd --zone=public --add-port=3306/tcp --permanent Warning: ALREADY_ENABLED: 3306:tcp success [root@cc1 bin]# firewall-cmd --zone=public --add-port=1111/tcp --permanent success ``` ## 28、Spring Cloud Alibaba Sentinel 熔断限流 1.是什么？是面向云原生微服务的高可用流控防护组件。可以保护你的微服务。随着微服务的流行，服务和服务之间的稳定性变得越来越重要。 Sentinel 以流量为切入点，从流量控制、熔断降级、系统负载保护等多个维度保护服务的稳定性。 ### 28.1、Sentinel 下载安装。在github下载之后是一个jar文件，直接java -jar 启动就可以了。 ![img](image/sentinel-start-01.png) ### 28.2、测试熔断、限流等新建模块cloudalibaba-sentinel-service8401。然后和nacos8848配合测试熔断、限流等在pom文件中加入sentinel的相关依赖,以及nacos的依赖。修改yml配置文件 ```yaml server: port: 8401 spring: application: name: cloud-alibaba-sentinel-service cloud: nacos: # nacos 服务注册中心地址 discovery: server-addr: localhost:8848 sentinel: transport: #配置sentinel dashboard地址 dashboard: localhost:8080 #默认8719端口，假如被占用会自动从8719开始依次+1扫描，直到找到未被占用的端口 port: 8719 management: endpoints: web: exposure: include: "*" ``` 创建主启动类MainApp8401，再创建FlowLimitController来测试 ```java package controller; import com.alibaba.csp.sentinel.annotation.SentinelResource; import com.alibaba.csp.sentinel.slots.block.BlockException; import lombok.extern.slf4j.Slf4j; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.RequestParam; import org.springframework.web.bind.annotation.RestController; import java.util.concurrent.TimeUnit; /** * @ClassName: FlowLimitController * @Description: * @Author: lin * @Date: 2020/8/24 22:30 * History: * @ 1.0 */ @RestController @Slf4j public class FlowLimitController { /** * 方法testA * @return */ @GetMapping("/testA") public String testA(){ return "testA-----"; } @GetMapping("/testB") public String testB(){ return "testB -----"; } } ``` 在启动nacos和sentinel之后，进入sentinel，但是什么都没有，这是因为sentinel是懒加载，也就是说要执行一次访问才能在sentinel中看到被监控的nacos等。 ![img](image/cloud-alibab-sentinel-8401-01.png) 再次进入sentinel刷新就可以看到这个sentinel微服务cloud-alibaba-sentinel-service。 ![img](image/cloud-alibaba-sentinel-02.png) ### 28.3、流控模式，在簇点链路中配置，QPS(每秒请求数) ![img](image/cloud-alibaba-sentinel-03.png) 测试添加配置的是1s阈值时1，新增后进入流控规则列表,如果1s之内超过了阈值就快速失败。 ![img](image/cloud-alibaba-sentinel-04.png) 然后测试 http://localhost:8401/testA 接口,一直刷新就会被限流。这种是快速失败（默认错误） ![img](image/cloud-alibaba-sentinel-flow-limiting-01.png) ### 28.4、流控模式，线程数测试线程数，修改testA方法，让其sleep 1000毫秒，然后浏览器中开两个页面访问testA接口，然后可以看到也被限流了 ![img](image/cloud-alibaba-sentinel-thread-01.png) QPS和线程数区别 ``` QPS：比如银行办理业务，这个QPS是将人员挡在门外。线程数：表示人员已经进入银行里面，但是现在只有一个柜台办理人员能处理业务。所以其它的都被限流了。 ``` ### 28.4、流控模式，关联 ``` 当关联的资源达到阈值时，就限流自己当与A关联的资源B达到阈值之后，就限流A自己 ``` ![img](image/cloud-alibaba-sentinel-05.png) 使用postman来测试 ![img](image/cloud-alibaba-sentinel-postman-test-01.png) 在postman设置线程数，每0.3秒就有1个线程发送过去。一共200个线程 ![img](image/cloud-alibaba-sentinel-postman-test-02.png) 再次到浏览器中请求testA，发现testA被限流。大批量线程访问testB,导致testA失效了。 ``` 快速失败，源码 com.alibaba.csp.sentinel.slots.block.flow.controller.DefaultController ``` ### 28.4、流控模式，预热 ``` 默认coldFactor(冷加载因子)为3，即请求QPS从threadhold/3开始,经过预热时长逐渐升至设定的QPS阈值。 com.alibaba.csp.sentinel.slots.block.flow.controller.WarmUpController 源码 ``` 测试案例 ![img](image/cloud-alibaba-sentinel-warm-up-01.png) ``` 表示：阈值为10+预热时长设置为5秒。系统初始化的阈值为10/3约等于3，即阈值刚开始为3；然后过了5秒后阈值慢慢升高恢复到10。 ``` 访问 http://localhost:8401/testB ，在开始的5秒内会出现Blocked by Sentinel(flow limiting)。之后限流慢慢升高恢复到原来设置的。 ### 28.5、流控模式，排队等待 ``` 匀速排队(RuleConstans.CONTROL_BEHAVIOR_RATE_LIMITER)方式会严格控制请求的通过间隔时间，也即是让请求以均匀的速度通过，对应的时漏桶算法。 com.alibaba.csp.sentinel.slots.block.flow.controller.RateLimiterController ``` 设置匀速排队 ![img](image/cloud-alibaba-sentinel-queue-01.png) ``` 匀速排队，让请求以均匀的速度通过，阈值类型必须设成QPS，否则无效。设置含义：/testA每秒1次请求，超过的话就排队等待，等待的超时时间为20000毫秒。 ``` ### 28.6 sentinel 服务降级 Sentinel 提供以下几种熔断策略： 1、慢调用比例 (SLOW_REQUEST_RATIO)：选择以慢调用比例作为阈值，需要设置允许的慢调用 RT（即最大的响应时间），请求的响应时间大于该值则统计为慢调用。当单位统计时长（statIntervalMs）内请求数目大于设置的最小请求数目，并且慢调用的比例大于阈值，则接下来的熔断时长内请求会自动被熔断。经过熔断时长后熔断器会进入探测恢复状态（HALF-OPEN 状态），若接下来的一个请求响应时间小于设置的慢调用 RT 则结束熔断，若大于设置的慢调用 RT 则会再次被熔断。 2、异常比例 (ERROR_RATIO)：当单位统计时长（statIntervalMs）内请求数目大于设置的最小请求数目，并且异常的比例大于阈值，则接下来的熔断时长内请求会自动被熔断。经过熔断时长后熔断器会进入探测恢复状态（HALF-OPEN 状态），若接下来的一个请求成功完成（没有错误）则结束熔断，否则会再次被熔断。异常比率的阈值范围是 [0.0, 1.0]，代表 0% - 100%。 3、异常数 (ERROR_COUNT)：当单位统计时长内的异常数目超过阈值之后会自动进行熔断。经过熔断时长后熔断器会进入在sentinel中可以看到熔断降级的设置 ![img](image/sentinel-fuse-01.png) ``` RT(平均响应时间，秒级) 平均响应时间超出阈值且在时间窗口内通过的请求 > =5 ,两个条件同时满足后触发降级窗口期过后关闭断路器 RT最大4900(更大的需要通过-Dcsp.sentinel.statistic.max.rt=xxxx 才能生效) 异常比例(秒级) QPS>=5 且异常比例(秒级统计)超过阈值时，触发降级；时间窗口结束后，关闭降级异常数(分钟级) 异常数(分钟统计)超过阈值时，触发降级；时间窗口结束后，关闭降级 ``` Sentinel熔断降级会在调用链路中某个资源出现不稳定状态时(例如调用超时或异常比例升高)，对这个资源的调用进行限制，让请求快速失败，避免影响到其它的资源二导致级联错误。当资源被降级后，在接下来的降级时间窗口子内，对该资源的调用都自动熔断(默认行为时抛出DegradeException) 添加测试接口testD。 ``` @GetMapping("/testD") public String testD(){ try { TimeUnit.SECONDS.sleep(1); } catch (InterruptedException e) { e.printStackTrace(); } log.info("testD 测试RT"); return "testD -----"; } ``` 然后设置降级规则 ![img](image/sentinel-fuse-02.png) ``` 平均响应时间(DEGRADE_GRADE_RT)：当1s内持续进入5个请求，对应时刻的平均响应时间(秒级)均超过阈值 (count,以ms为单位)，那么接下来的时间窗口(DegradeRule 中的timeWindow，以s为单位)之内，对这个方法的调用都会自动地熔断(抛出DegradeException)。注意Sentinel默认统计的RT上限是4900ms，超出此阈值的都会算作4900ms，若需要变更此上限可以通过启动配置项 —Dcsp.sentinel.statistic.max.rt=xx 来配置 ``` 在上面设置的是200毫秒，响应请求。但是在测试方法testD中设置了休眠1s。然后使用jmeter来测试。设置线程数是10个，每次请求花费1秒钟，这个设置远远达不到200ms的要求。 ![img](image/sentinel-rt-jmeter-01.png) 那么这个时候再次请求http://localhost:8401/testD。就会出现 Blocked by Sentinel(flow limiting)。从上述设置的可知，永远1秒钟进入10给线程（大于5个了）调用testD，我们希望200毫秒处理完本次任务，如果超过200毫秒还没有处理完，在未来1秒钟的时间窗口内，断路器打开(保险丝跳闸)微服务不可用，保险丝跳闸断电了。当后面停止jmeter测试，没有大量的访问后，断路器(保险丝恢复)，微服务恢复ok。 ### 28.7 sentinel 服务降级-异常比例 ``` 异常比例: 当资源的每秒请求量 >=5 ,并且每秒异常总数占通过量的比值超过阈值之后，资源进入降级状态，即在接下来的时间窗口之内，对这个方法的调用都会自动地返回。异常比率的阈值范围是0%-100%。 ``` 下面设置异常比例是0.2，时间窗口是3秒，也就是说错误比例超过0.2,那么在接下来的3s内，服务降级不可用。 ![img](image/sentinel-error-ratio-01.png) 测试方法，错误比例为100% ``` @GetMapping("/testException") public String testException(){ log.info("testException 异常比例"); int age = 10 /0 ; return "testException -----"; } ``` 所以使用jmeter 发送请求时，然后请求这个接口就可以看到，服务不可使用。 ![img](image/sentinel-error-ratio-02.png) 当停止jmeter时。再次 http://localhost:8401/testException，会看到报错异常错误，虽然没有进入服务降级但是这个出现了异常错误，是因为代码的问题而不是服务降级了。 ### 28.8 sentinel 服务降级-异常数 ``` 当资源近1分钟的异常数目超过阈值之后会进行熔断。注意时间窗口是分钟级别的。若timewindow小于60s，则结束熔断状态后仍可能再进入熔断状态。 ``` ![img](image/sentinel-exception-numer-1.png) 使用测试方法testE来进行测试,这里是故意出错。 ``` @GetMapping("/testE") public String testExceptionCount(){ log.info("testExceptionCount 异常数"); int age = 10 /0 ; return "testExceptionCount -----"; } ``` 设置异常数是5，时间窗口是70s。 ![img](image/sentinel-exception-numer-rule-01.png) 测试http://localhost:8401/testE，如果当异常超过5次后，那么再次请求就会进入服务降级，服务不可用。 ![img](image/sentinel-exception-numer-2.png) ``` 访问http://localhost:8401/testE，第一次访问绝对报错，因为除数不能为零，我们看到error窗口，但是达到5次报错后，进入熔断降级。 ``` ### 28.9 、sentinel 热点key。 ``` 何为热点？热点即经常访问的数据。很多时候我们希望统计某个热点数据中访问频次最高的 Top K 数据，并对其访问进行限制。比如：商品 ID 为参数，统计一段时间内最常购买的商品 ID 并进行限制用户 ID 为参数，针对一段时间内频繁访问的用户 ID 进行限制热点参数限流会统计传入参数中的热点参数，并根据配置的限流阈值与模式，对包含热点参数的资源调用进行限流。热点参数限流可以看做是一种特殊的流量控制，仅对包含热点参数的资源调用生效。 ``` ![img](image/sentinel-hot-param-overview-1.png) 添加方法测试热点key，方法testHotKey，注意@SentinelResource注解，这里value是资源名称要唯一，还有blockHandler表示兜底方法，这个和HystrixCommand注解类似。 ``` @GetMapping("/testHotKey") @SentinelResource(value = "testHotKey", blockHandler = "dealTestHotKey") public String testHotKey(@RequestParam(value = "p1", required = false) String p1, @RequestParam(value = "p2", required = false) String p2){ return "testHotKey -----"; } public String dealTestHotKey(String p1, String p2, BlockException blockException){ return "dealTestHotKey---------"; } ``` 首先访问http://localhost:8401/testHotKey?p1=a&p2=b,能正常访问 ![img](image/sentinel-hot-param-testHotKey-01.png) 然后在sentinel中添加规则，进行测试 ![img](image/sentinel-hot-param-rule-01.png) 当热点key的规则设置好了之后，再次请求http://localhost:8401/testHotKey?p1=a&p2=b, 如果是1s 访问一次那么是正常的，如果突然一直点击请求，就会出现下面的兜底返回内容。 ![img](image/sentinel-hot-param-testHotKey-02.png) 这里可以得到结论 ``` @SentinelResource(value = "testHotKey", blockHandler = "dealTestHotKey")。在使用注解时，如果违背了sentinel中配置的规则，那么出错了之后就会根据设定进入blockHandler 对应的方法来进行兜底处理。如果不设置兜底方法那么在出现错的时候，页面呈现的就error page。这样不友好根据规则设置，方法testHotKey里面第一个参数只要QPS超过每秒1次，马上降级处理 ``` ### 28.10、 sentinel 热点key参数例外项。 ``` 在上述测试中演示了第一给参数p1,当QPS超过1秒1次点击后马上被限流。特例情况：普通：超过1秒种一个后，达到阈值1后马上被限流现在我们期望p1参数当它是某个特殊值时，它的限流值和平时不一样假如当p1的值等于5时，它的阈值可以达到200 (也就说当p1=5时，他的阈值可以达到很高，根据规则配置来) ``` 下面添加参数例外项。在参数p1不等于5时，阈值就是1。在参数p1参数等于5时，阈值就是200。 ![img](image/sentinel-hot-param-rule-special-case-01.png) 当请求http://localhost:8401/testHotKey?p1=5 时，一直刷新也不会有任何问题，这就是参数特殊值 ![img](image/sentinel-hot-param-special-case-02.png) 注意参数必须时基本类型或者string. ``` @SentinelResource 注解处理的是Sentinel控制台配置的违规情况，有blockHandler方法配置的兜底处理 RuntimeException 比如 int a = 10/0; 这个是java运行时报出的运行时异常 RuntimeException,@SentinelResource 不会管这种错。总结：@SentinelResource 主管配置出错，运行错误该走异常还是走异常。 ``` ## 28.8 sentinel 系统规则 ### 28.9 注解@SentinelResource ，按资源名称限流和后续处理。修改8401模块pom，加入cloud-api-commons。新添加RateLimitController类，来测试 ```java package com.learn.springcloud.controller; import cn.hutool.core.util.IdUtil; import com.alibaba.csp.sentinel.annotation.SentinelResource; import com.alibaba.csp.sentinel.slots.block.BlockException; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import com.learn.springcloud.handler.CustomerBlockHandler; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.RestController; /** * @ClassName: RateLimitController * @Description: * @Author: lin * @Date: 2020/8/25 15:55 * History: * @ 1.0 */ @RestController public class RateLimitController { @GetMapping("/byResource") @SentinelResource(value = "byResource", blockHandler = "handleException") public CommonResult byResource(){ return new CommonResult(200, "按资源名称限流测试OK", new Payment(2020L, IdUtil.simpleUUID())); } public CommonResult handleException(BlockException blockException){ return new CommonResult<>(444, blockException.getClass().getCanonicalName()+"\t服务不可用" ); } } ``` 上述是按资源名称限流测试，现在根据资源名设置规则，设置的阈值是1，如果1s内QPS超过了那么就会被限流 ![img](image/sentinel-source-name-set-rule-01.png) 测试http://localhost:8401/byResource，然后突然一直刷新。就可以看到这个被限流了。 ![img](image/sentinel-source-name-test-01.png) ### 28.9.2 按照url限流测试, 通过访问的URL来限流，会返回sentinel自带默认的限流处理信息。在RateLimitController中添加根据url限流的方法 ``` @GetMapping("/rateLimit/byUrl") @SentinelResource(value = "byUrl") public CommonResult byUrl(){ return new CommonResult(200, "by url限流测试OK", new Payment(2020L, IdUtil.simpleUUID())); } ``` 访问 http://localhost:8401/rateLimit/byUrl可以正常访问 ![img](image/sentinel-rate-limit-url-01.png) 根据url添加流控规则。 ![img](image/sentinel-rate-limit-url-02.png) 再次访问http://localhost:8401/rateLimit/byUrl,然后多次刷新请求，可以看到这个被限流。如果没有自定义的blockHandler,那么就会使用默认的。 ![img](image/sentinel-rate-limit-url-03.png) 在上面兜底方案面临的问题 ``` 1、系统默认的，没有体现我们自己的业务要求 2、按照现有条件，我们自定义的处理方法又和业务代码耦合在一起，不直观。 3、每个业务方法都添加一个兜底的，那么代码膨胀将加剧。 4、全局同一的处理方法没有体现。 ``` ### 28.9.3 客户自定义限流处理逻辑创建 CustomerBlockHandler类用于自定义限流处理逻辑 ```java package com.learn.springcloud.handler; import com.alibaba.csp.sentinel.slots.block.BlockException; import com.learn.springcloud.entities.CommonResult; /** * @ClassName: CustomerBlockHandler * @Description: * @Author: lin * @Date: 2020/8/25 15:56 * History: * @ 1.0 */ public class CustomerBlockHandler { public static CommonResult handlerException(BlockException exception) { return new CommonResult(444, "客户自定义，global handlerException---1"); } public static CommonResult handlerException2(BlockException exception) { return new CommonResult(444, "客户自定义，global handlerException---2"); } } ``` 然后将 CustomerBlockHandler添加到RateLimitController中。添加方法然后blockHandlerClass来指定同一的限流降级类。blockHandler再来指定这个类中的那个方法。 ``` @GetMapping("/rateLimit/customerBlockHandler") @SentinelResource(value = "customerBlockHandler", blockHandlerClass = CustomerBlockHandler.class, blockHandler = "handlerException2") public CommonResult customerBlockHandler(){ return new CommonResult(200, "客户自定义限流测试OK", new Payment(2020L, IdUtil.simpleUUID())); } ``` 测试这个方法http://localhost:8401/rateLimit/customerBlockHandler。可以看到正常请求 ![img](image/sentinel-customer-block-handler-01.png) 在sentinel中设置规则。 ![img](image/sentinel-customer-block-handler-rule-01.png) 再次请求http://localhost:8401/rateLimit/customerBlockHandler，然后快速点击就可以看到这个被限流了 ![img](image/sentinel-customer-block-handler-flow-limiting-01.png) ## 29、sentinel 服务熔断功能。sentinel整合ribbon + openFeign + fallback 创建两个模块9003、9004 添加pom依赖，和修改application.yml文件等。添加主启动类和 controller类。启动测试http://localhost:9003/paymentSQL/1 ，http://localhost:9004/paymentSQL/1 都能正常访问 ![img](image/sentinel-payment-9003-01.png) ![img](image/sentinel-payment-9004-01.png) 新建cloudalibaba-consumer-nacos-order84 消费者模块。修改pom和yml配置文件。然后访问http://localhost:84/consumer/fallback/1 看看是否能负载均衡的访问9003、9004。可以看到能够负载均衡的访问到9003和9004两个。 ![img](image/sentinel-customer-ribbon-order84-01.png) 轮询访问到9004 ![img](image/sentinel-customer-ribbon-order84-02.png) 在cloudalibaba-consumer-nacos-order84 ，CircleBreakerController类中如果什么配置都没有配，既没有熔断，也没有降级。那么在访问的时候给客户error页面，不友好。 ### 29.1 在CircleBreakerController类中的方法fallback，上配置下面的配置需要一个兜底的方法。 ``` @RequestMapping("/consumer/fallback/{id}") @SentinelResource(value = "fallback",fallback = "handlerFallback") //配置了fallback的，fallback只负责业务异常 public CommonResult fallback(@PathVariable("id") Long id){ CommonResult commonResult = restTemplate.getForObject(SERVICE_URL + "/paymentSQL/" + id, CommonResult.class); if(id == 4){ throw new IllegalArgumentException("IllegalArgumentException,非法参数异常"); }else if(commonResult.getData() == null){ throw new NullPointerException("NullPointerException,该ID没有记录，空指针异常"); } return commonResult; } // 本例是fallback public CommonResult handlerFallback(Long id, Throwable e){ Payment payment = new Payment(id, null); return new CommonResult(444, "兜底异常handler，exception内容"+e.getMessage(), payment); } ``` 那么再次访问http://localhost:84/consumer/fallback/4的时候，如果出错就不会是error页面了，而是一个比较友好的提示。 ![img](image/sentinel-customer-fallback-order84-01.png) 如果输入的id=5，那么http://localhost:84/consumer/fallback/5 返回得也是一个友好的提示，只是这个是空指针异常。 ![img](image/sentinel-customer-fallback-nullpointerexception-order84-01.png) ### 29.3 在CircleBreakerController类中只配置blockHandler。 ``` @RequestMapping("/consumer/fallback/{id}") @SentinelResource(value = "fallback",blockHandler = "blockHandler") // 配置了blockHandler，只负责sentinel控制台配置违规 public CommonResult fallback(@PathVariable("id") Long id){ CommonResult commonResult = restTemplate.getForObject(SERVICE_URL + "/paymentSQL/" + id, CommonResult.class); if(id == 4){ throw new IllegalArgumentException("IllegalArgumentException,非法参数异常"); }else if(commonResult.getData() == null){ throw new NullPointerException("NullPointerException,该ID没有记录，空指针异常"); } return commonResult; } public CommonResult blockHandler(Long id, BlockException exception){ Payment payment = new Payment(id, null); return new CommonResult<>(445, "blockHandler-sentinel 限流，无此流水号：blockException" + exception.getMessage(), payment); } ``` 只配置了blockHandler，那么需要到sentinel中添加服务降级规则。配置的是异常数，如果出现两次异常数，那么后面就会进行服务降级。 ![img](image/sentinel-customer-block-handler-order84-01.png) 那么在测试http://localhost:84/consumer/fallback/4时候，在前面两次的请求中，返回的是error错误界面，当错误数超过了两次才会进入服务降级 ![img](image/sentinel-customer-block-handler-02.png) ### 29.3 在CircleBreakerController类中blockHandler和fallback都配置。 ``` @RequestMapping("/consumer/fallback/{id}") @SentinelResource(value = "fallback",fallback = "handlerFallback", blockHandler = "blockHandler") public CommonResult fallback(@PathVariable("id") Long id){ CommonResult commonResult = restTemplate.getForObject(SERVICE_URL + "/paymentSQL/" + id, CommonResult.class); if(id == 4){ throw new IllegalArgumentException("IllegalArgumentException,非法参数异常"); }else if(commonResult.getData() == null){ throw new NullPointerException("NullPointerException,该ID没有记录，空指针异常"); } return commonResult; } // 本例是fallback public CommonResult handlerFallback(Long id, Throwable e){ Payment payment = new Payment(id, null); return new CommonResult(444, "兜底异常handler，exception内容"+e.getMessage(), payment); } public CommonResult blockHandler(Long id, BlockException exception){ Payment payment = new Payment(id, null); return new CommonResult<>(445, "blockHandler-sentinel 限流，无此流水号：blockException" + exception.getMessage(), payment); } ``` 那么在sentinel中配置流控规则,配置的是QPS类型，阈值是1s。 ![img](image/sentinel-customer-fallback-block-handler-order84-01.png) 请求http://localhost:84/consumer/fallback/2,如果1s点击一次那么就是正常的，但是如果快速点击请求就可以看到访问正常的也会被限流。 ![img](image/sentinel-customer-fallback-order84-04.png) 请求http://localhost:84/consumer/fallback/4，那么如果正常请求返回的就是异常信息。 ![img](image/sentinel-customer-fallback-exception-01.png) 但是当请求http://localhost:84/consumer/fallback/4,就会会违背sentinel中限流的规则。进行限流 ![img](image/sentinel-customer-fallback-block-exception-01.png) 结论 ``` 如果blockHandler和fallback都进行了配置，则被限流降级而抛出BlockException时只会进入blockHandler处理逻辑。 ``` ### 29.4 在CircleBreakerController类中异常忽略， exceptionsToIgnore = {IllegalArgumentException.class}假如报该异常，不再有fallback方法兜底，没有降级效果了。 ``` @RequestMapping("/consumer/fallback/{id}") @SentinelResource(value = "fallback",fallback = "handlerFallback", blockHandler = "blockHandler", exceptionsToIgnore = {IllegalArgumentException.class}) public CommonResult fallback(@PathVariable("id") Long id){ CommonResult commonResult = restTemplate.getForObject(SERVICE_URL + "/paymentSQL/" + id, CommonResult.class); if(id == 4){ throw new IllegalArgumentException("IllegalArgumentException,非法参数异常"); }else if(commonResult.getData() == null){ throw new NullPointerException("NullPointerException,该ID没有记录，空指针异常"); } return commonResult; } ``` 那么在请求的时候，又会出现error page 页面了。 ### 29.5 sentinel 服务熔断OpenFeign。修改84，在pom中加入OpenFeign依赖和yml中添加Sentinel对feign的支持。在主启动类添加@EnableFeignClients注解来对激活Feign。并且在PaymentService接口中添加@FeignClient注解，那么controller就不用去找RestTemplate，而是根据@FeignClient指定的服务名去查找。 ```java package com.learn.springcloud.service; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import org.springframework.cloud.openfeign.FeignClient; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; /** * @ClassName: PaymentService * @Description: * @Author: lin * @Date: 2020/8/25 17:38 * History: * @ 1.0 */ @FeignClient(value = "nacos-payment-provider", fallback = PaymentFallbackService.class) public interface PaymentService { @GetMapping("/paymentSQL/{id}") CommonResult paymentSQL(@PathVariable("id") Long id); } ``` 然后实现这个接口PaymentService，实现类PaymentFallbackService，如果出现错误了那么这个来兜底处理。从接口的注解配置可知fallback对应的类。 ```java package com.learn.springcloud.service; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import org.springframework.stereotype.Component; /** * @ClassName: PaymentFallbackService * @Description: * @Author: lin * @Date: 2020/8/25 17:39 * History: * @ 1.0 */ @Component public class PaymentFallbackService implements PaymentService{ @Override public CommonResult paymentSQL(Long id) { return new CommonResult<>(444, "服务降级返回，----PaymentFallbackService", new Payment(id, "errorSerial")); } } ``` 在CircleBreakerController中添加接口来测试 ```java package com.learn.springcloud.controller; import com.alibaba.csp.sentinel.annotation.SentinelResource; import com.alibaba.csp.sentinel.slots.block.BlockException; import com.learn.springcloud.entities.CommonResult; import com.learn.springcloud.entities.Payment; import com.learn.springcloud.service.PaymentService; import lombok.extern.slf4j.Slf4j; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.PathVariable; import org.springframework.web.bind.annotation.RequestMapping; import org.springframework.web.bind.annotation.RestController; import org.springframework.web.client.RestTemplate; import javax.annotation.Resource; /** * @ClassName: CircleBreakerController * @Description: * @Author: lin * @Date: 2020/8/25 17:37 * History: * @ 1.0 */ @RestController @Slf4j public class CircleBreakerController { // --------------- open feign--------- @Resource private PaymentService paymentService; @GetMapping("/consumer/paymentSQL/{id}") public CommonResult paymentSQL(@PathVariable("id") Long id){ return paymentService.paymentSQL(id); } } ``` 现在请求http://localhost:84/consumer/paymentSQL/2，可以正常返回 ![img](image/sentinel-customer-payment-feign-order84-01.png) 如果故意将9003服务提供者关闭，看84消费侧是否自动降级，会不会被耗死。在关闭了服务提供者9003后再次请求http://localhost:84/consumer/paymentSQL/2 ,可以发现服务被降级了，完成一种自我的保护。 ![img](image/sentinel-customer-payment-feign-order84-02.png) ### 29.6 sentinel持久化规则。 ``` 在sentinel中配置规则后，如果重启sentinel那么规则就会消失，所以生产环境需要将规则配置进行持久化。将限流配置规则持久化进Nacos保存，只要刷新8401某个rest地址，sentinel控制台的流控规则就能看到，只要Nacos里面的配置不删除，针对8401上sentinel上的流控规则持续有效。 ``` 在模块cloudalibaba-sentinel-service8401中添加依赖，将sentinel配置规则持久化到nacos中。然后再yml中添加nacos数据源配置。 ```yaml datasource: dsl: nacos: server-addr: localhost:8848 dataId: cloud-alibaba-sentinel-service groupId: DEFAULT_GROUP data-type: json rule-type: flow ``` 然后再到nacos中添加一个配置。一定要注意配置 ``` [ { "resource":"/rateLimit/byUrl", "limitApp": "default", "grade":1, "count":1, "strategy":0, "controlBehavior":0, "clusterMode":false } ] #resource:资源名称; #limitApp:来源应用； #grade:阈值类型，0表示线程数，1表示QPS； #count:单机阈值； #strategy:流控模式，0表示直接，1表示关联，2表示链路； #controlBehavior：流控效果，0表示快速失败，1表示Warm up, 2表示排队等待； #clusterMode: 是否集群。 ``` 配置的是一个josn，而不是yml文件。 ![img](image/snetinel-data-source-naocs-payment8401.png) 然后启动8401，请求http://localhost:8401/rateLimit/byUrl 后，进入sentinel中可以看到 ![img](image/sentinel-flow-rule-payment8401.png) 再次请求http://localhost:8401/rateLimit/byUrl 就会发现被限流了 ![img](image/cloud-alibab-sentinel-8401-02.png) 如果这时关闭8401后刷新sentinel后发现流控规则不存在了，但是重启8401号再次刷新后发现这个规则又出现了，者说明了sentinel中的规则已经持久化了。 ![img](image/sentinel-data-source-nacos-payment8401-03.png) ## 30、分布式事务问题由来 ``` 比如在微服务系统中：订单模块、库存模块、支付等模块这三个如果每个模块的数据库都是单独，在不同的机房。那么这个时候就会牵扯到多数据源，多中心跨库的调用问题。有可能就是在下订单的时候添加一条数据，库存扣减都成功了才去扣减支付账号的钱等操作。这三个操作相当于是一个整体，物理上可以是不同的数据库存储，但是在逻辑上应该是同一个事务处理。牵扯到这种全局跨库处理的多数据源的统一调度，这就是分布式事务。 ``` 在将单体应用拆分成微服务应用，原来的三个模块被拆分成三个独立的应用，分别使用三个独立的数据源，业务操作需要调用三个服务来完成。此时每个服务内部的数据一致性由本地事务来保证，但是全局的数据一致性问题没法保证。 ``` 用户购买商品的业务逻辑，整个业务逻辑由3个微服务提供支持；仓储服务：对给定的商品扣除仓储数量。订单服务：根据采购需求创建订单。账户服务：从用户账户中扣除余额。 ``` 所以在一次业务操作需要跨多个数据源或者跨多个系统进行远程调用时，就会产生分布式事务问题。 ![img](image/seata-business-01.png) ### 30.1 seata 是什么？能干什么？解决了什么问题？ ``` seata 是什么： Seata是一款开源的分布式事务解决方案，致力于在微服务架构下提供高性能和简单易用的分布式事务服务。能干什么：一个典型的分布式事务过程，分布式事务处理过程的ID+三组件模型。 Transaction ID XID(全局唯一的事务id) 三组件概念： TC (Transaction Coordinator) - 事务协调者维护全局和分支事务的状态，驱动全局事务提交或回滚。 TM (Transaction Manager) - 事务管理器定义全局事务的范围：开始全局事务、提交或回滚全局事务。 RM (Resource Manager) - 资源管理器管理分支事务处理的资源，与TC交谈以注册分支事务和报告分支事务的状态，并驱动分支事务提交或回滚。 ``` 下面是请求流程图。 ![img](image/seata-transaction-01.png) ### 30.2、下载seata 然后修改file.conf和registry.conf文件，将其存储地址改为mysql，并且注册到nacos中去。先启动nacos再启动seata，然后到nacos中可以看到已经seata服务已经注册到nacos中了。 ![img](image/seata-registry-nacos-01.png) ### 30.3、订单/库存/账户业务数据库准备分布式事务业务说明: ``` 这里会去创建三个服务，一个订单服务，一个库存服务，一个账户服务。当用户下单时，会在订单服务中创建一个订单，然后通过远程调用库存服务来扣减下单商品的库存，再通过远程调用账户服务来扣减用户账户里面的余额，最后再订单服务中修改订单状态为已完成。该操作跨越三个数据库，有两次远程调用，很明显会有分布式事务问题。一句话：下订单-----扣减库存---减账户(余额) ``` 创建业务数据库和表。 ``` seata_order:存储订单的数据库； seata_storage:存储库存的数据库； seata_account：存储账户信息的数据库； CREATE DATABASE seata_order; CREATE DATABASE seata_storage; CREATE DATABASE seata_account; 再对应的数据库中创建表：t_account，t_order, t_storage, 然后每个数据库建立undo_log回滚日志记录表 ``` ### 30.4、订单/库存/账户业务微服务准备 ``` 1、新建订单Order-Module, seata-oerder-service2001 2、新建库存Storage-Module,seata-storage-service2002 3、新建账户Account-Module, seata-account-service2003 ``` 然后在2001中创建需要的类来进行测试。实体类Oder，CommonResult，对应的OderDao和业务类OderService。并且使用seata对数据源进行代理和 Feign来调用其它两个服务，StorageService类。 ```java package com.learn.springcloud.service; import com.learn.springcloud.domain.CommonResult; import org.springframework.cloud.openfeign.FeignClient; import org.springframework.web.bind.annotation.PostMapping; import org.springframework.web.bind.annotation.RequestParam; /** * 库存service,是微服务，通过feign来调用库存服务, * @ClassName: StorageService * @Description: * @Author: lin * @Date: 2020/8/26 15:46 * @History: * @ 1.0 */ @FeignClient(value = "seata-storage-service") public interface StorageService { /** * 通过feign来查找微服务seata-storage-service，然后找下的这个方法来进行扣减库存操作。 * 扣减库存操作 * @param productId 对应商品id * @param count 扣减数量 * @return */ @PostMapping("/storage/decrease") CommonResult decrease(@RequestParam("productId") Long productId, @RequestParam("count") Integer count); } ``` AccountService类 ```java package com.learn.springcloud.service; import com.learn.springcloud.domain.CommonResult; import org.springframework.cloud.openfeign.FeignClient; import org.springframework.web.bind.annotation.PostMapping; import org.springframework.web.bind.annotation.RequestMapping; import org.springframework.web.bind.annotation.RequestParam; import java.math.BigDecimal; /** * 账户service ,通过Feign来调用账户微服务。 * @ClassName: AccountService * @Description: * @Author: lin * @Date: 2020/8/26 15:46 * @History: * @ 1.0 */ @FeignClient(value = "seata-account-service") public interface AccountService { /** * 通过feign来查找微服务seata-account-service，然后通过post请求 * 找下的这个方法来进行账户余额扣减操作。 * @param userId 用户id * @param money 扣减金额 * @return */ @PostMapping("/account/decrease") CommonResult decrease(@RequestParam("userId") Long userId, @RequestParam("money")BigDecimal money); } ``` 那么在2002和2003 模块中添加依赖，然后编写相关的业务代码。对应的2002是库存模块，2003是账户余额模块。注意在启动模块是，需要将项目模块中yml配置和 seata-server配置文件中的事务分组一直，然后启动会报错 ``` no available server to connect ``` 这里使用的是seata-server-0.9.0版本，要修改file.conf文件中的 vgroup_mapping.prex_tx_group="default" 这里的prex_tx_group 自己定义的，如果使用db做出存储那么还要修改db对应的数据库账户密码。 ``` transport { # tcp udt unix-domain-socket type = "TCP" #NIO NATIVE server = "NIO" #enable heartbeat heartbeat = true #thread factory for netty thread-factory { boss-thread-prefix = "NettyBoss" worker-thread-prefix = "NettyServerNIOWorker" server-executor-thread-prefix = "NettyServerBizHandler" share-boss-worker = false client-selector-thread-prefix = "NettyClientSelector" client-selector-thread-size = 1 client-worker-thread-prefix = "NettyClientWorkerThread" # netty boss thread size,will not be used for UDT boss-thread-size = 1 #auto default pin or 8 worker-thread-size = 8 } shutdown { # when destroy server, wait seconds wait = 3 } serialization = "seata" compressor = "none" } service { #vgroup->rgroup # 事务组名称 vgroup_mapping.prex_tx_group="default" #only support single node default.grouplist = "127.0.0.1:8091" #degrade current not support enableDegrade = false #disable disable = false #unit ms,s,m,h,d represents milliseconds, seconds, minutes, hours, days, default permanent max.commit.retry.timeout = "-1" max.rollback.retry.timeout = "-1" } client { async.commit.buffer.limit = 10000 lock { retry.internal = 10 retry.times = 30 } report.retry.count = 5 tm.commit.retry.count = 1 tm.rollback.retry.count = 1 } ## transaction log store store { ## store mode: file、db mode = "db" ## file store file { dir = "sessionStore" # branch session size , if exceeded first try compress lockkey, still exceeded throws exceptions max-branch-session-size = 16384 # globe session size , if exceeded throws exceptions max-global-session-size = 512 # file buffer size , if exceeded allocate new buffer file-write-buffer-cache-size = 16384 # when recover batch read size session.reload.read_size = 100 # async, sync flush-disk-mode = async } ## database store db { ## the implement of javax.sql.DataSource, such as DruidDataSource(druid)/BasicDataSource(dbcp) etc. datasource = "dbcp" ## mysql/oracle/h2/oceanbase etc. db-type = "mysql" driver-class-name = "com.mysql.jdbc.Driver" url = "jdbc:mysql://127.0.0.1:3306/seata" user = "root" password = "123" min-conn = 1 max-conn = 3 global.table = "global_table" branch.table = "branch_table" lock-table = "lock_table" query-limit = 100 } } lock { ## the lock store mode: local、remote mode = "remote" local { ## store locks in user's database } remote { ## store locks in the seata's server } } recovery { #schedule committing retry period in milliseconds committing-retry-period = 1000 #schedule asyn committing retry period in milliseconds asyn-committing-retry-period = 1000 #schedule rollbacking retry period in milliseconds rollbacking-retry-period = 1000 #schedule timeout retry period in milliseconds timeout-retry-period = 1000 } transaction { undo.data.validation = true undo.log.serialization = "jackson" undo.log.save.days = 7 #schedule delete expired undo_log in milliseconds undo.log.delete.period = 86400000 undo.log.table = "undo_log" } ## metrics settings metrics { enabled = false registry-type = "compact" # multi exporters use comma divided exporter-list = "prometheus" exporter-prometheus-port = 9898 } support { ## spring spring { # auto proxy the DataSource bean datasource.autoproxy = false } } ``` 修改registry.conf文件,这里使用的是nacos作为注册中心。 ``` registry { # file 、nacos 、eureka、redis、zk、consul、etcd3、sofa type = "nacos" nacos { serverAddr = "localhost:8848" namespace = "" cluster = "default" } eureka { serviceUrl = "http://localhost:8761/eureka" application = "default" weight = "1" } redis { serverAddr = "localhost:6379" db = "0" } zk { cluster = "default" serverAddr = "127.0.0.1:2181" session.timeout = 6000 connect.timeout = 2000 } consul { cluster = "default" serverAddr = "127.0.0.1:8500" } etcd3 { cluster = "default" serverAddr = "http://localhost:2379" } sofa { serverAddr = "127.0.0.1:9603" application = "default" region = "DEFAULT_ZONE" datacenter = "DefaultDataCenter" cluster = "default" group = "SEATA_GROUP" addressWaitTime = "3000" } file { name = "file.conf" } } config { # file、nacos 、apollo、zk、consul、etcd3 type = "file" nacos { serverAddr = "nacos" namespace = "" } consul { serverAddr = "127.0.0.1:8500" } apollo { app.id = "seata-server" apollo.meta = "http://192.168.1.204:8801" } zk { serverAddr = "127.0.0.1:2181" session.timeout = 6000 connect.timeout = 2000 } etcd3 { serverAddr = "http://localhost:2379" } file { name = "file.conf" } } ``` 项目中对应的application.yml配置文件, tx-service-group: prex_tx_group这个要和seata-server中的对应。对应三个项目的 tx-service-group 事务分组都是同一个 ```yaml server: port: 2001 spring: application: name: seata-order-service cloud: alibaba: seata: # 自定义事务组名称需要与seata-server中的对应 tx-service-group: prex_tx_group nacos: discovery: server-addr: 127.0.0.1:8848 datasource: # 当前数据源操作类型 type: com.alibaba.druid.pool.DruidDataSource # mysql驱动类 driver-class-name: com.mysql.cj.jdbc.Driver url: jdbc:mysql://localhost:3306/seata_order?useUnicode=true&characterEncoding=UTF-8&useSSL=false&serverTimezone=GMT%2B8 username: root password: 123 feign: hystrix: enabled: false logging: level: io: seata: info mybatis: mapper-locations: classpath*:mapper/*.xml ``` 对于nacos-config.txt文件，这里没有修改可以，修改了也可以。。 ![img](image/seata-nacos-config-txt-01.png) 启动项目后可以看到控制台中打印的注册信息。 ![img](image/seata-tx-group-registry-order2001-01.png) 在nacos注册中心可以看到服务列表中，三个测试的模块都已经注册进入nacos中了。还一个serverAddr是seata注册到nacos中的服务。 ![img](image/seata-registry-nacos-service-01.png) 测试请求http://localhost:2001/order/create?userId=1&productId=1&count=10&money=100，来进行创建订单。 ![img](image/seata-nacos-order-create-01.png) 查看数据库订单数据已经生成 ![img](image/seata-order-create-02.png) 库存数据已经扣减 ![img](image/seata-storage-decrease-create-02.png) 账户余额也减少了。 ![img](image/seata-account-decrease-create-02.png) ### 30.5、模拟超时异常情况下不加@GlobalTransctional 事务注解事务处理的情况, 在Account模块下模拟超时情况。 ```java package com.learn.springcloud.service.impl; import com.learn.springcloud.dao.AccountDao; import com.learn.springcloud.service.AccountService; import org.slf4j.Logger; import org.slf4j.LoggerFactory; import org.springframework.stereotype.Service; import javax.annotation.Resource; import java.math.BigDecimal; import java.util.concurrent.TimeUnit; /** * @ClassName: AccountServiceImpl * @Description: * @Author: lin * @Date: 2020/8/26 17:17 * History: * @ 1.0 */ @Service public class AccountServiceImpl implements AccountService { private static final Logger LOGGER = LoggerFactory.getLogger(AccountServiceImpl.class); @Resource private AccountDao accountDao; @Override public void decrease(Long userId, BigDecimal money) { LOGGER.info("------>account-service中扣减余额开始"); //模拟超时异常，全局事务回滚 try { //暂停20秒钟 TimeUnit.SECONDS.sleep(20); } catch (InterruptedException e) { e.printStackTrace(); } accountDao.decrease(userId, money); LOGGER.info("------>storage-service中扣减余额开始"); } } ``` 重启account后，再次请求http://localhost:2001/order/create?userId=1&productId=1&count=10&money=100 ![img](image/seata-test-account-request-timeout-01.png) 查看订单数据库，虽然数据库订单已经添加了但是状态是0，而0代表未支付，1代表支付。 ![img](image/seata-test-order-timeout-01.png) 而库存已经被扣掉了，那么这样就会找出数据对应不上。因为支付没有成功那么这些应该回滚，库存数量不应该扣减。 ![img](image/seata-test-storage-timeout-01.png) 而且账户也被扣钱了。 ![img](image/seata-test-accout-timeout-01.png) 上述存在的情况 ``` 1、库存和账户都扣减了，但是订单状态没有设置为已完成，没有从0改为1。 2、而且由于feign的超时重试机制，账户余额还有可能被多次扣减。 ``` 所以在这种请求要了解请求服务之间的调用，那么查找问题起来才快速，更容易快速解决。 ### 30.6、超时异常，在order模块中serviceimpl的方法加上@GlobalTransctional注解。 ```java package com.learn.springcloud.service.impl; import com.learn.springcloud.dao.OrderDao; import com.learn.springcloud.domain.Order; import com.learn.springcloud.service.AccountService; import com.learn.springcloud.service.OrderService; import com.learn.springcloud.service.StorageService; import io.seata.spring.annotation.GlobalTransactional; import lombok.extern.slf4j.Slf4j; import org.springframework.stereotype.Service; import javax.annotation.Resource; /** * @ClassName: OrderServiceImpl * @Description: * @Author: lin * @Date: 2020/8/26 15:47 * History: * @ 1.0 */ @Service @Slf4j public class OrderServiceImpl implements OrderService { @Resource private OrderDao orderDao; @Resource private AccountService accountService; @Resource private StorageService storageService; /** * 创建订单->调用库存服务扣减库存->调用账户服务扣减账户余额->修改订单状态 * 简单说: * 下订单->减库存->减余额->改状态 * GlobalTransactional seata开启分布式事务,异常时回滚,name保证唯一即可 * @param order 订单对象 */ @Override @GlobalTransactional(name = "pre_order_service", rollbackFor = Exception.class) public void create(Order order) { //1、新建订单 log.info("------->开始新建订单"); orderDao.create(order); //2、扣减库存 log.info("------->订单微服务开始调用库存，进行扣减Count"); storageService.decrease(order.getProductId(), order.getCount()); log.info("------->订单微服务开始调用库存，做扣减end"); //3、扣减账户余额 log.info("------->订单微服务开始调用账户服务，做账户余额扣减Money"); accountService.decrease(order.getUserId(), order.getMoney()); log.info("------->订单微服务开始调用账户服务，做账户余额扣减end"); //4、修改订单状态，从零到1，1代表已经完成,这里传入0，是根据这个用户id和这个状态标识条件来更新 log.info("------->修改订单状态开始"); orderDao.update(order.getUserId(), 0); log.info("------->修改订单状态结束"); log.info("----->下订单结束了,O(∩_∩)O哈哈~"); } } ``` 然后重启，看看seata能不能的全局事务是否起作用。还是和前面一样的操作http://localhost:2001/order/create?userId=1&productId=1&count=10&money=100 错误的访问，看看seata有没有控制住这个事务回滚操作。在请求只会我们看数据库订单数据是否正常。可以看到数据根本没有插入到数据库中，这是因为事务回滚，根本就不会取提交写操作。 ![img](image/seata-test-global-transcation-order-01.png) 设置人工异常, 可以看到在正常情况下，phaseTwo_Committed。如果出现异常那么就会在第二阶段进行回滚操作 ![image](image/seata-rollback-01.png) ### 总结：seata 1、TC:事务的全局协调者(Seata服务器) 2、TM:事务的发起方，也就在方法上添加了 @GlobalTransactional注解的方法 3、RM：可以理解为数据库（比如上面的订单库、库存库、账户库），事务的参与方 TC对不同跨库之间的协调，通过全局事务id 分布式事务执行流程： 1、TM开始分布式事务(TM向TC注册全局事务记录)， 2、按业务场景，编排数据库、服务等事务内资源(RM向TC汇报资源准备状态) 3、TM结束分步事务，事务一阶段结束(TM通知TC 提交/回滚分布式事务)； 4、TC汇总事务信息，决定分布式事务是提交还是回滚； 5、TC通知所有RM提交/回滚资源，事务二阶段结束。 AT模式如何做到对业务的无侵入 AT模式：两阶段提交协议的演变：一阶段：业务数据和回滚日志记录在同一个本地事务中提交，释放本地锁和连接资源。二阶段：提交异步化，非常快速地完成。回滚通过一阶段的回滚日志进行反向补偿一阶段加载：在一阶段，Seata会连接"业务SQL", 1、解析SQL语义，找到"业务SQL" 要更新的业务数据，在业务数据被更新前，将其保存成"before image"， 2、执行"业务SQL" 更新业务数据，在因为数据更新之后， 3、其保存成 "after image" ,最后生成行锁以上操作全部在一个数据库事务内完成，这样保证了一阶段操作的原子性。第一阶段加载 ![img](image/seata-at-first-load-01.png) 二阶段如是顺利提交的话，因为"业务SQL"在一阶段已经提交至数据库，所以seata框架只需将一阶段保存的快照数据和行锁删掉，完成数据清理即可第二阶段提交 ![img](image/seata-at-second-commit-01.png) 二阶段回滚：二阶段如果是回滚的话，seata就需要回滚一阶段已经执行的"业务SQL"，还原业务数据(反向补偿)。回滚方式便是用 "before image" 还原业务数据；但在还原前要首先校验脏写，对比"数据库当前业务数据"和 "after image", 如果两份数据完全一致就说明没有脏写，可以还原业务数据，如果不一致就说明有脏写，出现脏写就需要转人工处理。二阶段回滚操作 ![img](image/seata-at-second-rollback-01.png) seata 数据库表，global_table、branch_table、lock_table ![img](image/seata-data-table-01.png) 现在将AccountServiceImpl类中模拟超时方法的处理去掉。那么调试看数据库中数据情况。在debug模式下 ![img](image/seata-debug-request-account-01.png) 可以到在数据库seata中的三个表可以看到global_table表中的数据如下 ``` xid:192.168.199.116:8091:2052305277 transaction_id:2052305277 application_id:seata-order-service transaction_service_group:prex_tx_group transaction_name:pre_order_service ``` ![img](image/seata-global-table-data-01.png) 看表branch_table中也是使用xid这个全局的事务id与global中的对应。并且每个分支都一个branch_id ``` 每一个分支id对的是不同的服务，比如订单的，库存的，账户的，这些都是一一对应。 branch_id: 2052305279 2052305282 2052305285 ``` ![img](image/seata-branch-table-data-01.png) 同样在lock_table中插入了数据，进行了行级锁定 ![img](image/seata-lock-table-data-01.png) 可以取订单库、库存库、账户库查看undo_log表数据 ,并且数据库中的rollback_info字段信息记录的就是before_image 和after_image。因为在mysql中这个字段是blob类型所以需要转换下。这是订单库中的undo_log。 select CONVERT(rollback_info USING utf8 ) from undo_log; ```json { "@class": "io.seata.rm.datasource.undo.BranchUndoLog", "xid": "192.168.199.116:8091:2052305277", "branchId": 2052305279, "sqlUndoLogs": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.undo.SQLUndoLog", "sqlType": "INSERT", "tableName": "t_order", "beforeImage": { "@class": "io.seata.rm.datasource.sql.struct.TableRecords$EmptyTableRecords", "tableName": "t_order", "rows": [ "java.util.ArrayList", [] ] }, "afterImage": { "@class": "io.seata.rm.datasource.sql.struct.TableRecords", "tableName": "t_order", "rows": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Row", "fields": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "id", "keyType": "PrimaryKey", "type": -5, "value": [ "java.lang.Long", 6 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "user_id", "keyType": "NULL", "type": -5, "value": [ "java.lang.Long", 1 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "product_id", "keyType": "NULL", "type": -5, "value": [ "java.lang.Long", 1 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "count", "keyType": "NULL", "type": 4, "value": 10 }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "money", "keyType": "NULL", "type": 3, "value": [ "java.math.BigDecimal", 100 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "status", "keyType": "NULL", "type": 4, "value": 0 } ] ] } ] ] } } ] ] } ``` ![img](image/seata-order-undo-log-table-data-01.png) 来看看storage中的undo_log数据 ````json { "@class": "io.seata.rm.datasource.undo.BranchUndoLog", "xid": "192.168.199.116:8091:2052305277", "branchId": 2052305282, "sqlUndoLogs": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.undo.SQLUndoLog", "sqlType": "UPDATE", "tableName": "t_storage", "beforeImage": { "@class": "io.seata.rm.datasource.sql.struct.TableRecords", "tableName": "t_storage", "rows": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Row", "fields": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "id", "keyType": "PrimaryKey", "type": -5, "value": [ "java.lang.Long", 1 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "used", "keyType": "NULL", "type": 4, "value": 30 }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "residue", "keyType": "NULL", "type": 4, "value": 70 } ] ] } ] ] }, "afterImage": { "@class": "io.seata.rm.datasource.sql.struct.TableRecords", "tableName": "t_storage", "rows": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Row", "fields": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "id", "keyType": "PrimaryKey", "type": -5, "value": [ "java.lang.Long", 1 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "used", "keyType": "NULL", "type": 4, "value": 40 }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "residue", "keyType": "NULL", "type": 4, "value": 60 } ] ] } ] ] } } ] ] } ```` account库中undo_log数据 ```json { "@class": "io.seata.rm.datasource.undo.BranchUndoLog", "xid": "192.168.199.116:8091:2052305277", "branchId": 2052305285, "sqlUndoLogs": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.undo.SQLUndoLog", "sqlType": "UPDATE", "tableName": "t_account", "beforeImage": { "@class": "io.seata.rm.datasource.sql.struct.TableRecords", "tableName": "t_account", "rows": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Row", "fields": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "id", "keyType": "PrimaryKey", "type": -5, "value": [ "java.lang.Long", 1 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "used", "keyType": "NULL", "type": 3, "value": [ "java.math.BigDecimal", 300 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "residue", "keyType": "NULL", "type": 3, "value": [ "java.math.BigDecimal", 700 ] } ] ] } ] ] }, "afterImage": { "@class": "io.seata.rm.datasource.sql.struct.TableRecords", "tableName": "t_account", "rows": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Row", "fields": [ "java.util.ArrayList", [ { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "id", "keyType": "PrimaryKey", "type": -5, "value": [ "java.lang.Long", 1 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "used", "keyType": "NULL", "type": 3, "value": [ "java.math.BigDecimal", 400 ] }, { "@class": "io.seata.rm.datasource.sql.struct.Field", "name": "residue", "keyType": "NULL", "type": 3, "value": [ "java.math.BigDecimal", 600 ] } ] ] } ] ] } } ] ] } ``` 从上面的数据可知，在阶段二数据是提交异步化的，如果出现错误通过回滚一阶段的日志进行反向补偿，这个反向补偿就是通过before_image(前镜像)来进行校验和回滚操作。当代码继续执行后，再次查看seata数据库中三个表中的数据已经被删除了。 ![img](image/seata-branch-table-data-02.png) 并且其它三个数据库中的undo_log数据也被删除了，只有accout数据库中的undo_log还存在。根据官网第二阶段的描述异步任务阶段的分支提交请求将异步和批量地删除相应 UNDO LOG 记录。 ![img](image/seata-account-database-undo-log-01.png) 综上上面可知，要么同时成功，要么同时失败。 ## 31、分布式全局唯一ID以及分布式ID的业务需求 ID生成规则部分硬性要求： ``` 1、全局唯一：不能出现重复的ID号，既然是唯一标识，这是最基本要求 2、趋势递增：在mysql的Innodb引擎中使用是聚集索引，由于多数RDBMS使用Btree的数据结构来存储数据，在主键的选择上面我们应该尽量使用有序的主键保证写入性能。 3、单调递增：保证下一个ID一定大于上一个ID，例如事务版本号、IM增量消息、排序等特殊需求 4、信息安全：如果id是连续的，恶意用户的扒取工作就非常容易做了，直接按照顺序下载指定URL即可；如果是订单号就更危险链路，竞争对手可以直接知道我们一天的单量。所以在一些应用场景下，需要ID无规则不规则，让竞争对手不好猜 5、含时间戳：这样就能够在开发中快速了解这个分布式id的生成时间。 ``` ID号生成系统的可用性要求：高可用：发送一个获取分布式ID的请求，服务器就要保证99.999%的情况下给我创建一个唯一分布式id。低延迟：发送一个获取分布式ID的请求，服务器就要快，极速。高QPS: 假如并发一口气10万个创建分布式ID请求同时过来，服务器要顶住且要在极短时间内成功创建10万个分布式ID。生成分布式ID的演变 1 、使用UUID 在只考虑唯一性的情况下, UUID(Universally Unique Identifier)的标准型式包含32个16进制数字，以连字号分为5段，形式为 8-4-4-4-12的36个字符。好处是本地生成性能非常高，没有网络消耗。但是无序的UUID会导致入库性能变差。为社么变差？ a、无序，无法预测他的生成顺序，不能生成递增有许的数字。首先分布式ID一般都会作为主键，但是安装mysql官方推荐主键要尽量越短越好，UUID每一个都很长，所以不是很推荐。 b、主键，ID作为主键时在特定的环境会存在一些问题。比如做DB主键的场景下，UUID就非常不适用MYSQL官方有明确的建议主键要尽量越短越好36个字符长度的UUID 不符合要求。 c、索引，B+树索引的分裂既然分布式id是主键，然后主键是包含索引的，然后mysql的索引是通过b+树来实现的，每一次新的UUID数据的插入，为了查询的优化，都会对索引底层的B+树进行修改，因为UUID是无序的，所以每一次UUID数据的插入都会对主键得 B+树进行很大的修改，这一点很不好。插入完全无序，不但会导致一些中间节点产生分裂，也会白白创造很多不饱和的节点这样大大降低了数据库插入的性能。 2、使用数据库自增主键 a、单机：在分布式里面，数据库的自增ID机制的主要原理是，数据库自增ID的mysql数据库的replace into实现的。这里的repalce into跟insert 功能类似。不同点在于：repalce into 首先尝试插入数据列表中，如果发现已经有此行数据(根据主键或唯一索引判断)则先删除，再插入。否则直接插入新数据。 repalce into 的含义是插入一条记录，如果表中唯一索引的值遇到冲突，则替换老数据。 b、集群分布式：数据库自增ID机制适合分布式ID吗？答案是不太合适 a、系统水平扩展比较困难，比如定义好了步长和机器台数之后，如果要添加机器该怎么办？假如现在只有一台机器发号是1,2,3,4,5（步长是1），这个时候需要扩容机器一台。可以这样做，把第二台机器的初始值设置得比第一台超过很多，貌似很好，现在想象一下如果我们线上有100台机器，这个时候要扩容该怎么做？简直是噩梦。所以系统水平扩展方案复杂难以实现。 b、数据库压力还是很大，每次获取ID都得读写一次数据库，非常影响性能，不符合分布式ID里面的延迟低和要高QPS的规则（再高并发下，如果都去数据库里面获取id，那是非常影响性能的） 3、基于Redis的分布式ID a、注意再redis集群情况下，同样和Mysql一样需要设置不同的增长步长，同时key一定要设置有效期可以使用redis集群来获取更高的吞吐量。假如一个集群中有5台redis。可以初始化每台redis的值分别是1,2,3,4,5，然后步长都是5. 各个redis生成的ID为： 1,6,11,16,21 2,7,12,17,22 3,8,13,18,23 4,9,14,19,24 5,10,15,20,15 缺点是维护麻烦，要避免redis单点故障，要设置哨兵值守如果某一台机器宕机怎么处理等。使用twitter的分布式自增ID算法snowflake twitter的分布式雪花算法SnowFlake，经测试每秒能够产生26个自增可排序的ID 1、twitter的分布式雪花算法SnowFlake生成id能够按照时间有序生成 2、SnowFlake算法生成id的结果是一个64bit大小的整数，为一个Long型（转换成字符串后长度最多19）. 3、分布式系统内不会产生ID碰撞（由datacenter和workerId做区分）并且效率较高。分布式系统中和，有一些需要使用创建唯一ID的场景，生成ID的基本要 a、在分布式的环境下必须全局且唯一。 b、一般都需要单调递增，因为一般唯一ID都会存在数据库，而Innodb的特性就是将内容存储在主键索引树上的叶子节点，而且是考虑到数据库性能，一般生成的id也最好是单掉递增。为了防止ID冲突可以使用36位的UUID，但是UUID有一些缺点，首先它是无序的结构： ![img](image/snowflake-structure-01.png ) 【1】第一位：1位标识占用1bit，其值始终是0，表示符号位，正数是0，负数是1，id一般是正数，在这里没有实际作用。因为二进制中最高位是符号位。【2】第二位：时间戳占用41bit，精确到毫秒，总共可以容纳约69 年的时间。41位可以表示2^41-1个数字，如果只有类表示正整数(计算机中正数包哈0)，可以表示的数值范围是0：0至2^41-1,减1是因为可表示的数值范围是从0开始算的，而不是1。也就是所41可以表示2^41-1毫秒的值，转换成年则是2^41-1/(1000*60*60*24*365)=69年【3】第三位：工作机器id占用10bit，其中高位5bit是数据中心ID（datacenterId），低位5bit是工作节点ID（workerId），做多可以容纳1024个节点。10-bit机器可以分别表示1024台机器。如果我们对IDC划分有需求，还可以将10-bit分5-bit给IDC，分5-bit给工作机器。这样就可以表示32个IDC，每个IDC下可以有32台机器，可以根据自身需求定义。可以表示的最大正整数是2^5-1=31，即可以用0、1、2、3...31 这32个数字，来表示不同的datacenterId或workerId. 【4】第四位：序列号占用12bit，这个值在同一毫秒同一节点上从0开始不断累加，最多可以累加到4095。 12个自增序列号可以表示2^12个ID。 SnowFlake算法在同一毫秒内最多可以生成多少个全局唯一ID呢？只需要做一个简单的乘法：同一毫秒的ID数量 = 1024 X 4096 = 4194304（QPS）这个数字在绝大多数并发场景下都是够用的。添加测试雪花算法的类，这是开源的 ```java /** * * Twitter_Snowflake
* SnowFlake的结构如下(每部分用-分开):
* 0 - 0000000000 0000000000 0000000000 0000000000 0 - 00000 - 00000 - 000000000000
* 1位标识，由于long基本类型在Java中是带符号的，最高位是符号位，正数是0，负数是1，所以id一般是正数，最高位是0
* * 41位时间截(毫秒级)，注意，41位时间截不是存储当前时间的时间截，而是存储时间截的差值（当前时间截 - 开始时间截) * 得到的值），这里的的开始时间截，一般是我们的id生成器开始使用的时间， * 由我们程序来指定的（如下下面程序IdWorker类的startTime属性）。41位的时间截，可以使用69年，年T = (1L << 41) / (1000L * 60 * 60 * 24 * 365) = 69
* * 10位的数据机器位，可以部署在1024个节点，包括5位datacenterId和5位workerId
* * 12位序列，毫秒内的计数，12位的计数顺序号支持每个节点每毫秒(同一机器，同一时间截)产生4096个ID序号
* 加起来刚好64位，为一个Long型。
* SnowFlake的优点是，整体上按照时间自增排序，并且整个分布式系统内不会产生ID碰撞(由数据中心ID和机器ID作区分)，并且效率较高，经测试，SnowFlake每秒能够产生26万ID左右。 * @ClassName: SnowFlakeIdWorker * @Description: * @Author: lin * @Date: 2020/8/29 12:54 * History: * @ 1.0 */ public class SnowFlakeIdWorker { /** * 工作机器ID(0~31) */ private final long workerId; /** * 数据中心ID(0~31) */ private final long dataCenterId; /** * 毫秒内序列(0~4095) */ private long sequence = 0L; /** * 上次生成ID的时间截 */ private long lastTimeStamp = -1L; /** * 起始的时间戳(2020-08-29) */ private final static long startTimestamp = 1598676988000L; //***************每一部分占用的位数*************** /** * 机器id所占的位数 */ private final static long workerIdBits = 5L; /** * 数据中心id所占的位数 */ private final static long dataCenterIdBits = 5L; /** * 序列号占用的位数 */ private final static long sequenceBit = 12L; //***************************每一部分的最大值********************************* /** * 支持的最大机器id，结果是31 (这个移位算法可以很快的计算出几位二进制数所能表示的最大十进制数) */ private final long maxWorkerId = ~(-1L << workerIdBits); /** * 支持的最大数据标识id，结果是31 */ private final long maxDataCenterId = ~(-1L << dataCenterIdBits); /** * 生成序列的掩码，这里为4095 (0b111111111111=0xfff=4095) */ private final long sequenceMask = ~(-1L << sequenceBit); //***********************每一部分向左的位移******************************* /** * 机器ID向左移12位 */ private final long workerIdShift = sequenceBit; /** * 数据标识id向左移17位(12+5) */ private final long dataCenterIdShift = sequenceBit + workerIdBits; /** * 时间截向左移22位(5+5+12) */ private final long timeStampLeftShift = sequenceBit + workerIdBits + dataCenterIdBits; /** * 构造函数 * @param workerId 工作ID(0~31) * @param dataCenterId 数据中心(0~31) */ public SnowFlakeIdWorker(long workerId, long dataCenterId){ if(workerId > maxWorkerId || workerId < 0){ throw new IllegalArgumentException(String.format("worker Id " + "can't be greater than %d or less than 0", maxWorkerId)); } if (dataCenterId > maxDataCenterId || dataCenterId < 0) { throw new IllegalArgumentException(String.format("dataCenter Id" + " can't be greater than %d or less than 0", maxDataCenterId)); } this.workerId = workerId; this.dataCenterId = dataCenterId; } /** * 获得下一个ID (该方法是线程安全的) * @return */ public synchronized long nextId(){ long timeStamp = timeGen(); //如果当前时间小于上一次ID生成的时间戳，说明系统时钟回退过这个时候应当抛出异常 if(timeStamp < lastTimeStamp){ throw new RuntimeException( String.format("Clock moved backwards. Refusing to generate id for %d milliseconds", lastTimeStamp - timeStamp)); } //如果是同一时间生成的，则进行毫秒内序列 if(lastTimeStamp == timeStamp){ sequence = (sequence + 1) & sequenceMask; if(sequence == 0){ //阻塞到下一个毫秒,获得新的时间戳 timeStamp = tilNextMillis(lastTimeStamp); } } //上次生成ID的时间截 lastTimeStamp = timeStamp; //移位并通过或运算拼到一起组成64位的ID return ((timeStamp - startTimestamp) << timeStampLeftShift) | (dataCenterId << dataCenterIdShift) | (workerId << workerIdShift) | sequence; } /** * 阻塞到下一个毫秒，直到获得新的时间戳 * * @param lastTimestamp 上次生成ID的时间截 * @return 当前时间戳 */ protected long tilNextMillis(long lastTimestamp) { long timestamp = timeGen(); while (timestamp <= lastTimestamp) { timestamp = timeGen(); } return timestamp; } /** * 返回以毫秒为单位的当前时间 * * @return 当前时间(毫秒) */ protected long timeGen() { return System.currentTimeMillis(); } //==============================Test============================================= /** * 测试 */ public static void main(String[] args) { SnowFlakeIdWorker idWorker = new SnowFlakeIdWorker(0, 0); for (int i = 0; i < 1000; i++) { long id = idWorker.nextId(); System.out.println(Long.toBinaryString(id) + "\t" + Long.toBinaryString(id).length()); System.out.println(id); } } } ``` 雪花算法优缺点：优点：毫秒数在高位，自增序列在地位，整个ID都是趋势递增的。不依赖数据库等第三方系统，以服务的方式部署，稳定性更高，生成ID的性能也是非常高的。可以根据自身业务特性分配bit位，非常灵活。缺点：依赖机器时钟，如果机器时钟回拨，会导致重复ID生成，在单机上是递增的，但是由于设计到分布式环境，每台机器上的时钟可能不可能完全同步，有时候会出现不是全局递增的情况。（此缺点可以任务无所谓，一般分布式 ID只要求趋势递增，并不会严格要求递增，90%的需求都只要求趋势递增）解决这个时钟问题又：百度开源的分布式唯一ID生成器UidGenerator leaf---美团点评分布式ID生成系统。