果然好记性不如烂笔头，再简单的东西不记录下来总是会忘的！

本文首先会分析eureka中的缓存架构。并在此基础上优化服务之间的感知

Eureka-Client获取注册信息

eureka-client获取注册信息可分为两种，分别是全量获取和增量获取。

Eureka-Client 启动时，首先执行一次全量获取进行本地缓存注册信息，代码如下：

@Inject

    DiscoveryClient(ApplicationInfoManager applicationInfoManager, EurekaClientConfig config, AbstractDiscoveryClientOptionalArgs args,

                    Provider<BackupRegistry> backupRegistryProvider) {

                    if (clientConfig.shouldFetchRegistry() && !fetchRegistry(false)) {

            fetchRegistryFromBackup();

        }

     }

项目中配置

eureka.client.fetch-registry=true

便可以调用fetchRegistry方法，从eureka-server全量获取注册信息

Eureka-Client 启动时，还会初始化一个缓存刷新定时任务

private void initScheduledTasks() {

        if (clientConfig.shouldFetchRegistry()) {

            // registry cache refresh timer

            int registryFetchIntervalSeconds = clientConfig.getRegistryFetchIntervalSeconds();

            int expBackOffBound = clientConfig.getCacheRefreshExecutorExponentialBackOffBound();

            scheduler.schedule(

                    new TimedSupervisorTask(

                            "cacheRefresh",

                            scheduler,

                            cacheRefreshExecutor,

                            registryFetchIntervalSeconds,

                            TimeUnit.SECONDS,

                            expBackOffBound,

                            new CacheRefreshThread()

                    ),

                    registryFetchIntervalSeconds, TimeUnit.SECONDS);

        }

    }

每间隔 registryFetchIntervalSeconds(默认值是30) 秒执行一次CacheRefreshThread任务。CacheRefreshThread最终还是执行了fetchRegistry方法。

private boolean fetchRegistry(boolean forceFullRegistryFetch) {

        try {

            Applications applications = getApplications();

            if (clientConfig.shouldDisableDelta()

                    || (!Strings.isNullOrEmpty(clientConfig.getRegistryRefreshSingleVipAddress()))

                    || forceFullRegistryFetch

                    || (applications == null)

                    || (applications.getRegisteredApplications().size() == 0)

                    || (applications.getVersion() == -1)) //Client application does not have latest library supporting delta

            {

                getAndStoreFullRegistry();

            } else {

                getAndUpdateDelta(applications);

            }

            applications.setAppsHashCode(applications.getReconcileHashCode());

        } catch (Throwable e) {

            logger.error(PREFIX + appPathIdentifier + " - was unable to refresh its cache! status = " + e.getMessage(), e);

            return false;

        } finally {

            if (tracer != null) {

                tracer.stop();

            }

        }

        // Notify about cache refresh before updating the instance remote status

        onCacheRefreshed();

        // Update remote status based on refreshed data held in the cache

        updateInstanceRemoteStatus();

        // registry was fetched successfully, so return true

        return true;

    }

fetchRegistry首先判断是全量获取还是增量获取，然后请求server端获取注册信息，成功后更新注册信息。再触发CacheRefreshed事件

Eureka-Server管理注册信息

客户端的请求到Server端后，通过ResponseCache返回服务信息

@GET

    public Response getContainers(@PathParam("version") String version,

                                  @HeaderParam(HEADER_ACCEPT) String acceptHeader,

                                  @HeaderParam(HEADER_ACCEPT_ENCODING) String acceptEncoding,

                                  @HeaderParam(EurekaAccept.HTTP_X_EUREKA_ACCEPT) String eurekaAccept,

                                  @Context UriInfo uriInfo,

                                  @Nullable @QueryParam("regions") String regionsStr) {

        boolean isRemoteRegionRequested = null != regionsStr && !regionsStr.isEmpty();

        String[] regions = null;

        if (!isRemoteRegionRequested) {

            EurekaMonitors.GET_ALL.increment();

        } else {

            regions = regionsStr.toLowerCase().split(",");

            Arrays.sort(regions); // So we don't have different caches for same regions queried in different order.

            EurekaMonitors.GET_ALL_WITH_REMOTE_REGIONS.increment();

        }

         // 判断是否可以访问

        if (!registry.shouldAllowAccess(isRemoteRegionRequested)) {

            return Response.status(Status.FORBIDDEN).build();

        }

        CurrentRequestVersion.set(Version.toEnum(version));

        // 返回数据格式

        KeyType keyType = Key.KeyType.JSON;

        String returnMediaType = MediaType.APPLICATION_JSON;

        if (acceptHeader == null || !acceptHeader.contains(HEADER_JSON_VALUE)) {

            keyType = Key.KeyType.XML;

            returnMediaType = MediaType.APPLICATION_XML;

        }

        // 响应缓存键( KEY )

        Key cacheKey = new Key(Key.EntityType.Application,

                ResponseCacheImpl.ALL_APPS,

                keyType, CurrentRequestVersion.get(), EurekaAccept.fromString(eurekaAccept), regions

        );

        Response response;

        if (acceptEncoding != null && acceptEncoding.contains(HEADER_GZIP_VALUE)) {

        // 根据cacheKey返回注册信息

            response = Response.ok(responseCache.getGZIP(cacheKey))

                    .header(HEADER_CONTENT_ENCODING, HEADER_GZIP_VALUE)

                    .header(HEADER_CONTENT_TYPE, returnMediaType)

                    .build();

        } else {

            response = Response.ok(responseCache.get(cacheKey))

                    .build();

        }

        return response;

    }

重点就是在responseCache中的get方法了了

String get(final Key key, boolean useReadOnlyCache) {

        Value payload = getValue(key, useReadOnlyCache);

        if (payload == null || payload.getPayload().equals(EMPTY_PAYLOAD)) {

            return null;

        } else {

            return payload.getPayload();

        }

    }

private final ConcurrentMap<Key, Value> readOnlyCacheMap = new ConcurrentHashMap<Key, Value>();

private final LoadingCache<Key, Value> readWriteCacheMap;

this.readWriteCacheMap =

                CacheBuilder.newBuilder().initialCapacity(1000)

                        .expireAfterWrite(serverConfig.getResponseCacheAutoExpirationInSeconds(), TimeUnit.SECONDS)

                        .removalListener(new RemovalListener<Key, Value>() {

                            @Override

                            public void onRemoval(RemovalNotification<Key, Value> notification) {

                                Key removedKey = notification.getKey();

                                if (removedKey.hasRegions()) {

                                    Key cloneWithNoRegions = removedKey.cloneWithoutRegions();

                                    regionSpecificKeys.remove(cloneWithNoRegions, removedKey);

                                }

                            }

                        })

                        .build(new CacheLoader<Key, Value>() {

                            @Override

                            public Value load(Key key) throws Exception {

                                if (key.hasRegions()) {

                                    Key cloneWithNoRegions = key.cloneWithoutRegions();

                                    regionSpecificKeys.put(cloneWithNoRegions, key);

                                }

                                Value value = generatePayload(key);

                                return value;

                            }

                        });

Value getValue(final Key key, boolean useReadOnlyCache) {

        Value payload = null;

        try {

            if (useReadOnlyCache) {

            //从只读缓存中获取注册信息

                final Value currentPayload = readOnlyCacheMap.get(key);

                if (currentPayload != null) {

                    payload = currentPayload;

                } else {

                //只读缓存不存在便从读写缓存中获取信息

                    payload = readWriteCacheMap.get(key);

                    readOnlyCacheMap.put(key, payload);

                }

            } else {

                payload = readWriteCacheMap.get(key);

            }

        } catch (Throwable t) {

            logger.error("Cannot get value for key :" + key, t);

        }

        return payload;

    }

这里采用了双层缓存的结构首先从readOnlyCacheMap读取数据，如果readOnlyCacheMap读取不到则从readWriteCacheMap读取数据。readOnlyCacheMap是个ConcurrentMap结构，而readWriteCacheMap则是一个guava cache，最大容量1000，180s后自动过期。

两个map之间的数据是如何交互的呢。这里有个定时任务每隔30秒去对比一次两个缓存中的数据，如果发现两者不一致，则用readWriteCacheMap的值覆盖readOnlyCacheMap的值

if (shouldUseReadOnlyResponseCache) {

            timer.schedule(getCacheUpdateTask(),

                    new Date(((System.currentTimeMillis() / responseCacheUpdateIntervalMs) * responseCacheUpdateIntervalMs)

                            + responseCacheUpdateIntervalMs),

                    responseCacheUpdateIntervalMs);

        }

private TimerTask getCacheUpdateTask() {

        return new TimerTask() {

            @Override

            public void run() {

                logger.debug("Updating the client cache from response cache");

                for (Key key : readOnlyCacheMap.keySet()) {

                    try {

                        CurrentRequestVersion.set(key.getVersion());

                        Value cacheValue = readWriteCacheMap.get(key);

                        Value currentCacheValue = readOnlyCacheMap.get(key);

                        //对比两个缓存的值

                        if (cacheValue != currentCacheValue) {

                            readOnlyCacheMap.put(key, cacheValue);

                        }

                    } catch (Throwable th) {

                        logger.error("Error while updating the client cache from response cache", th);

                    }

                }

            }

        };

    }

现在我们知道了readOnlyCacheMap中的数据是从readWriteCacheMap获得的，并且每隔30s同步一次。那么还有一个问题就是readWriteCacheMap中的数据是从哪里来的呢？

在readWriteCacheMap变量上find usages无法找到明确的信息，便在build方法中添加断点

this.readWriteCacheMap =

                CacheBuilder.newBuilder().initialCapacity(1000)

                        .expireAfterWrite(serverConfig.getResponseCacheAutoExpirationInSeconds(), TimeUnit.SECONDS)

                        .removalListener(new RemovalListener<Key, Value>() {

                            @Override

                            public void onRemoval(RemovalNotification<Key, Value> notification) {

                                Key removedKey = notification.getKey();

                                if (removedKey.hasRegions()) {

                                    Key cloneWithNoRegions = removedKey.cloneWithoutRegions();

                                    regionSpecificKeys.remove(cloneWithNoRegions, removedKey);

                                }

                            }

                        })

                        .build(new CacheLoader<Key, Value>() {

                            @Override

                            public Value load(Key key) throws Exception {

                                if (key.hasRegions()) {

                                    Key cloneWithNoRegions = key.cloneWithoutRegions();

                                    regionSpecificKeys.put(cloneWithNoRegions, key);

                                }

                                //添加断点

                                Value value = generatePayload(key);

                                return value;

                            }

                        });

最终发现readWriteCacheMap的值是在同步任务中添加的

private TimerTask getCacheUpdateTask() {

        return new TimerTask() {

            @Override

            public void run() {

                logger.debug("Updating the client cache from response cache");

                for (Key key : readOnlyCacheMap.keySet()) {

                    try {

                        CurrentRequestVersion.set(key.getVersion());

                        Value cacheValue = readWriteCacheMap.get(key);

                        //触发load方法加载Value

                        Value currentCacheValue = readOnlyCacheMap.get(key);

                        //对比两个缓存的值

                        if (cacheValue != currentCacheValue) {

                            readOnlyCacheMap.put(key, cacheValue);

                        }

                    } catch (Throwable th) {

                        logger.error("Error while updating the client cache from response cache", th);

                    }

                }

            }

        };

    }

好，触发时机我们现在也知道了，我们再看下数据时怎么产生的。大致我们可以了解到readWriteCacheMap中的value是通过AbstractInstanceRegistry中的registry变量得到的

private final AbstractInstanceRegistry registry;

private Value generatePayload(Key key) {

        Stopwatch tracer = null;

        try {

            String payload;

            switch (key.getEntityType()) {

                case Application:

                    boolean isRemoteRegionRequested = key.hasRegions();

                    if (ALL_APPS.equals(key.getName())) {

                        if (isRemoteRegionRequested) {

                            tracer = serializeAllAppsWithRemoteRegionTimer.start();

                            payload = getPayLoad(key, registry.getApplicationsFromMultipleRegions(key.getRegions()));

                        } else {

                            tracer = serializeAllAppsTimer.start();

                            payload = getPayLoad(key, registry.getApplications());

                        }

                    } else if (ALL_APPS_DELTA.equals(key.getName())) {

                        if (isRemoteRegionRequested) {

                            tracer = serializeDeltaAppsWithRemoteRegionTimer.start();

                            versionDeltaWithRegions.incrementAndGet();

                            versionDeltaWithRegionsLegacy.incrementAndGet();

                            payload = getPayLoad(key,

                                    registry.getApplicationDeltasFromMultipleRegions(key.getRegions()));

                        } else {

                            tracer = serializeDeltaAppsTimer.start();

                            versionDelta.incrementAndGet();

                            versionDeltaLegacy.incrementAndGet();

                            payload = getPayLoad(key, registry.getApplicationDeltas());

                        }

                    } else {

                        tracer = serializeOneApptimer.start();

                        payload = getPayLoad(key, registry.getApplication(key.getName()));

                    }

                    break;

                case VIP:

                case SVIP:

                    tracer = serializeViptimer.start();

                    payload = getPayLoad(key, getApplicationsForVip(key, registry));

                    break;

                default:

                    logger.error("Unidentified entity type: " + key.getEntityType() + " found in the cache key.");

                    payload = "";

                    break;

            }

            return new Value(payload);

        } finally {

            if (tracer != null) {

                tracer.stop();

            }

        }

    }

AbstractInstanceRegistry中的registry是一个多层缓存结构。client注册，续约，下线的数据都是通过registry进行保存

private final ConcurrentHashMap<String, Map<String, Lease<InstanceInfo>>> registry

            = new ConcurrentHashMap<String, Map<String, Lease<InstanceInfo>>>();

registry有一个定时任务每隔60s去剔除过期的数据

evictionTimer.schedule(evictionTaskRef.get(),

                //60*1000

                serverConfig.getEvictionIntervalTimerInMs(),

                serverConfig.getEvictionIntervalTimerInMs());

@Override

        public void run() {

            try {

                long compensationTimeMs = getCompensationTimeMs();

                logger.info("Running the evict task with compensationTime {}ms", compensationTimeMs);

                evict(compensationTimeMs);

            } catch (Throwable e) {

                logger.error("Could not run the evict task", e);

            }

        }

总结下

eureka客户端注册，续约，下线都会请求到server端，server端把数据保存在registry这个双层map中。每隔60s会有定时任务去检查registry中保存的租约是否已经过期(租约有效期是90s)，然后每隔30s会有定时任务更新readWriteCacheMap的值以及同步readWriteCacheMap和readOnlyCacheMap的值

服务感知优化

基于上述流程，想象下，假如一个服务异常下线server端没有接受到下线请求，那么会有以下情况

0s 时服务未通知 Eureka Client 直接下线；
29s 时第一次过期检查 evict 未超过 90s；
89s 时第二次过期检查 evict 未超过 90s；
149s 时第三次过期检查 evict 未续约时间超过了 90s，故将该服务实例从 registry 中删除；
179s 时定时任务更新readWriteCacheMap以及从 readWriteCacheMap 更新至 readOnlyCacheMap;
209s 时 Eureka Client 从 Eureka Server 的 readOnlyCacheMap 更新；
239s 时 Ribbon 从 Eureka Client 更新。

(ribbon同样也有缓存更新策略，默认30s)

因此，极限情况下服务消费者最长感知时间将无限趋近 240s。

怎么优化呢

server端：

减少registry服务剔除任务时间

减少两个缓存同步定时任务时间

小型系统可以直接去掉readOnlyCacheMap

服务提供端

减少心跳时间

减少租约过期时间

服务消费端

减少ribbon更新时间

减少fetchRegist时间

EurekaServer修改如下配置：

#eureka server刷新readCacheMap的时间，注意，client读取的是readCacheMap，这个时间决定了多久会把readWriteCacheMap的缓存更新到readCacheMap上

#默认30s

eureka.server.responseCacheUpdateIntervalMs=3000

#eureka server缓存readWriteCacheMap失效时间，这个只有在这个时间过去后缓存才会失效，失效前不会更新，过期后从registry重新读取注册服务信息，registry是一个ConcurrentHashMap。

#由于启用了evict其实就用不太上改这个配置了

#默认180s

eureka.server.responseCacheAutoExpirationInSeconds=180

#启用主动失效，并且每次主动失效检测间隔为3s

Eureka Server会定时（间隔值是eureka.server.eviction-interval-timer-in-ms，默认值为0，默认情况不删除实例）进行检查，

如果发现实例在在一定时间（此值由客户端设置的eureka.instance.lease-expiration-duration-in-seconds定义，默认值为90s）

内没有收到心跳，则会注销此实例。

eureka.server.eviction-interval-timer-in-ms=3000

Eureka服务提供方修改如下配置：

#服务过期时间配置,超过这个时间没有接收到心跳EurekaServer就会将这个实例剔除

#注意，EurekaServer一定要设置eureka.server.eviction-interval-timer-in-ms否则这个配置无效，这个配置一般为服务刷新时间配置的三倍

#默认90s

eureka.instance.lease-expiration-duration-in-seconds=15

#服务刷新时间配置，每隔这个时间会主动心跳一次

#默认30s

eureka.instance.lease-renewal-interval-in-seconds=5

Eureka服务调用方修改如下配置：

#eureka client刷新本地缓存时间

#默认30s

eureka.client.registryFetchIntervalSeconds=5

#eureka客户端ribbon刷新时间

#默认30s

ribbon.ServerListRefreshInterval=5000

Eureka 缓存结构以及服务感知优化的更多相关文章

Mysql服务配置优化
mysql服务器优化包含硬件优化.操作系统配置优化(cpu调度.网络.内存.虚拟内存.磁盘io).Mysql服务配置优化(最大连接数.表缓存等.存储引擎).表结构优化.索引优化总共5个方面. 本片 ...
程序员笔记|详解Eureka 缓存机制
引言 Eureka是Netflix开源的.用于实现服务注册和发现的服务.Spring Cloud Eureka基于Eureka进行二次封装,增加了更人性化的UI,使用更为方便.但是由于Eureka本身 ...
详解Eureka 缓存机制
原文:https://www.cnblogs.com/yixinjishu/p/10871243.html 引言 Eureka是Netflix开源的.用于实现服务注册和发现的服务.Spring Clo ...
Java后台服务慢优化杂谈
Java后台服务慢优化杂谈前言你是否遇到过这样的场景,当我们点击页面某个按钮后,页面一直loading,要等待好几分钟才出结果的画面,有时直接502或504,作为一个后台开发,看到自己开发的系统是 ...
一次漫长的服务CPU优化过程
从师父那里接了个服务,每天单机的流量并不大,峰值tips也并不高,但是CPU却高的异常.由于,服务十分重要,这个服务最高时占用了100个docker节点在跑,被逼无奈开始了异常曲折的查因和优化过程. ...
eureka快速剔除失效服务
eureka服务端配置 #eureka server刷新readCacheMap的时间,注意,client读取的是readCacheMap,这个时间决定了多久会把readWriteCacheMap的缓 ...
spring-eureka 源码解读----为什么一个服务最多两分钟被其他服务感知
Eureka的wiki上有一句话,大意是一个服务启动后最长可能需要2分钟时间才能被其它服务感知到,但是文档并没有解释为什么会有这2分钟.其实这是由三处缓存 + 一处延迟造成的. 首先,Eureka对H ...
Spring Cloud Eureka 分布式开发之服务注册中心、负载均衡、声明式服务调用实现
介绍本示例主要介绍 Spring Cloud 系列中的 Eureka,使你能快速上手负载均衡.声明式服务.服务注册中心等 Eureka Server Eureka 是 Netflix 的子模块,它是 ...
Spring Cloud Eureka源码分析---服务注册
本篇我们着重分析Eureka服务端的逻辑实现,主要涉及到服务的注册流程分析. 在Eureka的服务治理中,会涉及到下面一些概念: 服务注册:Eureka Client会通过发送REST请求的方式向Eu ...

随机推荐

JAVA基础-基础类型
学习JAVA的同学都知道,数据类型是基础中的基础,而JAVA本身是强类型语言,他对变量的类型有这魔一般的执著,所以学好JAVA的重心就是要学好数据类型.既然有强类型语言,就会有弱类型语言如PHP.Ja ...
WinForm控件之【ComboBox】
基本介绍下拉文本框应用较为广泛,在winfrom控件当中使用设置也是相对的简单,主要用于存在多种选择的单选操作场景. 常设置属性.事件 DataSource:绑定加载项的数据源,设置属性Displa ...
C# .net Ueditor实现图片上传到阿里云OSS 对象存储
在学习的时候,项目中需要实现在Ueditor编辑器中将图片上传到云储存中,老师演示的是上传到又拍云存储,既然看了一遍,直接照搬不算本事,咱们可以依葫芦画瓢自己来动手玩玩其它的云存储服务. 现在云计算产 ...
CF1027D Mouse Hunt题解
题目: 伯兰州立大学的医学部刚刚结束了招生活动.和以往一样,约80%的申请人都是女生并且她们中的大多数人将在未来4年(真希望如此)住在大学宿舍里. 宿舍楼里有nn个房间和一只老鼠!女孩们决定在一些房间 ...
个人永久性免费-Excel催化剂功能第88波-批量提取pdf文件信息（图片、表格、文本等）
日常办公场合中,除了常规的Excel.Word.PPT等文档外,还有一个不可忽略的文件格式是pdf格式,而对于想从pdf文件中获取信息时,常规方法将变得非常痛苦和麻烦.此篇给大家送一pdf文件提取信息 ...
[leetcode] 147. Insertion Sort List (Medium)
原题别人的思路非常简洁 function ListNode(val) { this.val = val; this.next = null; } /** * @param {ListNode} h ...
Vue的基本使用(一)
<!DOCTYPE html> <html lang="en"> <head> <meta charset="UTF-8&quo ...
今天来聊Java ClassLoader
背景类加载机制作为一个高频的面试题经常会在面试中被问到,前几天一个电话面试就问到,之前有了解过,但是没有梳理成自己的体系,所以说的有点凌乱,今天花点时间整理一下,分享给大家同时自己也好好梳理一下,顺 ...
activiti工作流委托功能的设计和实现
最近公司开发一个项目,客户提出了一个需求,用户在出差的时候,可以将自己的工作进行委托.可以指定委托时间.委托工作内容.指定委托人等等内容. 然后我就上网查询资料,发现activiti工作流本身并不支持 ...
c++容器简单总结
数据结构描述实现头文件向量(vector) 连续存储的元素 <vector> 列表(list) 由节点组成的双向链表,每个结点包含着一个元素 <list> 双队列(deq ...

Eureka 缓存结构以及服务感知优化

Eureka-Client获取注册信息

Eureka-Server管理注册信息

服务感知优化

Eureka 缓存结构以及服务感知优化的更多相关文章

随机推荐

热门专题