千家信息网

Redis Cluster 添加/删除 完整折腾步骤

发表于:2025-01-23 作者:千家信息网编辑
千家信息网最后更新 2025年01月23日,Redis还是挺好玩的,今天测试了集群的添加、删除节点、重分配slot等。更深入的理解redis的游戏规则。步骤繁多,但是详细。环境解释:我是在一台Centos 6.9上测试的,各个redis节点以端
千家信息网最后更新 2025年01月23日Redis Cluster 添加/删除 完整折腾步骤

Redis还是挺好玩的,今天测试了集群的添加、删除节点、重分配slot等。更深入的理解redis的游戏规则。步骤繁多,但是详细。


环境解释:

我是在一台Centos 6.9上测试的,各个redis节点以端口号区分。文中针对各个redis,我只是以端口号代表。

~~~~Master Node~~~~~172.16.32.116:7000172.16.32.116:7001172.16.32.116:7002~~~~Slave Node~~~~~172.16.32.116:8000172.16.32.116:8001172.16.32.116:8002~~~~用来折腾的Node~~~~~172.16.32.116:9000172.16.32.116:9001

1. 创建redis集群

注:更多redis集群创建,请参阅

Redis Cluster集群部署搭建

# ./redis-trib.rb create --replicas 1 172.16.32.116:7000 172.16.32.116:7001 172.16.32.116:7002 172.16.32.116:8000 172.16.32.116:8001 172.16.32.116:8002>>> Creating cluster>>> Performing hash slots allocation on 6 nodes...Using 3 masters:172.16.32.116:7000172.16.32.116:7001172.16.32.116:7002Adding replica 172.16.32.116:8000 to 172.16.32.116:7000Adding replica 172.16.32.116:8001 to 172.16.32.116:7001Adding replica 172.16.32.116:8002 to 172.16.32.116:7002M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:0-5460 (5461 slots) masterM: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5461-10922 (5462 slots) masterM: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:10923-16383 (5461 slots) masterS: aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000   replicates a0b91f48e933c1f1d427c54917ce970bd25d29f8S: a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001   replicates 273107e5ac994d675749be0979556e761274bb93S: 3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002   replicates 88fe075375295b59eabe69fa1438ed7c7c314f43Can I set the above configuration? (type 'yes' to accept): yes>>> Nodes configuration updated>>> Assign a different config epoch to each node>>> Sending CLUSTER MEET messages to join the clusterWaiting for the cluster to join...>>> Performing Cluster Check (using node 172.16.32.116:7000)M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:0-5460 (5461 slots) masterM: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5461-10922 (5462 slots) masterM: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:10923-16383 (5461 slots) masterM: aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000   slots: (0 slots) master   replicates a0b91f48e933c1f1d427c54917ce970bd25d29f8M: a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001   slots: (0 slots) master   replicates 273107e5ac994d675749be0979556e761274bb93M: 3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002   slots: (0 slots) master   replicates 88fe075375295b59eabe69fa1438ed7c7c314f43[OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...[OK] All 16384 slots covered.

2. 检查集群状态

# ./redis-trib.rb check 172.16.32.116:7000>>> Performing Cluster Check (using node 172.16.32.116:7000)M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:0-5460 (5461 slots) master   1 additional replica(s)M: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:10923-16383 (5461 slots) master   1 additional replica(s)M: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5461-10922 (5462 slots) master   1 additional replica(s)S: 3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002   slots: (0 slots) slave   replicates 88fe075375295b59eabe69fa1438ed7c7c314f43S: a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001   slots: (0 slots) slave   replicates 273107e5ac994d675749be0979556e761274bb93S: aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000   slots: (0 slots) slave   replicates a0b91f48e933c1f1d427c54917ce970bd25d29f8[OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...[OK] All 16384 slots covered.

~~~~~~~~~~~~~~~~~~~~~~~~~~~添加节点~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

3. 添加新节点redis-trib.rb add-node 新增节点名 原集群节点名

# ./redis-trib.rb add-node 172.16.32.116:9000 172.16.32.116:7000>>> Adding node 172.16.32.116:9000 to cluster 172.16.32.116:7000>>> Performing Cluster Check (using node 172.16.32.116:7000)M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:0-5460 (5461 slots) master   1 additional replica(s)M: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:10923-16383 (5461 slots) master   1 additional replica(s)M: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5461-10922 (5462 slots) master   1 additional replica(s)S: 3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002   slots: (0 slots) slave   replicates 88fe075375295b59eabe69fa1438ed7c7c314f43S: a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001   slots: (0 slots) slave   replicates 273107e5ac994d675749be0979556e761274bb93S: aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000   slots: (0 slots) slave   replicates a0b91f48e933c1f1d427c54917ce970bd25d29f8[OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...

4. 查看当前集群状态,9000是一个空的Master

# ./redis-cli  -p 9000 cluster nodesa0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505321254767 1 connected 0-5460273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505321250759 2 connected 5461-1092288fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505321251761 3 connected 10923-163833d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505321255769 3 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505321253765 1 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505321256771 2 connected

5. 为9000分配slot, redis的solt是固定的,就16384个,只能从其他节点获取slot,然后分配到9000

# ./redis-trib.rb reshard  172.16.32.116:9000>>> Performing Cluster Check (using node 172.16.32.116:9000)M: 364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000   slots: (0 slots) master   0 additional replica(s)M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:0-5460 (5461 slots) master   1 additional replica(s)M: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5461-10922 (5462 slots) master   1 additional replica(s)S: 3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002   slots: (0 slots) slave   replicates 88fe075375295b59eabe69fa1438ed7c7c314f43S: aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000   slots: (0 slots) slave   replicates a0b91f48e933c1f1d427c54917ce970bd25d29f8M: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:10923-16383 (5461 slots) master   1 additional replica(s)S: a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001   slots: (0 slots) slave   replicates 273107e5ac994d675749be0979556e761274bb93[OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...[OK] All 16384 slots covered.How many slots do you want to move (from 1 to 16384)? What is the receiving node ID? 364ae8322ab2627e25b05d45b702448c74afad10 Please enter all the source node IDs.  Type 'all' to use all the nodes as source nodes for the hash slots.  Type 'done' once you entered all the source nodes IDs.Source node #1:all      Ready to move 300 slots.  Source nodes:    M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:0-5460 (5461 slots) master   1 additional replica(s)    M: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5461-10922 (5462 slots) master   1 additional replica(s)    M: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:10923-16383 (5461 slots) master   1 additional replica(s)  Destination node:    M: 364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000   slots: (0 slots) master   0 additional replica(s)  Resharding plan:    Moving slot 5461 from 273107e5ac994d675749be0979556e761274bb93    Moving slot 5469 from 273107e5ac994d675749be0979556e761274bb93Do you want to proceed with the proposed reshard plan (yes/no)? yesMoving slot 5461 from 172.16.32.116:7001 to 172.16.32.116:9000:

6. 可以看到,9000已经分配到的slot是0-98 5461-5561 10923-11021

# ./redis-cli  -p 9000 cluster nodesa0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505324905062 1 connected 99-5460273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505324910075 2 connected 5562-10922364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000 myself,master - 0 0 7 connected 0-98 5461-5561 10923-110213d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505324908070 3 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505324911077 1 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505324902057 3 connected 11022-16383a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505324909073 2 connected

~~~~~~~~~~~~~~~~~~~~~~~~~~~将9000变为slave~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

7. 希望将9000变成7000的slave,但是由于有slot,执行失败,需要先转移slot

# redis-cli -c -p 9000 cluster replicate a0b91f48e933c1f1d427c54917ce970bd25d29f8(error) ERR To set a master the node must be empty and without assigned slots.

8. 删除节点也是不可以的,总之,只要上面有slot。redis是不会让你删除的,而且需要人工介入,rebalance这些slot之后才行

# ./redis-trib.rb del-node 172.16.32.116:9000 364ae8322ab2627e25b05d45b702448c74afad10>>> Removing node 364ae8322ab2627e25b05d45b702448c74afad10 from cluster 172.16.32.116:9000[ERR] Node 172.16.32.116:9000 is not empty! Reshard data away and try again.

9.重新分配9000的slot到7000上

# ./redis-trib.rb reshard  172.16.32.116:9000   <<<<<<重新分配slot>>> Performing Cluster Check (using node 172.16.32.116:9000)M: 364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000   slots:0-98,5461-5561,10923-11021 (299 slots) master   0 additional replica(s)M: a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000   slots:99-5460 (5362 slots) master   1 additional replica(s)M: 273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001   slots:5562-10922 (5361 slots) master   1 additional replica(s)S: 3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002   slots: (0 slots) slave   replicates 88fe075375295b59eabe69fa1438ed7c7c314f43S: aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000   slots: (0 slots) slave   replicates a0b91f48e933c1f1d427c54917ce970bd25d29f8M: 88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002   slots:11022-16383 (5362 slots) master   1 additional replica(s)S: a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001   slots: (0 slots) slave   replicates 273107e5ac994d675749be0979556e761274bb93[OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...[OK] All 16384 slots covered.How many slots do you want to move (from 1 to 16384)? 300     <<<9000节点全部需要迁移的节点What is the receiving node ID? a0b91f48e933c1f1d427c54917ce970bd25d29f8  <<<<<<7000的IDPlease enter all the source node IDs.  Type 'all' to use all the nodes as source nodes for the hash slots.  Type 'done' once you entered all the source nodes IDs.Source node #1:364ae8322ab2627e25b05d45b702448c74afad10    <<<<<<9000的IDSource node #2:doneReady to move 300 slots.  Source nodes:    M: 364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000   <<<<<

10. 查询,可以看到9000已经没有slot了

# ./redis-cli  -p 9000 cluster nodesa0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505328938056 8 connected 0-5561 10923-11021273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505328939059 2 connected 5562-10922364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000 myself,master - 0 0 7 connected3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505328936053 3 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505328933046 8 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505328937054 3 connected 11022-16383a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505328934049 2 connected

11. 再次执行命令,将9000变成7000的slave,成功

# redis-cli -c -p 9000 cluster replicate a0b91f48e933c1f1d427c54917ce970bd25d29f8OK

12. 查看状态,9000已经成为7000的slave

# ./redis-cli  -p 9000 cluster nodesa0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505329564286 8 connected 0-5561 10923-11021273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505329561281 2 connected 5562-10922364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000 myself,slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 0 7 connected    <<<<<<<<<<<<<<3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505329558274 3 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505329554266 8 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505329563285 3 connected 11022-16383a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505329562283 2 connected

13. 删除节点9000,成功删除

# ./redis-trib.rb del-node 172.16.32.116:9000 364ae8322ab2627e25b05d45b702448c74afad10>>> Removing node 364ae8322ab2627e25b05d45b702448c74afad10 from cluster 172.16.32.116:9000>>> Sending CLUSTER FORGET messages to the cluster...>>> SHUTDOWN the node.

14. 连接9000,发现已经shutdown,无法连接

# ./redis-cli  -p 9000 cluster nodesCould not connect to Redis at 127.0.0.1:9000: Connection refusedCould not connect to Redis at 127.0.0.1:9000: Connection refused

15. 查看集群状态,9000已经不见了

# ./redis-cli  -p 7000 cluster nodes88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505329693835 3 connected 11022-16383273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505329694837 2 connected 5562-10922a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 myself,master - 0 0 8 connected 0-5561 10923-110213d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505329696841 6 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505329695840 5 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505329692833 8 connected

~~~~~~~~~~~~~~~~~~~~~~~~~~~再次启动9000,发现不同~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

16. 再次启动9000,发现一个有意思的事情。上面检查7000,集群已经没有9000了

# ./redis-cli  -p 7000 cluster nodes88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505329898241 3 connected 11022-16383273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505329899242 2 connected 5562-10922a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 myself,master - 0 0 8 connected 0-5561 10923-110213d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505329902249 6 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505329901246 5 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505329900244 8 connected

17. 但是查看9000,确仍然能看到整个集群的信息。说明,在删除节点的过程,只是在原有集群中删除9000的信息。但是9000自身的信息并没有被删除,依然保留全部的信息,只是9000实例被关闭而已。

# ./redis-cli  -p 9000 cluster nodesa0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505329902003 8 connected 0-5561 10923-11021273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505329903006 2 connected 5562-109223d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505329906013 3 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505329908019 2 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505329904008 8 connected364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000 myself,slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 0 7 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505329907016 3 connected 11022-16383

18. 而9000的全部信息,是记录在自身目录的nodes.conf中

# more nodes.confa0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505329544244 8 connected 0-5561 10923-11021273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 master - 0 1505329542241 2 connected 5562-10922364ae8322ab2627e25b05d45b702448c74afad10 172.16.32.116:9000 myself,slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 0 7 connected3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505329541239 3 connectedaeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505329545246 8 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505329543242 3 connected 11022-16383a96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505329546248 2 connectedvars currentEpoch 8 lastVoteEpoch 0

19. 到7001等其他节点中查看,nodes.conf已经没有9000的信息

# more nodes.conf273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 myself,master - 0 0 2 connected 5562-10922a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505329680312 8 connected 0-5561 10923-11021aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505329684319 8 connected3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505329686321 6 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505329685318 5 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505329683317 3 connected 11022-16383vars currentEpoch 8 lastVoteEpoch 0

~~~~~~~~~~~~~~~~~~~~~~~~~~~再次添加9000,以及9001~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

20. 将9000和9001再次加入集群,开始下面的折腾。

注意:需要将9000和9001下的redis.conf外的文件清除,重启。然后才能再次加入集群。不然会遇到错误:

[ERR] Node 172.16.32.116:9001 is not empty. Either the node already knows other nodes (check with CLUSTER NODES) or contains some key in database 0.

21. 加入集群

./redis-trib.rb add-node 172.16.32.116:9000 172.16.32.116:7002./redis-trib.rb add-node 172.16.32.116:9001 172.16.32.116:7002

22. 现在是两个空的Master节点

# ./redis-trib.rb check 172.16.32.116:9001>>> Performing Cluster Check (using node 172.16.32.116:9001)M: c4ba7a1f537ac66076791461d6af9012741fee74 172.16.32.116:9001   slots: (0 slots) master   0 additional replica(s)M: dbf78b73f2ab9e37cbf31abbc2beb3d5413d5516 172.16.32.116:9000   slots: (0 slots) master   0 additional replica(s) [OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...[OK] All 16384 slots covered.

23. 重分配100个slot到9000上,其实redis很聪明的,我连接的是9001,但是在分配的时候,它会问你,receiving node是谁,Source node 是谁。

# ./redis-trib.rb reshard 172.16.32.116:9001>>> Performing Cluster Check (using node 172.16.32.116:9001)......[OK] All nodes agree about slots configuration.>>> Check for open slots...>>> Check slots coverage...[OK] All 16384 slots covered.How many slots do you want to move (from 1 to 16384)? 100What is the receiving node ID? dbf78b73f2ab9e37cbf31abbc2beb3d5413d5516  <<<<<<<<

24. 查看分配情况,redis还是很聪明灵活的。

# redis-cli -p 7001 cluster nodes273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 myself,master - 0 0 2 connected 5662-10922a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505330856605 8 connected 0-5561 10923-11021aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505330853598 8 connected3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505330860611 6 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505330859608 5 connectedc4ba7a1f537ac66076791461d6af9012741fee74 172.16.32.116:9001 master - 0 1505330862615 9 connected                         <<<<<<<<<<<<<<<<<<<<<依然是空的88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505330861612 3 connected 11022-16383dbf78b73f2ab9e37cbf31abbc2beb3d5413d5516 172.16.32.116:9000 master - 0 1505330858607 10 connected 5562-5661              <<<<<<<<<<<<<<<<<<<<<从7001要来100个slot

25. 将9001添加为9000的slave节点 redis-cli -p cluster nodes

# redis-cli -p 9001 cluster nodes dbf78b73f2ab9e37cbf31abbc2beb3d5413d5516273107e5ac994d675749be0979556e761274bb93 172.16.32.116:7001 myself,master - 0 0 2 connected 5662-10922a0b91f48e933c1f1d427c54917ce970bd25d29f8 172.16.32.116:7000 master - 0 1505331457798 8 connected 0-5561 10923-11021aeb684429d220c0fd1392574d193cc1ae7577782 172.16.32.116:8000 slave a0b91f48e933c1f1d427c54917ce970bd25d29f8 0 1505331454791 8 connected3d27f60a1cc4d9c8f09aca928b03f0e083722d3b 172.16.32.116:8002 slave 88fe075375295b59eabe69fa1438ed7c7c314f43 0 1505331456795 6 connecteda96cad95dca2a8e1e0302bff4f835260d92e3d31 172.16.32.116:8001 slave 273107e5ac994d675749be0979556e761274bb93 0 1505331458799 10 connectedc4ba7a1f537ac66076791461d6af9012741fee74 172.16.32.116:9001 slave dbf78b73f2ab9e37cbf31abbc2beb3d5413d5516 0 1505331459801 10 connected88fe075375295b59eabe69fa1438ed7c7c314f43 172.16.32.116:7002 master - 0 1505331455793 3 connected 11022-16383dbf78b73f2ab9e37cbf31abbc2beb3d5413d5516 172.16.32.116:9000 master - 0 1505331453788 10 connected 5562-5661


经过各种折腾,redis添加,删除,重分配slot等操作,都测试完了。

中间有很多命令输出部门,视乎有点重复。但是为了更好的阅读理解,观察每一步操作的变化。后面查阅也更容易一些。

毕竟,年纪大了,记性不好。好多自己写过的blog,回头翻阅的时候,发现某些步骤,不是很好理解了。虽然我的blog,都是基于自己测试的结果,但是依然会忘记。


0