千家信息网

ElasticSearch常用操作:索引篇

发表于:2024-11-17 作者:千家信息网编辑
千家信息网最后更新 2024年11月17日,[TOC]0 说明基于es 5.4和5.6,参考两份资料,《从Lucene到Elasticsearch全文检索实战》和官方文档https://www.elastic.co/guide/en/elast
千家信息网最后更新 2024年11月17日ElasticSearch常用操作:索引篇

[TOC]


0 说明

基于es 5.4和5.6,参考两份资料,《从Lucene到Elasticsearch全文检索实战》和官方文档

https://www.elastic.co/guide/en/elasticsearch/reference/5.4/indices.html (官方文档相当精彩,不容错过!)。

1 创建索引

PUT my_index

Note1:索引不能有大写字母;

Note2:es默认给索引设置5个分片1个副本;

NOte3:索引分片数一经指定后不能再修改,但副本数可以通过命令随时修改;

可以添加settings配置:

PUT my_index{  "settings": {    "number_of_shards": 3,    "number_of_replicas": 1  }}

2 更新索引副本数

PUT my_index/_settings{  "number_of_replicas": 2}

3 读写权限设置

权限参数如下:

参数设置说明
blocks.read_only:true为true时,设置当前索引只允许读不允许写或者更新
blocks.read:true为true时,禁止对当前索引进行读操作
blocks.write:true为true时,禁止对当前索引进行写操作

比如要禁止用户进行写操作:

PUT my_index/_settings{  "blocks.write": true}

再写入数据时,就会返回403错误。

恢复写操作:

PUT my_index/_settings{  "blocks.write": false}

4 查看索引

GET my_index/_mapping

返回结果:

{  "my_index": {    "mappings": {      "my_type": {        "properties": {          "title": {            "type": "text",            "fields": {              "keyword": {                "type": "keyword",                "ignore_above": 256              }            }          }        }      }    }  }}

同时查看多个索引的setting信息:

GET my_index,my_index2/_mapping

查看集群中所有索引的setting信息:

GET _all/_settings

5 删除索引

DELETE my_index

如果删除的索引不存在,会报索引未找到异常。

6 索引的打开与关闭

索引关闭以后就几乎不会占用系统资源。

POST my_index/_close

关闭多个索引:

POST my_index,my_index2/_close

加上ignore_unavailable参数:

POST my_index,my_index2,my_index3/_close?ignore_unavailable=true

my_index3是不存在的,如果不加ignore_unavailable参数,则会抛出索引不存在错误。

关闭集群中所有索引:

POST _all/_close

以能配符方式关闭索引,关闭以test开头的索引:

POST test*/_close

7 复制索引

POST _reindex{  "source":{"index":"my_index"},  "dest":{"index":"my_index3"}}

Note1:目标索引不会复制源索引中的配置信息,_redinx操作之前需要设置目标索引的分片数、副本数等信息,如果没有设置,或者说原来就不存在my_index3,那么会新创建一个索引,并且使用默认配置信息;

Note2:_reindex实际上是用来复制索引文档的,因此如果my_index中没有文档,那么是不会新创建my_index3的;

可以在source中增加type和query来限制复制的文档:

POST _reindex{  "source":{    "index":"my_index",    "type":"my_type",    "query":{      "term":{"title":"elasticsearch"}    }  },  "dest":{"index":"my_index3"}}

8 收缩索引

直接参考官方文档:https://www.elastic.co/guide/en/elasticsearch/reference/5.4/indices-shrink-index.html,非常详细。

The shrink index API allows you to shrink an existing index into a new index with fewer primary shards. The requested number of primary shards in the target index must be a factor of the number of shards in the source index. For example an index with 8 primary shards can be shrunk into 4, 2or 1 primary shards or an index with 15 primary shards can be shrunk into 5, 3 or 1. If the number of shards in the index is a prime number it can only be shrunk into a single primary shard. Before shrinking, a (primary or replica) copy of every shard in the index must be present on the same node.

Shrinking works as follows:

  • First, it creates a new target index with the same definition as the source index, but with a smaller number of primary shards.
  • Then it hard-links segments from the source index into the target index. (If the file system doesn't support hard-linking, then all segments are copied into the new index, which is a much more time consuming process.)
  • Finally, it recovers the target index as though it were a closed index which had just been re-opened.

收缩索引前的准备:

PUT /my_source_index/_settings{  "settings": {    "index.routing.allocation.require._name": "shrink_node_name",     "index.blocks.write": true   }}

进行索引的收缩:

POST my_source_index/_shrink/my_target_index

也可以添加其它一些配置信息:

POST my_source_index/_shrink/my_target_index{  "settings": {    "index.number_of_replicas": 1,    "index.number_of_shards": 1,     "index.codec": "best_compression"   },  "aliases": {    "my_search_indices": {}  }}

如果不太理解的话,就一定要好好阅读上面提供的官方文档链接。

9 索引别名

创建索引别名:

POST _aliases{  "actions": [    {      "add": {        "index": "test1",        "alias": "alias1"      }    }  ]}

移除索引别名:

POST _aliases{  "actions": [    {      "remove": {        "index": "test1",        "alias": "alias1"      }    }  ]}

Note1:一个索引可以有多个别名(添加多次就可以了),一个别名也可以对应多个索引(使用多次就可以了);

Note2:在使用别名的时候需要注意,如果别名和索引是一对一的,使用别名索引或者根据ID查询文档是可以的,但是如果别名和索引是一对多的,使用别名会发生错误,因为Elasticsearch不知道把文档写入哪个索引中去或者从哪个索引中读取文档;

查看某一个索引的别名:

GET my_index3/_aliases结果:{  "my_index3": {    "aliases": {      "alias_test": {},      "alias_test2": {}    }  }}

查看一个别名所对应的索引:

GET alias_test/_aliases结果:{  "my_index3": {    "aliases": {      "alias_test": {},      "alias_test2": {}    }  },  "my_index2": {    "aliases": {      "alias_test": {}    }  },  "my_index": {    "aliases": {      "alias_test": {}    }  }}

查看集群上所有的可用别名:

GET _all/_aliases或GET _aliases
0