只更新 elasticsearch 中的特定字段值

是否有可能在 elasticsearch 中更新某些特定字段的值而不覆盖其他字段?

152143 次浏览

Yes, Elasticsearch supports partial updates. That means that you can submit:

  • a partial document, which will be merged with the existing one
  • a script that will be executed on top of the existing document

Have a look at the update api. In both cases, what happens under the hood, due to how the underlying lucene library works, is that the document to be updated is retrieved, the changes are applied to it, and the old document gets overwritten with the new. At the end of the day it is in fact a complete rewrite of the document, but you don't have to submit the whole document, unless you disabled the _source field, enabled by default, which is the field that allows you to retrieve to whole document in order to apply changes to it.

As a codebased contribution to this answer, the following query may be used:

POST /index/type/100100471/_update
{
"doc" : {
"yourProperty" : 10000
}
}

This query updates yourProperty property only.

As a result, this response appears:

{
"_index": "index",
"_type": "type",
"_id": "100100471",
"_version": 1,
"_shards": {
"total": 0,
"successful": 1,
"failed": 0
}
}

If you would like to update the existing field value only then you must try this solution:

POST IndexName/_update_by_query
{
"script": {
"source": """


if (ctx._source?.Field != null)
{
ctx._source.remove('Field');
ctx._source.put('Field', 'Value');
}
""",
"lang": "painless"
},
"query": {
"terms": {
"_id": [
1 (Replace with Document ID)
]
}
}
}

If you would like to add new field with value then you must try this solution:

POST IndexName/_update_by_query
{
"script": {
"source": """


if (ctx._source?.NewField == null)
{
ctx._source.hf.put('NewField', 'Value');
}
""",
"lang": "painless"
},
"query": {
"terms": {
"_id": [
1 (Replace with Document ID)
]
}
}
}

In ES 7.3 the new format is:

POST /myindex/_update/mydocid
{
"doc" : {
"myfield": "new value of my field"
}
}

Try this

curl -XPOST --header 'Content-Type: application/json' https://search-testarticle-2cwv6oh7wtz3hxowgxv3rprnsa.ap-south-1.es.amazonaws.com/test/_update/4gI4knQB7d5sAgV24YPy -d '{
"doc":  { "name" : "Ankit Singh Raj" }
}'

Also you can use bulk Update to partially update multiple doc

POST /foo_v1/_bulk?pretty=true&error_trace=true
{"update":{"_index":"foo_v1","_type":"footype","_id":"397"}}
{"doc":{"Name":"foo","EmailId":"foo@test.com"}}
{"update":{"_index":"foo_v1","_type":"footype","_id":"398"}}
{"doc":{"Name":"foo1","EmailId":"foo1@test.com"}}

If you are not sure whether it will be a update or insert like Upsert so you can do :

POST /foo_v1/_bulk?pretty=true&error_trace=true
{"update":{"_index":"foo_v1","_type":"footype","_id":"397"}}
{"doc":{"Name":"foo","EmailId":"foo@test.com"}, "doc_as_upsert" : true}

use "doc_as_upsert" : true

Update By Query API

Updates documents that match the specified query. If no query is specified, performs an update on every document in the data stream or index without modifying the source, which is useful for picking up mapping changes.

POST http://localhost:9200/INDEX_NAME/_update_by_query
{
"script": {
"source": "ctx._source.userName=new_user_name",
"lang": "painless"
},
"query": {
"term": {
"userName": "old_user_name"
}
}
}

In the above example, the userName field value will be updated to new_user_name

Make POST call to \{\{host}}/\{\{index}}/_doc/\{\{id}}/_update

with body as

{
"doc": {
"field": "value"
}
}

Output would be similar to

{
"_index": "\{\{index}}",
"_type": "_doc",
"_id": "\{\{id}}",
"_version": 3,
"result": "updated",
"_shards": {
"total": 2,
"successful": 1,
"failed": 0
},
"_seq_no": 27,
"_primary_term": 2
}