白话Elasticsearch11-深度探秘搜索技术之基于tie_breaker参数优化dis_max搜索效果
生活随笔
收集整理的這篇文章主要介紹了
白话Elasticsearch11-深度探秘搜索技术之基于tie_breaker参数优化dis_max搜索效果
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.
文章目錄
- 概述
- 官方文檔
- 例子
- tie_breaker
概述
繼續跟中華石杉老師學習ES,第十一篇
課程地址: https://www.roncoo.com/view/55
官方文檔
https://www.elastic.co/guide/en/elasticsearch/guide/current/_tuning_best_fields_queries.html
https://www.elastic.co/guide/en/elasticsearch/reference/7.2/query-dsl-dis-max-query.html
例子
數據同 上篇博文 構造索引的DSL
這次我們使用dis_max查詢 java beginner , DSL如下
GET /forum/article/_search {"query": {"dis_max": {"queries": [{"match": {"title": "java beginner"}},{"match": {"content": "java beginner"}}]}} }返回
{"took": 2,"timed_out": false,"_shards": {"total": 1,"successful": 1,"skipped": 0,"failed": 0},"hits": {"total": 5,"max_score": 1.0341108,"hits": [{"_index": "forum","_type": "article","_id": "3","_score": 1.0341108,"_source": {"articleID": "JODL-X-1937-#pV7","userID": 2,"hidden": false,"postDate": "2017-01-01","tag": ["hadoop"],"tag_cnt": 1,"view_cnt": 100,"title": "this is elasticsearch blog","content": "i am only an elasticsearch beginner"}},{"_index": "forum","_type": "article","_id": "2","_score": 0.93952733,"_source": {"articleID": "KDKE-B-9947-#kL5","userID": 1,"hidden": false,"postDate": "2017-01-02","tag": ["java"],"tag_cnt": 1,"view_cnt": 50,"title": "this is java blog","content": "i think java is the best programming language"}},{"_index": "forum","_type": "article","_id": "4","_score": 0.79423964,"_source": {"articleID": "QQPX-R-3956-#aD8","userID": 2,"hidden": true,"postDate": "2017-01-02","tag": ["java","elasticsearch"],"tag_cnt": 2,"view_cnt": 80,"title": "this is java, elasticsearch, hadoop blog","content": "elasticsearch and hadoop are all very good solution, i am a beginner"}},{"_index": "forum","_type": "article","_id": "5","_score": 0.7116974,"_source": {"articleID": "DHJK-B-1395-#Ky5","userID": 3,"hidden": false,"postDate": "2019-05-01","tag": ["elasticsearch"],"tag_cnt": 1,"view_cnt": 10,"title": "this is spark blog","content": "spark is best big data solution based on scala ,an programming language similar to java"}},{"_index": "forum","_type": "article","_id": "1","_score": 0.4889865,"_source": {"articleID": "XHDK-A-1293-#fJ3","userID": 1,"hidden": false,"postDate": "2017-01-01","tag": ["java","hadoop"],"tag_cnt": 2,"view_cnt": 30,"title": "this is java and elasticsearch blog","content": "i like to write best elasticsearch article"}}]} }不知道為啥id=3的相關度是最高的… 如果有知道的,煩請不吝賜教。
dis_max只取某一個query最大的分數,完全不考慮其他query的分數
tie_breaker
使用tie_breaker將其他query的分數也考慮進去
tie_breaker參數的意義,在于說,將其他query的分數,乘以tie_breaker,然后綜合與最高分數的那個query的分數,綜合在一起進行計算,除了取最高分以外,還會考慮其他的query的分數。
tie_breaker的值,在0~1之間,是個小數。
GET /forum/article/_search {"query": {"dis_max": {"queries": [{"match": {"title": "java beginner"}},{"match": {"content": "java beginner"}}],"tie_breaker": 0.7}} }返回結果
{"took": 2,"timed_out": false,"_shards": {"total": 1,"successful": 1,"skipped": 0,"failed": 0},"hits": {"total": 5,"max_score": 1.344432,"hits": [{"_index": "forum","_type": "article","_id": "2","_score": 1.344432,"_source": {"articleID": "KDKE-B-9947-#kL5","userID": 1,"hidden": false,"postDate": "2017-01-02","tag": ["java"],"tag_cnt": 1,"view_cnt": 50,"title": "this is java blog","content": "i think java is the best programming language"}},{"_index": "forum","_type": "article","_id": "4","_score": 1.1365302,"_source": {"articleID": "QQPX-R-3956-#aD8","userID": 2,"hidden": true,"postDate": "2017-01-02","tag": ["java","elasticsearch"],"tag_cnt": 2,"view_cnt": 80,"title": "this is java, elasticsearch, hadoop blog","content": "elasticsearch and hadoop are all very good solution, i am a beginner"}},{"_index": "forum","_type": "article","_id": "3","_score": 1.0341108,"_source": {"articleID": "JODL-X-1937-#pV7","userID": 2,"hidden": false,"postDate": "2017-01-01","tag": ["hadoop"],"tag_cnt": 1,"view_cnt": 100,"title": "this is elasticsearch blog","content": "i am only an elasticsearch beginner"}},{"_index": "forum","_type": "article","_id": "5","_score": 0.7116974,"_source": {"articleID": "DHJK-B-1395-#Ky5","userID": 3,"hidden": false,"postDate": "2019-05-01","tag": ["elasticsearch"],"tag_cnt": 1,"view_cnt": 10,"title": "this is spark blog","content": "spark is best big data solution based on scala ,an programming language similar to java"}},{"_index": "forum","_type": "article","_id": "1","_score": 0.4889865,"_source": {"articleID": "XHDK-A-1293-#fJ3","userID": 1,"hidden": false,"postDate": "2017-01-01","tag": ["java","hadoop"],"tag_cnt": 2,"view_cnt": 30,"title": "this is java and elasticsearch blog","content": "i like to write best elasticsearch article"}}]} }總結
以上是生活随笔為你收集整理的白话Elasticsearch11-深度探秘搜索技术之基于tie_breaker参数优化dis_max搜索效果的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: linux中exp命令详解_exp/im
- 下一篇: u检验、t检验、F检验、卡方检验详细分析