在elasticsearch中使用python库自定义过滤器分析器

vatpfxk5  于 2023-03-01  发布在  ElasticSearch
关注(0)|答案(1)|浏览(114)

我想为persian-language文本创建一个index,并为其创建词干分析器,这是english-stemming for description字段

PUT my_index
{
    "mappings": {
      "properties": {
        "description": {
          "type": "text",
          "analyzer": "english"
        }
      }
    }, 
    "settings": {
      "analysis":{
        "filter": {
          "english_stemmer": {
            "type":       "stemmer",
            "language":   "english"
          }
        }
      }
    }
}

现在我想知道如何实现PersianStemmer python库来elasticsearch analyzer

mbzjlibv

mbzjlibv1#

您需要为此创建自定义分析器:

PUT my_index
{
  "settings": {
    "analysis": {
      "filter": {
        "persian_stemmer": {
          "type": "stemmer",
          "language": "persian"
        }
      },
      "analyzer": {
        "persian_analyzer": {
          "tokenizer": "standard",
          "filter": [
            "lowercase",
            "persian_stemmer"
          ]
        }
      }
    }
  },
  "mappings": {
    "properties": {
      "description": {
        "type": "text",
        "analyzer": "persian_analyzer"
      }
    }
  }
}

相关问题