2016-08-24 113 views
0

因此,我有一個定製的分析器,它添加了本體中的附加術語。另外,我想在索引之前對術語進行干擾。以下是取自elasticsearch頭插件分析在ElasticSearch中索引的術語

{ 
    "state": "open", 
    "settings": { 
     "index": { 
      "refresh_interval": "1000s", 
      "number_of_shards": "5", 
      "creation_date": "1471931611750", 
      "analysis": { 
       "filter": { 
        "owlfilter": { 
         "type": "owl", 
         "indexName": "ontoowl", 
         "expansionType": "RDFSLABEL", 
         "owlFile": "/home/tannys/elasticsearch-2.3.0/ontologyWorkTrial/myownowl.owl" 
        } 
       }, 
       "analyzer": { 
        "owlanalyzer": { 
         "filter": ["owlfilter","porter_stem"], 
         "type": "custom", 
         "tokenizer": "standard" 
        } 
       } 
      }, 
      "number_of_replicas": "1", 
      "uuid": "d8Ub8A0eSm65geMK_bpdvw", 
      "version": {"created": "2030099"} 
     } 
    }, 
    "mappings": { 
     "mytype": { 
      "properties": { 
       "nameortitle": { 
        "search_analyzer": "standard", 
        "analyzer": "owlanalyzer", 
        "store": true, 
        "type": "string" 
       }, 
       "description": { 
        "search_analyzer": "standard", 
        "analyzer": "owlanalyzer", 
        "store": true, 
        "type": "string" 
       } 
      }, 
      "aliases": [ ] 
     } 
    } 
} 

問題的諷刺意味的是,直到我曾用porter_stem過濾器,效果比較好。所以我不太確定,出了什麼問題。我想看看索引的條款。我怎樣才能看到分析器的表現如何,比如說luke對Lucene的作用? 任何指導。

回答

1

您可以使用Term Vectors API here。這將爲您提供文檔中字段的條款,或者您也可以使用multi-term API以同樣的方式查看多個文檔中的條款。