如何解决在Hibernate Search中为匹配项使用自定义分析器
@Analyzer(deFinition = "edgeNgram")
@Field(index = Index.YES,analyze = Analyze.YES,store = Store.YES)
@Lob
String value;
这是我班上的分析仪。
@AnalyzerDef(name = "edgeNgram",tokenizer = @TokenizerDef(factory = WhitespacetokenizerFactory.class),filters = {
@TokenFilterDef(factory = ASCIIFoldingFilterFactory.class),// Replace accented characters by their simpler counterpart (è => e,etc.)
@TokenFilterDef(factory = LowerCaseFilterFactory.class),// Lowercase all characters
@TokenFilterDef(
factory = EdgeNGramFilterFactory.class,// Generate prefix tokens
params = {
@org.hibernate.search.annotations.Parameter(name = "minGramSize",value = "4"),@org.hibernate.search.annotations.Parameter(name = "maxGramSize",value = "10")
}
)
})
query = queryBuilder
.simpleQueryString()
.boostedTo(3f) // This whole query is boosted so exact matches will obtain a better score
.onFields("title.value","keyword.values.value")
.boostedTo(2f)
.andField("description.values.value")
//.withAndAsDefaultOperator()
.matching(Arrays.stream(searchTerm.split(" ")).map(e -> e + "*").collect(Collectors.joining(" ")).toLowerCase())
.createquery();
我不知道如何(而且在Hibernate Search的文档中找不到)为搜索词searchTerm
设置分析器。基本上,我开始手动拆分并在Java中将其设置为小写。但这似乎不正确。
@AnalyzerDef(name = "edgeNGram_query",// Replace accented characeters by their simpler counterpart (è => e,etc.)
@TokenFilterDef(factory = LowerCaseFilterFactory.class) // Lowercase all characters
})
您知道如何为查询字词设置自定义分析器,为什么默认情况下不应用它?如果我搜索“ bouees”,它会起作用,但是如果我搜索“bouées”,则不会。
谢谢!
解决方案:
我的问题是,当我应该进行simpleQueryString
查询时,我正在做keyword
。 simpleQueryString
似乎没有在搜索词上运行分析器!然后,我只需要遵循@yrodiere .overridesForField( "description.values.value","edgeNGram_query" )
即可使用正确的搜索词分析器。
解决方法
在Hibernate Search 5中,创建查询构建器时必须调用overridesForField
,以覆盖每个字段的分析器:
QueryBuilder queryBuilder = fullTextEntityManager.getSearchFactory().buildQueryBuilder().forEntity(Hospital.class)
.overridesForField( "title.value","edgeNGram_query" )
.overridesForField( "keyword.values.value","edgeNGram_query" )
.overridesForField( "description.values.value" )
.get();
// Then it's business as usual
Query query = queryBuilder
.simpleQueryString()
.boostedTo(3f) // This whole query is boosted so exact matches will obtain a better score
.onFields("title.value","keyword.values.value")
.boostedTo(2f)
.andField("description.values.value")
//.withAndAsDefaultOperator()
.matching(searchTerm)
.createQuery();
另请参阅this answer的结尾,这可能是您最初获取代码的地方? :)
如果有一天您升级到Hibernate Search 6(在Beta中使用不同的API),您会发现它要简单得多:构建谓词时可以选择override the analyzer。例如:
List<MyEntity> hits = searchSession.search( MyEntity.class )
.where( f -> f.simpleQueryString()
.fields( "title.value","keyword.values.value" ).boost( 3f )
.fields( "description.values.value" )
.matching( searchTerm )
//.defaultOperator( BooleanOperator.AND )
.analyzer( "edgeNGram_query" ) ) // <= HERE
.fetchHits( 20 );
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。