如何获取SOLR文档的字数？

s2j5cfk0 于 2022-11-05 发布在 Solr

关注(0)|答案(1)|浏览(183)

我有一个pdf文件的二进制内容，我想将其上传到SOLR并为其内容编制索引：

ContentStreamUpdateRequest up = new ContentStreamUpdateRequest('/update/extract')
    up.setParam("literal.id", map.id)
    def tmpFile = null
    tmpFile = File.createTempFile(map.id, ".tmp")
    tmpFile.append(binary)
    up.addFile(tmpFile, ".pdf")
    // Do the SOLR stuff here
    def solr = getSolrServer()       
    up.setAction(AbstractUpdateRequest.ACTION.COMMIT, true, true)
    def response = solr.request(up)
    if (tmpFile) {
        tmpFile.delete()
    }
    return response

当我查询SOLR时，我可以检索SOLR文档。我如何才能得到文件的实际内容呢？基本上我需要找到我上传的文档的字数，所以我打算对返回的字符串执行size（）（如果可能的话）......
我对SOLR很陌生，所以我可能走错了路......任何帮助都非常感谢：）

solr

来源：https://stackoverflow.com/questions/30909795/how-to-get-word-count-of-solr-document