Document (#42821)

Author
Xiong, C.
Title
Knowledge based text representations for information retrieval
Imprint
Pittsburgh, PA : Carnegie Mellon University, School of Computer Science, Language Technologies Institute
Year
2016
Pages
iii, 82 S
Abstract
The successes of information retrieval (IR) in recent decades were built upon bag-of-words representations. Effective as it is, bag-of-words is only a shallow text understanding; there is a limited amount of information for document ranking in the word space. This dissertation goes beyond words and builds knowledge based text representations, which embed the external and carefully curated information from knowledge bases, and provide richer and structured evidence for more advanced information retrieval systems. This thesis research first builds query representations with entities associated with the query. Entities' descriptions are used by query expansion techniques that enrich the query with explanation terms. Then we present a general framework that represents a query with entities that appear in the query, are retrieved by the query, or frequently show up in the top retrieved documents. A latent space model is developed to jointly learn the connections from query to entities and the ranking of documents, modeling the external evidence from knowledge bases and internal ranking features cooperatively. To further improve the quality of relevant entities, a defining factor of our query representations, we introduce learning to rank to entity search and retrieve better entities from knowledge bases. In the document representation part, this thesis research also moves one step forward with a bag-of-entities model, in which documents are represented by their automatic entity annotations, and the ranking is performed in the entity space.
This proposal includes plans to improve the quality of relevant entities with a co-learning framework that learns from both entity labels and document labels. We also plan to develop a hybrid ranking system that combines word based and entity based representations together with their uncertainties considered. At last, we plan to enrich the text representations with connections between entities. We propose several ways to infer entity graph representations for texts, and to rank documents using their structure representations. This dissertation overcomes the limitation of word based representations with external and carefully curated information from knowledge bases. We believe this thesis research is a solid start towards the new generation of intelligent, semantic, and structured information retrieval.
Content
Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Language and Information Technologies. Vgl.: https%3A%2F%2Fwww.cs.cmu.edu%2F~cx%2Fpapers%2Fknowledge_based_text_representation.pdf&usg=AOvVaw0SaTSvhWLTh__Uz_HtOtl3.
Theme
Wissensrepräsentation

Similar documents (author)

  1. Xiong, L.J.: On the compiling of cataloguing rules for Chinese document(s) (1997) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:xiong in 3199) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 3199, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=3199)
    
  2. Xiong, S.; Ji, D.: Query-focused multi-document summarization using hypergraph-based ranking (2016) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:xiong in 2972) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 2972, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=2972)
    
  3. Zeng, Q.; Yu, M.; Yu, W.; Xiong, J.; Shi, Y.; Jiang, M.: Faceted hierarchy : a new graph type to organize scientific concepts and a construction method (2019) 2.48
    2.476282 = sum of:
      2.476282 = weight(author_txt:xiong in 400) [ClassicSimilarity], result of:
        2.476282 = fieldWeight in 400, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.25 = fieldNorm(doc=400)
    
  4. Luo, L.; Ju, J.; Li, Y.-F.; Haffari, G.; Xiong, B.; Pan, S.: ChatRule: mining logical rules with large language models for knowledge graph reasoning (2023) 2.48
    2.476282 = sum of:
      2.476282 = weight(author_txt:xiong in 1171) [ClassicSimilarity], result of:
        2.476282 = fieldWeight in 1171, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.25 = fieldNorm(doc=1171)
    

Similar documents (content)

  1. Han, B.; Chen, L.; Tian, X.: Knowledge based collection selection for distributed information retrieval (2018) 0.43
    0.4320767 = sum of:
      0.4320767 = product of:
        0.9819925 = sum of:
          0.0071793813 = weight(abstract_txt:this in 3289) [ClassicSimilarity], result of:
            0.0071793813 = score(doc=3289,freq=1.0), product of:
              0.047604337 = queryWeight, product of:
                1.02221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019299494 = queryNorm
              0.1508136 = fieldWeight in 3289, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.06844589 = weight(abstract_txt:enrich in 3289) [ClassicSimilarity], result of:
            0.06844589 = score(doc=3289,freq=1.0), product of:
              0.14840426 = queryWeight, product of:
                1.0420281 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.019299494 = queryNorm
              0.46121246 = fieldWeight in 3289, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.023895182 = weight(abstract_txt:based in 3289) [ClassicSimilarity], result of:
            0.023895182 = score(doc=3289,freq=3.0), product of:
              0.06924067 = queryWeight, product of:
                1.1253998 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019299494 = queryNorm
              0.3451033 = fieldWeight in 3289, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.0678781 = weight(abstract_txt:words in 3289) [ClassicSimilarity], result of:
            0.0678781 = score(doc=3289,freq=3.0), product of:
              0.11713622 = queryWeight, product of:
                1.1338288 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.019299494 = queryNorm
              0.57948 = fieldWeight in 3289, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.033723515 = weight(abstract_txt:documents in 3289) [ClassicSimilarity], result of:
            0.033723515 = score(doc=3289,freq=2.0), product of:
              0.09257704 = queryWeight, product of:
                1.1639193 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019299494 = queryNorm
              0.36427513 = fieldWeight in 3289, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.015257359 = weight(abstract_txt:from in 3289) [ClassicSimilarity], result of:
            0.015257359 = score(doc=3289,freq=2.0), product of:
              0.062454645 = queryWeight, product of:
                1.1708446 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019299494 = queryNorm
              0.24429502 = fieldWeight in 3289, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.032406244 = weight(abstract_txt:knowledge in 3289) [ClassicSimilarity], result of:
            0.032406244 = score(doc=3289,freq=2.0), product of:
              0.103196345 = queryWeight, product of:
                1.5050434 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019299494 = queryNorm
              0.31402513 = fieldWeight in 3289, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.17866626 = weight(abstract_txt:entity in 3289) [ClassicSimilarity], result of:
            0.17866626 = score(doc=3289,freq=2.0), product of:
              0.32206255 = queryWeight, product of:
                2.658808 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.019299494 = queryNorm
              0.5547564 = fieldWeight in 3289, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.21785067 = weight(abstract_txt:query in 3289) [ClassicSimilarity], result of:
            0.21785067 = score(doc=3289,freq=7.0), product of:
              0.27713552 = queryWeight, product of:
                3.020707 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019299494 = queryNorm
              0.78607994 = fieldWeight in 3289, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.1521353 = weight(abstract_txt:entities in 3289) [ClassicSimilarity], result of:
            0.1521353 = score(doc=3289,freq=1.0), product of:
              0.4172909 = queryWeight, product of:
                3.706653 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019299494 = queryNorm
              0.36457852 = fieldWeight in 3289, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
          0.18455458 = weight(abstract_txt:representations in 3289) [ClassicSimilarity], result of:
            0.18455458 = score(doc=3289,freq=1.0), product of:
              0.49161068 = queryWeight, product of:
                4.24084 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.019299494 = queryNorm
              0.375408 = fieldWeight in 3289, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.0625 = fieldNorm(doc=3289)
        0.44 = coord(11/25)
    
  2. Vechtomova, O.; Robertson, S.E.: ¬A domain-independent approach to finding related entities (2012) 0.33
    0.3334483 = sum of:
      0.3334483 = product of:
        1.1908867 = sum of:
          0.008974226 = weight(abstract_txt:this in 2733) [ClassicSimilarity], result of:
            0.008974226 = score(doc=2733,freq=1.0), product of:
              0.047604337 = queryWeight, product of:
                1.02221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019299494 = queryNorm
              0.18851699 = fieldWeight in 2733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.029807657 = weight(abstract_txt:documents in 2733) [ClassicSimilarity], result of:
            0.029807657 = score(doc=2733,freq=1.0), product of:
              0.09257704 = queryWeight, product of:
                1.1639193 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019299494 = queryNorm
              0.32197678 = fieldWeight in 2733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.023357965 = weight(abstract_txt:from in 2733) [ClassicSimilarity], result of:
            0.023357965 = score(doc=2733,freq=3.0), product of:
              0.062454645 = queryWeight, product of:
                1.1708446 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019299494 = queryNorm
              0.37399885 = fieldWeight in 2733, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.014965387 = weight(abstract_txt:with in 2733) [ClassicSimilarity], result of:
            0.014965387 = score(doc=2733,freq=1.0), product of:
              0.07663096 = queryWeight, product of:
                1.5884173 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019299494 = queryNorm
              0.19529167 = fieldWeight in 2733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.41781747 = weight(abstract_txt:entity in 2733) [ClassicSimilarity], result of:
            0.41781747 = score(doc=2733,freq=7.0), product of:
              0.32206255 = queryWeight, product of:
                2.658808 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.019299494 = queryNorm
              1.2973177 = fieldWeight in 2733, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.23014678 = weight(abstract_txt:query in 2733) [ClassicSimilarity], result of:
            0.23014678 = score(doc=2733,freq=5.0), product of:
              0.27713552 = queryWeight, product of:
                3.020707 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019299494 = queryNorm
              0.8304485 = fieldWeight in 2733, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.4658173 = weight(abstract_txt:entities in 2733) [ClassicSimilarity], result of:
            0.4658173 = score(doc=2733,freq=6.0), product of:
              0.4172909 = queryWeight, product of:
                3.706653 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019299494 = queryNorm
              1.1162891 = fieldWeight in 2733, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
        0.28 = coord(7/25)
    
  3. Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.31
    0.3086817 = sum of:
      0.3086817 = product of:
        0.7717042 = sum of:
          0.012435053 = weight(abstract_txt:this in 664) [ClassicSimilarity], result of:
            0.012435053 = score(doc=664,freq=3.0), product of:
              0.047604337 = queryWeight, product of:
                1.02221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019299494 = queryNorm
              0.2612168 = fieldWeight in 664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.023895182 = weight(abstract_txt:based in 664) [ClassicSimilarity], result of:
            0.023895182 = score(doc=664,freq=3.0), product of:
              0.06924067 = queryWeight, product of:
                1.1253998 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019299494 = queryNorm
              0.3451033 = fieldWeight in 664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.023846125 = weight(abstract_txt:documents in 664) [ClassicSimilarity], result of:
            0.023846125 = score(doc=664,freq=1.0), product of:
              0.09257704 = queryWeight, product of:
                1.1639193 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.019299494 = queryNorm
              0.2575814 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.010788581 = weight(abstract_txt:from in 664) [ClassicSimilarity], result of:
            0.010788581 = score(doc=664,freq=1.0), product of:
              0.062454645 = queryWeight, product of:
                1.1708446 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019299494 = queryNorm
              0.17274266 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.014651042 = weight(abstract_txt:information in 664) [ClassicSimilarity], result of:
            0.014651042 = score(doc=664,freq=3.0), product of:
              0.05590398 = queryWeight, product of:
                1.196497 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019299494 = queryNorm
              0.26207513 = fieldWeight in 664, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.01197231 = weight(abstract_txt:with in 664) [ClassicSimilarity], result of:
            0.01197231 = score(doc=664,freq=1.0), product of:
              0.07663096 = queryWeight, product of:
                1.5884173 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019299494 = queryNorm
              0.15623334 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.07473297 = weight(abstract_txt:ranking in 664) [ClassicSimilarity], result of:
            0.07473297 = score(doc=664,freq=1.0), product of:
              0.21356803 = queryWeight, product of:
                1.9764888 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.019299494 = queryNorm
              0.34992582 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.17866626 = weight(abstract_txt:entity in 664) [ClassicSimilarity], result of:
            0.17866626 = score(doc=664,freq=2.0), product of:
              0.32206255 = queryWeight, product of:
                2.658808 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.019299494 = queryNorm
              0.5547564 = fieldWeight in 664, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.116446085 = weight(abstract_txt:query in 664) [ClassicSimilarity], result of:
            0.116446085 = score(doc=664,freq=2.0), product of:
              0.27713552 = queryWeight, product of:
                3.020707 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019299494 = queryNorm
              0.4201774 = fieldWeight in 664, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.3042706 = weight(abstract_txt:entities in 664) [ClassicSimilarity], result of:
            0.3042706 = score(doc=664,freq=4.0), product of:
              0.4172909 = queryWeight, product of:
                3.706653 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019299494 = queryNorm
              0.72915703 = fieldWeight in 664, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
        0.4 = coord(10/25)
    
  4. Zhao, G.; Wu, J.; Wang, D.; Li, T.: Entity disambiguation to Wikipedia using collective ranking (2016) 0.30
    0.2952994 = sum of:
      0.2952994 = product of:
        0.8202761 = sum of:
          0.012691473 = weight(abstract_txt:this in 3266) [ClassicSimilarity], result of:
            0.012691473 = score(doc=3266,freq=2.0), product of:
              0.047604337 = queryWeight, product of:
                1.02221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019299494 = queryNorm
              0.2666033 = fieldWeight in 3266, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.017244862 = weight(abstract_txt:based in 3266) [ClassicSimilarity], result of:
            0.017244862 = score(doc=3266,freq=1.0), product of:
              0.06924067 = queryWeight, product of:
                1.1253998 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019299494 = queryNorm
              0.24905685 = fieldWeight in 3266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.048772547 = weight(abstract_txt:text in 3266) [ClassicSimilarity], result of:
            0.048772547 = score(doc=3266,freq=3.0), product of:
              0.089130834 = queryWeight, product of:
                1.1420503 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019299494 = queryNorm
              0.54720175 = fieldWeight in 3266, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.010573479 = weight(abstract_txt:information in 3266) [ClassicSimilarity], result of:
            0.010573479 = score(doc=3266,freq=1.0), product of:
              0.05590398 = queryWeight, product of:
                1.196497 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.019299494 = queryNorm
              0.18913643 = fieldWeight in 3266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.028643344 = weight(abstract_txt:knowledge in 3266) [ClassicSimilarity], result of:
            0.028643344 = score(doc=3266,freq=1.0), product of:
              0.103196345 = queryWeight, product of:
                1.5050434 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019299494 = queryNorm
              0.2775616 = fieldWeight in 3266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.09341621 = weight(abstract_txt:ranking in 3266) [ClassicSimilarity], result of:
            0.09341621 = score(doc=3266,freq=1.0), product of:
              0.21356803 = queryWeight, product of:
                1.9764888 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.019299494 = queryNorm
              0.43740726 = fieldWeight in 3266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.31584033 = weight(abstract_txt:entity in 3266) [ClassicSimilarity], result of:
            0.31584033 = score(doc=3266,freq=4.0), product of:
              0.32206255 = queryWeight, product of:
                2.658808 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.019299494 = queryNorm
              0.98068005 = fieldWeight in 3266, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.10292477 = weight(abstract_txt:query in 3266) [ClassicSimilarity], result of:
            0.10292477 = score(doc=3266,freq=1.0), product of:
              0.27713552 = queryWeight, product of:
                3.020707 = boost
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.019299494 = queryNorm
              0.37138787 = fieldWeight in 3266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7537646 = idf(docFreq=1035, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
          0.19016911 = weight(abstract_txt:entities in 3266) [ClassicSimilarity], result of:
            0.19016911 = score(doc=3266,freq=1.0), product of:
              0.4172909 = queryWeight, product of:
                3.706653 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019299494 = queryNorm
              0.45572314 = fieldWeight in 3266, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.078125 = fieldNorm(doc=3266)
        0.36 = coord(9/25)
    
  5. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.29
    0.28943816 = sum of:
      0.28943816 = product of:
        0.9044943 = sum of:
          0.0071793813 = weight(abstract_txt:this in 1726) [ClassicSimilarity], result of:
            0.0071793813 = score(doc=1726,freq=1.0), product of:
              0.047604337 = queryWeight, product of:
                1.02221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.019299494 = queryNorm
              0.1508136 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.05542223 = weight(abstract_txt:words in 1726) [ClassicSimilarity], result of:
            0.05542223 = score(doc=1726,freq=2.0), product of:
              0.11713622 = queryWeight, product of:
                1.1338288 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.019299494 = queryNorm
              0.47314343 = fieldWeight in 1726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.031858098 = weight(abstract_txt:text in 1726) [ClassicSimilarity], result of:
            0.031858098 = score(doc=1726,freq=2.0), product of:
              0.089130834 = queryWeight, product of:
                1.1420503 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019299494 = queryNorm
              0.3574307 = fieldWeight in 1726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.010788581 = weight(abstract_txt:from in 1726) [ClassicSimilarity], result of:
            0.010788581 = score(doc=1726,freq=1.0), product of:
              0.062454645 = queryWeight, product of:
                1.1708446 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019299494 = queryNorm
              0.17274266 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.016931403 = weight(abstract_txt:with in 1726) [ClassicSimilarity], result of:
            0.016931403 = score(doc=1726,freq=2.0), product of:
              0.07663096 = queryWeight, product of:
                1.5884173 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019299494 = queryNorm
              0.22094731 = fieldWeight in 1726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.33425397 = weight(abstract_txt:entity in 1726) [ClassicSimilarity], result of:
            0.33425397 = score(doc=1726,freq=7.0), product of:
              0.32206255 = queryWeight, product of:
                2.658808 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.019299494 = queryNorm
              1.0378542 = fieldWeight in 1726, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.26350605 = weight(abstract_txt:entities in 1726) [ClassicSimilarity], result of:
            0.26350605 = score(doc=1726,freq=3.0), product of:
              0.4172909 = queryWeight, product of:
                3.706653 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.019299494 = queryNorm
              0.6314685 = fieldWeight in 1726, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.18455458 = weight(abstract_txt:representations in 1726) [ClassicSimilarity], result of:
            0.18455458 = score(doc=1726,freq=1.0), product of:
              0.49161068 = queryWeight, product of:
                4.24084 = boost
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.019299494 = queryNorm
              0.375408 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.006528 = idf(docFreq=295, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
        0.32 = coord(8/25)