Document (#36451)

Author
Pereira, D.A.
Ribeiro-Neto, B.
Ziviani, N.
Laender, A.H.F.
Gonçalves, M.A.
Title
¬A generic Web-based entity resolution framework
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.5, S.919-932
Year
2011
Abstract
Web data repositories usually contain references to thousands of real-world entities from multiple sources. It is not uncommon that multiple entities share the same label (polysemes) and that distinct label variations are associated with the same entity (synonyms), which frequently leads to ambiguous interpretations. Further, spelling variants, acronyms, abbreviated forms, and misspellings compound to worsen the problem. Solving this problem requires identifying which labels correspond to the same real-world entity, a process known as entity resolution. One approach to solve the entity resolution problem is to associate an authority identifier and a list of variant forms with each entity-a data structure known as an authority file. In this work, we propose a generic framework for implementing a method for generating authority files. Our method uses information from the Web to improve the quality of the authority file and, because of that, is referred to as WER-Web-based Entity Resolution. Our contribution here is threefold: (a) we discuss how to implement the WER framework, which is flexible and easy to adapt to new domains; (b) we run extended experimentation with our WER framework to show that it outperforms selected baselines; and (c) we compare the results of a specialized solution for author name resolution with those produced by the generic WER framework, and show that the WER results remain competitive.
Theme
Internet

Similar documents (author)

  1. Ribeiro-Neto, B.; Laender, A.H.F.; Lima, L.R.S. de: ¬An experimental study in automatically categorizing medical documents (2001) 2.53
    2.5277295 = sum of:
      2.5277295 = product of:
        4.4235263 = sum of:
          0.8968825 = weight(author_txt:ribeiro in 5702) [ClassicSimilarity], result of:
            0.8968825 = score(doc=5702,freq=1.0), product of:
              0.327911 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.036647245 = queryNorm
              2.73514 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.3125 = fieldNorm(doc=5702)
          1.1467571 = weight(author_txt:neto in 5702) [ClassicSimilarity], result of:
            1.1467571 = score(doc=5702,freq=1.0), product of:
              0.3862898 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.036647245 = queryNorm
              2.9686446 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.3125 = fieldNorm(doc=5702)
          1.1899432 = weight(author_txt:laender in 5702) [ClassicSimilarity], result of:
            1.1899432 = score(doc=5702,freq=1.0), product of:
              0.3959282 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.036647245 = queryNorm
              3.005452 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=5702)
          1.1899432 = weight(author_txt:a.h.f in 5702) [ClassicSimilarity], result of:
            1.1899432 = score(doc=5702,freq=1.0), product of:
              0.3959282 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.036647245 = queryNorm
              3.005452 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.3125 = fieldNorm(doc=5702)
        0.5714286 = coord(4/7)
    
  2. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 2.04
    2.038632 = sum of:
      2.038632 = product of:
        3.5676057 = sum of:
          0.6715374 = weight(author_txt:gonçalves in 4219) [ClassicSimilarity], result of:
            0.6715374 = score(doc=4219,freq=1.0), product of:
              0.31375146 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.036647245 = queryNorm
              2.1403482 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          0.95195454 = weight(author_txt:laender in 4219) [ClassicSimilarity], result of:
            0.95195454 = score(doc=4219,freq=1.0), product of:
              0.3959282 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.036647245 = queryNorm
              2.4043615 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          0.95195454 = weight(author_txt:a.h.f in 4219) [ClassicSimilarity], result of:
            0.95195454 = score(doc=4219,freq=1.0), product of:
              0.3959282 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.036647245 = queryNorm
              2.4043615 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          0.9921592 = weight(author_txt:ziviani in 4219) [ClassicSimilarity], result of:
            0.9921592 = score(doc=4219,freq=1.0), product of:
              0.40699887 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.036647245 = queryNorm
              2.4377444 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
        0.5714286 = coord(4/7)
    
  3. Freitas-Junior, H.R.; Ribeiro-Neto, B.A.; Freitas-Vale, R. de; Laender, A.H.F.; Lima, L.R.S. de: Categorization-driven cross-language retrieval of medical information (2006) 2.02
    2.0221834 = sum of:
      2.0221834 = product of:
        3.5388207 = sum of:
          0.717506 = weight(author_txt:ribeiro in 5282) [ClassicSimilarity], result of:
            0.717506 = score(doc=5282,freq=1.0), product of:
              0.327911 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.036647245 = queryNorm
              2.188112 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=5282)
          0.91740566 = weight(author_txt:neto in 5282) [ClassicSimilarity], result of:
            0.91740566 = score(doc=5282,freq=1.0), product of:
              0.3862898 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.036647245 = queryNorm
              2.3749156 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=5282)
          0.95195454 = weight(author_txt:laender in 5282) [ClassicSimilarity], result of:
            0.95195454 = score(doc=5282,freq=1.0), product of:
              0.3959282 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.036647245 = queryNorm
              2.4043615 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=5282)
          0.95195454 = weight(author_txt:a.h.f in 5282) [ClassicSimilarity], result of:
            0.95195454 = score(doc=5282,freq=1.0), product of:
              0.3959282 = queryWeight, product of:
                1.1233506 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.036647245 = queryNorm
              2.4043615 = fieldWeight in 5282, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=5282)
        0.5714286 = coord(4/7)
    
  4. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 1.88
    1.8849189 = sum of:
      1.8849189 = product of:
        3.298608 = sum of:
          0.6715374 = weight(author_txt:gonçalves in 4921) [ClassicSimilarity], result of:
            0.6715374 = score(doc=4921,freq=1.0), product of:
              0.31375146 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.036647245 = queryNorm
              2.1403482 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4921)
          0.717506 = weight(author_txt:ribeiro in 4921) [ClassicSimilarity], result of:
            0.717506 = score(doc=4921,freq=1.0), product of:
              0.327911 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.036647245 = queryNorm
              2.188112 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=4921)
          0.91740566 = weight(author_txt:neto in 4921) [ClassicSimilarity], result of:
            0.91740566 = score(doc=4921,freq=1.0), product of:
              0.3862898 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.036647245 = queryNorm
              2.3749156 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4921)
          0.9921592 = weight(author_txt:ziviani in 4921) [ClassicSimilarity], result of:
            0.9921592 = score(doc=4921,freq=1.0), product of:
              0.40699887 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.036647245 = queryNorm
              2.4377444 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4921)
        0.5714286 = coord(4/7)
    
  5. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 1.88
    1.8849189 = sum of:
      1.8849189 = product of:
        3.298608 = sum of:
          0.6715374 = weight(author_txt:gonçalves in 2531) [ClassicSimilarity], result of:
            0.6715374 = score(doc=2531,freq=1.0), product of:
              0.31375146 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.036647245 = queryNorm
              2.1403482 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=2531)
          0.717506 = weight(author_txt:ribeiro in 2531) [ClassicSimilarity], result of:
            0.717506 = score(doc=2531,freq=1.0), product of:
              0.327911 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.036647245 = queryNorm
              2.188112 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=2531)
          0.91740566 = weight(author_txt:neto in 2531) [ClassicSimilarity], result of:
            0.91740566 = score(doc=2531,freq=1.0), product of:
              0.3862898 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.036647245 = queryNorm
              2.3749156 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=2531)
          0.9921592 = weight(author_txt:ziviani in 2531) [ClassicSimilarity], result of:
            0.9921592 = score(doc=2531,freq=1.0), product of:
              0.40699887 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.036647245 = queryNorm
              2.4377444 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=2531)
        0.5714286 = coord(4/7)
    

Similar documents (content)

  1. Lawrie, D.; Mayfield, J.; McNamee, P.; Oard, P.W.: Cross-language person-entity linking from 20 languages (2015) 0.34
    0.33530283 = sum of:
      0.33530283 = product of:
        1.1975101 = sum of:
          0.019691372 = weight(abstract_txt:which in 1848) [ClassicSimilarity], result of:
            0.019691372 = score(doc=1848,freq=3.0), product of:
              0.049892053 = queryWeight, product of:
                1.0056303 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.017009797 = queryNorm
              0.39467955 = fieldWeight in 1848, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
          0.019084612 = weight(abstract_txt:with in 1848) [ClassicSimilarity], result of:
            0.019084612 = score(doc=1848,freq=4.0), product of:
              0.04886182 = queryWeight, product of:
                1.1491503 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017009797 = queryNorm
              0.39058334 = fieldWeight in 1848, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
          0.061062023 = weight(abstract_txt:known in 1848) [ClassicSimilarity], result of:
            0.061062023 = score(doc=1848,freq=2.0), product of:
              0.106094986 = queryWeight, product of:
                1.1973591 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.017009797 = queryNorm
              0.5755411 = fieldWeight in 1848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
          0.060628302 = weight(abstract_txt:entities in 1848) [ClassicSimilarity], result of:
            0.060628302 = score(doc=1848,freq=1.0), product of:
              0.13303758 = queryWeight, product of:
                1.3408005 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.017009797 = queryNorm
              0.45572314 = fieldWeight in 1848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
          0.017595407 = weight(abstract_txt:that in 1848) [ClassicSimilarity], result of:
            0.017595407 = score(doc=1848,freq=3.0), product of:
              0.054877777 = queryWeight, product of:
                1.3615866 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017009797 = queryNorm
              0.320629 = fieldWeight in 1848, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
          0.49080566 = weight(abstract_txt:resolution in 1848) [ClassicSimilarity], result of:
            0.49080566 = score(doc=1848,freq=3.0), product of:
              0.5047427 = queryWeight, product of:
                4.129353 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.017009797 = queryNorm
              0.97238785 = fieldWeight in 1848, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
          0.5286428 = weight(abstract_txt:entity in 1848) [ClassicSimilarity], result of:
            0.5286428 = score(doc=1848,freq=4.0), product of:
              0.5390573 = queryWeight, product of:
                5.0492687 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.017009797 = queryNorm
              0.98068005 = fieldWeight in 1848, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.078125 = fieldNorm(doc=1848)
        0.28 = coord(7/25)
    
  2. Liu, X.; Zheng, W.; Fang, H.: ¬An exploration of ranking models and feedback method for related entity finding (2013) 0.24
    0.24436691 = sum of:
      0.24436691 = product of:
        0.67879695 = sum of:
          0.009095055 = weight(abstract_txt:which in 2714) [ClassicSimilarity], result of:
            0.009095055 = score(doc=2714,freq=1.0), product of:
              0.049892053 = queryWeight, product of:
                1.0056303 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.017009797 = queryNorm
              0.18229467 = fieldWeight in 2714, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.020899689 = weight(abstract_txt:show in 2714) [ClassicSimilarity], result of:
            0.020899689 = score(doc=2714,freq=1.0), product of:
              0.07589688 = queryWeight, product of:
                1.012719 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.017009797 = queryNorm
              0.27536952 = fieldWeight in 2714, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.031510826 = weight(abstract_txt:method in 2714) [ClassicSimilarity], result of:
            0.031510826 = score(doc=2714,freq=2.0), product of:
              0.07920646 = queryWeight, product of:
                1.0345638 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017009797 = queryNorm
              0.3978315 = fieldWeight in 2714, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.0076338453 = weight(abstract_txt:with in 2714) [ClassicSimilarity], result of:
            0.0076338453 = score(doc=2714,freq=1.0), product of:
              0.04886182 = queryWeight, product of:
                1.1491503 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017009797 = queryNorm
              0.15623334 = fieldWeight in 2714, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.12832592 = weight(abstract_txt:entities in 2714) [ClassicSimilarity], result of:
            0.12832592 = score(doc=2714,freq=7.0), product of:
              0.13303758 = queryWeight, product of:
                1.3408005 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.017009797 = queryNorm
              0.96458405 = fieldWeight in 2714, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.011493271 = weight(abstract_txt:that in 2714) [ClassicSimilarity], result of:
            0.011493271 = score(doc=2714,freq=2.0), product of:
              0.054877777 = queryWeight, product of:
                1.3615866 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017009797 = queryNorm
              0.20943399 = fieldWeight in 2714, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.0460048 = weight(abstract_txt:problem in 2714) [ClassicSimilarity], result of:
            0.0460048 = score(doc=2714,freq=2.0), product of:
              0.11668631 = queryWeight, product of:
                1.5379158 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.017009797 = queryNorm
              0.39426047 = fieldWeight in 2714, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.05757911 = weight(abstract_txt:framework in 2714) [ClassicSimilarity], result of:
            0.05757911 = score(doc=2714,freq=1.0), product of:
              0.20243582 = queryWeight, product of:
                2.6151145 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.017009797 = queryNorm
              0.28443143 = fieldWeight in 2714, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
          0.36625445 = weight(abstract_txt:entity in 2714) [ClassicSimilarity], result of:
            0.36625445 = score(doc=2714,freq=3.0), product of:
              0.5390573 = queryWeight, product of:
                5.0492687 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.017009797 = queryNorm
              0.6794351 = fieldWeight in 2714, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=2714)
        0.36 = coord(9/25)
    
  3. Vechtomova, O.; Robertson, S.E.: ¬A domain-independent approach to finding related entities (2012) 0.23
    0.2264375 = sum of:
      0.2264375 = product of:
        0.94348955 = sum of:
          0.027851898 = weight(abstract_txt:method in 2733) [ClassicSimilarity], result of:
            0.027851898 = score(doc=2733,freq=1.0), product of:
              0.07920646 = queryWeight, product of:
                1.0345638 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017009797 = queryNorm
              0.3516367 = fieldWeight in 2733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.009542306 = weight(abstract_txt:with in 2733) [ClassicSimilarity], result of:
            0.009542306 = score(doc=2733,freq=1.0), product of:
              0.04886182 = queryWeight, product of:
                1.1491503 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017009797 = queryNorm
              0.19529167 = fieldWeight in 2733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.14850841 = weight(abstract_txt:entities in 2733) [ClassicSimilarity], result of:
            0.14850841 = score(doc=2733,freq=6.0), product of:
              0.13303758 = queryWeight, product of:
                1.3408005 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.017009797 = queryNorm
              1.1162891 = fieldWeight in 2733, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.017595407 = weight(abstract_txt:that in 2733) [ClassicSimilarity], result of:
            0.017595407 = score(doc=2733,freq=3.0), product of:
              0.054877777 = queryWeight, product of:
                1.3615866 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017009797 = queryNorm
              0.320629 = fieldWeight in 2733, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.04066288 = weight(abstract_txt:problem in 2733) [ClassicSimilarity], result of:
            0.04066288 = score(doc=2733,freq=1.0), product of:
              0.11668631 = queryWeight, product of:
                1.5379158 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.017009797 = queryNorm
              0.3484803 = fieldWeight in 2733, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
          0.6993286 = weight(abstract_txt:entity in 2733) [ClassicSimilarity], result of:
            0.6993286 = score(doc=2733,freq=7.0), product of:
              0.5390573 = queryWeight, product of:
                5.0492687 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.017009797 = queryNorm
              1.2973177 = fieldWeight in 2733, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.078125 = fieldNorm(doc=2733)
        0.24 = coord(6/25)
    
  4. Soulier, L.; Jabeur, L.B.; Tamine, L.; Bahsoun, W.: On ranking relevant entities in heterogeneous networks using a language-based model (2013) 0.21
    0.2089825 = sum of:
      0.2089825 = product of:
        0.5805069 = sum of:
          0.020899689 = weight(abstract_txt:show in 664) [ClassicSimilarity], result of:
            0.020899689 = score(doc=664,freq=1.0), product of:
              0.07589688 = queryWeight, product of:
                1.012719 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.017009797 = queryNorm
              0.27536952 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.0076338453 = weight(abstract_txt:with in 664) [ClassicSimilarity], result of:
            0.0076338453 = score(doc=664,freq=1.0), product of:
              0.04886182 = queryWeight, product of:
                1.1491503 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017009797 = queryNorm
              0.15623334 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.033130646 = weight(abstract_txt:multiple in 664) [ClassicSimilarity], result of:
            0.033130646 = score(doc=664,freq=1.0), product of:
              0.103185184 = queryWeight, product of:
                1.1808254 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.017009797 = queryNorm
              0.3210795 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.039700985 = weight(abstract_txt:forms in 664) [ClassicSimilarity], result of:
            0.039700985 = score(doc=664,freq=1.0), product of:
              0.11641213 = queryWeight, product of:
                1.2542269 = boost
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.017009797 = queryNorm
              0.3410382 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.456611 = idf(docFreq=512, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.097005285 = weight(abstract_txt:entities in 664) [ClassicSimilarity], result of:
            0.097005285 = score(doc=664,freq=4.0), product of:
              0.13303758 = queryWeight, product of:
                1.3408005 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.017009797 = queryNorm
              0.72915703 = fieldWeight in 664, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.011493271 = weight(abstract_txt:that in 664) [ClassicSimilarity], result of:
            0.011493271 = score(doc=664,freq=2.0), product of:
              0.054877777 = queryWeight, product of:
                1.3615866 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017009797 = queryNorm
              0.20943399 = fieldWeight in 664, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.032530304 = weight(abstract_txt:problem in 664) [ClassicSimilarity], result of:
            0.032530304 = score(doc=664,freq=1.0), product of:
              0.11668631 = queryWeight, product of:
                1.5379158 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.017009797 = queryNorm
              0.27878425 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.03906738 = weight(abstract_txt:same in 664) [ClassicSimilarity], result of:
            0.03906738 = score(doc=664,freq=1.0), product of:
              0.13183701 = queryWeight, product of:
                1.6347121 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.017009797 = queryNorm
              0.2963309 = fieldWeight in 664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
          0.2990455 = weight(abstract_txt:entity in 664) [ClassicSimilarity], result of:
            0.2990455 = score(doc=664,freq=2.0), product of:
              0.5390573 = queryWeight, product of:
                5.0492687 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.017009797 = queryNorm
              0.5547564 = fieldWeight in 664, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=664)
        0.36 = coord(9/25)
    
  5. Li, X.; Schijvenaars, B.J.A.; Rijke, M.de: Investigating queries and search failures in academic search (2017) 0.21
    0.20844476 = sum of:
      0.20844476 = product of:
        0.5211119 = sum of:
          0.007958174 = weight(abstract_txt:which in 5033) [ClassicSimilarity], result of:
            0.007958174 = score(doc=5033,freq=1.0), product of:
              0.049892053 = queryWeight, product of:
                1.0056303 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.017009797 = queryNorm
              0.15950784 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.027571972 = weight(abstract_txt:method in 5033) [ClassicSimilarity], result of:
            0.027571972 = score(doc=5033,freq=2.0), product of:
              0.07920646 = queryWeight, product of:
                1.0345638 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.017009797 = queryNorm
              0.34810257 = fieldWeight in 5033, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.006679615 = weight(abstract_txt:with in 5033) [ClassicSimilarity], result of:
            0.006679615 = score(doc=5033,freq=1.0), product of:
              0.04886182 = queryWeight, product of:
                1.1491503 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.017009797 = queryNorm
              0.13670418 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.030224161 = weight(abstract_txt:known in 5033) [ClassicSimilarity], result of:
            0.030224161 = score(doc=5033,freq=1.0), product of:
              0.106094986 = queryWeight, product of:
                1.1973591 = boost
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.017009797 = queryNorm
              0.2848783 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2092032 = idf(docFreq=656, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.042439815 = weight(abstract_txt:entities in 5033) [ClassicSimilarity], result of:
            0.042439815 = score(doc=5033,freq=1.0), product of:
              0.13303758 = queryWeight, product of:
                1.3408005 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.017009797 = queryNorm
              0.3190062 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.0159009 = weight(abstract_txt:that in 5033) [ClassicSimilarity], result of:
            0.0159009 = score(doc=5033,freq=5.0), product of:
              0.054877777 = queryWeight, product of:
                1.3615866 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.017009797 = queryNorm
              0.28975117 = fieldWeight in 5033, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.040254198 = weight(abstract_txt:problem in 5033) [ClassicSimilarity], result of:
            0.040254198 = score(doc=5033,freq=2.0), product of:
              0.11668631 = queryWeight, product of:
                1.5379158 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.017009797 = queryNorm
              0.34497792 = fieldWeight in 5033, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.080840975 = weight(abstract_txt:label in 5033) [ClassicSimilarity], result of:
            0.080840975 = score(doc=5033,freq=1.0), product of:
              0.20443083 = queryWeight, product of:
                1.6620734 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.017009797 = queryNorm
              0.39544415 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.08421715 = weight(abstract_txt:generic in 5033) [ClassicSimilarity], result of:
            0.08421715 = score(doc=5033,freq=1.0), product of:
              0.24048583 = queryWeight, product of:
                2.2078388 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.017009797 = queryNorm
              0.3501959 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
          0.18502496 = weight(abstract_txt:entity in 5033) [ClassicSimilarity], result of:
            0.18502496 = score(doc=5033,freq=1.0), product of:
              0.5390573 = queryWeight, product of:
                5.0492687 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.017009797 = queryNorm
              0.34323803 = fieldWeight in 5033, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5033)
        0.4 = coord(10/25)