Document (#41215)

Author
Colavizza, G.
Boyack, K.W.
Eck, N.J. van
Waltman, L.
Title
¬The closer the better : similarity of publication pairs at different cocitation levels
Source
Journal of the Association for Information Science and Technology. 69(2018) no.4, S.600-609
Year
2018
Abstract
We investigated the similarities of pairs of articles that are cocited at the different cocitation levels of the journal, article, section, paragraph, sentence, and bracket. Our results indicate that textual similarity, intellectual overlap (shared references), author overlap (shared authors), proximity in publication time all rise monotonically as the cocitation level gets lower (from journal to bracket). While the main gain in similarity happens when moving from journal to article cocitation, all level changes entail an increase in similarity, especially section to paragraph and paragraph to sentence/bracket levels. We compared the results from four journals over the years 2010-2015: Cell, the European Journal of Operational Research, Physics Letters B, and Research Policy, with consistent general outcomes and some interesting differences. Our findings motivate the use of granular cocitation information as defined by meaningful units of text, with implications for, among others, the elaboration of maps of science and the retrieval of scholarly literature.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.23981.
Theme
Informetrie

Similar documents (author)

  1. Boyack; K.W.; Börner, K.: Indicator-assisted evaluation and funding of research : visualizing the influence of grants on the number and citation counts of research papers (2003) 1.64
    1.6396044 = sum of:
      1.6396044 = product of:
        3.279209 = sum of:
          3.279209 = weight(author_txt:boyack in 1471) [ClassicSimilarity], result of:
            3.279209 = score(doc=1471,freq=1.0), product of:
              0.71818465 = queryWeight, product of:
                1.0159198 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.07741297 = queryNorm
              4.565969 = fieldWeight in 1471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.5 = fieldNorm(doc=1471)
        0.5 = coord(1/2)
    
  2. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 1.64
    1.6396044 = sum of:
      1.6396044 = product of:
        3.279209 = sum of:
          3.279209 = weight(author_txt:boyack in 5252) [ClassicSimilarity], result of:
            3.279209 = score(doc=5252,freq=1.0), product of:
              0.71818465 = queryWeight, product of:
                1.0159198 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.07741297 = queryNorm
              4.565969 = fieldWeight in 5252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.5 = fieldNorm(doc=5252)
        0.5 = coord(1/2)
    
  3. Klavans, R.; Boyack, K.W.: Toward a consensus map of science (2009) 1.64
    1.6396044 = sum of:
      1.6396044 = product of:
        3.279209 = sum of:
          3.279209 = weight(author_txt:boyack in 2736) [ClassicSimilarity], result of:
            3.279209 = score(doc=2736,freq=1.0), product of:
              0.71818465 = queryWeight, product of:
                1.0159198 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.07741297 = queryNorm
              4.565969 = fieldWeight in 2736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.5 = fieldNorm(doc=2736)
        0.5 = coord(1/2)
    
  4. Boyack, K.W.; Klavans, R.: Co-citation analysis, bibliographic coupling, and direct citation : which citation approach represents the research front most accurately? (2010) 1.64
    1.6396044 = sum of:
      1.6396044 = product of:
        3.279209 = sum of:
          3.279209 = weight(author_txt:boyack in 4111) [ClassicSimilarity], result of:
            3.279209 = score(doc=4111,freq=1.0), product of:
              0.71818465 = queryWeight, product of:
                1.0159198 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.07741297 = queryNorm
              4.565969 = fieldWeight in 4111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.5 = fieldNorm(doc=4111)
        0.5 = coord(1/2)
    
  5. Klavans, R.; Boyack, K.W.: Using global mapping to create more accurate document-level maps of research fields (2011) 1.64
    1.6396044 = sum of:
      1.6396044 = product of:
        3.279209 = sum of:
          3.279209 = weight(author_txt:boyack in 4956) [ClassicSimilarity], result of:
            3.279209 = score(doc=4956,freq=1.0), product of:
              0.71818465 = queryWeight, product of:
                1.0159198 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.07741297 = queryNorm
              4.565969 = fieldWeight in 4956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.5 = fieldNorm(doc=4956)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Wang, F.; Wolfram, D.: Assessment of journal similarity based on citing discipline analysis (2015) 0.16
    0.15659763 = sum of:
      0.15659763 = product of:
        0.78298813 = sum of:
          0.011696244 = weight(abstract_txt:different in 1849) [ClassicSimilarity], result of:
            0.011696244 = score(doc=1849,freq=1.0), product of:
              0.051054373 = queryWeight, product of:
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0139283445 = queryNorm
              0.22909386 = fieldWeight in 1849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=1849)
          0.0075213555 = weight(abstract_txt:from in 1849) [ClassicSimilarity], result of:
            0.0075213555 = score(doc=1849,freq=1.0), product of:
              0.04354081 = queryWeight, product of:
                1.1310385 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0139283445 = queryNorm
              0.17274266 = fieldWeight in 1849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1849)
          0.11228493 = weight(abstract_txt:journal in 1849) [ClassicSimilarity], result of:
            0.11228493 = score(doc=1849,freq=3.0), product of:
              0.20145865 = queryWeight, product of:
                2.809257 = boost
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.0139283445 = queryNorm
              0.5573597 = fieldWeight in 1849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1486683 = idf(docFreq=697, maxDocs=44218)
                0.0625 = fieldNorm(doc=1849)
          0.18719102 = weight(abstract_txt:similarity in 1849) [ClassicSimilarity], result of:
            0.18719102 = score(doc=1849,freq=4.0), product of:
              0.25734478 = queryWeight, product of:
                3.1750913 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0139283445 = queryNorm
              0.7273939 = fieldWeight in 1849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=1849)
          0.46429458 = weight(abstract_txt:cocitation in 1849) [ClassicSimilarity], result of:
            0.46429458 = score(doc=1849,freq=3.0), product of:
              0.5590758 = queryWeight, product of:
                5.2322545 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0139283445 = queryNorm
              0.83046806 = fieldWeight in 1849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=1849)
        0.2 = coord(5/25)
    
  2. White, H.D.: Author cocitation analysis and pearson's r (2003) 0.14
    0.14250436 = sum of:
      0.14250436 = product of:
        0.7125218 = sum of:
          0.011439228 = weight(abstract_txt:article in 2119) [ClassicSimilarity], result of:
            0.011439228 = score(doc=2119,freq=1.0), product of:
              0.05498714 = queryWeight, product of:
                1.037801 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0139283445 = queryNorm
              0.20803462 = fieldWeight in 2119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2119)
          0.0065811863 = weight(abstract_txt:from in 2119) [ClassicSimilarity], result of:
            0.0065811863 = score(doc=2119,freq=1.0), product of:
              0.04354081 = queryWeight, product of:
                1.1310385 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0139283445 = queryNorm
              0.15114984 = fieldWeight in 2119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2119)
          0.083547175 = weight(abstract_txt:cocited in 2119) [ClassicSimilarity], result of:
            0.083547175 = score(doc=2119,freq=1.0), product of:
              0.16428876 = queryWeight, product of:
                1.2684474 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0139283445 = queryNorm
              0.5085386 = fieldWeight in 2119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2119)
          0.14184816 = weight(abstract_txt:similarity in 2119) [ClassicSimilarity], result of:
            0.14184816 = score(doc=2119,freq=3.0), product of:
              0.25734478 = queryWeight, product of:
                3.1750913 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0139283445 = queryNorm
              0.5511989 = fieldWeight in 2119, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2119)
          0.46910605 = weight(abstract_txt:cocitation in 2119) [ClassicSimilarity], result of:
            0.46910605 = score(doc=2119,freq=4.0), product of:
              0.5590758 = queryWeight, product of:
                5.2322545 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0139283445 = queryNorm
              0.8390742 = fieldWeight in 2119, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2119)
        0.2 = coord(5/25)
    
  3. White, H.D.: Pathfinder networks and author cocitation analysis : a remapping of paradigmatic information scientists (2003) 0.13
    0.13270913 = sum of:
      0.13270913 = product of:
        0.66354567 = sum of:
          0.018488586 = weight(abstract_txt:article in 1459) [ClassicSimilarity], result of:
            0.018488586 = score(doc=1459,freq=2.0), product of:
              0.05498714 = queryWeight, product of:
                1.037801 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0139283445 = queryNorm
              0.33623472 = fieldWeight in 1459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1459)
          0.010636803 = weight(abstract_txt:from in 1459) [ClassicSimilarity], result of:
            0.010636803 = score(doc=1459,freq=2.0), product of:
              0.04354081 = queryWeight, product of:
                1.1310385 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0139283445 = queryNorm
              0.24429502 = fieldWeight in 1459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1459)
          0.09548249 = weight(abstract_txt:cocited in 1459) [ClassicSimilarity], result of:
            0.09548249 = score(doc=1459,freq=1.0), product of:
              0.16428876 = queryWeight, product of:
                1.2684474 = boost
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0139283445 = queryNorm
              0.581187 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0625 = fieldNorm(doc=1459)
          0.07464321 = weight(abstract_txt:pairs in 1459) [ClassicSimilarity], result of:
            0.07464321 = score(doc=1459,freq=1.0), product of:
              0.1756557 = queryWeight, product of:
                1.854875 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0139283445 = queryNorm
              0.42494047 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=1459)
          0.46429458 = weight(abstract_txt:cocitation in 1459) [ClassicSimilarity], result of:
            0.46429458 = score(doc=1459,freq=3.0), product of:
              0.5590758 = queryWeight, product of:
                5.2322545 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0139283445 = queryNorm
              0.83046806 = fieldWeight in 1459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=1459)
        0.2 = coord(5/25)
    
  4. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.12
    0.12248621 = sum of:
      0.12248621 = product of:
        0.61243105 = sum of:
          0.011696244 = weight(abstract_txt:different in 4088) [ClassicSimilarity], result of:
            0.011696244 = score(doc=4088,freq=1.0), product of:
              0.051054373 = queryWeight, product of:
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0139283445 = queryNorm
              0.22909386 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.013073404 = weight(abstract_txt:article in 4088) [ClassicSimilarity], result of:
            0.013073404 = score(doc=4088,freq=1.0), product of:
              0.05498714 = queryWeight, product of:
                1.037801 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0139283445 = queryNorm
              0.23775385 = fieldWeight in 4088, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.10556144 = weight(abstract_txt:pairs in 4088) [ClassicSimilarity], result of:
            0.10556144 = score(doc=4088,freq=2.0), product of:
              0.1756557 = queryWeight, product of:
                1.854875 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0139283445 = queryNorm
              0.60095656 = fieldWeight in 4088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.107709534 = weight(abstract_txt:sentence in 4088) [ClassicSimilarity], result of:
            0.107709534 = score(doc=4088,freq=2.0), product of:
              0.17803065 = queryWeight, product of:
                1.8673724 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0139283445 = queryNorm
              0.60500556 = fieldWeight in 4088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
          0.37439045 = weight(abstract_txt:paragraph in 4088) [ClassicSimilarity], result of:
            0.37439045 = score(doc=4088,freq=2.0), product of:
              0.4676335 = queryWeight, product of:
                3.7066534 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0139283445 = queryNorm
              0.8006066 = fieldWeight in 4088, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=4088)
        0.2 = coord(5/25)
    
  5. Tang, X.; Yang, C.C.; Song, M.: Understanding the evolution of multiple scientific research domains using a content and network approach (2013) 0.12
    0.12232145 = sum of:
      0.12232145 = product of:
        0.5096727 = sum of:
          0.011696244 = weight(abstract_txt:different in 744) [ClassicSimilarity], result of:
            0.011696244 = score(doc=744,freq=1.0), product of:
              0.051054373 = queryWeight, product of:
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0139283445 = queryNorm
              0.22909386 = fieldWeight in 744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=744)
          0.04720894 = weight(abstract_txt:closer in 744) [ClassicSimilarity], result of:
            0.04720894 = score(doc=744,freq=1.0), product of:
              0.102724686 = queryWeight, product of:
                1.0030116 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.0139283445 = queryNorm
              0.4595676 = fieldWeight in 744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.0625 = fieldNorm(doc=744)
          0.013073404 = weight(abstract_txt:article in 744) [ClassicSimilarity], result of:
            0.013073404 = score(doc=744,freq=1.0), product of:
              0.05498714 = queryWeight, product of:
                1.037801 = boost
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0139283445 = queryNorm
              0.23775385 = fieldWeight in 744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8040617 = idf(docFreq=2677, maxDocs=44218)
                0.0625 = fieldNorm(doc=744)
          0.0075213555 = weight(abstract_txt:from in 744) [ClassicSimilarity], result of:
            0.0075213555 = score(doc=744,freq=1.0), product of:
              0.04354081 = queryWeight, product of:
                1.1310385 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0139283445 = queryNorm
              0.17274266 = fieldWeight in 744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=744)
          0.16211218 = weight(abstract_txt:similarity in 744) [ClassicSimilarity], result of:
            0.16211218 = score(doc=744,freq=3.0), product of:
              0.25734478 = queryWeight, product of:
                3.1750913 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0139283445 = queryNorm
              0.6299416 = fieldWeight in 744, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=744)
          0.2680606 = weight(abstract_txt:cocitation in 744) [ClassicSimilarity], result of:
            0.2680606 = score(doc=744,freq=1.0), product of:
              0.5590758 = queryWeight, product of:
                5.2322545 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0139283445 = queryNorm
              0.47947097 = fieldWeight in 744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=744)
        0.24 = coord(6/25)