Document (#24852)

Author
Thelwall, M.
Title
Extracting macroscopic information from Web links
Source
Journal of the American Society for Information Science and technology. 52(2001) no.13, S.1157-1168
Year
2001
Abstract
Much has been written about the potential and pitfalls of macroscopic Web-based link analysis, yet there have been no studies that have provided clear statistical evidence that any of the proposed calculations can produce results over large areas of the Web that correlate with phenomena external to the Internet. This article attempts to provide such evidence through an evaluation of Ingwersen's (1998) proposed external Web Impact Factor (WIF) for the original use of the Web: the interlinking of academic research. In particular, it studies the case of the relationship between academic hyperlinks and research activity for universities in Britain, a country chosen for its variety of institutions and the existence of an official government rating exercise for research. After reviewing the numerous reasons why link counts may be unreliable, it demonstrates that four different WIFs do, in fact, correlate with the conventional academic research measures. The WIF delivering the greatest correlation with research rankings was the ratio of Web pages with links pointing at research-based pages to faculty numbers. The scarcity of links to electronic academic papers in the data set suggests that, in contrast to citation analysis, this WIF is measuring the reputations of universities and their scholars, rather than the quality of their publications
Theme
Internet
Citation indexing
Informetrie
Object
WWW

Similar documents (author)

  1. Thelwall, M.; Thelwall, S.: ¬A thematic analysis of highly retweeted early COVID-19 tweets : consensus, information, dissent and lockdown life (2020) 4.90
    4.897565 = sum of:
      4.897565 = weight(author_txt:thelwall in 178) [ClassicSimilarity], result of:
        4.897565 = fieldWeight in 178, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.926203 = idf(docFreq=117, maxDocs=44218)
          0.5 = fieldNorm(doc=178)
    
  2. Thelwall, M.: Conceptualizing documentation on the Web : an evaluation of different heuristic-based models for counting links between university Web sites (2002) 4.33
    4.3288765 = sum of:
      4.3288765 = weight(author_txt:thelwall in 978) [ClassicSimilarity], result of:
        4.3288765 = fieldWeight in 978, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.926203 = idf(docFreq=117, maxDocs=44218)
          0.625 = fieldNorm(doc=978)
    
  3. Thelwall, M.: Text characteristics of English language university Web sites (2005) 4.33
    4.3288765 = sum of:
      4.3288765 = weight(author_txt:thelwall in 3463) [ClassicSimilarity], result of:
        4.3288765 = fieldWeight in 3463, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.926203 = idf(docFreq=117, maxDocs=44218)
          0.625 = fieldNorm(doc=3463)
    
  4. Thelwall, M.: Bibliometrics to webometrics (2009) 4.33
    4.3288765 = sum of:
      4.3288765 = weight(author_txt:thelwall in 4239) [ClassicSimilarity], result of:
        4.3288765 = fieldWeight in 4239, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.926203 = idf(docFreq=117, maxDocs=44218)
          0.625 = fieldNorm(doc=4239)
    
  5. Thelwall, M.: ¬A layered approach for investigating the topological structure of communities in the Web (2003) 4.33
    4.3288765 = sum of:
      4.3288765 = weight(author_txt:thelwall in 4450) [ClassicSimilarity], result of:
        4.3288765 = fieldWeight in 4450, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.926203 = idf(docFreq=117, maxDocs=44218)
          0.625 = fieldNorm(doc=4450)
    

Similar documents (content)

  1. Barjak, F.; Thelwall, M.: ¬A statistical analysis of the web presences of European life sciences research teams (2008) 0.19
    0.18973082 = sum of:
      0.18973082 = product of:
        0.52703005 = sum of:
          0.024790788 = weight(abstract_txt:studies in 1383) [ClassicSimilarity], result of:
            0.024790788 = score(doc=1383,freq=1.0), product of:
              0.09293728 = queryWeight, product of:
                1.120295 = boost
                4.26796 = idf(docFreq=1683, maxDocs=44218)
                0.019437358 = queryNorm
              0.2667475 = fieldWeight in 1383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.26796 = idf(docFreq=1683, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.108895935 = weight(abstract_txt:unreliable in 1383) [ClassicSimilarity], result of:
            0.108895935 = score(doc=1383,freq=1.0), product of:
              0.19784611 = queryWeight, product of:
                1.1558093 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.019437358 = queryNorm
              0.55040723 = fieldWeight in 1383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.014088198 = weight(abstract_txt:with in 1383) [ClassicSimilarity], result of:
            0.014088198 = score(doc=1383,freq=2.0), product of:
              0.0637627 = queryWeight, product of:
                1.312308 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019437358 = queryNorm
              0.22094731 = fieldWeight in 1383, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.04959891 = weight(abstract_txt:evidence in 1383) [ClassicSimilarity], result of:
            0.04959891 = score(doc=1383,freq=1.0), product of:
              0.14756313 = queryWeight, product of:
                1.4116478 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.019437358 = queryNorm
              0.33611995 = fieldWeight in 1383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.083864704 = weight(abstract_txt:link in 1383) [ClassicSimilarity], result of:
            0.083864704 = score(doc=1383,freq=2.0), product of:
              0.16622867 = queryWeight, product of:
                1.4982711 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.019437358 = queryNorm
              0.5045141 = fieldWeight in 1383, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.014998257 = weight(abstract_txt:that in 1383) [ClassicSimilarity], result of:
            0.014998257 = score(doc=1383,freq=2.0), product of:
              0.07161329 = queryWeight, product of:
                1.5549064 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019437358 = queryNorm
              0.20943399 = fieldWeight in 1383, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.09724528 = weight(abstract_txt:links in 1383) [ClassicSimilarity], result of:
            0.09724528 = score(doc=1383,freq=2.0), product of:
              0.21002091 = queryWeight, product of:
                2.0625968 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.019437358 = queryNorm
              0.46302667 = fieldWeight in 1383, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.065384395 = weight(abstract_txt:academic in 1383) [ClassicSimilarity], result of:
            0.065384395 = score(doc=1383,freq=1.0), product of:
              0.22352314 = queryWeight, product of:
                2.4570482 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.019437358 = queryNorm
              0.29251733 = fieldWeight in 1383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
          0.06816357 = weight(abstract_txt:research in 1383) [ClassicSimilarity], result of:
            0.06816357 = score(doc=1383,freq=5.0), product of:
              0.1538444 = queryWeight, product of:
                2.4965422 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019437358 = queryNorm
              0.4430682 = fieldWeight in 1383, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=1383)
        0.36 = coord(9/25)
    
  2. Vaughan, L.; Thelwall, M.: Scholarly use of the Web : what are the key inducers of links to journal Web sites? (2003) 0.18
    0.17565626 = sum of:
      0.17565626 = product of:
        0.5489258 = sum of:
          0.00996186 = weight(abstract_txt:with in 1236) [ClassicSimilarity], result of:
            0.00996186 = score(doc=1236,freq=1.0), product of:
              0.0637627 = queryWeight, product of:
                1.312308 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019437358 = queryNorm
              0.15623334 = fieldWeight in 1236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.04959891 = weight(abstract_txt:evidence in 1236) [ClassicSimilarity], result of:
            0.04959891 = score(doc=1236,freq=1.0), product of:
              0.14756313 = queryWeight, product of:
                1.4116478 = boost
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.019437358 = queryNorm
              0.33611995 = fieldWeight in 1236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.377919 = idf(docFreq=554, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.059301306 = weight(abstract_txt:link in 1236) [ClassicSimilarity], result of:
            0.059301306 = score(doc=1236,freq=1.0), product of:
              0.16622867 = queryWeight, product of:
                1.4982711 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.019437358 = queryNorm
              0.35674536 = fieldWeight in 1236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.025977744 = weight(abstract_txt:that in 1236) [ClassicSimilarity], result of:
            0.025977744 = score(doc=1236,freq=6.0), product of:
              0.07161329 = queryWeight, product of:
                1.5549064 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019437358 = queryNorm
              0.36275032 = fieldWeight in 1236, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.0738154 = weight(abstract_txt:universities in 1236) [ClassicSimilarity], result of:
            0.0738154 = score(doc=1236,freq=1.0), product of:
              0.19235098 = queryWeight, product of:
                1.6117015 = boost
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.019437358 = queryNorm
              0.3837537 = fieldWeight in 1236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.14602867 = weight(abstract_txt:correlate in 1236) [ClassicSimilarity], result of:
            0.14602867 = score(doc=1236,freq=1.0), product of:
              0.30312505 = queryWeight, product of:
                2.0232444 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.019437358 = queryNorm
              0.48174396 = fieldWeight in 1236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.15375829 = weight(abstract_txt:links in 1236) [ClassicSimilarity], result of:
            0.15375829 = score(doc=1236,freq=5.0), product of:
              0.21002091 = queryWeight, product of:
                2.0625968 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.019437358 = queryNorm
              0.7321094 = fieldWeight in 1236, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
          0.03048367 = weight(abstract_txt:research in 1236) [ClassicSimilarity], result of:
            0.03048367 = score(doc=1236,freq=1.0), product of:
              0.1538444 = queryWeight, product of:
                2.4965422 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019437358 = queryNorm
              0.19814612 = fieldWeight in 1236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=1236)
        0.32 = coord(8/25)
    
  3. Stone, P.: JANET : a report on its use for libraries (1990) 0.17
    0.16550262 = sum of:
      0.16550262 = product of:
        1.0343914 = sum of:
          0.6287328 = weight(subject_txt:britain in 778) [ClassicSimilarity], result of:
            0.6287328 = score(doc=778,freq=2.0), product of:
              0.17283851 = queryWeight, product of:
                1.0802958 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.019437358 = queryNorm
              3.6376894 = fieldWeight in 778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.3125 = fieldNorm(doc=778)
          0.15658611 = weight(abstract_txt:universities in 778) [ClassicSimilarity], result of:
            0.15658611 = score(doc=778,freq=2.0), product of:
              0.19235098 = queryWeight, product of:
                1.6117015 = boost
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.019437358 = queryNorm
              0.8140645 = fieldWeight in 778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.09375 = fieldNorm(doc=778)
          0.16987363 = weight(abstract_txt:academic in 778) [ClassicSimilarity], result of:
            0.16987363 = score(doc=778,freq=3.0), product of:
              0.22352314 = queryWeight, product of:
                2.4570482 = boost
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.019437358 = queryNorm
              0.7599823 = fieldWeight in 778, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6802773 = idf(docFreq=1114, maxDocs=44218)
                0.09375 = fieldNorm(doc=778)
          0.0791989 = weight(abstract_txt:research in 778) [ClassicSimilarity], result of:
            0.0791989 = score(doc=778,freq=3.0), product of:
              0.1538444 = queryWeight, product of:
                2.4965422 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019437358 = queryNorm
              0.5147987 = fieldWeight in 778, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.09375 = fieldNorm(doc=778)
        0.16 = coord(4/25)
    
  4. Thelwall, M.: ¬A comparison of sources of links for academic Web impact factor calculations (2002) 0.15
    0.15000434 = sum of:
      0.15000434 = product of:
        0.6250181 = sum of:
          0.017610246 = weight(abstract_txt:with in 4474) [ClassicSimilarity], result of:
            0.017610246 = score(doc=4474,freq=2.0), product of:
              0.0637627 = queryWeight, product of:
                1.312308 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019437358 = queryNorm
              0.27618414 = fieldWeight in 4474, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.037495643 = weight(abstract_txt:that in 4474) [ClassicSimilarity], result of:
            0.037495643 = score(doc=4474,freq=8.0), product of:
              0.07161329 = queryWeight, product of:
                1.5549064 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019437358 = queryNorm
              0.52358496 = fieldWeight in 4474, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.15981503 = weight(abstract_txt:universities in 4474) [ClassicSimilarity], result of:
            0.15981503 = score(doc=4474,freq=3.0), product of:
              0.19235098 = queryWeight, product of:
                1.6117015 = boost
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.019437358 = queryNorm
              0.83085114 = fieldWeight in 4474, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.25814465 = weight(abstract_txt:correlate in 4474) [ClassicSimilarity], result of:
            0.25814465 = score(doc=4474,freq=2.0), product of:
              0.30312505 = queryWeight, product of:
                2.0232444 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.019437358 = queryNorm
              0.851611 = fieldWeight in 4474, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.0859535 = weight(abstract_txt:links in 4474) [ClassicSimilarity], result of:
            0.0859535 = score(doc=4474,freq=1.0), product of:
              0.21002091 = queryWeight, product of:
                2.0625968 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.019437358 = queryNorm
              0.4092616 = fieldWeight in 4474, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
          0.06599908 = weight(abstract_txt:research in 4474) [ClassicSimilarity], result of:
            0.06599908 = score(doc=4474,freq=3.0), product of:
              0.1538444 = queryWeight, product of:
                2.4965422 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.019437358 = queryNorm
              0.42899892 = fieldWeight in 4474, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.078125 = fieldNorm(doc=4474)
        0.24 = coord(6/25)
    
  5. Vaughan, L.; Thelwall, M.: ¬A modelling approach to uncover hyperlink patterns : the case of Canadian universities (2005) 0.14
    0.14121695 = sum of:
      0.14121695 = product of:
        0.7060847 = sum of:
          0.1633439 = weight(abstract_txt:interlinking in 1014) [ClassicSimilarity], result of:
            0.1633439 = score(doc=1014,freq=1.0), product of:
              0.19784611 = queryWeight, product of:
                1.1558093 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.019437358 = queryNorm
              0.8256109 = fieldWeight in 1014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.09375 = fieldNorm(doc=1014)
          0.08895196 = weight(abstract_txt:link in 1014) [ClassicSimilarity], result of:
            0.08895196 = score(doc=1014,freq=1.0), product of:
              0.16622867 = queryWeight, product of:
                1.4982711 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.019437358 = queryNorm
              0.53511804 = fieldWeight in 1014, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.09375 = fieldNorm(doc=1014)
          0.027553555 = weight(abstract_txt:that in 1014) [ClassicSimilarity], result of:
            0.027553555 = score(doc=1014,freq=3.0), product of:
              0.07161329 = queryWeight, product of:
                1.5549064 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019437358 = queryNorm
              0.38475478 = fieldWeight in 1014, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=1014)
          0.24758437 = weight(abstract_txt:universities in 1014) [ClassicSimilarity], result of:
            0.24758437 = score(doc=1014,freq=5.0), product of:
              0.19235098 = queryWeight, product of:
                1.6117015 = boost
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.019437358 = queryNorm
              1.287149 = fieldWeight in 1014, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.140059 = idf(docFreq=258, maxDocs=44218)
                0.09375 = fieldNorm(doc=1014)
          0.17865098 = weight(abstract_txt:links in 1014) [ClassicSimilarity], result of:
            0.17865098 = score(doc=1014,freq=3.0), product of:
              0.21002091 = queryWeight, product of:
                2.0625968 = boost
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.019437358 = queryNorm
              0.8506342 = fieldWeight in 1014, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2385488 = idf(docFreq=637, maxDocs=44218)
                0.09375 = fieldNorm(doc=1014)
        0.2 = coord(5/25)