Document (#40836)

Author
O'Neill, E.T.
Kammerer, K.A.
Bennett, R.
Title
¬The aboutness of words
Source
Journal of the Association for Information Science and Technology. 68(2017) no.10, S.2471-2483
Year
2017
Abstract
Word aboutness is defined as the relationship between words and subjects associated with them. An aboutness coefficient is developed to estimate the strength of the aboutness relationship. Words that are randomly distributed across subjects are assumed to lack aboutness and the degree to which their usage deviates from a random pattern indicates the strength of the aboutness. To estimate aboutness, title words and their associated subjects are extracted from the titles of non-fiction English language books in the OCLC WorldCat database. The usage patterns of the title words are analyzed and used to compute aboutness coefficients for each of the common title words. Words with low aboutness coefficients (An and In) are commonly found in stop word lists, whereas words with high aboutness coefficients (Carbonate, Autism) are unambiguous and have a strong subject association. The aboutness coefficient potentially can enhance indexing, advance authority control, and improve retrieval.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23856/full.
Theme
Begriffstheorie

Similar documents (author)

  1. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2012) 5.83
    5.8257585 = sum of:
      5.8257585 = sum of:
        1.3527329 = weight(author_txt:o'neill in 310) [ClassicSimilarity], result of:
          1.3527329 = score(doc=310,freq=1.0), product of:
            0.45046005 = queryWeight, product of:
              8.008008 = idf(docFreq=39, maxDocs=44218)
              0.056251198 = queryNorm
            3.0030031 = fieldWeight in 310, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.008008 = idf(docFreq=39, maxDocs=44218)
              0.375 = fieldNorm(doc=310)
        1.9131504 = weight(author_txt:bennett in 310) [ClassicSimilarity], result of:
          1.9131504 = score(doc=310,freq=1.0), product of:
            0.56756335 = queryWeight, product of:
              1.1224811 = boost
              8.988837 = idf(docFreq=14, maxDocs=44218)
              0.056251198 = queryNorm
            3.3708138 = fieldWeight in 310, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.988837 = idf(docFreq=14, maxDocs=44218)
              0.375 = fieldNorm(doc=310)
        2.559875 = weight(author_txt:kammerer in 310) [ClassicSimilarity], result of:
          2.559875 = score(doc=310,freq=1.0), product of:
            0.6891717 = queryWeight, product of:
              1.2369028 = boost
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.056251198 = queryNorm
            3.7144227 = fieldWeight in 310, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.375 = fieldNorm(doc=310)
    
  2. O'Neill, E.T.; Bennett, R.; Kammerer, K.: Using authorities to improve subject searches (2014) 5.83
    5.8257585 = sum of:
      5.8257585 = sum of:
        1.3527329 = weight(author_txt:o'neill in 1970) [ClassicSimilarity], result of:
          1.3527329 = score(doc=1970,freq=1.0), product of:
            0.45046005 = queryWeight, product of:
              8.008008 = idf(docFreq=39, maxDocs=44218)
              0.056251198 = queryNorm
            3.0030031 = fieldWeight in 1970, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.008008 = idf(docFreq=39, maxDocs=44218)
              0.375 = fieldNorm(doc=1970)
        1.9131504 = weight(author_txt:bennett in 1970) [ClassicSimilarity], result of:
          1.9131504 = score(doc=1970,freq=1.0), product of:
            0.56756335 = queryWeight, product of:
              1.1224811 = boost
              8.988837 = idf(docFreq=14, maxDocs=44218)
              0.056251198 = queryNorm
            3.3708138 = fieldWeight in 1970, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.988837 = idf(docFreq=14, maxDocs=44218)
              0.375 = fieldNorm(doc=1970)
        2.559875 = weight(author_txt:kammerer in 1970) [ClassicSimilarity], result of:
          2.559875 = score(doc=1970,freq=1.0), product of:
            0.6891717 = queryWeight, product of:
              1.2369028 = boost
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.056251198 = queryNorm
            3.7144227 = fieldWeight in 1970, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.905128 = idf(docFreq=5, maxDocs=44218)
              0.375 = fieldNorm(doc=1970)
    
  3. O'Neill, E.T.; Childress, E.; Dean, R.; Kammerer, K.; Vizine-Goetz, D.; Chan, L.M.; El-Hoshy, L.: FAST: faceted application of subject terminology (2003) 1.74
    1.7389369 = sum of:
      1.7389369 = product of:
        2.6084054 = sum of:
          0.9018219 = weight(author_txt:o'neill in 3816) [ClassicSimilarity], result of:
            0.9018219 = score(doc=3816,freq=1.0), product of:
              0.45046005 = queryWeight, product of:
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.056251198 = queryNorm
              2.002002 = fieldWeight in 3816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.25 = fieldNorm(doc=3816)
          1.7065834 = weight(author_txt:kammerer in 3816) [ClassicSimilarity], result of:
            1.7065834 = score(doc=3816,freq=1.0), product of:
              0.6891717 = queryWeight, product of:
                1.2369028 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.056251198 = queryNorm
              2.476282 = fieldWeight in 3816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.25 = fieldNorm(doc=3816)
        0.6666667 = coord(2/3)
    
  4. Bennett, R.: Terminology and the computer : attention shifts to the micro (1994) 1.06
    1.0628614 = sum of:
      1.0628614 = product of:
        3.188584 = sum of:
          3.188584 = weight(author_txt:bennett in 608) [ClassicSimilarity], result of:
            3.188584 = score(doc=608,freq=1.0), product of:
              0.56756335 = queryWeight, product of:
                1.1224811 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.056251198 = queryNorm
              5.6180234 = fieldWeight in 608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=608)
        0.33333334 = coord(1/3)
    
  5. Bennett, D.C.: ¬The internationalization of scholarship and scholarly societies in the humanities and social sciences (1996) 1.06
    1.0628614 = sum of:
      1.0628614 = product of:
        3.188584 = sum of:
          3.188584 = weight(author_txt:bennett in 6037) [ClassicSimilarity], result of:
            3.188584 = score(doc=6037,freq=1.0), product of:
              0.56756335 = queryWeight, product of:
                1.1224811 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.056251198 = queryNorm
              5.6180234 = fieldWeight in 6037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=6037)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Holley, R.M.; Joudrey, D.N.: Aboutness and conceptual analysis : a review (2021) 0.22
    0.21697968 = sum of:
      0.21697968 = product of:
        1.808164 = sum of:
          0.005249711 = weight(abstract_txt:with in 703) [ClassicSimilarity], result of:
            0.005249711 = score(doc=703,freq=1.0), product of:
              0.022401156 = queryWeight, product of:
                1.1617409 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.007713784 = queryNorm
              0.23435001 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=703)
          0.029457541 = weight(abstract_txt:associated in 703) [ClassicSimilarity], result of:
            0.029457541 = score(doc=703,freq=1.0), product of:
              0.061794158 = queryWeight, product of:
                1.575441 = boost
                5.084846 = idf(docFreq=743, maxDocs=44218)
                0.007713784 = queryNorm
              0.4767043 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.084846 = idf(docFreq=743, maxDocs=44218)
                0.09375 = fieldNorm(doc=703)
          1.7734567 = weight(abstract_txt:aboutness in 703) [ClassicSimilarity], result of:
            1.7734567 = score(doc=703,freq=8.0), product of:
              0.8377627 = queryWeight, product of:
                13.604115 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.007713784 = queryNorm
              2.1168962 = fieldWeight in 703, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.09375 = fieldNorm(doc=703)
        0.12 = coord(3/25)
    
  2. Hauser, E.; Tennis, J.T.: Episemantics: aboutness as aroundness (2019) 0.22
    0.21638758 = sum of:
      0.21638758 = product of:
        1.0819379 = sum of:
          0.0034998073 = weight(abstract_txt:with in 5640) [ClassicSimilarity], result of:
            0.0034998073 = score(doc=5640,freq=1.0), product of:
              0.022401156 = queryWeight, product of:
                1.1617409 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.007713784 = queryNorm
              0.15623334 = fieldWeight in 5640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=5640)
          0.018174054 = weight(abstract_txt:relationship in 5640) [ClassicSimilarity], result of:
            0.018174054 = score(doc=5640,freq=1.0), product of:
              0.05868293 = queryWeight, product of:
                1.5352684 = boost
                4.9551864 = idf(docFreq=846, maxDocs=44218)
                0.007713784 = queryNorm
              0.30969915 = fieldWeight in 5640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9551864 = idf(docFreq=846, maxDocs=44218)
                0.0625 = fieldNorm(doc=5640)
          0.03392191 = weight(abstract_txt:word in 5640) [ClassicSimilarity], result of:
            0.03392191 = score(doc=5640,freq=2.0), product of:
              0.070608035 = queryWeight, product of:
                1.684052 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.007713784 = queryNorm
              0.48042563 = fieldWeight in 5640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=5640)
          0.09164846 = weight(abstract_txt:words in 5640) [ClassicSimilarity], result of:
            0.09164846 = score(doc=5640,freq=1.0), product of:
              0.2739349 = queryWeight, product of:
                6.6341014 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.007713784 = queryNorm
              0.33456293 = fieldWeight in 5640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5640)
          0.93469363 = weight(abstract_txt:aboutness in 5640) [ClassicSimilarity], result of:
            0.93469363 = score(doc=5640,freq=5.0), product of:
              0.8377627 = queryWeight, product of:
                13.604115 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.007713784 = queryNorm
              1.1157022 = fieldWeight in 5640, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.0625 = fieldNorm(doc=5640)
        0.2 = coord(5/25)
    
  3. Weinberg, B.H.: Why indexing fails the researcher (1988) 0.09
    0.090077825 = sum of:
      0.090077825 = product of:
        0.75064856 = sum of:
          0.0069996146 = weight(abstract_txt:with in 703) [ClassicSimilarity], result of:
            0.0069996146 = score(doc=703,freq=4.0), product of:
              0.022401156 = queryWeight, product of:
                1.1617409 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.007713784 = queryNorm
              0.31246668 = fieldWeight in 703, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=703)
          0.019638361 = weight(abstract_txt:associated in 703) [ClassicSimilarity], result of:
            0.019638361 = score(doc=703,freq=1.0), product of:
              0.061794158 = queryWeight, product of:
                1.575441 = boost
                5.084846 = idf(docFreq=743, maxDocs=44218)
                0.007713784 = queryNorm
              0.31780288 = fieldWeight in 703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.084846 = idf(docFreq=743, maxDocs=44218)
                0.0625 = fieldNorm(doc=703)
          0.7240106 = weight(abstract_txt:aboutness in 703) [ClassicSimilarity], result of:
            0.7240106 = score(doc=703,freq=3.0), product of:
              0.8377627 = queryWeight, product of:
                13.604115 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.007713784 = queryNorm
              0.8642192 = fieldWeight in 703, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.0625 = fieldNorm(doc=703)
        0.12 = coord(3/25)
    
  4. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.08
    0.08214647 = sum of:
      0.08214647 = product of:
        0.342277 = sum of:
          0.006061844 = weight(abstract_txt:with in 5226) [ClassicSimilarity], result of:
            0.006061844 = score(doc=5226,freq=3.0), product of:
              0.022401156 = queryWeight, product of:
                1.1617409 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.007713784 = queryNorm
              0.27060407 = fieldWeight in 5226, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.031792775 = weight(abstract_txt:stop in 5226) [ClassicSimilarity], result of:
            0.031792775 = score(doc=5226,freq=1.0), product of:
              0.06762172 = queryWeight, product of:
                1.1653504 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.007713784 = queryNorm
              0.47015625 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.027772836 = weight(abstract_txt:associated in 5226) [ClassicSimilarity], result of:
            0.027772836 = score(doc=5226,freq=2.0), product of:
              0.061794158 = queryWeight, product of:
                1.575441 = boost
                5.084846 = idf(docFreq=743, maxDocs=44218)
                0.007713784 = queryNorm
              0.44944113 = fieldWeight in 5226, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.084846 = idf(docFreq=743, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.047972824 = weight(abstract_txt:word in 5226) [ClassicSimilarity], result of:
            0.047972824 = score(doc=5226,freq=4.0), product of:
              0.070608035 = queryWeight, product of:
                1.684052 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.007713784 = queryNorm
              0.67942446 = fieldWeight in 5226, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.06993692 = weight(abstract_txt:coefficient in 5226) [ClassicSimilarity], result of:
            0.06993692 = score(doc=5226,freq=1.0), product of:
              0.14410585 = queryWeight, product of:
                2.4058537 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.007713784 = queryNorm
              0.48531634 = fieldWeight in 5226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
          0.15873979 = weight(abstract_txt:words in 5226) [ClassicSimilarity], result of:
            0.15873979 = score(doc=5226,freq=3.0), product of:
              0.2739349 = queryWeight, product of:
                6.6341014 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.007713784 = queryNorm
              0.57948 = fieldWeight in 5226, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5226)
        0.24 = coord(6/25)
    
  5. Moraes, J.B.E. de: Aboutness in fiction : methodological perspectives for knowledge organization (2012) 0.07
    0.0732616 = sum of:
      0.0732616 = product of:
        0.91577005 = sum of:
          0.07975459 = weight(abstract_txt:fiction in 856) [ClassicSimilarity], result of:
            0.07975459 = score(doc=856,freq=3.0), product of:
              0.054531064 = queryWeight, product of:
                1.0464908 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.007713784 = queryNorm
              1.4625534 = fieldWeight in 856, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.125 = fieldNorm(doc=856)
          0.83601546 = weight(abstract_txt:aboutness in 856) [ClassicSimilarity], result of:
            0.83601546 = score(doc=856,freq=1.0), product of:
              0.8377627 = queryWeight, product of:
                13.604115 = boost
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.007713784 = queryNorm
              0.9979144 = fieldWeight in 856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.125 = fieldNorm(doc=856)
        0.08 = coord(2/25)