Document (#2231)

Author
Cousins, S.A.
Title
Enhancing subject access to OPACs : controlled vocabulary vs. natural language
Source
Journal of documentation. 48(1992) no.3, S.291-309
Year
1992
Abstract
Experimental evidence suggests that enhancing the subject content of OPAC records can improve retrieval performance. This is based on the use of natural language index terms derived from the table of contents and back-of-the-book index of documents. The research reported here investigates the alternative approach of translating these natural language terms into controlled vocabulary. Subject queries were collected by interview at the catalogue, and indexing of the queries demonstrated the impressive ability of PRECIS, and to a lesser extent LCSH, to represent users' information needs. DDC performed poorly in this respect. The assumption was made that an index language adequately specific to represent users' queries should be adequate to represent document contents. Searches were carried out on three test databases, and both natural language and PRECIS enhancement of MARC records increased the number of relevant documents found, with PRECIS showing the better performance. However, with weak stemming the advantage of PRECIS was lost. Consideration must also be given to the potential advantages of controlled vocabulary, over and above basic retrieval performance measures
Theme
Verbale Doksprachen im Online-Retrieval
Kataloganreicherung
Object
LCSH
PRECIS

Similar documents (author)

  1. Cousins, S.A.: In their own words : an examination of catalogue users' subject queries (1992) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:cousins in 2621) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 2621, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=2621)
    
  2. Cousins, S.A.: In their own words : an examination of catalogue subject queries (1992) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:cousins in 3731) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 3731, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=3731)
    
  3. Cousins, G.: Professional indexing in Australia : first steps towards accreditation (1993) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:cousins in 7651) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 7651, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=7651)
    
  4. Cousins, S.: COPAC: new research library union catalogue (1997) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:cousins in 664) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 664, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=664)
    
  5. Cousins, S.A.: Duplicate detection and record consolidation in large bibliographic databases : the COPAC database experience (1998) 5.81
    5.81187 = sum of:
      5.81187 = weight(author_txt:cousins in 2833) [ClassicSimilarity], result of:
        5.81187 = fieldWeight in 2833, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.298992 = idf(docFreq=10, maxDocs=44218)
          0.625 = fieldNorm(doc=2833)
    

Similar documents (content)

  1. Austin, D.; Digger, J.A.: PRECIS: The Preserved Context Index System (1985) 0.41
    0.40799758 = sum of:
      0.40799758 = product of:
        1.6999899 = sum of:
          0.019703235 = weight(abstract_txt:were in 3652) [ClassicSimilarity], result of:
            0.019703235 = score(doc=3652,freq=4.0), product of:
              0.06871854 = queryWeight, product of:
                1.028093 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.018212432 = queryNorm
              0.28672373 = fieldWeight in 3652, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.026357686 = weight(abstract_txt:terms in 3652) [ClassicSimilarity], result of:
            0.026357686 = score(doc=3652,freq=4.0), product of:
              0.08342965 = queryWeight, product of:
                1.1328062 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018212432 = queryNorm
              0.3159271 = fieldWeight in 3652, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.017828487 = weight(abstract_txt:subject in 3652) [ClassicSimilarity], result of:
            0.017828487 = score(doc=3652,freq=1.0), product of:
              0.11681779 = queryWeight, product of:
                1.6417066 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.018212432 = queryNorm
              0.15261792 = fieldWeight in 3652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.078423545 = weight(abstract_txt:index in 3652) [ClassicSimilarity], result of:
            0.078423545 = score(doc=3652,freq=6.0), product of:
              0.17258903 = queryWeight, product of:
                1.9954813 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.018212432 = queryNorm
              0.4543947 = fieldWeight in 3652, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          0.036442325 = weight(abstract_txt:language in 3652) [ClassicSimilarity], result of:
            0.036442325 = score(doc=3652,freq=1.0), product of:
              0.22307605 = queryWeight, product of:
                2.9288146 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018212432 = queryNorm
              0.16336279 = fieldWeight in 3652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0390625 = fieldNorm(doc=3652)
          1.5212346 = weight(title_txt:precis in 3652) [ClassicSimilarity], result of:
            1.5212346 = score(doc=3652,freq=1.0), product of:
              0.5516905 = queryWeight, product of:
                4.11963 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.018212432 = queryNorm
              2.7574058 = fieldWeight in 3652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.375 = fieldNorm(doc=3652)
        0.24 = coord(6/25)
    
  2. Austin, D.: PRECIS in a multilingual context : Pt.1: PRECIS: an overview (1976) 0.40
    0.4043827 = sum of:
      0.4043827 = product of:
        2.0219135 = sum of:
          0.026357686 = weight(abstract_txt:terms in 983) [ClassicSimilarity], result of:
            0.026357686 = score(doc=983,freq=1.0), product of:
              0.08342965 = queryWeight, product of:
                1.1328062 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018212432 = queryNorm
              0.3159271 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.035656974 = weight(abstract_txt:subject in 983) [ClassicSimilarity], result of:
            0.035656974 = score(doc=983,freq=1.0), product of:
              0.11681779 = queryWeight, product of:
                1.6417066 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.018212432 = queryNorm
              0.30523583 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.064032555 = weight(abstract_txt:index in 983) [ClassicSimilarity], result of:
            0.064032555 = score(doc=983,freq=1.0), product of:
              0.17258903 = queryWeight, product of:
                1.9954813 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.018212432 = queryNorm
              0.37101173 = fieldWeight in 983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          0.10307446 = weight(abstract_txt:language in 983) [ClassicSimilarity], result of:
            0.10307446 = score(doc=983,freq=2.0), product of:
              0.22307605 = queryWeight, product of:
                2.9288146 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018212432 = queryNorm
              0.46205974 = fieldWeight in 983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=983)
          1.792792 = weight(title_txt:precis in 983) [ClassicSimilarity], result of:
            1.792792 = score(doc=983,freq=2.0), product of:
              0.5516905 = queryWeight, product of:
                4.11963 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.018212432 = queryNorm
              3.2496336 = fieldWeight in 983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.3125 = fieldNorm(doc=983)
        0.2 = coord(5/25)
    
  3. Biswas, S.C.; Smith, F.: Efficiency and effectiveness of deep structure based indexing languages : PRECIS vs. DSIS (1991) 0.37
    0.374575 = sum of:
      0.374575 = product of:
        1.3377678 = sum of:
          0.01845038 = weight(abstract_txt:terms in 2187) [ClassicSimilarity], result of:
            0.01845038 = score(doc=2187,freq=1.0), product of:
              0.08342965 = queryWeight, product of:
                1.1328062 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.018212432 = queryNorm
              0.22114895 = fieldWeight in 2187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2187)
          0.019530725 = weight(abstract_txt:documents in 2187) [ClassicSimilarity], result of:
            0.019530725 = score(doc=2187,freq=1.0), product of:
              0.08665543 = queryWeight, product of:
                1.1544983 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018212432 = queryNorm
              0.22538373 = fieldWeight in 2187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2187)
          0.04323178 = weight(abstract_txt:subject in 2187) [ClassicSimilarity], result of:
            0.04323178 = score(doc=2187,freq=3.0), product of:
              0.11681779 = queryWeight, product of:
                1.6417066 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.018212432 = queryNorm
              0.37007877 = fieldWeight in 2187, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2187)
          0.08964557 = weight(abstract_txt:index in 2187) [ClassicSimilarity], result of:
            0.08964557 = score(doc=2187,freq=4.0), product of:
              0.17258903 = queryWeight, product of:
                1.9954813 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.018212432 = queryNorm
              0.5194164 = fieldWeight in 2187, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2187)
          0.06438511 = weight(abstract_txt:vocabulary in 2187) [ClassicSimilarity], result of:
            0.06438511 = score(doc=2187,freq=1.0), product of:
              0.21972065 = queryWeight, product of:
                2.2515235 = boost
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.018212432 = queryNorm
              0.29303166 = fieldWeight in 2187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.358293 = idf(docFreq=565, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2187)
          0.08836795 = weight(abstract_txt:language in 2187) [ClassicSimilarity], result of:
            0.08836795 = score(doc=2187,freq=3.0), product of:
              0.22307605 = queryWeight, product of:
                2.9288146 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018212432 = queryNorm
              0.39613372 = fieldWeight in 2187, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2187)
          1.0141563 = weight(title_txt:precis in 2187) [ClassicSimilarity], result of:
            1.0141563 = score(doc=2187,freq=1.0), product of:
              0.5516905 = queryWeight, product of:
                4.11963 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.018212432 = queryNorm
              1.8382704 = fieldWeight in 2187, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.25 = fieldNorm(doc=2187)
        0.28 = coord(7/25)
    
  4. Austin, D.: PRECIS (2009) 0.33
    0.33177447 = sum of:
      0.33177447 = product of:
        4.147181 = sum of:
          0.090555705 = weight(abstract_txt:index in 985) [ClassicSimilarity], result of:
            0.090555705 = score(doc=985,freq=2.0), product of:
              0.17258903 = queryWeight, product of:
                1.9954813 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.018212432 = queryNorm
              0.5246898 = fieldWeight in 985, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=985)
          4.0566254 = weight(title_txt:precis in 985) [ClassicSimilarity], result of:
            4.0566254 = score(doc=985,freq=1.0), product of:
              0.5516905 = queryWeight, product of:
                4.11963 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.018212432 = queryNorm
              7.3530817 = fieldWeight in 985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                1.0 = fieldNorm(doc=985)
        0.08 = coord(2/25)
    
  5. Weintraub, D.K.: ¬An extended review of PRECIS (1979) 0.33
    0.326444 = sum of:
      0.326444 = product of:
        2.040275 = sum of:
          0.10085315 = weight(abstract_txt:subject in 1197) [ClassicSimilarity], result of:
            0.10085315 = score(doc=1197,freq=8.0), product of:
              0.11681779 = queryWeight, product of:
                1.6417066 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.018212432 = queryNorm
              0.8633373 = fieldWeight in 1197, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.078125 = fieldNorm(doc=1197)
          0.064032555 = weight(abstract_txt:index in 1197) [ClassicSimilarity], result of:
            0.064032555 = score(doc=1197,freq=1.0), product of:
              0.17258903 = queryWeight, product of:
                1.9954813 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.018212432 = queryNorm
              0.37101173 = fieldWeight in 1197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.078125 = fieldNorm(doc=1197)
          0.100615755 = weight(abstract_txt:represent in 1197) [ClassicSimilarity], result of:
            0.100615755 = score(doc=1197,freq=1.0), product of:
              0.2332688 = queryWeight, product of:
                2.3199008 = boost
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.018212432 = queryNorm
              0.43132967 = fieldWeight in 1197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.078125 = fieldNorm(doc=1197)
          1.7747737 = weight(title_txt:precis in 1197) [ClassicSimilarity], result of:
            1.7747737 = score(doc=1197,freq=1.0), product of:
              0.5516905 = queryWeight, product of:
                4.11963 = boost
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.018212432 = queryNorm
              3.2169733 = fieldWeight in 1197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3530817 = idf(docFreq=76, maxDocs=44218)
                0.4375 = fieldNorm(doc=1197)
        0.16 = coord(4/25)