Document (#32925)

Author
Losee, R.M.
Title
Decisions in thesaurus construction and use
Source
Information processing and management. 43(2007) no.4, S.958-968
Year
2007
Abstract
A thesaurus and an ontology provide a set of structured terms, phrases, and metadata, often in a hierarchical arrangement, that may be used to index, search, and mine documents. We describe the decisions that should be made when including a term, deciding whether a term should be subdivided into its subclasses, or determining which of more than one set of possible subclasses should be used. Based on retrospective measurements or estimates of future performance when using thesaurus terms in document ordering, decisions are made so as to maximize performance. These decisions may be used in the automatic construction of a thesaurus. The evaluation of an existing thesaurus is described, consistent with the decision criteria developed here. These kinds of user-focused decision-theoretic techniques may be applied to other hierarchical applications, such as faceted classification systems used in information architecture or the use of hierarchical terms in "breadcrumb navigation".
Theme
Konzeption und Anwendung des Prinzips Thesaurus

Similar documents (author)

  1. Losee, R.M.: ¬A Gray code based ordering for documents on shelves : classification for browsing and retrieval (1992) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 2335) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 2335, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=2335)
    
  2. Losee, R.M.: ¬The relative shelf location of circulated books : a study of classification, users, and browsing (1993) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 4485) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 4485, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=4485)
    
  3. Losee, R.M.: Seven fundamental questions for the science of library classification (1993) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 4508) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 4508, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=4508)
    
  4. Losee, R.M.: Term dependence : truncating the Bahadur Lazarsfeld expansion (1994) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 7390) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 7390, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=7390)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 5.18
    5.184806 = sum of:
      5.184806 = weight(author_txt:losee in 7418) [ClassicSimilarity], result of:
        5.184806 = fieldWeight in 7418, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.29569 = idf(docFreq=29, maxDocs=44218)
          0.625 = fieldNorm(doc=7418)
    

Similar documents (content)

  1. Park, Y.C.; Choi, K.-S.: Automatic thesaurus construction using Bayesian networks (1996) 0.18
    0.17544176 = sum of:
      0.17544176 = product of:
        0.73100734 = sum of:
          0.12102299 = weight(abstract_txt:deciding in 6581) [ClassicSimilarity], result of:
            0.12102299 = score(doc=6581,freq=1.0), product of:
              0.16666296 = queryWeight, product of:
                1.1939517 = boost
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.01802166 = queryNorm
              0.7261541 = fieldWeight in 6581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7456436 = idf(docFreq=51, maxDocs=44218)
                0.09375 = fieldNorm(doc=6581)
          0.08152503 = weight(abstract_txt:term in 6581) [ClassicSimilarity], result of:
            0.08152503 = score(doc=6581,freq=2.0), product of:
              0.12807208 = queryWeight, product of:
                1.4801627 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01802166 = queryNorm
              0.6365558 = fieldWeight in 6581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.09375 = fieldNorm(doc=6581)
          0.11954854 = weight(abstract_txt:construction in 6581) [ClassicSimilarity], result of:
            0.11954854 = score(doc=6581,freq=2.0), product of:
              0.16530654 = queryWeight, product of:
                1.6816176 = boost
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.01802166 = queryNorm
              0.72319305 = fieldWeight in 6581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.09375 = fieldNorm(doc=6581)
          0.11552927 = weight(abstract_txt:terms in 6581) [ClassicSimilarity], result of:
            0.11552927 = score(doc=6581,freq=5.0), product of:
              0.13628213 = queryWeight, product of:
                1.8700246 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01802166 = queryNorm
              0.84772134 = fieldWeight in 6581, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=6581)
          0.039491296 = weight(abstract_txt:used in 6581) [ClassicSimilarity], result of:
            0.039491296 = score(doc=6581,freq=1.0), product of:
              0.12539534 = queryWeight, product of:
                2.0712757 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01802166 = queryNorm
              0.3149343 = fieldWeight in 6581, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.09375 = fieldNorm(doc=6581)
          0.2538902 = weight(abstract_txt:thesaurus in 6581) [ClassicSimilarity], result of:
            0.2538902 = score(doc=6581,freq=2.0), product of:
              0.3706845 = queryWeight, product of:
                3.981571 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.01802166 = queryNorm
              0.6849226 = fieldWeight in 6581, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.09375 = fieldNorm(doc=6581)
        0.24 = coord(6/25)
    
  2. Becker, C.; Rauber, A.: Decision criteria in digital preservation : what to measure and how (2011) 0.16
    0.16044764 = sum of:
      0.16044764 = product of:
        0.5730273 = sum of:
          0.12061108 = weight(abstract_txt:measurements in 4456) [ClassicSimilarity], result of:
            0.12061108 = score(doc=4456,freq=2.0), product of:
              0.17294294 = queryWeight, product of:
                1.2162383 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.01802166 = queryNorm
              0.6974039 = fieldWeight in 4456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
          0.024788732 = weight(abstract_txt:when in 4456) [ClassicSimilarity], result of:
            0.024788732 = score(doc=4456,freq=1.0), product of:
              0.095609464 = queryWeight, product of:
                1.2788885 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.01802166 = queryNorm
              0.2592707 = fieldWeight in 4456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
          0.033142224 = weight(abstract_txt:made in 4456) [ClassicSimilarity], result of:
            0.033142224 = score(doc=4456,freq=1.0), product of:
              0.116034135 = queryWeight, product of:
                1.4088836 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.01802166 = queryNorm
              0.28562477 = fieldWeight in 4456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
          0.05435002 = weight(abstract_txt:term in 4456) [ClassicSimilarity], result of:
            0.05435002 = score(doc=4456,freq=2.0), product of:
              0.12807208 = queryWeight, product of:
                1.4801627 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01802166 = queryNorm
              0.42437053 = fieldWeight in 4456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
          0.12247613 = weight(abstract_txt:decision in 4456) [ClassicSimilarity], result of:
            0.12247613 = score(doc=4456,freq=4.0), product of:
              0.17472121 = queryWeight, product of:
                1.728841 = boost
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.01802166 = queryNorm
              0.7009803 = fieldWeight in 4456, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6078424 = idf(docFreq=440, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
          0.02632753 = weight(abstract_txt:used in 4456) [ClassicSimilarity], result of:
            0.02632753 = score(doc=4456,freq=1.0), product of:
              0.12539534 = queryWeight, product of:
                2.0712757 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01802166 = queryNorm
              0.2099562 = fieldWeight in 4456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
          0.19133158 = weight(abstract_txt:decisions in 4456) [ClassicSimilarity], result of:
            0.19133158 = score(doc=4456,freq=2.0), product of:
              0.37341273 = queryWeight, product of:
                3.5743065 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.01802166 = queryNorm
              0.5123863 = fieldWeight in 4456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.0625 = fieldNorm(doc=4456)
        0.28 = coord(7/25)
    
  3. Fidel, R.: Thesaurus requirements for an intermediary expert system (1992) 0.16
    0.15845606 = sum of:
      0.15845606 = product of:
        0.5659145 = sum of:
          0.024788732 = weight(abstract_txt:when in 2103) [ClassicSimilarity], result of:
            0.024788732 = score(doc=2103,freq=1.0), product of:
              0.095609464 = queryWeight, product of:
                1.2788885 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.01802166 = queryNorm
              0.2592707 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
          0.03843127 = weight(abstract_txt:term in 2103) [ClassicSimilarity], result of:
            0.03843127 = score(doc=2103,freq=1.0), product of:
              0.12807208 = queryWeight, product of:
                1.4801627 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01802166 = queryNorm
              0.3000753 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
          0.056355722 = weight(abstract_txt:construction in 2103) [ClassicSimilarity], result of:
            0.056355722 = score(doc=2103,freq=1.0), product of:
              0.16530654 = queryWeight, product of:
                1.6816176 = boost
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.01802166 = queryNorm
              0.34091648 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
          0.034444172 = weight(abstract_txt:terms in 2103) [ClassicSimilarity], result of:
            0.034444172 = score(doc=2103,freq=1.0), product of:
              0.13628213 = queryWeight, product of:
                1.8700246 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01802166 = queryNorm
              0.25274166 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
          0.03723275 = weight(abstract_txt:used in 2103) [ClassicSimilarity], result of:
            0.03723275 = score(doc=2103,freq=2.0), product of:
              0.12539534 = queryWeight, product of:
                2.0712757 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01802166 = queryNorm
              0.2969229 = fieldWeight in 2103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
          0.13529186 = weight(abstract_txt:decisions in 2103) [ClassicSimilarity], result of:
            0.13529186 = score(doc=2103,freq=1.0), product of:
              0.37341273 = queryWeight, product of:
                3.5743065 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.01802166 = queryNorm
              0.36231187 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
          0.23936996 = weight(abstract_txt:thesaurus in 2103) [ClassicSimilarity], result of:
            0.23936996 = score(doc=2103,freq=4.0), product of:
              0.3706845 = queryWeight, product of:
                3.981571 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.01802166 = queryNorm
              0.6457512 = fieldWeight in 2103, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.0625 = fieldNorm(doc=2103)
        0.28 = coord(7/25)
    
  4. Losee, R.M.: ¬The effect of assigning a metadata or indexing term on document ordering (2013) 0.14
    0.13727365 = sum of:
      0.13727365 = product of:
        0.57197356 = sum of:
          0.1235461 = weight(abstract_txt:ordering in 1100) [ClassicSimilarity], result of:
            0.1235461 = score(doc=1100,freq=3.0), product of:
              0.13230014 = queryWeight, product of:
                1.0637691 = boost
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.01802166 = queryNorm
              0.933832 = fieldWeight in 1100, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.901097 = idf(docFreq=120, maxDocs=44218)
                0.078125 = fieldNorm(doc=1100)
          0.0438207 = weight(abstract_txt:when in 1100) [ClassicSimilarity], result of:
            0.0438207 = score(doc=1100,freq=2.0), product of:
              0.095609464 = queryWeight, product of:
                1.2788885 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.01802166 = queryNorm
              0.45833018 = fieldWeight in 1100, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.078125 = fieldNorm(doc=1100)
          0.09635857 = weight(abstract_txt:performance in 1100) [ClassicSimilarity], result of:
            0.09635857 = score(doc=1100,freq=5.0), product of:
              0.119122796 = queryWeight, product of:
                1.4275117 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.01802166 = queryNorm
              0.80890113 = fieldWeight in 1100, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=1100)
          0.09607817 = weight(abstract_txt:term in 1100) [ClassicSimilarity], result of:
            0.09607817 = score(doc=1100,freq=4.0), product of:
              0.12807208 = queryWeight, product of:
                1.4801627 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01802166 = queryNorm
              0.75018823 = fieldWeight in 1100, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=1100)
          0.043055218 = weight(abstract_txt:terms in 1100) [ClassicSimilarity], result of:
            0.043055218 = score(doc=1100,freq=1.0), product of:
              0.13628213 = queryWeight, product of:
                1.8700246 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01802166 = queryNorm
              0.3159271 = fieldWeight in 1100, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1100)
          0.16911483 = weight(abstract_txt:decisions in 1100) [ClassicSimilarity], result of:
            0.16911483 = score(doc=1100,freq=1.0), product of:
              0.37341273 = queryWeight, product of:
                3.5743065 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.01802166 = queryNorm
              0.45288983 = fieldWeight in 1100, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.078125 = fieldNorm(doc=1100)
        0.24 = coord(6/25)
    
  5. Srinivasan, P.: Thesaurus construction (1992) 0.13
    0.13387766 = sum of:
      0.13387766 = product of:
        0.6693883 = sum of:
          0.14088932 = weight(abstract_txt:construction in 3504) [ClassicSimilarity], result of:
            0.14088932 = score(doc=3504,freq=4.0), product of:
              0.16530654 = queryWeight, product of:
                1.6816176 = boost
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.01802166 = queryNorm
              0.8522912 = fieldWeight in 3504, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.043055218 = weight(abstract_txt:terms in 3504) [ClassicSimilarity], result of:
            0.043055218 = score(doc=3504,freq=1.0), product of:
              0.13628213 = queryWeight, product of:
                1.8700246 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01802166 = queryNorm
              0.3159271 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.056713503 = weight(abstract_txt:should in 3504) [ClassicSimilarity], result of:
            0.056713503 = score(doc=3504,freq=1.0), product of:
              0.16376184 = queryWeight, product of:
                2.0499072 = boost
                4.432857 = idf(docFreq=1427, maxDocs=44218)
                0.01802166 = queryNorm
              0.34631696 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.432857 = idf(docFreq=1427, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.032909412 = weight(abstract_txt:used in 3504) [ClassicSimilarity], result of:
            0.032909412 = score(doc=3504,freq=1.0), product of:
              0.12539534 = queryWeight, product of:
                2.0712757 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01802166 = queryNorm
              0.26244524 = fieldWeight in 3504, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
          0.39582083 = weight(abstract_txt:thesaurus in 3504) [ClassicSimilarity], result of:
            0.39582083 = score(doc=3504,freq=7.0), product of:
              0.3706845 = queryWeight, product of:
                3.981571 = boost
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.01802166 = queryNorm
              1.0678105 = fieldWeight in 3504, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.1660094 = idf(docFreq=685, maxDocs=44218)
                0.078125 = fieldNorm(doc=3504)
        0.2 = coord(5/25)