Document (#28814)

Author
Kuhr, P.S.
Title
Putting the world back together : mapping multiple vocabularies into a single thesaurus
Source
Subject retrieval in a networked environment: Proceedings of the IFLA Satellite Meeting held in Dublin, OH, 14-16 August 2001 and sponsored by the IFLA Classification and Indexing Section, the IFLA Information Technology Section and OCLC. Ed.: I.C. McIlwaine
Imprint
München : Saur
Year
2003
Pages
S.37-42
Series
UBCIM publications: new series; vol.25
Abstract
This paper describes an ongoing project in which the subject headings contained in twelve controlled vocabularies covering multiple disciplines from the humanities to the sciences and including law and education among others are being collapsed into a single vocabulary and reference structure. The design of the database, algorithms created to programmatically link like-concepts, and daily maintenance are detailed. The problems and pitfalls of dealing with multiple vocabularies are noted, as well as the difficulties in relying purely an computer generated algorithms. The application of this megathesaurus to bibliographic records and methodology of retrieval is explained.
Footnote
Ein Beitrag zum Thema des Mischens oder Zusammenspielens verschiedener Thesauri zu einem
Theme
Konzeption und Anwendung des Prinzips Thesaurus

Similar documents (content)

  1. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi-automatic matching procedure for building up vocabulary crosswalks (2014) 0.11
    0.110324025 = sum of:
      0.110324025 = product of:
        0.45968345 = sum of:
          0.040561963 = weight(abstract_txt:generated in 1371) [ClassicSimilarity], result of:
            0.040561963 = score(doc=1371,freq=1.0), product of:
              0.11754919 = queryWeight, product of:
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.02129121 = queryNorm
              0.34506375 = fieldWeight in 1371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.0625 = fieldNorm(doc=1371)
          0.044822495 = weight(abstract_txt:link in 1371) [ClassicSimilarity], result of:
            0.044822495 = score(doc=1371,freq=1.0), product of:
              0.12564282 = queryWeight, product of:
                1.0338535 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.02129121 = queryNorm
              0.35674536 = fieldWeight in 1371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=1371)
          0.061294753 = weight(abstract_txt:maintenance in 1371) [ClassicSimilarity], result of:
            0.061294753 = score(doc=1371,freq=1.0), product of:
              0.15479462 = queryWeight, product of:
                1.1475407 = boost
                6.335595 = idf(docFreq=212, maxDocs=44218)
                0.02129121 = queryNorm
              0.3959747 = fieldWeight in 1371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.335595 = idf(docFreq=212, maxDocs=44218)
                0.0625 = fieldNorm(doc=1371)
          0.065476954 = weight(abstract_txt:back in 1371) [ClassicSimilarity], result of:
            0.065476954 = score(doc=1371,freq=1.0), product of:
              0.16175807 = queryWeight, product of:
                1.1730679 = boost
                6.4765315 = idf(docFreq=184, maxDocs=44218)
                0.02129121 = queryNorm
              0.40478322 = fieldWeight in 1371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4765315 = idf(docFreq=184, maxDocs=44218)
                0.0625 = fieldNorm(doc=1371)
          0.098034725 = weight(abstract_txt:multiple in 1371) [ClassicSimilarity], result of:
            0.098034725 = score(doc=1371,freq=1.0), product of:
              0.30532852 = queryWeight, product of:
                2.7914798 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.02129121 = queryNorm
              0.3210795 = fieldWeight in 1371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.0625 = fieldNorm(doc=1371)
          0.14949256 = weight(abstract_txt:vocabularies in 1371) [ClassicSimilarity], result of:
            0.14949256 = score(doc=1371,freq=1.0), product of:
              0.404508 = queryWeight, product of:
                3.2130268 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.02129121 = queryNorm
              0.36956638 = fieldWeight in 1371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.0625 = fieldNorm(doc=1371)
        0.24 = coord(6/25)
    
  2. Purpura, A.; Silvello, G.; Susto, G.A.: Learning to rank from relevance judgments distributions (2022) 0.09
    0.09376246 = sum of:
      0.09376246 = product of:
        0.4688123 = sum of:
          0.040561963 = weight(abstract_txt:generated in 645) [ClassicSimilarity], result of:
            0.040561963 = score(doc=645,freq=1.0), product of:
              0.11754919 = queryWeight, product of:
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.02129121 = queryNorm
              0.34506375 = fieldWeight in 645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.52102 = idf(docFreq=480, maxDocs=44218)
                0.0625 = fieldNorm(doc=645)
          0.1433518 = weight(abstract_txt:relying in 645) [ClassicSimilarity], result of:
            0.1433518 = score(doc=645,freq=2.0), product of:
              0.21647069 = queryWeight, product of:
                1.3570309 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.02129121 = queryNorm
              0.6622227 = fieldWeight in 645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.0625 = fieldNorm(doc=645)
          0.09721883 = weight(abstract_txt:single in 645) [ClassicSimilarity], result of:
            0.09721883 = score(doc=645,freq=2.0), product of:
              0.21052673 = queryWeight, product of:
                1.8925998 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.02129121 = queryNorm
              0.4617885 = fieldWeight in 645, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.0625 = fieldNorm(doc=645)
          0.08964499 = weight(abstract_txt:algorithms in 645) [ClassicSimilarity], result of:
            0.08964499 = score(doc=645,freq=1.0), product of:
              0.25128564 = queryWeight, product of:
                2.067707 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.02129121 = queryNorm
              0.35674536 = fieldWeight in 645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=645)
          0.098034725 = weight(abstract_txt:multiple in 645) [ClassicSimilarity], result of:
            0.098034725 = score(doc=645,freq=1.0), product of:
              0.30532852 = queryWeight, product of:
                2.7914798 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.02129121 = queryNorm
              0.3210795 = fieldWeight in 645, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.0625 = fieldNorm(doc=645)
        0.2 = coord(5/25)
    
  3. ISO 25964-2: Thesauri and interoperability with other vocabularies : Part 2: Interoperability with other vocabularies (2013) 0.09
    0.08897853 = sum of:
      0.08897853 = product of:
        0.74148774 = sum of:
          0.122589506 = weight(abstract_txt:maintenance in 4832) [ClassicSimilarity], result of:
            0.122589506 = score(doc=4832,freq=1.0), product of:
              0.15479462 = queryWeight, product of:
                1.1475407 = boost
                6.335595 = idf(docFreq=212, maxDocs=44218)
                0.02129121 = queryNorm
              0.7919494 = fieldWeight in 4832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.335595 = idf(docFreq=212, maxDocs=44218)
                0.125 = fieldNorm(doc=4832)
          0.19606945 = weight(abstract_txt:multiple in 4832) [ClassicSimilarity], result of:
            0.19606945 = score(doc=4832,freq=1.0), product of:
              0.30532852 = queryWeight, product of:
                2.7914798 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.02129121 = queryNorm
              0.642159 = fieldWeight in 4832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.125 = fieldNorm(doc=4832)
          0.4228288 = weight(abstract_txt:vocabularies in 4832) [ClassicSimilarity], result of:
            0.4228288 = score(doc=4832,freq=2.0), product of:
              0.404508 = queryWeight, product of:
                3.2130268 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.02129121 = queryNorm
              1.0452915 = fieldWeight in 4832, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.125 = fieldNorm(doc=4832)
        0.12 = coord(3/25)
    
  4. Will, L.: Thesaurus management software (2009) 0.09
    0.087063566 = sum of:
      0.087063566 = product of:
        0.5441473 = sum of:
          0.06974024 = weight(abstract_txt:mapping in 3892) [ClassicSimilarity], result of:
            0.06974024 = score(doc=3892,freq=1.0), product of:
              0.1287464 = queryWeight, product of:
                1.0465446 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.02129121 = queryNorm
              0.5416869 = fieldWeight in 3892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.09375 = fieldNorm(doc=3892)
          0.10311614 = weight(abstract_txt:single in 3892) [ClassicSimilarity], result of:
            0.10311614 = score(doc=3892,freq=1.0), product of:
              0.21052673 = queryWeight, product of:
                1.8925998 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.02129121 = queryNorm
              0.4898007 = fieldWeight in 3892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.09375 = fieldNorm(doc=3892)
          0.1470521 = weight(abstract_txt:multiple in 3892) [ClassicSimilarity], result of:
            0.1470521 = score(doc=3892,freq=1.0), product of:
              0.30532852 = queryWeight, product of:
                2.7914798 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.02129121 = queryNorm
              0.48161924 = fieldWeight in 3892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.09375 = fieldNorm(doc=3892)
          0.22423883 = weight(abstract_txt:vocabularies in 3892) [ClassicSimilarity], result of:
            0.22423883 = score(doc=3892,freq=1.0), product of:
              0.404508 = queryWeight, product of:
                3.2130268 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.02129121 = queryNorm
              0.55434954 = fieldWeight in 3892, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.09375 = fieldNorm(doc=3892)
        0.16 = coord(4/25)
    
  5. Grossman, D.A.; Frieder, O.: Information retrieval : algorithms and heuristics (2004) 0.08
    0.07847597 = sum of:
      0.07847597 = product of:
        0.39237982 = sum of:
          0.045856077 = weight(abstract_txt:detailed in 1486) [ClassicSimilarity], result of:
            0.045856077 = score(doc=1486,freq=1.0), product of:
              0.12756698 = queryWeight, product of:
                1.04174 = boost
                5.7514668 = idf(docFreq=381, maxDocs=44218)
                0.02129121 = queryNorm
              0.35946667 = fieldWeight in 1486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7514668 = idf(docFreq=381, maxDocs=44218)
                0.0625 = fieldNorm(doc=1486)
          0.024475267 = weight(abstract_txt:into in 1486) [ClassicSimilarity], result of:
            0.024475267 = score(doc=1486,freq=1.0), product of:
              0.105755255 = queryWeight, product of:
                1.3413934 = boost
                3.7029297 = idf(docFreq=2962, maxDocs=44218)
                0.02129121 = queryNorm
              0.23143311 = fieldWeight in 1486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7029297 = idf(docFreq=2962, maxDocs=44218)
                0.0625 = fieldNorm(doc=1486)
          0.06874409 = weight(abstract_txt:single in 1486) [ClassicSimilarity], result of:
            0.06874409 = score(doc=1486,freq=1.0), product of:
              0.21052673 = queryWeight, product of:
                1.8925998 = boost
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.02129121 = queryNorm
              0.3265338 = fieldWeight in 1486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2245407 = idf(docFreq=646, maxDocs=44218)
                0.0625 = fieldNorm(doc=1486)
          0.15526967 = weight(abstract_txt:algorithms in 1486) [ClassicSimilarity], result of:
            0.15526967 = score(doc=1486,freq=3.0), product of:
              0.25128564 = queryWeight, product of:
                2.067707 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.02129121 = queryNorm
              0.6179011 = fieldWeight in 1486, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=1486)
          0.098034725 = weight(abstract_txt:multiple in 1486) [ClassicSimilarity], result of:
            0.098034725 = score(doc=1486,freq=1.0), product of:
              0.30532852 = queryWeight, product of:
                2.7914798 = boost
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.02129121 = queryNorm
              0.3210795 = fieldWeight in 1486, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.137272 = idf(docFreq=705, maxDocs=44218)
                0.0625 = fieldNorm(doc=1486)
        0.2 = coord(5/25)