Document (#34498)

Author
Broughton, V.
Title
Language related problems in the construction of faceted terminologies and their automatic management
Source
Culture and identity in knowledge organization: Proceedings of the Tenth International ISKO Conference 5-8 August 2008, Montreal, Canada. Ed. by Clément Arsenault and Joseph T. Tennis
Imprint
Würzburg : Ergon Verlag
Year
2008
Pages
S.43-49
Series
Advances in knowledge organization; vol.11
Content
The paper describes current work on the generation of a thesaurus format from the schedules of the Bliss Bibliographic Classification 2nd edition (BC2). The practical problems that occur in moving from a concept based approach to a terminological approach cluster around issues of vocabulary control that are not fully addressed in a systematic structure. These difficulties can be exacerbated within domains in the humanities because large numbers of culture specific terms may need to be accommodated in any thesaurus. The ways in which these problems can be resolved within the context of a semi-automated approach to the thesaurus generation have consequences for the management of classification data in the source vocabulary. The way in which the vocabulary is marked up for the purpose of machine manipulation is described, and some of the implications for editorial policy are discussed and examples given. The value of the classification notation as a language independent representation and mapping tool should not be sacrificed in such an exercise.
Footnote
Vgl. unter: http://www.ergon-verlag.de/isko_ko/tocs/0497f79b0c0b3ed06/0497f79b0c0b5550a/index.php.
Theme
Theorie verbaler Dokumentationssprachen
Wissensrepräsentation
Universale Facettenklassifikationen
Object
BC2

Similar documents (author)

  1. Broughton, V.: Classification and subject organization and retrieval (2007) 5.01
    5.005005 = sum of:
      5.005005 = weight(author_txt:broughton in 6145) [ClassicSimilarity], result of:
        5.005005 = fieldWeight in 6145, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.008008 = idf(docFreq=39, maxDocs=44218)
          0.625 = fieldNorm(doc=6145)
    
  2. Broughton, V.: Meccano, molecules, and the organization of knowledge : the continuing contribution of S.R. Ranganathan (2007) 5.01
    5.005005 = sum of:
      5.005005 = weight(author_txt:broughton in 1807) [ClassicSimilarity], result of:
        5.005005 = fieldWeight in 1807, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.008008 = idf(docFreq=39, maxDocs=44218)
          0.625 = fieldNorm(doc=1807)
    
  3. Broughton, V.: Organizing a national humanities portal : a model for the classification and subject management of digital resources (2002) 5.01
    5.005005 = sum of:
      5.005005 = weight(author_txt:broughton in 4607) [ClassicSimilarity], result of:
        5.005005 = fieldWeight in 4607, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.008008 = idf(docFreq=39, maxDocs=44218)
          0.625 = fieldNorm(doc=4607)
    
  4. Broughton, V.: ¬A new classification for the literature for religion (2000) 5.01
    5.005005 = sum of:
      5.005005 = weight(author_txt:broughton in 5398) [ClassicSimilarity], result of:
        5.005005 = fieldWeight in 5398, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.008008 = idf(docFreq=39, maxDocs=44218)
          0.625 = fieldNorm(doc=5398)
    
  5. Broughton, V.: ¬The revision process in UDC : an examination of the systematic auxiliary of 'Point-of-View' using facet-analytical methods (1998) 5.01
    5.005005 = sum of:
      5.005005 = weight(author_txt:broughton in 6367) [ClassicSimilarity], result of:
        5.005005 = fieldWeight in 6367, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.008008 = idf(docFreq=39, maxDocs=44218)
          0.625 = fieldNorm(doc=6367)
    

Similar documents (content)

  1. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 0.33
    0.3320805 = sum of:
      0.3320805 = product of:
        0.5977449 = sum of:
          0.02165569 = weight(abstract_txt:their in 107) [ClassicSimilarity], result of:
            0.02165569 = score(doc=107,freq=1.0), product of:
              0.10966644 = queryWeight, product of:
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.03471007 = queryNorm
              0.19746871 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1594994 = idf(docFreq=5101, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.10044459 = weight(abstract_txt:language in 107) [ClassicSimilarity], result of:
            0.10044459 = score(doc=107,freq=4.0), product of:
              0.1921425 = queryWeight, product of:
                1.3236551 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.03471007 = queryNorm
              0.5227609 = fieldWeight in 107, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.052375153 = weight(abstract_txt:management in 107) [ClassicSimilarity], result of:
            0.052375153 = score(doc=107,freq=1.0), product of:
              0.19759499 = queryWeight, product of:
                1.3423046 = boost
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.03471007 = queryNorm
              0.26506317 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2410107 = idf(docFreq=1729, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.07706844 = weight(abstract_txt:problems in 107) [ClassicSimilarity], result of:
            0.07706844 = score(doc=107,freq=2.0), product of:
              0.20289285 = queryWeight, product of:
                1.3601804 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.03471007 = queryNorm
              0.37984797 = fieldWeight in 107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.346201 = weight(abstract_txt:terminologies in 107) [ClassicSimilarity], result of:
            0.346201 = score(doc=107,freq=1.0), product of:
              0.6959498 = queryWeight, product of:
                2.5191388 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.03471007 = queryNorm
              0.4974511 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
        0.5555556 = coord(5/9)
    
  2. Wunner, T.; Buitelaar, P.; O'Riain, S.: Semantic, terminological and linguistic interpretation of XBRL (2010) 0.33
    0.33002442 = sum of:
      0.33002442 = product of:
        0.74255496 = sum of:
          0.06277787 = weight(abstract_txt:language in 1122) [ClassicSimilarity], result of:
            0.06277787 = score(doc=1122,freq=1.0), product of:
              0.1921425 = queryWeight, product of:
                1.3236551 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.03471007 = queryNorm
              0.32672557 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1122)
          0.06380099 = weight(abstract_txt:related in 1122) [ClassicSimilarity], result of:
            0.06380099 = score(doc=1122,freq=1.0), product of:
              0.19422448 = queryWeight, product of:
                1.3308071 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.03471007 = queryNorm
              0.32849097 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.078125 = fieldNorm(doc=1122)
          0.18322487 = weight(abstract_txt:faceted in 1122) [ClassicSimilarity], result of:
            0.18322487 = score(doc=1122,freq=1.0), product of:
              0.39241174 = queryWeight, product of:
                1.891621 = boost
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.03471007 = queryNorm
              0.46691996 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.078125 = fieldNorm(doc=1122)
          0.43275124 = weight(abstract_txt:terminologies in 1122) [ClassicSimilarity], result of:
            0.43275124 = score(doc=1122,freq=1.0), product of:
              0.6959498 = queryWeight, product of:
                2.5191388 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.03471007 = queryNorm
              0.6218139 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.078125 = fieldNorm(doc=1122)
        0.44444445 = coord(4/9)
    
  3. Muresan, S.; Klavans, J.L.: Inducing terminologies from text : a case study for the consumer health domain (2013) 0.32
    0.31913865 = sum of:
      0.31913865 = product of:
        0.95741594 = sum of:
          0.08698757 = weight(abstract_txt:language in 682) [ClassicSimilarity], result of:
            0.08698757 = score(doc=682,freq=3.0), product of:
              0.1921425 = queryWeight, product of:
                1.3236551 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.03471007 = queryNorm
              0.45272425 = fieldWeight in 682, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=682)
          0.09629943 = weight(abstract_txt:automatic in 682) [ClassicSimilarity], result of:
            0.09629943 = score(doc=682,freq=1.0), product of:
              0.296557 = queryWeight, product of:
                1.644437 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03471007 = queryNorm
              0.32472485 = fieldWeight in 682, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=682)
          0.774129 = weight(abstract_txt:terminologies in 682) [ClassicSimilarity], result of:
            0.774129 = score(doc=682,freq=5.0), product of:
              0.6959498 = queryWeight, product of:
                2.5191388 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.03471007 = queryNorm
              1.1123345 = fieldWeight in 682, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0625 = fieldNorm(doc=682)
        0.33333334 = coord(3/9)
    
  4. Broughton, V.: Concepts and terms in the faceted classification : the case of UDC (2010) 0.31
    0.31111914 = sum of:
      0.31111914 = product of:
        0.70001805 = sum of:
          0.050222296 = weight(abstract_txt:language in 4065) [ClassicSimilarity], result of:
            0.050222296 = score(doc=4065,freq=1.0), product of:
              0.1921425 = queryWeight, product of:
                1.3236551 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.03471007 = queryNorm
              0.26138046 = fieldWeight in 4065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.0625 = fieldNorm(doc=4065)
          0.09629943 = weight(abstract_txt:automatic in 4065) [ClassicSimilarity], result of:
            0.09629943 = score(doc=4065,freq=1.0), product of:
              0.296557 = queryWeight, product of:
                1.644437 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03471007 = queryNorm
              0.32472485 = fieldWeight in 4065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=4065)
          0.20729528 = weight(abstract_txt:faceted in 4065) [ClassicSimilarity], result of:
            0.20729528 = score(doc=4065,freq=2.0), product of:
              0.39241174 = queryWeight, product of:
                1.891621 = boost
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.03471007 = queryNorm
              0.52825963 = fieldWeight in 4065, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9765754 = idf(docFreq=304, maxDocs=44218)
                0.0625 = fieldNorm(doc=4065)
          0.346201 = weight(abstract_txt:terminologies in 4065) [ClassicSimilarity], result of:
            0.346201 = score(doc=4065,freq=1.0), product of:
              0.6959498 = queryWeight, product of:
                2.5191388 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.03471007 = queryNorm
              0.4974511 = fieldWeight in 4065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.0625 = fieldNorm(doc=4065)
        0.44444445 = coord(4/9)
    
  5. Su, H.: Automatic abstracting (1996) 0.30
    0.29877448 = sum of:
      0.29877448 = product of:
        0.89632344 = sum of:
          0.13623904 = weight(abstract_txt:problems in 150) [ClassicSimilarity], result of:
            0.13623904 = score(doc=150,freq=1.0), product of:
              0.20289285 = queryWeight, product of:
                1.3601804 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.03471007 = queryNorm
              0.6714827 = fieldWeight in 150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.15625 = fieldNorm(doc=150)
          0.48149717 = weight(abstract_txt:automatic in 150) [ClassicSimilarity], result of:
            0.48149717 = score(doc=150,freq=4.0), product of:
              0.296557 = queryWeight, product of:
                1.644437 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.03471007 = queryNorm
              1.6236243 = fieldWeight in 150, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.15625 = fieldNorm(doc=150)
          0.27858722 = weight(abstract_txt:construction in 150) [ClassicSimilarity], result of:
            0.27858722 = score(doc=150,freq=1.0), product of:
              0.3268686 = queryWeight, product of:
                1.7264329 = boost
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.03471007 = queryNorm
              0.8522912 = fieldWeight in 150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4546638 = idf(docFreq=513, maxDocs=44218)
                0.15625 = fieldNorm(doc=150)
        0.33333334 = coord(3/9)