Document (#40854)

Author
Piros, A.
Title
¬The thought behind the symbol : about the automatic interpretation and representation of UDC numbers
Source
Knowledge organization. 44(2017) no.6, S.416-424
Year
2017
Abstract
Analytico-synthetic and faceted classifications, such as Universal Decimal Classification (UDC) provide facilities to express pre-coordinated subject statements using syntactic relations. In this case, the relevance, in the process of UDC-based information retrieval, can be determined by extracting the meaning of the classmarks as precisely as is possible. The central question here is how the identification mentioned above can be supported by automatic means and an analysis of the structure of complex classmarks appears to be an obvious requirement. Many bibliographic sources contain complex UDC classmarks which are stored as simple text strings and on which it is very difficult to perform any meaningful information discovery. The paper presents results from a phase of ongoing research focused on developing a new platform-independent, machine-processable data format capable of representing the whole syntactic structure of the composite UDC numbers to support their further automatic processing. An algorithm that can produce the representation of the numbers in such a format directly from their designations has also been developed and implemented. The research also includes implementing conversion methods to provide outputs that can be employed by other software directly and, as a service, make them available for other software. The paper provides an overview of the solutions developed and implemented since 2015 and outlines future research plans.
Content
Beitrag in einem Special Issue: Selected Papers from the International UDC Seminar 2017, Faceted Classification Today: Theory, Technology and End Users, 14-15 September, London UK.
Theme
International bedeutende Universalklassifikationen
Object
UDC

Similar documents (content)

  1. Piros, A.: Az ETO-jelzetek automatikus interpretálásának és elemzésének kérdései (2018) 0.42
    0.41566736 = sum of:
      0.41566736 = product of:
        1.0391684 = sum of:
          0.06456546 = weight(abstract_txt:requirement in 855) [ClassicSimilarity], result of:
            0.06456546 = score(doc=855,freq=1.0), product of:
              0.13815634 = queryWeight, product of:
                1.0290816 = boost
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.01795443 = queryNorm
              0.4673362 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4773793 = idf(docFreq=67, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.06742063 = weight(abstract_txt:coordinated in 855) [ClassicSimilarity], result of:
            0.06742063 = score(doc=855,freq=1.0), product of:
              0.14219987 = queryWeight, product of:
                1.0440325 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01795443 = queryNorm
              0.47412583 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.0255306 = weight(abstract_txt:software in 855) [ClassicSimilarity], result of:
            0.0255306 = score(doc=855,freq=1.0), product of:
              0.093775205 = queryWeight, product of:
                1.1990117 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.01795443 = queryNorm
              0.27225322 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.025564864 = weight(abstract_txt:structure in 855) [ClassicSimilarity], result of:
            0.025564864 = score(doc=855,freq=1.0), product of:
              0.093859084 = queryWeight, product of:
                1.1995478 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.01795443 = queryNorm
              0.27237496 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.020878773 = weight(abstract_txt:research in 855) [ClassicSimilarity], result of:
            0.020878773 = score(doc=855,freq=2.0), product of:
              0.07450826 = queryWeight, product of:
                1.3089626 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.01795443 = queryNorm
              0.28022093 = fieldWeight in 855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.057749793 = weight(abstract_txt:format in 855) [ClassicSimilarity], result of:
            0.057749793 = score(doc=855,freq=2.0), product of:
              0.12825401 = queryWeight, product of:
                1.4022158 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.01795443 = queryNorm
              0.4502767 = fieldWeight in 855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.04086792 = weight(abstract_txt:complex in 855) [ClassicSimilarity], result of:
            0.04086792 = score(doc=855,freq=1.0), product of:
              0.12832236 = queryWeight, product of:
                1.4025894 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.01795443 = queryNorm
              0.31847855 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.12142797 = weight(abstract_txt:syntactic in 855) [ClassicSimilarity], result of:
            0.12142797 = score(doc=855,freq=2.0), product of:
              0.2104989 = queryWeight, product of:
                1.7964052 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.01795443 = queryNorm
              0.576858 = fieldWeight in 855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.23104236 = weight(abstract_txt:numbers in 855) [ClassicSimilarity], result of:
            0.23104236 = score(doc=855,freq=6.0), product of:
              0.25653997 = queryWeight, product of:
                2.4288602 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.01795443 = queryNorm
              0.90060955 = fieldWeight in 855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
          0.38411993 = weight(abstract_txt:classmarks in 855) [ClassicSimilarity], result of:
            0.38411993 = score(doc=855,freq=1.0), product of:
              0.6542177 = queryWeight, product of:
                3.8787 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.01795443 = queryNorm
              0.5871439 = fieldWeight in 855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=855)
        0.4 = coord(10/25)
    
  2. Gnoli, C.; Pullman, T.; Cousson, P.; Merli, G.; Szostak, R.: Representing the structural elements of a freely faceted classification (2011) 0.12
    0.121330276 = sum of:
      0.121330276 = product of:
        0.60665137 = sum of:
          0.025046885 = weight(abstract_txt:provide in 4825) [ClassicSimilarity], result of:
            0.025046885 = score(doc=4825,freq=1.0), product of:
              0.07978902 = queryWeight, product of:
                1.1059893 = boost
                4.0180984 = idf(docFreq=2161, maxDocs=44218)
                0.01795443 = queryNorm
              0.31391394 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0180984 = idf(docFreq=2161, maxDocs=44218)
                0.078125 = fieldNorm(doc=4825)
          0.03195608 = weight(abstract_txt:structure in 4825) [ClassicSimilarity], result of:
            0.03195608 = score(doc=4825,freq=1.0), product of:
              0.093859084 = queryWeight, product of:
                1.1995478 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.01795443 = queryNorm
              0.3404687 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.078125 = fieldNorm(doc=4825)
          0.018454403 = weight(abstract_txt:research in 4825) [ClassicSimilarity], result of:
            0.018454403 = score(doc=4825,freq=1.0), product of:
              0.07450826 = queryWeight, product of:
                1.3089626 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.01795443 = queryNorm
              0.24768265 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.078125 = fieldNorm(doc=4825)
          0.051044088 = weight(abstract_txt:format in 4825) [ClassicSimilarity], result of:
            0.051044088 = score(doc=4825,freq=1.0), product of:
              0.12825401 = queryWeight, product of:
                1.4022158 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.01795443 = queryNorm
              0.39799213 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.078125 = fieldNorm(doc=4825)
          0.48014992 = weight(abstract_txt:classmarks in 4825) [ClassicSimilarity], result of:
            0.48014992 = score(doc=4825,freq=1.0), product of:
              0.6542177 = queryWeight, product of:
                3.8787 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.01795443 = queryNorm
              0.7339299 = fieldWeight in 4825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.078125 = fieldNorm(doc=4825)
        0.2 = coord(5/25)
    
  3. Piros, A.: Automatic interpretation of complex UDC numbers : towards support for library systems (2015) 0.12
    0.116545945 = sum of:
      0.116545945 = product of:
        0.4162355 = sum of:
          0.05183927 = weight(abstract_txt:synthetic in 2301) [ClassicSimilarity], result of:
            0.05183927 = score(doc=2301,freq=1.0), product of:
              0.13045815 = queryWeight, product of:
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.01795443 = queryNorm
              0.39736322 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.072925545 = weight(abstract_txt:analytico in 2301) [ClassicSimilarity], result of:
            0.072925545 = score(doc=2301,freq=1.0), product of:
              0.16378914 = queryWeight, product of:
                1.1204873 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.01795443 = queryNorm
              0.44524044 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.031592507 = weight(abstract_txt:software in 2301) [ClassicSimilarity], result of:
            0.031592507 = score(doc=2301,freq=2.0), product of:
              0.093775205 = queryWeight, product of:
                1.1990117 = boost
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.01795443 = queryNorm
              0.33689618 = fieldWeight in 2301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3560514 = idf(docFreq=1541, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.03573086 = weight(abstract_txt:format in 2301) [ClassicSimilarity], result of:
            0.03573086 = score(doc=2301,freq=1.0), product of:
              0.12825401 = queryWeight, product of:
                1.4022158 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.01795443 = queryNorm
              0.2785945 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.05057147 = weight(abstract_txt:complex in 2301) [ClassicSimilarity], result of:
            0.05057147 = score(doc=2301,freq=2.0), product of:
              0.12832236 = queryWeight, product of:
                1.4025894 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.01795443 = queryNorm
              0.3940971 = fieldWeight in 2301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.056857515 = weight(abstract_txt:automatic in 2301) [ClassicSimilarity], result of:
            0.056857515 = score(doc=2301,freq=1.0), product of:
              0.20010793 = queryWeight, product of:
                2.1451476 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.01795443 = queryNorm
              0.28413424 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
          0.116718315 = weight(abstract_txt:numbers in 2301) [ClassicSimilarity], result of:
            0.116718315 = score(doc=2301,freq=2.0), product of:
              0.25653997 = queryWeight, product of:
                2.4288602 = boost
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.01795443 = queryNorm
              0.45497125 = fieldWeight in 2301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8827567 = idf(docFreq=334, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2301)
        0.28 = coord(7/25)
    
  4. Zeng, M.L.; Fan, W.; Lin, X.: SKOS for an integrated vocabulary structure (2008) 0.10
    0.100585826 = sum of:
      0.100585826 = product of:
        0.35923508 = sum of:
          0.05183927 = weight(abstract_txt:synthetic in 2654) [ClassicSimilarity], result of:
            0.05183927 = score(doc=2654,freq=1.0), product of:
              0.13045815 = queryWeight, product of:
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.01795443 = queryNorm
              0.39736322 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
          0.08342878 = weight(abstract_txt:coordinated in 2654) [ClassicSimilarity], result of:
            0.08342878 = score(doc=2654,freq=2.0), product of:
              0.14219987 = queryWeight, product of:
                1.0440325 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.01795443 = queryNorm
              0.5867008 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
          0.02479515 = weight(abstract_txt:provide in 2654) [ClassicSimilarity], result of:
            0.02479515 = score(doc=2654,freq=2.0), product of:
              0.07978902 = queryWeight, product of:
                1.1059893 = boost
                4.0180984 = idf(docFreq=2161, maxDocs=44218)
                0.01795443 = queryNorm
              0.31075892 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0180984 = idf(docFreq=2161, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
          0.031634904 = weight(abstract_txt:structure in 2654) [ClassicSimilarity], result of:
            0.031634904 = score(doc=2654,freq=2.0), product of:
              0.093859084 = queryWeight, product of:
                1.1995478 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.01795443 = queryNorm
              0.3370468 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
          0.09604668 = weight(abstract_txt:processable in 2654) [ClassicSimilarity], result of:
            0.09604668 = score(doc=2654,freq=1.0), product of:
              0.19679777 = queryWeight, product of:
                1.2282152 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.01795443 = queryNorm
              0.48804757 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
          0.03573086 = weight(abstract_txt:format in 2654) [ClassicSimilarity], result of:
            0.03573086 = score(doc=2654,freq=1.0), product of:
              0.12825401 = queryWeight, product of:
                1.4022158 = boost
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.01795443 = queryNorm
              0.2785945 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0942993 = idf(docFreq=736, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
          0.03575943 = weight(abstract_txt:complex in 2654) [ClassicSimilarity], result of:
            0.03575943 = score(doc=2654,freq=1.0), product of:
              0.12832236 = queryWeight, product of:
                1.4025894 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.01795443 = queryNorm
              0.27866873 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2654)
        0.28 = coord(7/25)
    
  5. Slavic, A.; Davies, S.: Facet analysis in UDC : questions of structure, functionality and data formality (2017) 0.10
    0.09950792 = sum of:
      0.09950792 = product of:
        0.6219245 = sum of:
          0.08378491 = weight(abstract_txt:synthetic in 3848) [ClassicSimilarity], result of:
            0.08378491 = score(doc=3848,freq=2.0), product of:
              0.13045815 = queryWeight, product of:
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.01795443 = queryNorm
              0.64223593 = fieldWeight in 3848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0625 = fieldNorm(doc=3848)
          0.11786549 = weight(abstract_txt:analytico in 3848) [ClassicSimilarity], result of:
            0.11786549 = score(doc=3848,freq=2.0), product of:
              0.16378914 = queryWeight, product of:
                1.1204873 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.01795443 = queryNorm
              0.71961725 = fieldWeight in 3848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=3848)
          0.036154177 = weight(abstract_txt:structure in 3848) [ClassicSimilarity], result of:
            0.036154177 = score(doc=3848,freq=2.0), product of:
              0.093859084 = queryWeight, product of:
                1.1995478 = boost
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.01795443 = queryNorm
              0.38519636 = fieldWeight in 3848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3579993 = idf(docFreq=1538, maxDocs=44218)
                0.0625 = fieldNorm(doc=3848)
          0.38411993 = weight(abstract_txt:classmarks in 3848) [ClassicSimilarity], result of:
            0.38411993 = score(doc=3848,freq=1.0), product of:
              0.6542177 = queryWeight, product of:
                3.8787 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.01795443 = queryNorm
              0.5871439 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=3848)
        0.16 = coord(4/25)