Document (#37468)

Alexiev, V.
Implementing CIDOC CRM search based on fundamental relations and OWLIM rules
Proceedings of the 2nd International Workshop on Semantic Digital Archives held in conjunction with the 16th Int. Conference on Theory and Practice of Digital Libraries (TPDL) on September 27, 2012 in Paphos, Cyprus []. Eds.: A. Mitschik et al
The CIDOC CRM provides an ontology for describing entities, properties and relationships appearing in cultural heritage (CH) documentation, history and archeology. CRM promotes shared understanding by providing an extensible semantic framework that any CH information can be mapped to. CRM data is usually represented in semantic web format (RDF) and comprises complex graphs of nodes and properties. An important question is how a user can search through such complex graphs, since the number of possible combinations is staggering. One approach "compresses" the semantic network by mapping many CRM entity classes to a few "Fundamental Concepts" (FC), and mapping whole networks of CRM properties to fewer "Fundamental Relations" (FR). These FC and FRs serve as a "search index" over the CRM semantic web and allow the user to use a simpler query vocabulary. We describe an implementation of CRM FR Search based on OWLIM Rules, done as part of the ResearchSpace (RS) project. We describe the technical details, problems and difficulties encountered, benefits and disadvantages of using OWLIM rules, and preliminary performance results. We provide implementation experience that can be valuable for further implementation, definition and maintenance of CRM FRs.
Vgl. auch:

Similar documents (content)

  1. Peponakis, M.; Mastora, A.; Kapidakis, S.; Doerr, M.: Expressiveness and machine processability of Knowledge Organization Systems (KOS) : an analysis of concepts and relations (2020) 0.17
    0.16517453 = sum of:
      0.16517453 = product of:
        0.589909 = sum of:
          0.053107694 = weight(abstract_txt:nodes in 5787) [ClassicSimilarity], result of:
            0.053107694 = score(doc=5787,freq=1.0), product of:
              0.13825513 = queryWeight, product of:
                1.0781192 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.018256873 = queryNorm
              0.38412818 = fieldWeight in 5787, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
          0.05850165 = weight(abstract_txt:comprises in 5787) [ClassicSimilarity], result of:
            0.05850165 = score(doc=5787,freq=1.0), product of:
              0.1474648 = queryWeight, product of:
                1.113449 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.018256873 = queryNorm
              0.39671603 = fieldWeight in 5787, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
          0.040553063 = weight(abstract_txt:complex in 5787) [ClassicSimilarity], result of:
            0.040553063 = score(doc=5787,freq=1.0), product of:
              0.14552426 = queryWeight, product of:
                1.5642596 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.018256873 = queryNorm
              0.27866873 = fieldWeight in 5787, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
          0.10458264 = weight(abstract_txt:relations in 5787) [ClassicSimilarity], result of:
            0.10458264 = score(doc=5787,freq=4.0), product of:
              0.17240083 = queryWeight, product of:
                1.7025928 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.018256873 = queryNorm
              0.6066249 = fieldWeight in 5787, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
          0.066032514 = weight(abstract_txt:rules in 5787) [ClassicSimilarity], result of:
            0.066032514 = score(doc=5787,freq=1.0), product of:
              0.23056248 = queryWeight, product of:
                2.4114656 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.018256873 = queryNorm
              0.2863975 = fieldWeight in 5787, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
          0.18948002 = weight(abstract_txt:cidoc in 5787) [ClassicSimilarity], result of:
            0.18948002 = score(doc=5787,freq=1.0), product of:
              0.40671974 = queryWeight, product of:
                2.6151028 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018256873 = queryNorm
              0.4658737 = fieldWeight in 5787, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
          0.07765146 = weight(abstract_txt:semantic in 5787) [ClassicSimilarity], result of:
            0.07765146 = score(doc=5787,freq=2.0), product of:
              0.22439821 = queryWeight, product of:
                2.7470453 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.018256873 = queryNorm
              0.34604314 = fieldWeight in 5787, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5787)
        0.28 = coord(7/25)
  2. Styltsvig, H.B.: Ontology-based information retrieval (2006) 0.13
    0.13407882 = sum of:
      0.13407882 = product of:
        0.47885293 = sum of:
          0.09104176 = weight(abstract_txt:nodes in 1154) [ClassicSimilarity], result of:
            0.09104176 = score(doc=1154,freq=4.0), product of:
              0.13825513 = queryWeight, product of:
                1.0781192 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.018256873 = queryNorm
              0.65850544 = fieldWeight in 1154, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.050144266 = weight(abstract_txt:comprises in 1154) [ClassicSimilarity], result of:
            0.050144266 = score(doc=1154,freq=1.0), product of:
              0.1474648 = queryWeight, product of:
                1.113449 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.018256873 = queryNorm
              0.3400423 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.034759767 = weight(abstract_txt:complex in 1154) [ClassicSimilarity], result of:
            0.034759767 = score(doc=1154,freq=1.0), product of:
              0.14552426 = queryWeight, product of:
                1.5642596 = boost
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.018256873 = queryNorm
              0.23885891 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.095657 = idf(docFreq=735, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.07763247 = weight(abstract_txt:relations in 1154) [ClassicSimilarity], result of:
            0.07763247 = score(doc=1154,freq=3.0), product of:
              0.17240083 = queryWeight, product of:
                1.7025928 = boost
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.018256873 = queryNorm
              0.45030218 = fieldWeight in 1154, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5462847 = idf(docFreq=468, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.0506766 = weight(abstract_txt:mapping in 1154) [ClassicSimilarity], result of:
            0.0506766 = score(doc=1154,freq=1.0), product of:
              0.18710661 = queryWeight, product of:
                1.7737225 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.018256873 = queryNorm
              0.27084345 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.08047033 = weight(abstract_txt:properties in 1154) [ClassicSimilarity], result of:
            0.08047033 = score(doc=1154,freq=1.0), product of:
              0.2915223 = queryWeight, product of:
                2.7115815 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.018256873 = queryNorm
              0.27603492 = fieldWeight in 1154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
          0.09412778 = weight(abstract_txt:semantic in 1154) [ClassicSimilarity], result of:
            0.09412778 = score(doc=1154,freq=4.0), product of:
              0.22439821 = queryWeight, product of:
                2.7470453 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.018256873 = queryNorm
              0.41946763 = fieldWeight in 1154, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.046875 = fieldNorm(doc=1154)
        0.28 = coord(7/25)
  3. Vlachidis, A.; Binding, C.; Tudhope, D.; May, K.: Excavating grey literature : a case study on the rich indexing of archaeological documents via natural language-processing techniques and knowledge-based resources (2010) 0.12
    0.1243028 = sum of:
      0.1243028 = product of:
        0.62151396 = sum of:
          0.06849582 = weight(abstract_txt:heritage in 3948) [ClassicSimilarity], result of:
            0.06849582 = score(doc=3948,freq=2.0), product of:
              0.11894542 = queryWeight, product of:
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.018256873 = queryNorm
              0.57585925 = fieldWeight in 3948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3948)
          0.07546573 = weight(abstract_txt:rules in 3948) [ClassicSimilarity], result of:
            0.07546573 = score(doc=3948,freq=1.0), product of:
              0.23056248 = queryWeight, product of:
                2.4114656 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.018256873 = queryNorm
              0.32731143 = fieldWeight in 3948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.0625 = fieldNorm(doc=3948)
          0.2165486 = weight(abstract_txt:cidoc in 3948) [ClassicSimilarity], result of:
            0.2165486 = score(doc=3948,freq=1.0), product of:
              0.40671974 = queryWeight, product of:
                2.6151028 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018256873 = queryNorm
              0.5324271 = fieldWeight in 3948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.0625 = fieldNorm(doc=3948)
          0.10729378 = weight(abstract_txt:properties in 3948) [ClassicSimilarity], result of:
            0.10729378 = score(doc=3948,freq=1.0), product of:
              0.2915223 = queryWeight, product of:
                2.7115815 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.018256873 = queryNorm
              0.36804655 = fieldWeight in 3948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0625 = fieldNorm(doc=3948)
          0.15371004 = weight(abstract_txt:semantic in 3948) [ClassicSimilarity], result of:
            0.15371004 = score(doc=3948,freq=6.0), product of:
              0.22439821 = queryWeight, product of:
                2.7470453 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.018256873 = queryNorm
              0.6849878 = fieldWeight in 3948, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=3948)
        0.2 = coord(5/25)
  4. Mao, M.: Ontology mapping : towards semantic interoperability in distributed and heterogeneous environments (2008) 0.12
    0.115213886 = sum of:
      0.115213886 = product of:
        0.5760694 = sum of:
          0.1960878 = weight(abstract_txt:mapping in 4659) [ClassicSimilarity], result of:
            0.1960878 = score(doc=4659,freq=11.0), product of:
              0.18710661 = queryWeight, product of:
                1.7737225 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.018256873 = queryNorm
              1.0480005 = fieldWeight in 4659, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4659)
          0.11025128 = weight(abstract_txt:graphs in 4659) [ClassicSimilarity], result of:
            0.11025128 = score(doc=4659,freq=1.0), product of:
              0.28347105 = queryWeight, product of:
                2.18321 = boost
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.018256873 = queryNorm
              0.38893312 = fieldWeight in 4659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4659)
          0.066032514 = weight(abstract_txt:rules in 4659) [ClassicSimilarity], result of:
            0.066032514 = score(doc=4659,freq=1.0), product of:
              0.23056248 = queryWeight, product of:
                2.4114656 = boost
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.018256873 = queryNorm
              0.2863975 = fieldWeight in 4659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.236983 = idf(docFreq=638, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4659)
          0.093882054 = weight(abstract_txt:properties in 4659) [ClassicSimilarity], result of:
            0.093882054 = score(doc=4659,freq=1.0), product of:
              0.2915223 = queryWeight, product of:
                2.7115815 = boost
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.018256873 = queryNorm
              0.32204074 = fieldWeight in 4659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.888745 = idf(docFreq=332, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4659)
          0.10981575 = weight(abstract_txt:semantic in 4659) [ClassicSimilarity], result of:
            0.10981575 = score(doc=4659,freq=4.0), product of:
              0.22439821 = queryWeight, product of:
                2.7470453 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.018256873 = queryNorm
              0.4893789 = fieldWeight in 4659, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4659)
        0.2 = coord(5/25)
  5. Park, H.; Smiraglia, R.P.: Enhancing data curation of cultural heritage for information sharing : a case study using open Government data (2014) 0.11
    0.11470158 = sum of:
      0.11470158 = product of:
        0.71688485 = sum of:
          0.104862384 = weight(abstract_txt:heritage in 1575) [ClassicSimilarity], result of:
            0.104862384 = score(doc=1575,freq=3.0), product of:
              0.11894542 = queryWeight, product of:
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.018256873 = queryNorm
              0.88160086 = fieldWeight in 1575, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=1575)
          0.14475404 = weight(abstract_txt:mapped in 1575) [ClassicSimilarity], result of:
            0.14475404 = score(doc=1575,freq=3.0), product of:
              0.1474648 = queryWeight, product of:
                1.113449 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.018256873 = queryNorm
              0.98161757 = fieldWeight in 1575, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.078125 = fieldNorm(doc=1575)
          0.08446099 = weight(abstract_txt:mapping in 1575) [ClassicSimilarity], result of:
            0.08446099 = score(doc=1575,freq=1.0), product of:
              0.18710661 = queryWeight, product of:
                1.7737225 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.018256873 = queryNorm
              0.4514057 = fieldWeight in 1575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.078125 = fieldNorm(doc=1575)
          0.38280743 = weight(abstract_txt:cidoc in 1575) [ClassicSimilarity], result of:
            0.38280743 = score(doc=1575,freq=2.0), product of:
              0.40671974 = queryWeight, product of:
                2.6151028 = boost
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.018256873 = queryNorm
              0.94120693 = fieldWeight in 1575, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.518833 = idf(docFreq=23, maxDocs=44218)
                0.078125 = fieldNorm(doc=1575)
        0.16 = coord(4/25)