Document (#18560)

Miyamoto, S.
Application of rough sets to information retrieval
Journal of the American Society for Information Science. 49(1998) no.3, S.195-205
The aim of the present article is to develop a method of rough retrieval, namely, an application of the rough set theory to information retrieval. After a brief review of fuzzy sets, rough sets, and a fuzzy logical model for information retrieval, rough approximations for retrieved data are defined. The approximations are considered for both crisp and fuzzy cases. A fuzzy set is introduced for the rough boundary, and estimation for the membership for the results of set operations on the boundary is discussed. Rough approximations in cases when hierarchical classes are assumed are considered. Moreover, another approximation by a membership sequence is discussed which refines the foregoing approximations. Illustrative examples are shown

Similar documents (content)

  1. Srinivasan, P.: Intelligent information retrieval using rough set approximations (1989) 0.24
    0.23897474 = sum of:
      0.23897474 = product of:
        0.8534812 = sum of:
          0.0293287 = weight(abstract_txt:retrieved in 2526) [ClassicSimilarity], result of:
            0.0293287 = score(doc=2526,freq=1.0), product of:
              0.05480791 = queryWeight, product of:
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.009602071 = queryNorm
              0.53511804 = fieldWeight in 2526, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
          0.029445196 = weight(abstract_txt:introduced in 2526) [ClassicSimilarity], result of:
            0.029445196 = score(doc=2526,freq=1.0), product of:
              0.054952946 = queryWeight, product of:
                1.0013223 = boost
                5.715473 = idf(docFreq=395, maxDocs=44218)
                0.009602071 = queryNorm
              0.5358256 = fieldWeight in 2526, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.715473 = idf(docFreq=395, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
          0.0067132777 = weight(abstract_txt:information in 2526) [ClassicSimilarity], result of:
            0.0067132777 = score(doc=2526,freq=1.0), product of:
              0.029578637 = queryWeight, product of:
                1.272413 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009602071 = queryNorm
              0.22696373 = fieldWeight in 2526, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
          0.030232355 = weight(abstract_txt:application in 2526) [ClassicSimilarity], result of:
            0.030232355 = score(doc=2526,freq=1.0), product of:
              0.07046487 = queryWeight, product of:
                1.6035397 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.009602071 = queryNorm
              0.4290415 = fieldWeight in 2526, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
          0.02647491 = weight(abstract_txt:retrieval in 2526) [ClassicSimilarity], result of:
            0.02647491 = score(doc=2526,freq=1.0), product of:
              0.081262656 = queryWeight, product of:
                2.435308 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.009602071 = queryNorm
              0.3257943 = fieldWeight in 2526, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
          0.09327733 = weight(abstract_txt:sets in 2526) [ClassicSimilarity], result of:
            0.09327733 = score(doc=2526,freq=2.0), product of:
              0.13568416 = queryWeight, product of:
                2.7252326 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.009602071 = queryNorm
              0.68745923 = fieldWeight in 2526, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
          0.6380094 = weight(abstract_txt:rough in 2526) [ClassicSimilarity], result of:
            0.6380094 = score(doc=2526,freq=1.0), product of:
              0.81701887 = queryWeight, product of:
                10.215119 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.009602071 = queryNorm
              0.7808992 = fieldWeight in 2526, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=2526)
        0.28 = coord(7/25)
  2. Knowledge management in fuzzy databases (2000) 0.21
    0.2132012 = sum of:
      0.2132012 = product of:
        1.066006 = sum of:
          0.0094940085 = weight(abstract_txt:information in 4260) [ClassicSimilarity], result of:
            0.0094940085 = score(doc=4260,freq=2.0), product of:
              0.029578637 = queryWeight, product of:
                1.272413 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009602071 = queryNorm
              0.32097518 = fieldWeight in 4260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.030655304 = weight(abstract_txt:discussed in 4260) [ClassicSimilarity], result of:
            0.030655304 = score(doc=4260,freq=1.0), product of:
              0.071120545 = queryWeight, product of:
                1.6109829 = boost
                4.5976853 = idf(docFreq=1210, maxDocs=44218)
                0.009602071 = queryNorm
              0.43103302 = fieldWeight in 4260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5976853 = idf(docFreq=1210, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.037441175 = weight(abstract_txt:retrieval in 4260) [ClassicSimilarity], result of:
            0.037441175 = score(doc=4260,freq=2.0), product of:
              0.081262656 = queryWeight, product of:
                2.435308 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.009602071 = queryNorm
              0.4607427 = fieldWeight in 4260, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.35040602 = weight(abstract_txt:fuzzy in 4260) [ClassicSimilarity], result of:
            0.35040602 = score(doc=4260,freq=3.0), product of:
              0.31526467 = queryWeight, product of:
                4.7967386 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.009602071 = queryNorm
              1.1114662 = fieldWeight in 4260, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
          0.6380094 = weight(abstract_txt:rough in 4260) [ClassicSimilarity], result of:
            0.6380094 = score(doc=4260,freq=1.0), product of:
              0.81701887 = queryWeight, product of:
                10.215119 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.009602071 = queryNorm
              0.7808992 = fieldWeight in 4260, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=4260)
        0.2 = coord(5/25)
  3. Hassanien, A.-E.: Rough set approach for attribute reduction and rule generation : a case of patients with suspected breast cancer (2004) 0.19
    0.1871883 = sum of:
      0.1871883 = product of:
        1.1699269 = sum of:
          0.024066644 = weight(abstract_txt:moreover in 2883) [ClassicSimilarity], result of:
            0.024066644 = score(doc=2883,freq=1.0), product of:
              0.06294857 = queryWeight, product of:
                1.0716953 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.009602071 = queryNorm
              0.38232234 = fieldWeight in 2883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.0625 = fieldNorm(doc=2883)
          0.06002389 = weight(abstract_txt:approximation in 2883) [ClassicSimilarity], result of:
            0.06002389 = score(doc=2883,freq=1.0), product of:
              0.11576882 = queryWeight, product of:
                1.4533633 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.009602071 = queryNorm
              0.5184806 = fieldWeight in 2883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.0625 = fieldNorm(doc=2883)
          0.043971352 = weight(abstract_txt:sets in 2883) [ClassicSimilarity], result of:
            0.043971352 = score(doc=2883,freq=1.0), product of:
              0.13568416 = queryWeight, product of:
                2.7252326 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.009602071 = queryNorm
              0.32407138 = fieldWeight in 2883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.0625 = fieldNorm(doc=2883)
          1.041865 = weight(abstract_txt:rough in 2883) [ClassicSimilarity], result of:
            1.041865 = score(doc=2883,freq=6.0), product of:
              0.81701887 = queryWeight, product of:
                10.215119 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.009602071 = queryNorm
              1.2752031 = fieldWeight in 2883, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0625 = fieldNorm(doc=2883)
        0.16 = coord(4/25)
  4. Fuzziness in database management systems (1995) 0.17
    0.17227669 = sum of:
      0.17227669 = product of:
        0.7178196 = sum of:
          0.0094940085 = weight(abstract_txt:information in 2123) [ClassicSimilarity], result of:
            0.0094940085 = score(doc=2123,freq=2.0), product of:
              0.029578637 = queryWeight, product of:
                1.272413 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.009602071 = queryNorm
              0.32097518 = fieldWeight in 2123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=2123)
          0.05309654 = weight(abstract_txt:discussed in 2123) [ClassicSimilarity], result of:
            0.05309654 = score(doc=2123,freq=3.0), product of:
              0.071120545 = queryWeight, product of:
                1.6109829 = boost
                4.5976853 = idf(docFreq=1210, maxDocs=44218)
                0.009602071 = queryNorm
              0.74657106 = fieldWeight in 2123, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5976853 = idf(docFreq=1210, maxDocs=44218)
                0.09375 = fieldNorm(doc=2123)
          0.03992777 = weight(abstract_txt:considered in 2123) [ClassicSimilarity], result of:
            0.03992777 = score(doc=2123,freq=1.0), product of:
              0.08482191 = queryWeight, product of:
                1.7593304 = boost
                5.021064 = idf(docFreq=792, maxDocs=44218)
                0.009602071 = queryNorm
              0.47072473 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.021064 = idf(docFreq=792, maxDocs=44218)
                0.09375 = fieldNorm(doc=2123)
          0.02647491 = weight(abstract_txt:retrieval in 2123) [ClassicSimilarity], result of:
            0.02647491 = score(doc=2123,freq=1.0), product of:
              0.081262656 = queryWeight, product of:
                2.435308 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.009602071 = queryNorm
              0.3257943 = fieldWeight in 2123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2123)
          0.09327733 = weight(abstract_txt:sets in 2123) [ClassicSimilarity], result of:
            0.09327733 = score(doc=2123,freq=2.0), product of:
              0.13568416 = queryWeight, product of:
                2.7252326 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.009602071 = queryNorm
              0.68745923 = fieldWeight in 2123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.09375 = fieldNorm(doc=2123)
          0.495549 = weight(abstract_txt:fuzzy in 2123) [ClassicSimilarity], result of:
            0.495549 = score(doc=2123,freq=6.0), product of:
              0.31526467 = queryWeight, product of:
                4.7967386 = boost
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.009602071 = queryNorm
              1.5718507 = fieldWeight in 2123, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.8448567 = idf(docFreq=127, maxDocs=44218)
                0.09375 = fieldNorm(doc=2123)
        0.24 = coord(6/25)
  5. Bell, D.A.; Guan, J.W.: Computational methods for rough classification and discovery (1998) 0.16
    0.16477713 = sum of:
      0.16477713 = product of:
        1.3731428 = sum of:
          0.031166932 = weight(abstract_txt:classes in 2909) [ClassicSimilarity], result of:
            0.031166932 = score(doc=2909,freq=1.0), product of:
              0.05707475 = queryWeight, product of:
                1.0204704 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.009602071 = queryNorm
              0.5460721 = fieldWeight in 2909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.09375 = fieldNorm(doc=2909)
          0.065957025 = weight(abstract_txt:sets in 2909) [ClassicSimilarity], result of:
            0.065957025 = score(doc=2909,freq=1.0), product of:
              0.13568416 = queryWeight, product of:
                2.7252326 = boost
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.009602071 = queryNorm
              0.48610705 = fieldWeight in 2909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.185142 = idf(docFreq=672, maxDocs=44218)
                0.09375 = fieldNorm(doc=2909)
          1.2760189 = weight(abstract_txt:rough in 2909) [ClassicSimilarity], result of:
            1.2760189 = score(doc=2909,freq=4.0), product of:
              0.81701887 = queryWeight, product of:
                10.215119 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.009602071 = queryNorm
              1.5617985 = fieldWeight in 2909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=2909)
        0.12 = coord(3/25)