Document (#14072)

Author
Rijsbergen, C.J. van
Title
¬A test for the separation of relevant and non-relevant documents in experimental retrieval collections
Source
Journal of documentation. 29(1973) no.3, S.251-257
Year
1973
Abstract
Many retrievalexperiments are intended to discover ways of improving performance, taking the results obtained with some particular technique as a baseline. The fact that substantial alterations to a system often have little or no effect on particular collections is puzzling. This may be due to the initially poor seperation of relevant and non-relevant documents. The paper presents a procedure for characterizing this seperation for a collection, which can be used to show whether proposed modifications of the base system are likely to be useful.
Theme
Retrievalstudien

Similar documents (author)

  1. Van Rijsbergen, C.J. -> Rijsbergen, C.J. van: 4.46
    4.45533 = sum of:
      4.45533 = weight(author_txt:rijsbergen in 4130) [ClassicSimilarity], result of:
        4.45533 = fieldWeight in 4130, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.401051 = idf(docFreq=26, maxDocs=44218)
          0.375 = fieldNorm(doc=4130)
    
  2. Rijsbergen, C.J. van: Foundations of evaluation (1974) 4.20
    4.2005253 = sum of:
      4.2005253 = weight(author_txt:rijsbergen in 1078) [ClassicSimilarity], result of:
        4.2005253 = fieldWeight in 1078, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.401051 = idf(docFreq=26, maxDocs=44218)
          0.5 = fieldNorm(doc=1078)
    
  3. Rijsbergen, C.J. van: Automatic classification in information retrieval (1978) 4.20
    4.2005253 = sum of:
      4.2005253 = weight(author_txt:rijsbergen in 2412) [ClassicSimilarity], result of:
        4.2005253 = fieldWeight in 2412, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.401051 = idf(docFreq=26, maxDocs=44218)
          0.5 = fieldNorm(doc=2412)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 4.20
    4.2005253 = sum of:
      4.2005253 = weight(author_txt:rijsbergen in 3300) [ClassicSimilarity], result of:
        4.2005253 = fieldWeight in 3300, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.401051 = idf(docFreq=26, maxDocs=44218)
          0.5 = fieldNorm(doc=3300)
    
  5. Rijsbergen, C.J. van: Retrieval effectiveness (1981) 4.20
    4.2005253 = sum of:
      4.2005253 = weight(author_txt:rijsbergen in 3147) [ClassicSimilarity], result of:
        4.2005253 = fieldWeight in 3147, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.401051 = idf(docFreq=26, maxDocs=44218)
          0.5 = fieldNorm(doc=3147)
    

Similar documents (content)

  1. Ruthven, T.; Lalmas, M.; Rijsbergen, K.van: Incorporating user research behavior into relevance feedback (2003) 0.14
    0.14407355 = sum of:
      0.14407355 = product of:
        0.51454836 = sum of:
          0.0727102 = weight(abstract_txt:experimental in 5169) [ClassicSimilarity], result of:
            0.0727102 = score(doc=5169,freq=3.0), product of:
              0.12443049 = queryWeight, product of:
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.023051487 = queryNorm
              0.5843439 = fieldWeight in 5169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
          0.04272434 = weight(abstract_txt:effect in 5169) [ClassicSimilarity], result of:
            0.04272434 = score(doc=5169,freq=1.0), product of:
              0.12589851 = queryWeight, product of:
                1.0058817 = boost
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.023051487 = queryNorm
              0.3393554 = fieldWeight in 5169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
          0.06592799 = weight(abstract_txt:technique in 5169) [ClassicSimilarity], result of:
            0.06592799 = score(doc=5169,freq=2.0), product of:
              0.13343617 = queryWeight, product of:
                1.0355555 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.023051487 = queryNorm
              0.49407884 = fieldWeight in 5169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
          0.040944256 = weight(abstract_txt:system in 5169) [ClassicSimilarity], result of:
            0.040944256 = score(doc=5169,freq=4.0), product of:
              0.09713051 = queryWeight, product of:
                1.2494804 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.023051487 = queryNorm
              0.42153856 = fieldWeight in 5169, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
          0.052844476 = weight(abstract_txt:documents in 5169) [ClassicSimilarity], result of:
            0.052844476 = score(doc=5169,freq=2.0), product of:
              0.14506748 = queryWeight, product of:
                1.5269915 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.023051487 = queryNorm
              0.36427513 = fieldWeight in 5169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
          0.055202775 = weight(abstract_txt:collections in 5169) [ClassicSimilarity], result of:
            0.055202775 = score(doc=5169,freq=1.0), product of:
              0.18817168 = queryWeight, product of:
                1.7391167 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.023051487 = queryNorm
              0.29336387 = fieldWeight in 5169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
          0.18419434 = weight(abstract_txt:relevant in 5169) [ClassicSimilarity], result of:
            0.18419434 = score(doc=5169,freq=3.0), product of:
              0.36705753 = queryWeight, product of:
                3.4350548 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.023051487 = queryNorm
              0.5018133 = fieldWeight in 5169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=5169)
        0.28 = coord(7/25)
    
  2. Dadashkarimia, J.; Shakery, A.; Failia, H.; Zamani, H.: ¬An expectation-maximization algorithm for query translation based on pseudo-relevant documents (2017) 0.14
    0.14199789 = sum of:
      0.14199789 = product of:
        0.50713533 = sum of:
          0.0373838 = weight(abstract_txt:effect in 3296) [ClassicSimilarity], result of:
            0.0373838 = score(doc=3296,freq=1.0), product of:
              0.12589851 = queryWeight, product of:
                1.0058817 = boost
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.023051487 = queryNorm
              0.29693598 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4296865 = idf(docFreq=526, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
          0.044553675 = weight(abstract_txt:obtained in 3296) [ClassicSimilarity], result of:
            0.044553675 = score(doc=3296,freq=1.0), product of:
              0.14152092 = queryWeight, product of:
                1.0664657 = boost
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.023051487 = queryNorm
              0.3148204 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.756716 = idf(docFreq=379, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
          0.048058506 = weight(abstract_txt:improving in 3296) [ClassicSimilarity], result of:
            0.048058506 = score(doc=3296,freq=1.0), product of:
              0.14884874 = queryWeight, product of:
                1.0937276 = boost
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.023051487 = queryNorm
              0.32286808 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
          0.07567612 = weight(abstract_txt:baseline in 3296) [ClassicSimilarity], result of:
            0.07567612 = score(doc=3296,freq=1.0), product of:
              0.20146714 = queryWeight, product of:
                1.272444 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.023051487 = queryNorm
              0.37562513 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
          0.056630872 = weight(abstract_txt:documents in 3296) [ClassicSimilarity], result of:
            0.056630872 = score(doc=3296,freq=3.0), product of:
              0.14506748 = queryWeight, product of:
                1.5269915 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.023051487 = queryNorm
              0.39037606 = fieldWeight in 3296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
          0.083662264 = weight(abstract_txt:collections in 3296) [ClassicSimilarity], result of:
            0.083662264 = score(doc=3296,freq=3.0), product of:
              0.18817168 = queryWeight, product of:
                1.7391167 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.023051487 = queryNorm
              0.444606 = fieldWeight in 3296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
          0.16117005 = weight(abstract_txt:relevant in 3296) [ClassicSimilarity], result of:
            0.16117005 = score(doc=3296,freq=3.0), product of:
              0.36705753 = queryWeight, product of:
                3.4350548 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.023051487 = queryNorm
              0.43908662 = fieldWeight in 3296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3296)
        0.28 = coord(7/25)
    
  3. Talvensaari, T.; Juhola, M.; Laurikkala, J.; Järvelin, K.: Corpus-based cross-language information retrieval in retrieval of highly relevant documents (2007) 0.14
    0.13663432 = sum of:
      0.13663432 = product of:
        0.56930965 = sum of:
          0.020472128 = weight(abstract_txt:system in 139) [ClassicSimilarity], result of:
            0.020472128 = score(doc=139,freq=1.0), product of:
              0.09713051 = queryWeight, product of:
                1.2494804 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.023051487 = queryNorm
              0.21076928 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.086486995 = weight(abstract_txt:baseline in 139) [ClassicSimilarity], result of:
            0.086486995 = score(doc=139,freq=1.0), product of:
              0.20146714 = queryWeight, product of:
                1.272444 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.023051487 = queryNorm
              0.42928585 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.08803822 = weight(abstract_txt:poor in 139) [ClassicSimilarity], result of:
            0.08803822 = score(doc=139,freq=1.0), product of:
              0.203869 = queryWeight, product of:
                1.2800065 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.023051487 = queryNorm
              0.43183723 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.083554454 = weight(abstract_txt:documents in 139) [ClassicSimilarity], result of:
            0.083554454 = score(doc=139,freq=5.0), product of:
              0.14506748 = queryWeight, product of:
                1.5269915 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.023051487 = queryNorm
              0.5759696 = fieldWeight in 139, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.07806851 = weight(abstract_txt:collections in 139) [ClassicSimilarity], result of:
            0.07806851 = score(doc=139,freq=2.0), product of:
              0.18817168 = queryWeight, product of:
                1.7391167 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.023051487 = queryNorm
              0.41487914 = fieldWeight in 139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
          0.21268933 = weight(abstract_txt:relevant in 139) [ClassicSimilarity], result of:
            0.21268933 = score(doc=139,freq=4.0), product of:
              0.36705753 = queryWeight, product of:
                3.4350548 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.023051487 = queryNorm
              0.5794441 = fieldWeight in 139, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=139)
        0.24 = coord(6/25)
    
  4. Lam-Adesina, A.M.; Jones, G.J.F.: Examining and improving the effectiveness of relevance feedback for retrieval of scanned text documents (2006) 0.11
    0.11211836 = sum of:
      0.11211836 = product of:
        0.40042272 = sum of:
          0.041979253 = weight(abstract_txt:experimental in 977) [ClassicSimilarity], result of:
            0.041979253 = score(doc=977,freq=1.0), product of:
              0.12443049 = queryWeight, product of:
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.023051487 = queryNorm
              0.3373711 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.054924008 = weight(abstract_txt:improving in 977) [ClassicSimilarity], result of:
            0.054924008 = score(doc=977,freq=1.0), product of:
              0.14884874 = queryWeight, product of:
                1.0937276 = boost
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.023051487 = queryNorm
              0.3689921 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.020472128 = weight(abstract_txt:system in 977) [ClassicSimilarity], result of:
            0.020472128 = score(doc=977,freq=1.0), product of:
              0.09713051 = queryWeight, product of:
                1.2494804 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.023051487 = queryNorm
              0.21076928 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.086486995 = weight(abstract_txt:baseline in 977) [ClassicSimilarity], result of:
            0.086486995 = score(doc=977,freq=1.0), product of:
              0.20146714 = queryWeight, product of:
                1.272444 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.023051487 = queryNorm
              0.42928585 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.09814544 = weight(abstract_txt:modifications in 977) [ClassicSimilarity], result of:
            0.09814544 = score(doc=977,freq=1.0), product of:
              0.21918817 = queryWeight, product of:
                1.3272268 = boost
                7.1642876 = idf(docFreq=92, maxDocs=44218)
                0.023051487 = queryNorm
              0.44776797 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1642876 = idf(docFreq=92, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.052844476 = weight(abstract_txt:documents in 977) [ClassicSimilarity], result of:
            0.052844476 = score(doc=977,freq=2.0), product of:
              0.14506748 = queryWeight, product of:
                1.5269915 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.023051487 = queryNorm
              0.36427513 = fieldWeight in 977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
          0.04557044 = weight(abstract_txt:particular in 977) [ClassicSimilarity], result of:
            0.04557044 = score(doc=977,freq=1.0), product of:
              0.16559066 = queryWeight, product of:
                1.631434 = boost
                4.4031897 = idf(docFreq=1470, maxDocs=44218)
                0.023051487 = queryNorm
              0.27519935 = fieldWeight in 977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4031897 = idf(docFreq=1470, maxDocs=44218)
                0.0625 = fieldNorm(doc=977)
        0.28 = coord(7/25)
    
  5. Khan, M.S.; Khor, S.: Enhanced Web document retrieval using automatic query expansion (2004) 0.10
    0.102677256 = sum of:
      0.102677256 = product of:
        0.4278219 = sum of:
          0.041979253 = weight(abstract_txt:experimental in 2091) [ClassicSimilarity], result of:
            0.041979253 = score(doc=2091,freq=1.0), product of:
              0.12443049 = queryWeight, product of:
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.023051487 = queryNorm
              0.3373711 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.397938 = idf(docFreq=543, maxDocs=44218)
                0.0625 = fieldNorm(doc=2091)
          0.04892868 = weight(abstract_txt:likely in 2091) [ClassicSimilarity], result of:
            0.04892868 = score(doc=2091,freq=1.0), product of:
              0.13780956 = queryWeight, product of:
                1.0523889 = boost
                5.68073 = idf(docFreq=409, maxDocs=44218)
                0.023051487 = queryNorm
              0.35504562 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.68073 = idf(docFreq=409, maxDocs=44218)
                0.0625 = fieldNorm(doc=2091)
          0.054924008 = weight(abstract_txt:improving in 2091) [ClassicSimilarity], result of:
            0.054924008 = score(doc=2091,freq=1.0), product of:
              0.14884874 = queryWeight, product of:
                1.0937276 = boost
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.023051487 = queryNorm
              0.3689921 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9038734 = idf(docFreq=327, maxDocs=44218)
                0.0625 = fieldNorm(doc=2091)
          0.1009119 = weight(abstract_txt:initially in 2091) [ClassicSimilarity], result of:
            0.1009119 = score(doc=2091,freq=1.0), product of:
              0.22328794 = queryWeight, product of:
                1.3395817 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.023051487 = queryNorm
              0.4519362 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0625 = fieldNorm(doc=2091)
          0.07473338 = weight(abstract_txt:documents in 2091) [ClassicSimilarity], result of:
            0.07473338 = score(doc=2091,freq=4.0), product of:
              0.14506748 = queryWeight, product of:
                1.5269915 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.023051487 = queryNorm
              0.5151628 = fieldWeight in 2091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2091)
          0.10634466 = weight(abstract_txt:relevant in 2091) [ClassicSimilarity], result of:
            0.10634466 = score(doc=2091,freq=1.0), product of:
              0.36705753 = queryWeight, product of:
                3.4350548 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.023051487 = queryNorm
              0.28972206 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=2091)
        0.24 = coord(6/25)