Document (#7840)

Author
Pao, M.L.
Title
High precision by duplicate retrieval
Source
Proceedings of the 14th National Online Meeting 1993, New York, 4-6 May 1993. Ed.: M.E. Williams
Imprint
Medford, NJ : Learned Information
Year
1993
Pages
S.337-341
Abstract
Reports results of a study of the phenomenon of retrieval overlap. Three studies reported that retrieval overlap was small, and analysis of the overlap data shows that the overlap items are much more likely to be rated as relevant to the search topic. Moreover, the odds are at least 2 to 1 that overlap items are judged to be definitely relevant. This retrieval overlap may be used as a search tactic if searchers are interested in a few items of definite relevance: Although multiple retrieval could be derived from different searches or different search methods, it appears that online searchers could easily incorporate this search tactic by searching different databases when a few definite relevant items are called for

Similar documents (content)

  1. Pao, M.L.: Relevance odds of retrieval overlaps from seven search fields (1994) 0.37
    0.37408832 = sum of:
      0.37408832 = product of:
        1.0391341 = sum of:
          0.087682 = weight(abstract_txt:duplicate in 7817) [ClassicSimilarity], result of:
            0.087682 = score(doc=7817,freq=1.0), product of:
              0.13970922 = queryWeight, product of:
                1.398013 = boost
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.012439947 = queryNorm
              0.62760353 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.033325 = idf(docFreq=38, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.24104044 = weight(abstract_txt:odds in 7817) [ClassicSimilarity], result of:
            0.24104044 = score(doc=7817,freq=5.0), product of:
              0.16033243 = queryWeight, product of:
                1.4976467 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.012439947 = queryNorm
              1.5033792 = fieldWeight in 7817, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.008999965 = weight(abstract_txt:that in 7817) [ClassicSimilarity], result of:
            0.008999965 = score(doc=7817,freq=1.0), product of:
              0.048618175 = queryWeight, product of:
                1.6494076 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012439947 = queryNorm
              0.18511525 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.036764678 = weight(abstract_txt:could in 7817) [ClassicSimilarity], result of:
            0.036764678 = score(doc=7817,freq=1.0), product of:
              0.09860872 = queryWeight, product of:
                1.6610065 = boost
                4.772275 = idf(docFreq=1016, maxDocs=44218)
                0.012439947 = queryNorm
              0.37283397 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.772275 = idf(docFreq=1016, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.050541747 = weight(abstract_txt:relevant in 7817) [ClassicSimilarity], result of:
            0.050541747 = score(doc=7817,freq=1.0), product of:
              0.13955927 = queryWeight, product of:
                2.4201298 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.012439947 = queryNorm
              0.36215258 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.0468327 = weight(abstract_txt:search in 7817) [ClassicSimilarity], result of:
            0.0468327 = score(doc=7817,freq=2.0), product of:
              0.1158762 = queryWeight, product of:
                2.546395 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012439947 = queryNorm
              0.4041615 = fieldWeight in 7817, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.0614712 = weight(abstract_txt:retrieval in 7817) [ClassicSimilarity], result of:
            0.0614712 = score(doc=7817,freq=3.0), product of:
              0.13072205 = queryWeight, product of:
                3.023835 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012439947 = queryNorm
              0.47024357 = fieldWeight in 7817, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.16611908 = weight(abstract_txt:items in 7817) [ClassicSimilarity], result of:
            0.16611908 = score(doc=7817,freq=2.0), product of:
              0.26950973 = queryWeight, product of:
                3.8834336 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.012439947 = queryNorm
              0.6163751 = fieldWeight in 7817, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
          0.33968228 = weight(abstract_txt:overlap in 7817) [ClassicSimilarity], result of:
            0.33968228 = score(doc=7817,freq=1.0), product of:
              0.62620586 = queryWeight, product of:
                7.249914 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.012439947 = queryNorm
              0.54244506 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.078125 = fieldNorm(doc=7817)
        0.36 = coord(9/25)
    
  2. Still, J.: ¬The anthroplogy of online search strategy formation : a study of four countires (1996) 0.24
    0.23579654 = sum of:
      0.23579654 = product of:
        0.7368642 = sum of:
          0.010182299 = weight(abstract_txt:that in 4954) [ClassicSimilarity], result of:
            0.010182299 = score(doc=4954,freq=2.0), product of:
              0.048618175 = queryWeight, product of:
                1.6494076 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012439947 = queryNorm
              0.20943399 = fieldWeight in 4954, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.029411744 = weight(abstract_txt:could in 4954) [ClassicSimilarity], result of:
            0.029411744 = score(doc=4954,freq=1.0), product of:
              0.09860872 = queryWeight, product of:
                1.6610065 = boost
                4.772275 = idf(docFreq=1016, maxDocs=44218)
                0.012439947 = queryNorm
              0.2982672 = fieldWeight in 4954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.772275 = idf(docFreq=1016, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.20348826 = weight(abstract_txt:searchers in 4954) [ClassicSimilarity], result of:
            0.20348826 = score(doc=4954,freq=12.0), product of:
              0.15638699 = queryWeight, product of:
                2.0917702 = boost
                6.009912 = idf(docFreq=294, maxDocs=44218)
                0.012439947 = queryNorm
              1.301184 = fieldWeight in 4954, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.009912 = idf(docFreq=294, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.0404334 = weight(abstract_txt:relevant in 4954) [ClassicSimilarity], result of:
            0.0404334 = score(doc=4954,freq=1.0), product of:
              0.13955927 = queryWeight, product of:
                2.4201298 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.012439947 = queryNorm
              0.28972206 = fieldWeight in 4954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.0592392 = weight(abstract_txt:search in 4954) [ClassicSimilarity], result of:
            0.0592392 = score(doc=4954,freq=5.0), product of:
              0.1158762 = queryWeight, product of:
                2.546395 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012439947 = queryNorm
              0.5112284 = fieldWeight in 4954, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.028392334 = weight(abstract_txt:retrieval in 4954) [ClassicSimilarity], result of:
            0.028392334 = score(doc=4954,freq=1.0), product of:
              0.13072205 = queryWeight, product of:
                3.023835 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012439947 = queryNorm
              0.21719621 = fieldWeight in 4954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.09397114 = weight(abstract_txt:items in 4954) [ClassicSimilarity], result of:
            0.09397114 = score(doc=4954,freq=1.0), product of:
              0.26950973 = queryWeight, product of:
                3.8834336 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.012439947 = queryNorm
              0.3486744 = fieldWeight in 4954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
          0.27174583 = weight(abstract_txt:overlap in 4954) [ClassicSimilarity], result of:
            0.27174583 = score(doc=4954,freq=1.0), product of:
              0.62620586 = queryWeight, product of:
                7.249914 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.012439947 = queryNorm
              0.43395606 = fieldWeight in 4954, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.0625 = fieldNorm(doc=4954)
        0.32 = coord(8/25)
    
  3. Bellardo, T.; Saracevic, T.: Online searching and search output : relationships between overlap, relevance, recall and precision (1987) 0.21
    0.21287054 = sum of:
      0.21287054 = product of:
        0.76025194 = sum of:
          0.076900266 = weight(abstract_txt:judged in 4150) [ClassicSimilarity], result of:
            0.076900266 = score(doc=4150,freq=1.0), product of:
              0.12800787 = queryWeight, product of:
                1.3381877 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.012439947 = queryNorm
              0.6007464 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
          0.024988836 = weight(abstract_txt:different in 4150) [ClassicSimilarity], result of:
            0.024988836 = score(doc=4150,freq=1.0), product of:
              0.087261476 = queryWeight, product of:
                1.9136856 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.012439947 = queryNorm
              0.28636733 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
          0.10384217 = weight(abstract_txt:searchers in 4150) [ClassicSimilarity], result of:
            0.10384217 = score(doc=4150,freq=2.0), product of:
              0.15638699 = queryWeight, product of:
                2.0917702 = boost
                6.009912 = idf(docFreq=294, maxDocs=44218)
                0.012439947 = queryNorm
              0.6640077 = fieldWeight in 4150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.009912 = idf(docFreq=294, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
          0.050541747 = weight(abstract_txt:relevant in 4150) [ClassicSimilarity], result of:
            0.050541747 = score(doc=4150,freq=1.0), product of:
              0.13955927 = queryWeight, product of:
                2.4201298 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.012439947 = queryNorm
              0.36215258 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
          0.0468327 = weight(abstract_txt:search in 4150) [ClassicSimilarity], result of:
            0.0468327 = score(doc=4150,freq=2.0), product of:
              0.1158762 = queryWeight, product of:
                2.546395 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012439947 = queryNorm
              0.4041615 = fieldWeight in 4150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
          0.11746393 = weight(abstract_txt:items in 4150) [ClassicSimilarity], result of:
            0.11746393 = score(doc=4150,freq=1.0), product of:
              0.26950973 = queryWeight, product of:
                3.8834336 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.012439947 = queryNorm
              0.435843 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
          0.33968228 = weight(abstract_txt:overlap in 4150) [ClassicSimilarity], result of:
            0.33968228 = score(doc=4150,freq=1.0), product of:
              0.62620586 = queryWeight, product of:
                7.249914 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.012439947 = queryNorm
              0.54244506 = fieldWeight in 4150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.078125 = fieldNorm(doc=4150)
        0.28 = coord(7/25)
    
  4. Pao, M.L.: Term and citation retrieval : a field study (1993) 0.20
    0.19610207 = sum of:
      0.19610207 = product of:
        0.81709194 = sum of:
          0.12935586 = weight(abstract_txt:odds in 3741) [ClassicSimilarity], result of:
            0.12935586 = score(doc=3741,freq=1.0), product of:
              0.16033243 = queryWeight, product of:
                1.4976467 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.012439947 = queryNorm
              0.8067979 = fieldWeight in 3741, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.09375 = fieldNorm(doc=3741)
          0.010799958 = weight(abstract_txt:that in 3741) [ClassicSimilarity], result of:
            0.010799958 = score(doc=3741,freq=1.0), product of:
              0.048618175 = queryWeight, product of:
                1.6494076 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012439947 = queryNorm
              0.22213829 = fieldWeight in 3741, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=3741)
          0.08577219 = weight(abstract_txt:relevant in 3741) [ClassicSimilarity], result of:
            0.08577219 = score(doc=3741,freq=2.0), product of:
              0.13955927 = queryWeight, product of:
                2.4201298 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.012439947 = queryNorm
              0.61459327 = fieldWeight in 3741, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.09375 = fieldNorm(doc=3741)
          0.0425885 = weight(abstract_txt:retrieval in 3741) [ClassicSimilarity], result of:
            0.0425885 = score(doc=3741,freq=1.0), product of:
              0.13072205 = queryWeight, product of:
                3.023835 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012439947 = queryNorm
              0.3257943 = fieldWeight in 3741, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=3741)
          0.1409567 = weight(abstract_txt:items in 3741) [ClassicSimilarity], result of:
            0.1409567 = score(doc=3741,freq=1.0), product of:
              0.26950973 = queryWeight, product of:
                3.8834336 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.012439947 = queryNorm
              0.52301157 = fieldWeight in 3741, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.09375 = fieldNorm(doc=3741)
          0.40761876 = weight(abstract_txt:overlap in 3741) [ClassicSimilarity], result of:
            0.40761876 = score(doc=3741,freq=1.0), product of:
              0.62620586 = queryWeight, product of:
                7.249914 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.012439947 = queryNorm
              0.6509341 = fieldWeight in 3741, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.09375 = fieldNorm(doc=3741)
        0.24 = coord(6/25)
    
  5. MacCain, K.W.: Descriptor and citation retrieval in the medical behavioral sciences literature : retrieval overlaps and novelty distribution (1989) 0.18
    0.1826085 = sum of:
      0.1826085 = product of:
        0.9130425 = sum of:
          0.024988836 = weight(abstract_txt:different in 2290) [ClassicSimilarity], result of:
            0.024988836 = score(doc=2290,freq=1.0), product of:
              0.087261476 = queryWeight, product of:
                1.9136856 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.012439947 = queryNorm
              0.28636733 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.078125 = fieldNorm(doc=2290)
          0.071476825 = weight(abstract_txt:relevant in 2290) [ClassicSimilarity], result of:
            0.071476825 = score(doc=2290,freq=2.0), product of:
              0.13955927 = queryWeight, product of:
                2.4201298 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.012439947 = queryNorm
              0.5121611 = fieldWeight in 2290, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=2290)
          0.06623144 = weight(abstract_txt:search in 2290) [ClassicSimilarity], result of:
            0.06623144 = score(doc=2290,freq=4.0), product of:
              0.1158762 = queryWeight, product of:
                2.546395 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012439947 = queryNorm
              0.5715707 = fieldWeight in 2290, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=2290)
          0.07098083 = weight(abstract_txt:retrieval in 2290) [ClassicSimilarity], result of:
            0.07098083 = score(doc=2290,freq=4.0), product of:
              0.13072205 = queryWeight, product of:
                3.023835 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.012439947 = queryNorm
              0.5429905 = fieldWeight in 2290, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2290)
          0.67936456 = weight(abstract_txt:overlap in 2290) [ClassicSimilarity], result of:
            0.67936456 = score(doc=2290,freq=4.0), product of:
              0.62620586 = queryWeight, product of:
                7.249914 = boost
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.012439947 = queryNorm
              1.0848901 = fieldWeight in 2290, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.943297 = idf(docFreq=115, maxDocs=44218)
                0.078125 = fieldNorm(doc=2290)
        0.2 = coord(5/25)