Document (#11879)

Author
Boeri, R.J.
Hensel, M.
Title
Set up a winning text retrieval system : carefully
Source
CD-ROM professional. 8(1995) no.8, S.67-68
Year
1995
Abstract
Considers some of the practical issues involved when a company plans to develop an in house computerized document management system: conversion of paper to electronic form via optical character recognition (OCR) or rekeying; coding of document elements using SGML; indexing for information searching and retrieval (including proximity searching); and hybrid CD-ROM and online information retrieval systems
Theme
Dokumentenmanagement
Aid
SGML

Similar documents (content)

  1. Thiel, T.J.: Automated indexing of document image management systems (1992) 0.32
    0.32304794 = sum of:
      0.32304794 = product of:
        1.0095248 = sum of:
          0.012895495 = weight(abstract_txt:information in 3049) [ClassicSimilarity], result of:
            0.012895495 = score(doc=3049,freq=1.0), product of:
              0.056817424 = queryWeight, product of:
                1.1125243 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.021095358 = queryNorm
              0.22696373 = fieldWeight in 3049, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.14737421 = weight(abstract_txt:recognition in 3049) [ClassicSimilarity], result of:
            0.14737421 = score(doc=3049,freq=2.0), product of:
              0.18160059 = queryWeight, product of:
                1.4064113 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.021095358 = queryNorm
              0.8115294 = fieldWeight in 3049, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.17771803 = weight(abstract_txt:character in 3049) [ClassicSimilarity], result of:
            0.17771803 = score(doc=3049,freq=2.0), product of:
              0.20574245 = queryWeight, product of:
                1.4969789 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.021095358 = queryNorm
              0.86378884 = fieldWeight in 3049, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.034854945 = weight(abstract_txt:system in 3049) [ClassicSimilarity], result of:
            0.034854945 = score(doc=3049,freq=1.0), product of:
              0.11024676 = queryWeight, product of:
                1.5497143 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.021095358 = queryNorm
              0.3161539 = fieldWeight in 3049, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.19810286 = weight(abstract_txt:coding in 3049) [ClassicSimilarity], result of:
            0.19810286 = score(doc=3049,freq=2.0), product of:
              0.22118895 = queryWeight, product of:
                1.5521562 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.021095358 = queryNorm
              0.8956273 = fieldWeight in 3049, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.25685608 = weight(abstract_txt:optical in 3049) [ClassicSimilarity], result of:
            0.25685608 = score(doc=3049,freq=2.0), product of:
              0.26300427 = queryWeight, product of:
                1.6925251 = boost
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.021095358 = queryNorm
              0.97662324 = fieldWeight in 3049, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.12451076 = weight(abstract_txt:document in 3049) [ClassicSimilarity], result of:
            0.12451076 = score(doc=3049,freq=3.0), product of:
              0.17862973 = queryWeight, product of:
                1.9726298 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.021095358 = queryNorm
              0.6970327 = fieldWeight in 3049, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
          0.057212435 = weight(abstract_txt:retrieval in 3049) [ClassicSimilarity], result of:
            0.057212435 = score(doc=3049,freq=1.0), product of:
              0.17560907 = queryWeight, product of:
                2.395454 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021095358 = queryNorm
              0.3257943 = fieldWeight in 3049, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=3049)
        0.32 = coord(8/25)
    
  2. Ramsden, A.: ELINOR electronic library system (1998) 0.22
    0.21946542 = sum of:
      0.21946542 = product of:
        0.91443926 = sum of:
          0.13894574 = weight(abstract_txt:recognition in 1403) [ClassicSimilarity], result of:
            0.13894574 = score(doc=1403,freq=1.0), product of:
              0.18160059 = queryWeight, product of:
                1.4064113 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.021095358 = queryNorm
              0.7651173 = fieldWeight in 1403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.125 = fieldNorm(doc=1403)
          0.16257086 = weight(abstract_txt:computerized in 1403) [ClassicSimilarity], result of:
            0.16257086 = score(doc=1403,freq=1.0), product of:
              0.20164256 = queryWeight, product of:
                1.4819884 = boost
                6.449863 = idf(docFreq=189, maxDocs=44218)
                0.021095358 = queryNorm
              0.80623287 = fieldWeight in 1403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449863 = idf(docFreq=189, maxDocs=44218)
                0.125 = fieldNorm(doc=1403)
          0.16755417 = weight(abstract_txt:character in 1403) [ClassicSimilarity], result of:
            0.16755417 = score(doc=1403,freq=1.0), product of:
              0.20574245 = queryWeight, product of:
                1.4969789 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.021095358 = queryNorm
              0.814388 = fieldWeight in 1403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.125 = fieldNorm(doc=1403)
          0.24216624 = weight(abstract_txt:optical in 1403) [ClassicSimilarity], result of:
            0.24216624 = score(doc=1403,freq=1.0), product of:
              0.26300427 = queryWeight, product of:
                1.6925251 = boost
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.021095358 = queryNorm
              0.9207692 = fieldWeight in 1403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.125 = fieldNorm(doc=1403)
          0.09532148 = weight(abstract_txt:searching in 1403) [ClassicSimilarity], result of:
            0.09532148 = score(doc=1403,freq=1.0), product of:
              0.17797442 = queryWeight, product of:
                1.9690081 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.021095358 = queryNorm
              0.5355909 = fieldWeight in 1403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.125 = fieldNorm(doc=1403)
          0.10788079 = weight(abstract_txt:retrieval in 1403) [ClassicSimilarity], result of:
            0.10788079 = score(doc=1403,freq=2.0), product of:
              0.17560907 = queryWeight, product of:
                2.395454 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021095358 = queryNorm
              0.6143236 = fieldWeight in 1403, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=1403)
        0.24 = coord(6/25)
    
  3. Broadhurst, R.: ¬The digitisation of library material (1993) 0.21
    0.20954709 = sum of:
      0.20954709 = product of:
        0.87311286 = sum of:
          0.08035922 = weight(abstract_txt:considers in 6256) [ClassicSimilarity], result of:
            0.08035922 = score(doc=6256,freq=1.0), product of:
              0.12606007 = queryWeight, product of:
                1.1717703 = boost
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.021095358 = queryNorm
              0.6374677 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.13894574 = weight(abstract_txt:recognition in 6256) [ClassicSimilarity], result of:
            0.13894574 = score(doc=6256,freq=1.0), product of:
              0.18160059 = queryWeight, product of:
                1.4064113 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.021095358 = queryNorm
              0.7651173 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.14823906 = weight(abstract_txt:conversion in 6256) [ClassicSimilarity], result of:
            0.14823906 = score(doc=6256,freq=1.0), product of:
              0.1896104 = queryWeight, product of:
                1.4370928 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.021095358 = queryNorm
              0.7818087 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.16755417 = weight(abstract_txt:character in 6256) [ClassicSimilarity], result of:
            0.16755417 = score(doc=6256,freq=1.0), product of:
              0.20574245 = queryWeight, product of:
                1.4969789 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.021095358 = queryNorm
              0.814388 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.24216624 = weight(abstract_txt:optical in 6256) [ClassicSimilarity], result of:
            0.24216624 = score(doc=6256,freq=1.0), product of:
              0.26300427 = queryWeight, product of:
                1.6925251 = boost
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.021095358 = queryNorm
              0.9207692 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
          0.095848426 = weight(abstract_txt:document in 6256) [ClassicSimilarity], result of:
            0.095848426 = score(doc=6256,freq=1.0), product of:
              0.17862973 = queryWeight, product of:
                1.9726298 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.021095358 = queryNorm
              0.53657603 = fieldWeight in 6256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.125 = fieldNorm(doc=6256)
        0.24 = coord(6/25)
    
  4. Lunin, L.F.: ¬The big picture : selection and design issues for image information systems (1997) 0.17
    0.17155933 = sum of:
      0.17155933 = product of:
        0.5361229 = sum of:
          0.03746015 = weight(abstract_txt:including in 757) [ClassicSimilarity], result of:
            0.03746015 = score(doc=757,freq=1.0), product of:
              0.09181054 = queryWeight, product of:
                4.352168 = idf(docFreq=1547, maxDocs=44218)
                0.021095358 = queryNorm
              0.40801576 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.352168 = idf(docFreq=1547, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.018236984 = weight(abstract_txt:information in 757) [ClassicSimilarity], result of:
            0.018236984 = score(doc=757,freq=2.0), product of:
              0.056817424 = queryWeight, product of:
                1.1125243 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.021095358 = queryNorm
              0.32097518 = fieldWeight in 757, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.060269415 = weight(abstract_txt:considers in 757) [ClassicSimilarity], result of:
            0.060269415 = score(doc=757,freq=1.0), product of:
              0.12606007 = queryWeight, product of:
                1.1717703 = boost
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.021095358 = queryNorm
              0.47810078 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.06367701 = weight(abstract_txt:involved in 757) [ClassicSimilarity], result of:
            0.06367701 = score(doc=757,freq=1.0), product of:
              0.13076796 = queryWeight, product of:
                1.1934505 = boost
                5.194097 = idf(docFreq=666, maxDocs=44218)
                0.021095358 = queryNorm
              0.48694658 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.194097 = idf(docFreq=666, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.06615951 = weight(abstract_txt:elements in 757) [ClassicSimilarity], result of:
            0.06615951 = score(doc=757,freq=1.0), product of:
              0.13414498 = queryWeight, product of:
                1.2087624 = boost
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.021095358 = queryNorm
              0.4931941 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.260737 = idf(docFreq=623, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.1111793 = weight(abstract_txt:conversion in 757) [ClassicSimilarity], result of:
            0.1111793 = score(doc=757,freq=1.0), product of:
              0.1896104 = queryWeight, product of:
                1.4370928 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.021095358 = queryNorm
              0.5863565 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.12192814 = weight(abstract_txt:computerized in 757) [ClassicSimilarity], result of:
            0.12192814 = score(doc=757,freq=1.0), product of:
              0.20164256 = queryWeight, product of:
                1.4819884 = boost
                6.449863 = idf(docFreq=189, maxDocs=44218)
                0.021095358 = queryNorm
              0.60467464 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449863 = idf(docFreq=189, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
          0.057212435 = weight(abstract_txt:retrieval in 757) [ClassicSimilarity], result of:
            0.057212435 = score(doc=757,freq=1.0), product of:
              0.17560907 = queryWeight, product of:
                2.395454 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.021095358 = queryNorm
              0.3257943 = fieldWeight in 757, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=757)
        0.32 = coord(8/25)
    
  5. Initiatives for access (1994) 0.17
    0.16551214 = sum of:
      0.16551214 = product of:
        0.5911148 = sum of:
          0.010746244 = weight(abstract_txt:information in 3837) [ClassicSimilarity], result of:
            0.010746244 = score(doc=3837,freq=1.0), product of:
              0.056817424 = queryWeight, product of:
                1.1125243 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.021095358 = queryNorm
              0.18913643 = fieldWeight in 3837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
          0.05306418 = weight(abstract_txt:involved in 3837) [ClassicSimilarity], result of:
            0.05306418 = score(doc=3837,freq=1.0), product of:
              0.13076796 = queryWeight, product of:
                1.1934505 = boost
                5.194097 = idf(docFreq=666, maxDocs=44218)
                0.021095358 = queryNorm
              0.40578884 = fieldWeight in 3837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.194097 = idf(docFreq=666, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
          0.08684109 = weight(abstract_txt:recognition in 3837) [ClassicSimilarity], result of:
            0.08684109 = score(doc=3837,freq=1.0), product of:
              0.18160059 = queryWeight, product of:
                1.4064113 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.021095358 = queryNorm
              0.4781983 = fieldWeight in 3837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
          0.092649415 = weight(abstract_txt:conversion in 3837) [ClassicSimilarity], result of:
            0.092649415 = score(doc=3837,freq=1.0), product of:
              0.1896104 = queryWeight, product of:
                1.4370928 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.021095358 = queryNorm
              0.4886304 = fieldWeight in 3837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
          0.10472136 = weight(abstract_txt:character in 3837) [ClassicSimilarity], result of:
            0.10472136 = score(doc=3837,freq=1.0), product of:
              0.20574245 = queryWeight, product of:
                1.4969789 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.021095358 = queryNorm
              0.5089925 = fieldWeight in 3837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
          0.029045787 = weight(abstract_txt:system in 3837) [ClassicSimilarity], result of:
            0.029045787 = score(doc=3837,freq=1.0), product of:
              0.11024676 = queryWeight, product of:
                1.5497143 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.021095358 = queryNorm
              0.2634616 = fieldWeight in 3837, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
          0.21404673 = weight(abstract_txt:optical in 3837) [ClassicSimilarity], result of:
            0.21404673 = score(doc=3837,freq=2.0), product of:
              0.26300427 = queryWeight, product of:
                1.6925251 = boost
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.021095358 = queryNorm
              0.81385267 = fieldWeight in 3837, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3661537 = idf(docFreq=75, maxDocs=44218)
                0.078125 = fieldNorm(doc=3837)
        0.28 = coord(7/25)