Document (#21511)

Wartik, S.
Fox, E.
Heath, L.
Chen, Q.-F.
Hashing algorithms
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Englewood Cliffs, NJ : Prentice Hall
Discusses hashing, an information storage and retrieval technique useful for implementing many of the other structures in this book. The concepts underlying hashing are presented, along with 2 implementation strategies. The chapter also contains an extensive discussion of perfect hashing, an important optimization in information retrieval, and an O(n) algorithm to find minimal perfect hash functions for a set of keys

Similar documents (author)

  1. Heath, F.: Libraries, information technology, and the future (1995) 2.89
    2.8879638 = sum of:
      2.8879638 = product of:
        5.7759275 = sum of:
          5.7759275 = weight(author_txt:heath in 3664) [ClassicSimilarity], result of:
            5.7759275 = score(doc=3664,freq=1.0), product of:
              0.933 = queryWeight, product of:
                1.6101422 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0585002 = queryNorm
              6.190705 = fieldWeight in 3664, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.625 = fieldNorm(doc=3664)
        0.5 = coord(1/2)
  2. Bizer, C.; Heath, T.: Linked Data : evolving the web into a global data space (2011) 2.31
    2.3103712 = sum of:
      2.3103712 = product of:
        4.6207423 = sum of:
          4.6207423 = weight(author_txt:heath in 4725) [ClassicSimilarity], result of:
            4.6207423 = score(doc=4725,freq=1.0), product of:
              0.933 = queryWeight, product of:
                1.6101422 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0585002 = queryNorm
              4.952564 = fieldWeight in 4725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=4725)
        0.5 = coord(1/2)
  3. Vikor, D.L.; Gaumond, G.; Heath, F.M.: Building electronic cooperation in the 1990s : the Maryland, Georgia, and Texas experiences (1997) 1.73
    1.7327782 = sum of:
      1.7327782 = product of:
        3.4655564 = sum of:
          3.4655564 = weight(author_txt:heath in 1680) [ClassicSimilarity], result of:
            3.4655564 = score(doc=1680,freq=1.0), product of:
              0.933 = queryWeight, product of:
                1.6101422 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0585002 = queryNorm
              3.7144227 = fieldWeight in 1680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=1680)
        0.5 = coord(1/2)
  4. Bizer, C.; Cyganiak, R.; Heath, T.: How to publish Linked Data on the Web (2007) 1.73
    1.7327782 = sum of:
      1.7327782 = product of:
        3.4655564 = sum of:
          3.4655564 = weight(author_txt:heath in 3791) [ClassicSimilarity], result of:
            3.4655564 = score(doc=3791,freq=1.0), product of:
              0.933 = queryWeight, product of:
                1.6101422 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0585002 = queryNorm
              3.7144227 = fieldWeight in 3791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=3791)
        0.5 = coord(1/2)
  5. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 0.78
    0.7827156 = sum of:
      0.7827156 = product of:
        1.5654312 = sum of:
          1.5654312 = weight(author_txt:chen in 3384) [ClassicSimilarity], result of:
            1.5654312 = score(doc=3384,freq=2.0), product of:
              0.35987625 = queryWeight, product of:
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.0585002 = queryNorm
              4.3499155 = fieldWeight in 3384, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.5 = fieldNorm(doc=3384)
        0.5 = coord(1/2)

Similar documents (content)

  1. Wartik, S.: Boolean operators (1992) 0.18
    0.18241926 = sum of:
      0.18241926 = product of:
        1.1401204 = sum of:
          0.008164233 = weight(abstract_txt:information in 3509) [ClassicSimilarity], result of:
            0.008164233 = score(doc=3509,freq=1.0), product of:
              0.026978647 = queryWeight, product of:
                1.0005207 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011138044 = queryNorm
              0.3026183 = fieldWeight in 3509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.125 = fieldNorm(doc=3509)
          0.040882397 = weight(abstract_txt:implementation in 3509) [ClassicSimilarity], result of:
            0.040882397 = score(doc=3509,freq=1.0), product of:
              0.06267449 = queryWeight, product of:
                1.0783168 = boost
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.011138044 = queryNorm
              0.6522972 = fieldWeight in 3509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183776 = idf(docFreq=650, maxDocs=44218)
                0.125 = fieldNorm(doc=3509)
          0.024147741 = weight(abstract_txt:retrieval in 3509) [ClassicSimilarity], result of:
            0.024147741 = score(doc=3509,freq=1.0), product of:
              0.055589695 = queryWeight, product of:
                1.4361941 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011138044 = queryNorm
              0.43439242 = fieldWeight in 3509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=3509)
          1.066926 = weight(abstract_txt:hashing in 3509) [ClassicSimilarity], result of:
            1.066926 = score(doc=3509,freq=1.0), product of:
              0.8753387 = queryWeight, product of:
                8.059703 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011138044 = queryNorm
              1.2188722 = fieldWeight in 3509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.125 = fieldNorm(doc=3509)
        0.16 = coord(4/25)
  2. Nelson, M.J.: ¬A prefix trie index for inverted files (1997) 0.17
    0.17064318 = sum of:
      0.17064318 = product of:
        1.0665199 = sum of:
          0.005102645 = weight(abstract_txt:information in 495) [ClassicSimilarity], result of:
            0.005102645 = score(doc=495,freq=1.0), product of:
              0.026978647 = queryWeight, product of:
                1.0005207 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011138044 = queryNorm
              0.18913643 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=495)
          0.021343788 = weight(abstract_txt:retrieval in 495) [ClassicSimilarity], result of:
            0.021343788 = score(doc=495,freq=2.0), product of:
              0.055589695 = queryWeight, product of:
                1.4361941 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011138044 = queryNorm
              0.38395226 = fieldWeight in 495, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=495)
          0.09703525 = weight(abstract_txt:keys in 495) [ClassicSimilarity], result of:
            0.09703525 = score(doc=495,freq=1.0), product of:
              0.15255728 = queryWeight, product of:
                1.6823542 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.011138044 = queryNorm
              0.6360578 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.078125 = fieldNorm(doc=495)
          0.9430382 = weight(abstract_txt:hashing in 495) [ClassicSimilarity], result of:
            0.9430382 = score(doc=495,freq=2.0), product of:
              0.8753387 = queryWeight, product of:
                8.059703 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011138044 = queryNorm
              1.077341 = fieldWeight in 495, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=495)
        0.16 = coord(4/25)
  3. Hoad, T.C.; Zobel, J.: Methods for identifying versioned and plagiarized documents (2003) 0.17
    0.16509232 = sum of:
      0.16509232 = product of:
        0.82546157 = sum of:
          0.0040821163 = weight(abstract_txt:information in 5159) [ClassicSimilarity], result of:
            0.0040821163 = score(doc=5159,freq=1.0), product of:
              0.026978647 = queryWeight, product of:
                1.0005207 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011138044 = queryNorm
              0.15130915 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=5159)
          0.019343078 = weight(abstract_txt:strategies in 5159) [ClassicSimilarity], result of:
            0.019343078 = score(doc=5159,freq=1.0), product of:
              0.060409278 = queryWeight, product of:
                1.058651 = boost
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.011138044 = queryNorm
              0.32020044 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.123207 = idf(docFreq=715, maxDocs=44218)
                0.0625 = fieldNorm(doc=5159)
          0.035531912 = weight(abstract_txt:technique in 5159) [ClassicSimilarity], result of:
            0.035531912 = score(doc=5159,freq=2.0), product of:
              0.07191547 = queryWeight, product of:
                1.1550802 = boost
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.011138044 = queryNorm
              0.49407884 = fieldWeight in 5159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5898643 = idf(docFreq=448, maxDocs=44218)
                0.0625 = fieldNorm(doc=5159)
          0.012073871 = weight(abstract_txt:retrieval in 5159) [ClassicSimilarity], result of:
            0.012073871 = score(doc=5159,freq=1.0), product of:
              0.055589695 = queryWeight, product of:
                1.4361941 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011138044 = queryNorm
              0.21719621 = fieldWeight in 5159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=5159)
          0.7544306 = weight(abstract_txt:hashing in 5159) [ClassicSimilarity], result of:
            0.7544306 = score(doc=5159,freq=2.0), product of:
              0.8753387 = queryWeight, product of:
                8.059703 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011138044 = queryNorm
              0.8618728 = fieldWeight in 5159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=5159)
        0.2 = coord(5/25)
  4. Ford, D.A.; Christodoukalis, S.: File organizations for optical disks (1992) 0.15
    0.14648984 = sum of:
      0.14648984 = product of:
        0.91556156 = sum of:
          0.02892007 = weight(abstract_txt:structures in 3501) [ClassicSimilarity], result of:
            0.02892007 = score(doc=3501,freq=1.0), product of:
              0.060277972 = queryWeight, product of:
                1.0574998 = boost
                5.117636 = idf(docFreq=719, maxDocs=44218)
                0.011138044 = queryNorm
              0.4797784 = fieldWeight in 3501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.117636 = idf(docFreq=719, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
          0.06083446 = weight(abstract_txt:storage in 3501) [ClassicSimilarity], result of:
            0.06083446 = score(doc=3501,freq=2.0), product of:
              0.07854445 = queryWeight, product of:
                1.2071431 = boost
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.011138044 = queryNorm
              0.77452266 = fieldWeight in 3501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
          0.025612546 = weight(abstract_txt:retrieval in 3501) [ClassicSimilarity], result of:
            0.025612546 = score(doc=3501,freq=2.0), product of:
              0.055589695 = queryWeight, product of:
                1.4361941 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011138044 = queryNorm
              0.4607427 = fieldWeight in 3501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
          0.8001945 = weight(abstract_txt:hashing in 3501) [ClassicSimilarity], result of:
            0.8001945 = score(doc=3501,freq=1.0), product of:
              0.8753387 = queryWeight, product of:
                8.059703 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011138044 = queryNorm
              0.9141542 = fieldWeight in 3501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
        0.16 = coord(4/25)
  5. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.13
    0.12929647 = sum of:
      0.12929647 = product of:
        0.8081029 = sum of:
          0.0040821163 = weight(abstract_txt:information in 303) [ClassicSimilarity], result of:
            0.0040821163 = score(doc=303,freq=1.0), product of:
              0.026978647 = queryWeight, product of:
                1.0005207 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011138044 = queryNorm
              0.15130915 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.028677637 = weight(abstract_txt:storage in 303) [ClassicSimilarity], result of:
            0.028677637 = score(doc=303,freq=1.0), product of:
              0.07854445 = queryWeight, product of:
                1.2071431 = boost
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.011138044 = queryNorm
              0.36511347 = fieldWeight in 303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8418155 = idf(docFreq=348, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.020912558 = weight(abstract_txt:retrieval in 303) [ClassicSimilarity], result of:
            0.020912558 = score(doc=303,freq=3.0), product of:
              0.055589695 = queryWeight, product of:
                1.4361941 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.011138044 = queryNorm
              0.37619486 = fieldWeight in 303, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
          0.7544306 = weight(abstract_txt:hashing in 303) [ClassicSimilarity], result of:
            0.7544306 = score(doc=303,freq=2.0), product of:
              0.8753387 = queryWeight, product of:
                8.059703 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.011138044 = queryNorm
              0.8618728 = fieldWeight in 303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=303)
        0.16 = coord(4/25)