Document (#1500)

Author
Ballard, T.
Lifshin, A.
Title
Prediction of OPAC spelling errors through a keyword inventory
Source
Information technology and libraries. 11(1992), S.139-145
Year
1992
Abstract
In order to find and correct spelling errors in the online public access catalog at Adelphi University, a visual inspection was performed of the 117.000 keywords indexed in the system. More than 1.000 errors were found. Certain long but common words such as administration, education, and commercial were found to generate many different misspellings. Most of the records were derived from bibliographic utilities, so the findings can be generalized to other OPACs. The same misspellings were also found in substantial numbers in CD-ROM databases. Misspellings were analyzed by the machine-readable catalog (MARC) field in which they were found, part of speech, and type of mistake. Lists of commonly misspelled root words and specific mistakes are included
Theme
OPAC

Similar documents (author)

  1. Ballard, P.I.: Bound withs versus an online catalog : a practical solution (1992) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ballard in 2968) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 2968, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=2968)
    
  2. Ballard, T.: OCLC's EPIC : report from the field (1991) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ballard in 4859) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4859, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4859)
    
  3. Ballard, T.: Using FirstSearch in a bibliographic construction (1993) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ballard in 7309) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 7309, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=7309)
    
  4. Ballard, T.: Comparative searching styles of patrons and staff (1994) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ballard in 8501) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 8501, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=8501)
    
  5. Ballard, T.: Library systems : transaction log fever; analyzing patron searches can reveal solutions to increase search success (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:ballard in 5761) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 5761, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=5761)
    

Similar documents (content)

  1. Drabenstott, K.M.; Weller, M.S.: Handling spelling errors in online catalog searches (1996) 0.41
    0.41302794 = sum of:
      0.41302794 = product of:
        1.4750998 = sum of:
          0.18515486 = weight(abstract_txt:misspelled in 5973) [ClassicSimilarity], result of:
            0.18515486 = score(doc=5973,freq=2.0), product of:
              0.21781127 = queryWeight, product of:
                1.5760683 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.014369629 = queryNorm
              0.8500701 = fieldWeight in 5973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.0638527 = weight(abstract_txt:words in 5973) [ClassicSimilarity], result of:
            0.0638527 = score(doc=5973,freq=2.0), product of:
              0.13495421 = queryWeight, product of:
                1.7544584 = boost
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.014369629 = queryNorm
              0.47314343 = fieldWeight in 5973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.353007 = idf(docFreq=568, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.2951164 = weight(abstract_txt:spelling in 5973) [ClassicSimilarity], result of:
            0.2951164 = score(doc=5973,freq=5.0), product of:
              0.27589837 = queryWeight, product of:
                2.5085597 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.014369629 = queryNorm
              1.0696561 = fieldWeight in 5973, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.05283644 = weight(abstract_txt:found in 5973) [ClassicSimilarity], result of:
            0.05283644 = score(doc=5973,freq=1.0), product of:
              0.18881768 = queryWeight, product of:
                2.934852 = boost
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.014369629 = queryNorm
              0.27982783 = fieldWeight in 5973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.24808197 = weight(abstract_txt:errors in 5973) [ClassicSimilarity], result of:
            0.24808197 = score(doc=5973,freq=4.0), product of:
              0.30302897 = queryWeight, product of:
                3.2198644 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.014369629 = queryNorm
              0.8186741 = fieldWeight in 5973, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.061734498 = weight(abstract_txt:were in 5973) [ClassicSimilarity], result of:
            0.061734498 = score(doc=5973,freq=2.0), product of:
              0.19030899 = queryWeight, product of:
                3.6086116 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014369629 = queryNorm
              0.32439086 = fieldWeight in 5973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
          0.56832296 = weight(abstract_txt:misspellings in 5973) [ClassicSimilarity], result of:
            0.56832296 = score(doc=5973,freq=3.0), product of:
              0.5796027 = queryWeight, product of:
                4.453082 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.014369629 = queryNorm
              0.98053885 = fieldWeight in 5973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0625 = fieldNorm(doc=5973)
        0.28 = coord(7/25)
    
  2. Randall, N.B.: Spelling errors in the database : shadow or substance? (1999) 0.25
    0.25469157 = sum of:
      0.25469157 = product of:
        1.2734579 = sum of:
          0.055491935 = weight(abstract_txt:correct in 106) [ClassicSimilarity], result of:
            0.055491935 = score(doc=106,freq=1.0), product of:
              0.10591241 = queryWeight, product of:
                1.0990268 = boost
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.014369629 = queryNorm
              0.52394176 = fieldWeight in 106, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.078125 = fieldNorm(doc=106)
          0.23330995 = weight(abstract_txt:spelling in 106) [ClassicSimilarity], result of:
            0.23330995 = score(doc=106,freq=2.0), product of:
              0.27589837 = queryWeight, product of:
                2.5085597 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.014369629 = queryNorm
              0.8456373 = fieldWeight in 106, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.078125 = fieldNorm(doc=106)
          0.31010246 = weight(abstract_txt:errors in 106) [ClassicSimilarity], result of:
            0.31010246 = score(doc=106,freq=4.0), product of:
              0.30302897 = queryWeight, product of:
                3.2198644 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.014369629 = queryNorm
              1.0233426 = fieldWeight in 106, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.078125 = fieldNorm(doc=106)
          0.09451126 = weight(abstract_txt:were in 106) [ClassicSimilarity], result of:
            0.09451126 = score(doc=106,freq=3.0), product of:
              0.19030899 = queryWeight, product of:
                3.6086116 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014369629 = queryNorm
              0.49662006 = fieldWeight in 106, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.078125 = fieldNorm(doc=106)
          0.58004224 = weight(abstract_txt:misspellings in 106) [ClassicSimilarity], result of:
            0.58004224 = score(doc=106,freq=2.0), product of:
              0.5796027 = queryWeight, product of:
                4.453082 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.014369629 = queryNorm
              1.0007583 = fieldWeight in 106, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=106)
        0.2 = coord(5/25)
    
  3. Tüür-Fröhlich, T.: ¬The non-trivial effects of trivial errors in scientific communication and evaluation (2016) 0.25
    0.24563923 = sum of:
      0.24563923 = product of:
        0.877283 = sum of:
          0.041382626 = weight(abstract_txt:indexed in 3137) [ClassicSimilarity], result of:
            0.041382626 = score(doc=3137,freq=2.0), product of:
              0.087686 = queryWeight, product of:
                6.1021757 = idf(docFreq=268, maxDocs=44218)
                0.014369629 = queryNorm
              0.47194108 = fieldWeight in 3137, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1021757 = idf(docFreq=268, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
          0.038844354 = weight(abstract_txt:correct in 3137) [ClassicSimilarity], result of:
            0.038844354 = score(doc=3137,freq=1.0), product of:
              0.10591241 = queryWeight, product of:
                1.0990268 = boost
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.014369629 = queryNorm
              0.36675924 = fieldWeight in 3137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7064548 = idf(docFreq=146, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
          0.1104011 = weight(abstract_txt:mistake in 3137) [ClassicSimilarity], result of:
            0.1104011 = score(doc=3137,freq=1.0), product of:
              0.21250893 = queryWeight, product of:
                1.5567664 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.014369629 = queryNorm
              0.5195128 = fieldWeight in 3137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
          0.04623189 = weight(abstract_txt:found in 3137) [ClassicSimilarity], result of:
            0.04623189 = score(doc=3137,freq=1.0), product of:
              0.18881768 = queryWeight, product of:
                2.934852 = boost
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.014369629 = queryNorm
              0.24484935 = fieldWeight in 3137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
          0.28715888 = weight(abstract_txt:errors in 3137) [ClassicSimilarity], result of:
            0.28715888 = score(doc=3137,freq=7.0), product of:
              0.30302897 = queryWeight, product of:
                3.2198644 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.014369629 = queryNorm
              0.9476285 = fieldWeight in 3137, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
          0.066157885 = weight(abstract_txt:were in 3137) [ClassicSimilarity], result of:
            0.066157885 = score(doc=3137,freq=3.0), product of:
              0.19030899 = queryWeight, product of:
                3.6086116 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014369629 = queryNorm
              0.34763405 = fieldWeight in 3137, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
          0.28710625 = weight(abstract_txt:misspellings in 3137) [ClassicSimilarity], result of:
            0.28710625 = score(doc=3137,freq=1.0), product of:
              0.5796027 = queryWeight, product of:
                4.453082 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.014369629 = queryNorm
              0.49535006 = fieldWeight in 3137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3137)
        0.28 = coord(7/25)
    
  4. Ballard, T.: Spelling and typographical errors in library databases (1992) 0.21
    0.21477032 = sum of:
      0.21477032 = product of:
        1.3423145 = sum of:
          0.30505243 = weight(abstract_txt:adelphi in 5971) [ClassicSimilarity], result of:
            0.30505243 = score(doc=5971,freq=1.0), product of:
              0.20782124 = queryWeight, product of:
                1.5395005 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.014369629 = queryNorm
              1.4678597 = fieldWeight in 5971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.15625 = fieldNorm(doc=5971)
          0.4666199 = weight(abstract_txt:spelling in 5971) [ClassicSimilarity], result of:
            0.4666199 = score(doc=5971,freq=2.0), product of:
              0.27589837 = queryWeight, product of:
                2.5085597 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.014369629 = queryNorm
              1.6912746 = fieldWeight in 5971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.15625 = fieldNorm(doc=5971)
          0.1320911 = weight(abstract_txt:found in 5971) [ClassicSimilarity], result of:
            0.1320911 = score(doc=5971,freq=1.0), product of:
              0.18881768 = queryWeight, product of:
                2.934852 = boost
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.014369629 = queryNorm
              0.6995696 = fieldWeight in 5971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.15625 = fieldNorm(doc=5971)
          0.43855107 = weight(abstract_txt:errors in 5971) [ClassicSimilarity], result of:
            0.43855107 = score(doc=5971,freq=2.0), product of:
              0.30302897 = queryWeight, product of:
                3.2198644 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.014369629 = queryNorm
              1.4472249 = fieldWeight in 5971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.15625 = fieldNorm(doc=5971)
        0.16 = coord(4/25)
    
  5. Berget, G.; Sandnes, F.E.: Do autocomplete functions reduce the impact of dyslexia on information-searching behavior? : the case of Google (2016) 0.16
    0.16367511 = sum of:
      0.16367511 = product of:
        1.0229695 = sum of:
          0.23330995 = weight(abstract_txt:spelling in 3112) [ClassicSimilarity], result of:
            0.23330995 = score(doc=3112,freq=2.0), product of:
              0.27589837 = queryWeight, product of:
                2.5085597 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.014369629 = queryNorm
              0.8456373 = fieldWeight in 3112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.078125 = fieldNorm(doc=3112)
          0.15505123 = weight(abstract_txt:errors in 3112) [ClassicSimilarity], result of:
            0.15505123 = score(doc=3112,freq=1.0), product of:
              0.30302897 = queryWeight, product of:
                3.2198644 = boost
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.014369629 = queryNorm
              0.5116713 = fieldWeight in 3112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5493927 = idf(docFreq=171, maxDocs=44218)
                0.078125 = fieldNorm(doc=3112)
          0.054566104 = weight(abstract_txt:were in 3112) [ClassicSimilarity], result of:
            0.054566104 = score(doc=3112,freq=1.0), product of:
              0.19030899 = queryWeight, product of:
                3.6086116 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014369629 = queryNorm
              0.28672373 = fieldWeight in 3112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.078125 = fieldNorm(doc=3112)
          0.58004224 = weight(abstract_txt:misspellings in 3112) [ClassicSimilarity], result of:
            0.58004224 = score(doc=3112,freq=2.0), product of:
              0.5796027 = queryWeight, product of:
                4.453082 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.014369629 = queryNorm
              1.0007583 = fieldWeight in 3112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.078125 = fieldNorm(doc=3112)
        0.16 = coord(4/25)