Document (#20914)

Author
Becker, S.
Title
¬A practical perspective on data quality issues
Source
Journal of database management. 9(1998) no.1, S.35-37
Year
1998
Abstract
Explains why data quality is important. Problems that impact data quality include: data corruption due to incorrect conversion, historical and current data have different meanings, the same data has more than 1 data definition, missing data, hidden data, missing granularity, and violation of integrity rules. Suggests an improvement strategy to establish organizational commitment to cahnge what has been done in promoting data quality. Misconceptions that impact data quality are: data quality improves with the introduction of new technology; old data quality will not have an impact on new database development; and data quality is a database administration problem

Similar documents (author)

  1. Becker, J.: Zentrallager : Data Warehouse - zentrale Sammelstelle für Informationen (1997) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 4480) [ClassicSimilarity], result of:
        4.682621 = score(doc=4480,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 4480, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=4480)
    
  2. Becker, C.A.: Community information service (1974) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 5737) [ClassicSimilarity], result of:
        4.682621 = score(doc=5737,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 5737, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=5737)
    
  3. Becker, J.: Strategische Ausrichtung der Informations- und Organisationsstruktur des Unternehmens (1994) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 8383) [ClassicSimilarity], result of:
        4.682621 = score(doc=8383,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 8383, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=8383)
    
  4. Becker, J.: Probleme des grenzüberschreitenden Datenflusses (1988) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 512) [ClassicSimilarity], result of:
        4.682621 = score(doc=512,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 512, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=512)
    
  5. Becker, J.: ¬Die Postmoderne und ihr Verhältnis zur Informationstheorie (1995) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 1040) [ClassicSimilarity], result of:
        4.682621 = score(doc=1040,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 1040, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=1040)
    

Similar documents (content)

  1. Beamsley, T.G.: Securing digital image assets in museums and libraries : a risk management approach (1999) 0.14
    0.13975634 = sum of:
      0.13975634 = product of:
        0.58231807 = sum of:
          0.043108527 = weight(abstract_txt:establish in 842) [ClassicSimilarity], result of:
            0.043108527 = score(doc=842,freq=1.0), product of:
              0.10981688 = queryWeight, product of:
                1.1084651 = boost
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.01577368 = queryNorm
              0.3925492 = fieldWeight in 842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.0625 = fieldNorm(doc=842)
          0.0114517845 = weight(abstract_txt:have in 842) [ClassicSimilarity], result of:
            0.0114517845 = score(doc=842,freq=1.0), product of:
              0.0571767 = queryWeight, product of:
                1.1311287 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.01577368 = queryNorm
              0.20028761 = fieldWeight in 842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=842)
          0.13511981 = weight(abstract_txt:integrity in 842) [ClassicSimilarity], result of:
            0.13511981 = score(doc=842,freq=3.0), product of:
              0.16307946 = queryWeight, product of:
                1.3507876 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.01577368 = queryNorm
              0.828552 = fieldWeight in 842, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=842)
          0.16131155 = weight(abstract_txt:corruption in 842) [ClassicSimilarity], result of:
            0.16131155 = score(doc=842,freq=1.0), product of:
              0.26468986 = queryWeight, product of:
                1.7209018 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.01577368 = queryNorm
              0.6094361 = fieldWeight in 842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=842)
          0.05040375 = weight(abstract_txt:impact in 842) [ClassicSimilarity], result of:
            0.05040375 = score(doc=842,freq=1.0), product of:
              0.17578264 = queryWeight, product of:
                2.4290478 = boost
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.01577368 = queryNorm
              0.28673908 = fieldWeight in 842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.0625 = fieldNorm(doc=842)
          0.18092267 = weight(abstract_txt:data in 842) [ClassicSimilarity], result of:
            0.18092267 = score(doc=842,freq=4.0), product of:
              0.4338221 = queryWeight, product of:
                8.243418 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01577368 = queryNorm
              0.41704348 = fieldWeight in 842, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=842)
        0.24 = coord(6/25)
    
  2. Jiang, Z.; Liu, X.; Chen, Y.: Recovering uncaptured citations in a scholarly network : a two-step citation analysis to estimate publication importance (2016) 0.13
    0.13210218 = sum of:
      0.13210218 = product of:
        0.6605109 = sum of:
          0.0114517845 = weight(abstract_txt:have in 3018) [ClassicSimilarity], result of:
            0.0114517845 = score(doc=3018,freq=1.0), product of:
              0.0571767 = queryWeight, product of:
                1.1311287 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.01577368 = queryNorm
              0.20028761 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=3018)
          0.05040375 = weight(abstract_txt:impact in 3018) [ClassicSimilarity], result of:
            0.05040375 = score(doc=3018,freq=1.0), product of:
              0.17578264 = queryWeight, product of:
                2.4290478 = boost
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.01577368 = queryNorm
              0.28673908 = fieldWeight in 3018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.0625 = fieldNorm(doc=3018)
          0.22787827 = weight(abstract_txt:missing in 3018) [ClassicSimilarity], result of:
            0.22787827 = score(doc=3018,freq=3.0), product of:
              0.2911154 = queryWeight, product of:
                2.5523195 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.01577368 = queryNorm
              0.7827764 = fieldWeight in 3018, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0625 = fieldNorm(doc=3018)
          0.24284546 = weight(abstract_txt:quality in 3018) [ClassicSimilarity], result of:
            0.24284546 = score(doc=3018,freq=3.0), product of:
              0.48213637 = queryWeight, product of:
                6.5692744 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.01577368 = queryNorm
              0.50368625 = fieldWeight in 3018, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.0625 = fieldNorm(doc=3018)
          0.12793165 = weight(abstract_txt:data in 3018) [ClassicSimilarity], result of:
            0.12793165 = score(doc=3018,freq=2.0), product of:
              0.4338221 = queryWeight, product of:
                8.243418 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01577368 = queryNorm
              0.29489428 = fieldWeight in 3018, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3018)
        0.2 = coord(5/25)
    
  3. Lunati, G.: On line union catalogue (OLUC) compie 25 anni (1996) 0.13
    0.13108544 = sum of:
      0.13108544 = product of:
        0.65542716 = sum of:
          0.047477562 = weight(abstract_txt:explains in 105) [ClassicSimilarity], result of:
            0.047477562 = score(doc=105,freq=1.0), product of:
              0.08937686 = queryWeight, product of:
                5.666202 = idf(docFreq=415, maxDocs=44218)
                0.01577368 = queryNorm
              0.5312064 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.666202 = idf(docFreq=415, maxDocs=44218)
                0.09375 = fieldNorm(doc=105)
          0.060757466 = weight(abstract_txt:improvement in 105) [ClassicSimilarity], result of:
            0.060757466 = score(doc=105,freq=1.0), product of:
              0.105349526 = queryWeight, product of:
                1.0856848 = boost
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.01577368 = queryNorm
              0.57672274 = fieldWeight in 105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1517096 = idf(docFreq=255, maxDocs=44218)
                0.09375 = fieldNorm(doc=105)
          0.057870924 = weight(abstract_txt:database in 105) [ClassicSimilarity], result of:
            0.057870924 = score(doc=105,freq=2.0), product of:
              0.10198581 = queryWeight, product of:
                1.5106795 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.01577368 = queryNorm
              0.5674409 = fieldWeight in 105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.09375 = fieldNorm(doc=105)
          0.29742372 = weight(abstract_txt:quality in 105) [ClassicSimilarity], result of:
            0.29742372 = score(doc=105,freq=2.0), product of:
              0.48213637 = queryWeight, product of:
                6.5692744 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.01577368 = queryNorm
              0.61688715 = fieldWeight in 105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.09375 = fieldNorm(doc=105)
          0.19189748 = weight(abstract_txt:data in 105) [ClassicSimilarity], result of:
            0.19189748 = score(doc=105,freq=2.0), product of:
              0.4338221 = queryWeight, product of:
                8.243418 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01577368 = queryNorm
              0.44234142 = fieldWeight in 105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=105)
        0.2 = coord(5/25)
    
  4. Baroncini, S.; Sartini, B.; Erp, M. Van; Tomasi, F.; Gangemi, A.: Is dc:subject enough? : A landscape on iconography and iconology statements of knowledge graphs in the semantic web (2023) 0.13
    0.12746204 = sum of:
      0.12746204 = product of:
        0.6373102 = sum of:
          0.047079757 = weight(abstract_txt:meanings in 1030) [ClassicSimilarity], result of:
            0.047079757 = score(doc=1030,freq=1.0), product of:
              0.12730469 = queryWeight, product of:
                1.193465 = boost
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.01577368 = queryNorm
              0.3698195 = fieldWeight in 1030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1030)
          0.06873469 = weight(abstract_txt:granularity in 1030) [ClassicSimilarity], result of:
            0.06873469 = score(doc=1030,freq=1.0), product of:
              0.16383459 = queryWeight, product of:
                1.3539114 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.01577368 = queryNorm
              0.4195371 = fieldWeight in 1030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1030)
          0.11511988 = weight(abstract_txt:missing in 1030) [ClassicSimilarity], result of:
            0.11511988 = score(doc=1030,freq=1.0), product of:
              0.2911154 = queryWeight, product of:
                2.5523195 = boost
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.01577368 = queryNorm
              0.39544415 = fieldWeight in 1030, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.230979 = idf(docFreq=86, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1030)
          0.21248978 = weight(abstract_txt:quality in 1030) [ClassicSimilarity], result of:
            0.21248978 = score(doc=1030,freq=3.0), product of:
              0.48213637 = queryWeight, product of:
                6.5692744 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.01577368 = queryNorm
              0.44072548 = fieldWeight in 1030, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1030)
          0.19388612 = weight(abstract_txt:data in 1030) [ClassicSimilarity], result of:
            0.19388612 = score(doc=1030,freq=6.0), product of:
              0.4338221 = queryWeight, product of:
                8.243418 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01577368 = queryNorm
              0.4469254 = fieldWeight in 1030, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1030)
        0.2 = coord(5/25)
    
  5. Barnett, J.: OCLC cataloging peer committees : an overview (1993) 0.11
    0.11496746 = sum of:
      0.11496746 = product of:
        0.7185466 = sum of:
          0.028629461 = weight(abstract_txt:have in 5988) [ClassicSimilarity], result of:
            0.028629461 = score(doc=5988,freq=1.0), product of:
              0.0571767 = queryWeight, product of:
                1.1311287 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.01577368 = queryNorm
              0.500719 = fieldWeight in 5988, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.15625 = fieldNorm(doc=5988)
          0.06820154 = weight(abstract_txt:database in 5988) [ClassicSimilarity], result of:
            0.06820154 = score(doc=5988,freq=1.0), product of:
              0.10198581 = queryWeight, product of:
                1.5106795 = boost
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.01577368 = queryNorm
              0.66873556 = fieldWeight in 5988, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2799077 = idf(docFreq=1663, maxDocs=44218)
                0.15625 = fieldNorm(doc=5988)
          0.12600937 = weight(abstract_txt:impact in 5988) [ClassicSimilarity], result of:
            0.12600937 = score(doc=5988,freq=1.0), product of:
              0.17578264 = queryWeight, product of:
                2.4290478 = boost
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.01577368 = queryNorm
              0.7168477 = fieldWeight in 5988, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.15625 = fieldNorm(doc=5988)
          0.49570626 = weight(abstract_txt:quality in 5988) [ClassicSimilarity], result of:
            0.49570626 = score(doc=5988,freq=2.0), product of:
              0.48213637 = queryWeight, product of:
                6.5692744 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.01577368 = queryNorm
              1.0281453 = fieldWeight in 5988, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.15625 = fieldNorm(doc=5988)
        0.16 = coord(4/25)