Document (#29925)

Author
Daizadeh, I.
Title
¬An example of information management in biology : qualitative data economizing theory applied to the Human Genome Project databases
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.2, S.244-250
Year
2006
Abstract
Ironically, although much work has been done an elucidating algorithms for enabling scientists to efficiently retrieve relevant information from the glut of data derived from the efforts of the Human Genome Project and other similar projects, little has been performed an optimizing the levels of data economy across databases. One technique to qualify the degree of data economization is that constructed by Boisot. Boisot's Information Space (I-Space) takes into account the degree to which data are written (codification), the degree to which the data can be understood (abstraction), and the degree to which the data are effectively communicated to an audience (diffusion). A data system is said to be more data economical if it is relatively high in these dimensions. Application of the approach to entries in two popular, publicly available biological data repositories, the Protein DataBank (PDB) and GenBank, leads to the recommendation that PDB increases its level of abstraction through establishing a larger set of detailed keywords, diffusion through constructing hyperlinks to other databases, and codification through constructing additional subsections. With these recommendations in place, PDB would achieve the greater data economies currently enjoyed by GenBank. A discussion of the limitations of the approach is presented.
Field
Molekularbiologie

Similar documents (content)

  1. Rapp, B.A.; Wheeler, D.L.: Bioinformatics resources from the National Center for Biotechnology Information : an integrated foundation for discovery (2005) 0.24
    0.24333806 = sum of:
      0.24333806 = product of:
        0.8690645 = sum of:
          0.010314618 = weight(abstract_txt:which in 5265) [ClassicSimilarity], result of:
            0.010314618 = score(doc=5265,freq=1.0), product of:
              0.05658212 = queryWeight, product of:
                1.1681412 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.016606951 = queryNorm
              0.18229467 = fieldWeight in 5265, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
          0.21664026 = weight(abstract_txt:protein in 5265) [ClassicSimilarity], result of:
            0.21664026 = score(doc=5265,freq=4.0), product of:
              0.18813783 = queryWeight, product of:
                1.2297963 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016606951 = queryNorm
              1.1514976 = fieldWeight in 5265, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
          0.04345459 = weight(abstract_txt:space in 5265) [ClassicSimilarity], result of:
            0.04345459 = score(doc=5265,freq=1.0), product of:
              0.12893489 = queryWeight, product of:
                1.4397774 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016606951 = queryNorm
              0.3370274 = fieldWeight in 5265, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
          0.03794059 = weight(abstract_txt:through in 5265) [ClassicSimilarity], result of:
            0.03794059 = score(doc=5265,freq=2.0), product of:
              0.107012995 = queryWeight, product of:
                1.6064751 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.016606951 = queryNorm
              0.35454193 = fieldWeight in 5265, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
          0.035719067 = weight(abstract_txt:databases in 5265) [ClassicSimilarity], result of:
            0.035719067 = score(doc=5265,freq=1.0), product of:
              0.12951215 = queryWeight, product of:
                1.767303 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.016606951 = queryNorm
              0.27579704 = fieldWeight in 5265, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
          0.37523195 = weight(abstract_txt:genome in 5265) [ClassicSimilarity], result of:
            0.37523195 = score(doc=5265,freq=3.0), product of:
              0.37627566 = queryWeight, product of:
                2.4595926 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016606951 = queryNorm
              0.9972262 = fieldWeight in 5265, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
          0.1497634 = weight(abstract_txt:data in 5265) [ClassicSimilarity], result of:
            0.1497634 = score(doc=5265,freq=7.0), product of:
              0.2714597 = queryWeight, product of:
                4.899414 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016606951 = queryNorm
              0.55169666 = fieldWeight in 5265, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5265)
        0.28 = coord(7/25)
    
  2. Shachak, A.: Diffusion pattern of the use of genomic databases and analysis of biological sequences from 1970-2003 : bibliographic record analysis of 12 journals (2006) 0.10
    0.10241776 = sum of:
      0.10241776 = product of:
        0.5120888 = sum of:
          0.014559606 = weight(abstract_txt:approach in 4906) [ClassicSimilarity], result of:
            0.014559606 = score(doc=4906,freq=1.0), product of:
              0.06219848 = queryWeight, product of:
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016606951 = queryNorm
              0.234083 = fieldWeight in 4906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=4906)
          0.10832013 = weight(abstract_txt:protein in 4906) [ClassicSimilarity], result of:
            0.10832013 = score(doc=4906,freq=1.0), product of:
              0.18813783 = queryWeight, product of:
                1.2297963 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.016606951 = queryNorm
              0.5757488 = fieldWeight in 4906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=4906)
          0.07143813 = weight(abstract_txt:databases in 4906) [ClassicSimilarity], result of:
            0.07143813 = score(doc=4906,freq=4.0), product of:
              0.12951215 = queryWeight, product of:
                1.767303 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.016606951 = queryNorm
              0.5515941 = fieldWeight in 4906, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.0625 = fieldNorm(doc=4906)
          0.23771898 = weight(abstract_txt:diffusion in 4906) [ClassicSimilarity], result of:
            0.23771898 = score(doc=4906,freq=5.0), product of:
              0.23409884 = queryWeight, product of:
                1.9400358 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.016606951 = queryNorm
              1.0154642 = fieldWeight in 4906, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0625 = fieldNorm(doc=4906)
          0.080051914 = weight(abstract_txt:data in 4906) [ClassicSimilarity], result of:
            0.080051914 = score(doc=4906,freq=2.0), product of:
              0.2714597 = queryWeight, product of:
                4.899414 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016606951 = queryNorm
              0.29489428 = fieldWeight in 4906, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=4906)
        0.2 = coord(5/25)
    
  3. Ingwersen, P.: Cognitive perspectives of information retrieval interaction : elements of a cognitive IR theory (1996) 0.10
    0.10114886 = sum of:
      0.10114886 = product of:
        0.4214536 = sum of:
          0.014559606 = weight(abstract_txt:approach in 3616) [ClassicSimilarity], result of:
            0.014559606 = score(doc=3616,freq=1.0), product of:
              0.06219848 = queryWeight, product of:
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016606951 = queryNorm
              0.234083 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=3616)
          0.07526556 = weight(abstract_txt:space in 3616) [ClassicSimilarity], result of:
            0.07526556 = score(doc=3616,freq=3.0), product of:
              0.12893489 = queryWeight, product of:
                1.4397774 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016606951 = queryNorm
              0.5837486 = fieldWeight in 3616, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.0625 = fieldNorm(doc=3616)
          0.026828052 = weight(abstract_txt:through in 3616) [ClassicSimilarity], result of:
            0.026828052 = score(doc=3616,freq=1.0), product of:
              0.107012995 = queryWeight, product of:
                1.6064751 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.016606951 = queryNorm
              0.250699 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.0625 = fieldNorm(doc=3616)
          0.106311165 = weight(abstract_txt:diffusion in 3616) [ClassicSimilarity], result of:
            0.106311165 = score(doc=3616,freq=1.0), product of:
              0.23409884 = queryWeight, product of:
                1.9400358 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.016606951 = queryNorm
              0.4541294 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0625 = fieldNorm(doc=3616)
          0.100446045 = weight(abstract_txt:degree in 3616) [ClassicSimilarity], result of:
            0.100446045 = score(doc=3616,freq=1.0), product of:
              0.28399578 = queryWeight, product of:
                3.0219069 = boost
                5.659016 = idf(docFreq=418, maxDocs=44218)
                0.016606951 = queryNorm
              0.3536885 = fieldWeight in 3616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.659016 = idf(docFreq=418, maxDocs=44218)
                0.0625 = fieldNorm(doc=3616)
          0.098043166 = weight(abstract_txt:data in 3616) [ClassicSimilarity], result of:
            0.098043166 = score(doc=3616,freq=3.0), product of:
              0.2714597 = queryWeight, product of:
                4.899414 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016606951 = queryNorm
              0.36117023 = fieldWeight in 3616, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3616)
        0.24 = coord(6/25)
    
  4. Vries, A.P. de: Content independence in multimedia databases (2001) 0.10
    0.09532499 = sum of:
      0.09532499 = product of:
        0.47662497 = sum of:
          0.021880612 = weight(abstract_txt:which in 6534) [ClassicSimilarity], result of:
            0.021880612 = score(doc=6534,freq=2.0), product of:
              0.05658212 = queryWeight, product of:
                1.1681412 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.016606951 = queryNorm
              0.3867054 = fieldWeight in 6534, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.09375 = fieldNorm(doc=6534)
          0.0535786 = weight(abstract_txt:databases in 6534) [ClassicSimilarity], result of:
            0.0535786 = score(doc=6534,freq=1.0), product of:
              0.12951215 = queryWeight, product of:
                1.767303 = boost
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.016606951 = queryNorm
              0.41369557 = fieldWeight in 6534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4127526 = idf(docFreq=1456, maxDocs=44218)
                0.09375 = fieldNorm(doc=6534)
          0.14348584 = weight(abstract_txt:constructing in 6534) [ClassicSimilarity], result of:
            0.14348584 = score(doc=6534,freq=1.0), product of:
              0.21818516 = queryWeight, product of:
                1.872935 = boost
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.016606951 = queryNorm
              0.6576334 = fieldWeight in 6534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.014756 = idf(docFreq=107, maxDocs=44218)
                0.09375 = fieldNorm(doc=6534)
          0.17277204 = weight(abstract_txt:abstraction in 6534) [ClassicSimilarity], result of:
            0.17277204 = score(doc=6534,freq=1.0), product of:
              0.2469457 = queryWeight, product of:
                1.9925574 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.016606951 = queryNorm
              0.69963574 = fieldWeight in 6534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.09375 = fieldNorm(doc=6534)
          0.084907874 = weight(abstract_txt:data in 6534) [ClassicSimilarity], result of:
            0.084907874 = score(doc=6534,freq=1.0), product of:
              0.2714597 = queryWeight, product of:
                4.899414 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016606951 = queryNorm
              0.31278262 = fieldWeight in 6534, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.09375 = fieldNorm(doc=6534)
        0.2 = coord(5/25)
    
  5. Hagen, L.; Patel, M.; Luna-Reyes, L.: Human-supervised data science framework for city governments : a design science approach (2023) 0.08
    0.08477685 = sum of:
      0.08477685 = product of:
        0.35323688 = sum of:
          0.018199507 = weight(abstract_txt:approach in 1016) [ClassicSimilarity], result of:
            0.018199507 = score(doc=1016,freq=1.0), product of:
              0.06219848 = queryWeight, product of:
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.016606951 = queryNorm
              0.29260373 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.0128932735 = weight(abstract_txt:which in 1016) [ClassicSimilarity], result of:
            0.0128932735 = score(doc=1016,freq=1.0), product of:
              0.05658212 = queryWeight, product of:
                1.1681412 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.016606951 = queryNorm
              0.22786833 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.029075075 = weight(abstract_txt:project in 1016) [ClassicSimilarity], result of:
            0.029075075 = score(doc=1016,freq=1.0), product of:
              0.08500032 = queryWeight, product of:
                1.1690159 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.016606951 = queryNorm
              0.34205842 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.035782076 = weight(abstract_txt:human in 1016) [ClassicSimilarity], result of:
            0.035782076 = score(doc=1016,freq=1.0), product of:
              0.09761511 = queryWeight, product of:
                1.2527622 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.016606951 = queryNorm
              0.3665629 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.033535067 = weight(abstract_txt:through in 1016) [ClassicSimilarity], result of:
            0.033535067 = score(doc=1016,freq=1.0), product of:
              0.107012995 = queryWeight, product of:
                1.6064751 = boost
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.016606951 = queryNorm
              0.31337377 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.011184 = idf(docFreq=2176, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.22375189 = weight(abstract_txt:data in 1016) [ClassicSimilarity], result of:
            0.22375189 = score(doc=1016,freq=10.0), product of:
              0.2714597 = queryWeight, product of:
                4.899414 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.016606951 = queryNorm
              0.8242545 = fieldWeight in 1016, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
        0.24 = coord(6/25)