Document (#40112)

Carter, D.
Sholler, D.
Data science on the ground : hype, criticism, and everyday work
Journal of the Association for Information Science and Technology. 67(2016) no.10, S.2309-2319
Modern organizations often employ data scientists to improve business processes using diverse sets of data. Researchers and practitioners have both touted the benefits and warned of the drawbacks associated with data science and big data approaches, but few studies investigate how data science is carried out "on the ground." In this paper, we first review the hype and criticisms surrounding data science and big data approaches. We then present the findings of semistructured interviews with 18 data analysts from various industries and organizational roles. Using qualitative coding techniques, we evaluated these interviews in light of the hype and criticisms surrounding data science in the popular discourse. We found that although the data analysts we interviewed were sensitive to both the allure and the potential pitfalls of data science, their motivations and evaluations of their work were more nuanced. We conclude by reflecting on the relationship between data analysts' work and the discourses around data science and big data, suggesting how future research can better account for the everyday practices of this profession.
Data Mining

Similar documents (author)

  1. Carter, J.A.: PASSPORT/PRISM: authors and titles and MARC : oh my! (1993) 5.27
    5.274244 = sum of:
      5.274244 = weight(author_txt:carter in 527) [ClassicSimilarity], result of:
        5.274244 = fieldWeight in 527, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.43879 = idf(docFreq=25, maxDocs=44218)
          0.625 = fieldNorm(doc=527)
  2. Carter, R.C.: Education for serials : a presentation at the Phinazee Symposium (1993) 5.27
    5.274244 = sum of:
      5.274244 = weight(author_txt:carter in 4179) [ClassicSimilarity], result of:
        5.274244 = fieldWeight in 4179, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.43879 = idf(docFreq=25, maxDocs=44218)
          0.625 = fieldNorm(doc=4179)
  3. Carter, J.A.: ¬A fever of excitement over keyword searching (1993) 5.27
    5.274244 = sum of:
      5.274244 = weight(author_txt:carter in 6424) [ClassicSimilarity], result of:
        5.274244 = fieldWeight in 6424, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.43879 = idf(docFreq=25, maxDocs=44218)
          0.625 = fieldNorm(doc=6424)
  4. Carter, J.A.: PRISM/PASSPORT : 'questioning authority (files)' (1994) 5.27
    5.274244 = sum of:
      5.274244 = weight(author_txt:carter in 745) [ClassicSimilarity], result of:
        5.274244 = fieldWeight in 745, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.43879 = idf(docFreq=25, maxDocs=44218)
          0.625 = fieldNorm(doc=745)
  5. Carter, J.A.: PASSPORT/PRISM: authors and titles and MARC : part two! (1994) 5.27
    5.274244 = sum of:
      5.274244 = weight(author_txt:carter in 747) [ClassicSimilarity], result of:
        5.274244 = fieldWeight in 747, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.43879 = idf(docFreq=25, maxDocs=44218)
          0.625 = fieldNorm(doc=747)

Similar documents (content)

  1. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.12
    0.11919768 = sum of:
      0.11919768 = product of:
        0.5959884 = sum of:
          0.025712382 = weight(abstract_txt:approaches in 5011) [ClassicSimilarity], result of:
            0.025712382 = score(doc=5011,freq=1.0), product of:
              0.08926983 = queryWeight, product of:
                1.2766663 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.015172941 = queryNorm
              0.2880299 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.030444235 = weight(abstract_txt:work in 5011) [ClassicSimilarity], result of:
            0.030444235 = score(doc=5011,freq=2.0), product of:
              0.090775155 = queryWeight, product of:
                1.5767186 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.015172941 = queryNorm
              0.3353807 = fieldWeight in 5011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.2223097 = weight(abstract_txt:analysts in 5011) [ClassicSimilarity], result of:
            0.2223097 = score(doc=5011,freq=1.0), product of:
              0.43047294 = queryWeight, product of:
                3.4335518 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.015172941 = queryNorm
              0.5164313 = fieldWeight in 5011, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.074838065 = weight(abstract_txt:science in 5011) [ClassicSimilarity], result of:
            0.074838065 = score(doc=5011,freq=2.0), product of:
              0.21929947 = queryWeight, product of:
                3.7435 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.015172941 = queryNorm
              0.3412597 = fieldWeight in 5011, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
          0.242684 = weight(abstract_txt:data in 5011) [ClassicSimilarity], result of:
            0.242684 = score(doc=5011,freq=11.0), product of:
              0.35090816 = queryWeight, product of:
                6.931902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.015172941 = queryNorm
              0.6915884 = fieldWeight in 5011, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5011)
        0.2 = coord(5/25)
  2. Kelly, D.; Wacholder, N.; Rittman, R.; Sun, Y.; Kantor, P.; Small, S.; Strzalkowski, T.: Using interview data to identify evaluation criteria for interactive, analytical question-answering systems (2007) 0.11
    0.112022676 = sum of:
      0.112022676 = product of:
        0.5601134 = sum of:
          0.028116489 = weight(abstract_txt:were in 332) [ClassicSimilarity], result of:
            0.028116489 = score(doc=332,freq=3.0), product of:
              0.05661569 = queryWeight, product of:
                1.0167015 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.015172941 = queryNorm
              0.49662006 = fieldWeight in 332, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.078125 = fieldNorm(doc=332)
          0.057632506 = weight(abstract_txt:interviews in 332) [ClassicSimilarity], result of:
            0.057632506 = score(doc=332,freq=1.0), product of:
              0.13175938 = queryWeight, product of:
                1.5510143 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.015172941 = queryNorm
              0.43740726 = fieldWeight in 332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.078125 = fieldNorm(doc=332)
          0.038055293 = weight(abstract_txt:work in 332) [ClassicSimilarity], result of:
            0.038055293 = score(doc=332,freq=2.0), product of:
              0.090775155 = queryWeight, product of:
                1.5767186 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.015172941 = queryNorm
              0.41922587 = fieldWeight in 332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.078125 = fieldNorm(doc=332)
          0.2778871 = weight(abstract_txt:analysts in 332) [ClassicSimilarity], result of:
            0.2778871 = score(doc=332,freq=1.0), product of:
              0.43047294 = queryWeight, product of:
                3.4335518 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.015172941 = queryNorm
              0.6455391 = fieldWeight in 332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=332)
          0.15842198 = weight(abstract_txt:data in 332) [ClassicSimilarity], result of:
            0.15842198 = score(doc=332,freq=3.0), product of:
              0.35090816 = queryWeight, product of:
                6.931902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.015172941 = queryNorm
              0.4514628 = fieldWeight in 332, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=332)
        0.2 = coord(5/25)
  3. Nathan, L.P.: Sustainable information practice : an ethnographic investigation (2012) 0.10
    0.102718174 = sum of:
      0.102718174 = product of:
        0.4279924 = sum of:
          0.06059044 = weight(abstract_txt:semistructured in 496) [ClassicSimilarity], result of:
            0.06059044 = score(doc=496,freq=1.0), product of:
              0.12546885 = queryWeight, product of:
                1.0702322 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.015172941 = queryNorm
              0.4829122 = fieldWeight in 496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.0625 = fieldNorm(doc=496)
          0.046106007 = weight(abstract_txt:interviews in 496) [ClassicSimilarity], result of:
            0.046106007 = score(doc=496,freq=1.0), product of:
              0.13175938 = queryWeight, product of:
                1.5510143 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.015172941 = queryNorm
              0.34992582 = fieldWeight in 496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.0625 = fieldNorm(doc=496)
          0.030444235 = weight(abstract_txt:work in 496) [ClassicSimilarity], result of:
            0.030444235 = score(doc=496,freq=2.0), product of:
              0.090775155 = queryWeight, product of:
                1.5767186 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.015172941 = queryNorm
              0.3353807 = fieldWeight in 496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.0625 = fieldNorm(doc=496)
          0.08927605 = weight(abstract_txt:ground in 496) [ClassicSimilarity], result of:
            0.08927605 = score(doc=496,freq=1.0), product of:
              0.20469151 = queryWeight, product of:
                1.9331919 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.015172941 = queryNorm
              0.43614927 = fieldWeight in 496, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=496)
          0.074838065 = weight(abstract_txt:science in 496) [ClassicSimilarity], result of:
            0.074838065 = score(doc=496,freq=2.0), product of:
              0.21929947 = queryWeight, product of:
                3.7435 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.015172941 = queryNorm
              0.3412597 = fieldWeight in 496, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=496)
          0.12673758 = weight(abstract_txt:data in 496) [ClassicSimilarity], result of:
            0.12673758 = score(doc=496,freq=3.0), product of:
              0.35090816 = queryWeight, product of:
                6.931902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.015172941 = queryNorm
              0.36117023 = fieldWeight in 496, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=496)
        0.24 = coord(6/25)
  4. Eaglestone, B.; Ford, N.; Brown, G.J.; Moore, A.: Information systems and creativity : an empirical study (2007) 0.09
    0.09085073 = sum of:
      0.09085073 = product of:
        0.3244669 = sum of:
          0.016869891 = weight(abstract_txt:were in 834) [ClassicSimilarity], result of:
            0.016869891 = score(doc=834,freq=3.0), product of:
              0.05661569 = queryWeight, product of:
                1.0167015 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.015172941 = queryNorm
              0.29797202 = fieldWeight in 834, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
          0.010910346 = weight(abstract_txt:both in 834) [ClassicSimilarity], result of:
            0.010910346 = score(doc=834,freq=1.0), product of:
              0.061065327 = queryWeight, product of:
                1.055899 = boost
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.015172941 = queryNorm
              0.17866679 = fieldWeight in 834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
          0.027272098 = weight(abstract_txt:approaches in 834) [ClassicSimilarity], result of:
            0.027272098 = score(doc=834,freq=2.0), product of:
              0.08926983 = queryWeight, product of:
                1.2766663 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.015172941 = queryNorm
              0.30550185 = fieldWeight in 834, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
          0.034579508 = weight(abstract_txt:interviews in 834) [ClassicSimilarity], result of:
            0.034579508 = score(doc=834,freq=1.0), product of:
              0.13175938 = queryWeight, product of:
                1.5510143 = boost
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.015172941 = queryNorm
              0.26244438 = fieldWeight in 834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598813 = idf(docFreq=444, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
          0.016145496 = weight(abstract_txt:work in 834) [ClassicSimilarity], result of:
            0.016145496 = score(doc=834,freq=1.0), product of:
              0.090775155 = queryWeight, product of:
                1.5767186 = boost
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.015172941 = queryNorm
              0.1778625 = fieldWeight in 834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7943997 = idf(docFreq=2703, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
          0.095976435 = weight(abstract_txt:criticisms in 834) [ClassicSimilarity], result of:
            0.095976435 = score(doc=834,freq=1.0), product of:
              0.26022285 = queryWeight, product of:
                2.1797051 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.015172941 = queryNorm
              0.368824 = fieldWeight in 834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
          0.12271314 = weight(abstract_txt:data in 834) [ClassicSimilarity], result of:
            0.12271314 = score(doc=834,freq=5.0), product of:
              0.35090816 = queryWeight, product of:
                6.931902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.015172941 = queryNorm
              0.34970158 = fieldWeight in 834, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.046875 = fieldNorm(doc=834)
        0.28 = coord(7/25)
  5. Hagen, L.; Patel, M.; Luna-Reyes, L.: Human-supervised data science framework for city governments : a design science approach (2023) 0.08
    0.08426516 = sum of:
      0.08426516 = product of:
        0.5266572 = sum of:
          0.01818391 = weight(abstract_txt:both in 1016) [ClassicSimilarity], result of:
            0.01818391 = score(doc=1016,freq=1.0), product of:
              0.061065327 = queryWeight, product of:
                1.055899 = boost
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.015172941 = queryNorm
              0.29777798 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.811558 = idf(docFreq=2657, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.03214048 = weight(abstract_txt:approaches in 1016) [ClassicSimilarity], result of:
            0.03214048 = score(doc=1016,freq=1.0), product of:
              0.08926983 = queryWeight, product of:
                1.2766663 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.015172941 = queryNorm
              0.3600374 = fieldWeight in 1016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.18709517 = weight(abstract_txt:science in 1016) [ClassicSimilarity], result of:
            0.18709517 = score(doc=1016,freq=8.0), product of:
              0.21929947 = queryWeight, product of:
                3.7435 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.015172941 = queryNorm
              0.85314924 = fieldWeight in 1016, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
          0.28923765 = weight(abstract_txt:data in 1016) [ClassicSimilarity], result of:
            0.28923765 = score(doc=1016,freq=10.0), product of:
              0.35090816 = queryWeight, product of:
                6.931902 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.015172941 = queryNorm
              0.8242545 = fieldWeight in 1016, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=1016)
        0.16 = coord(4/25)