Document (#38733)

Author
Wasserman, M.
Mukherjee, S.
Scott, K.
Zeng, X.H.T.
Radicchi, F.
Amaral, L.A.N.
Title
Correlations between user voting data, budget, and box office for films in the internet movie database
Source
Journal of the Association for Information Science and Technology. 66(2015) no.4, S.858-868
Year
2015
Abstract
The Internet Movie Database (IMDb) is one of the most-visited websites in the world and the premier source for information on films. Similar to Wikipedia, much of IMDb's information is user contributed. IMDb also allows users to voice their opinion on the quality of films through voting. We investigate whether there is a connection between user voting data and economic film characteristics. We perform distribution and correlation analysis on a set of films chosen to mitigate effects of bias due to the language and country of origin of films. Production budget, box office gross, and total number of user votes for films are consistent with double-log normal distributions for certain time periods. Both total gross and user votes are consistent with a double-log normal distribution from the late 1980s onward while for budget it extends from 1935 to 1979. In addition, we find a strong correlation between number of user votes and the economic statistics, particularly budget. Remarkably, we find no evidence for a correlation between number of votes and average user rating. Our results suggest that total user votes is an indicator of a film's prominence or notability, which can be quantified by its promotional costs.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23213/abstract.
Form
Filme
Object
Internet Movie Database

Similar documents (author)

  1. Amaral, L.A.N. -> Nunes Amaral, L.A.: 1.44
    1.4372063 = sum of:
      1.4372063 = product of:
        4.311619 = sum of:
          4.311619 = weight(author_txt:amaral in 1185) [ClassicSimilarity], result of:
            4.311619 = score(doc=1185,freq=2.0), product of:
              0.724582 = queryWeight, product of:
                1.27051 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05929932 = queryNorm
              5.950491 = fieldWeight in 1185, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.4375 = fieldNorm(doc=1185)
        0.33333334 = coord(1/3)
    
  2. Morato Amaral, R. -> Amaral, R.M.: 1.44
    1.4372063 = sum of:
      1.4372063 = product of:
        4.311619 = sum of:
          4.311619 = weight(author_txt:amaral in 891) [ClassicSimilarity], result of:
            4.311619 = score(doc=891,freq=2.0), product of:
              0.724582 = queryWeight, product of:
                1.27051 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05929932 = queryNorm
              5.950491 = fieldWeight in 891, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.4375 = fieldNorm(doc=891)
        0.33333334 = coord(1/3)
    
  3. Amaral, L.A. Nunes -> Nunes Amaral, L.A.: 1.23
    1.2318912 = sum of:
      1.2318912 = product of:
        3.6956732 = sum of:
          3.6956732 = weight(author_txt:amaral in 4143) [ClassicSimilarity], result of:
            3.6956732 = score(doc=4143,freq=2.0), product of:
              0.724582 = queryWeight, product of:
                1.27051 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.05929932 = queryNorm
              5.100421 = fieldWeight in 4143, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.375 = fieldNorm(doc=4143)
        0.33333334 = coord(1/3)
    
  4. Scott, D.S.: Subject classification and natural-language processing for retrieval in large databases (1989) 0.89
    0.8901781 = sum of:
      0.8901781 = product of:
        2.6705341 = sum of:
          2.6705341 = weight(author_txt:scott in 967) [ClassicSimilarity], result of:
            2.6705341 = score(doc=967,freq=1.0), product of:
              0.52295953 = queryWeight, product of:
                1.079365 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.05929932 = queryNorm
              5.106579 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.625 = fieldNorm(doc=967)
        0.33333334 = coord(1/3)
    
  5. Scott, E.: ¬The evolution of bibliographic systems in the USA, 1876-1945 (1976/77) 0.89
    0.8901781 = sum of:
      0.8901781 = product of:
        2.6705341 = sum of:
          2.6705341 = weight(author_txt:scott in 4365) [ClassicSimilarity], result of:
            2.6705341 = score(doc=4365,freq=1.0), product of:
              0.52295953 = queryWeight, product of:
                1.079365 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.05929932 = queryNorm
              5.106579 = fieldWeight in 4365, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.625 = fieldNorm(doc=4365)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Collins, B.R.: Webwatch (1996) 0.10
    0.09693578 = sum of:
      0.09693578 = product of:
        0.80779815 = sum of:
          0.038447164 = weight(abstract_txt:number in 6956) [ClassicSimilarity], result of:
            0.038447164 = score(doc=6956,freq=1.0), product of:
              0.059540953 = queryWeight, product of:
                1.3987368 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.010300333 = queryNorm
              0.6457264 = fieldWeight in 6956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.15625 = fieldNorm(doc=6956)
          0.20987399 = weight(abstract_txt:movie in 6956) [ClassicSimilarity], result of:
            0.20987399 = score(doc=6956,freq=1.0), product of:
              0.16125563 = queryWeight, product of:
                1.8794897 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.010300333 = queryNorm
              1.3014987 = fieldWeight in 6956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.15625 = fieldNorm(doc=6956)
          0.55947703 = weight(abstract_txt:films in 6956) [ClassicSimilarity], result of:
            0.55947703 = score(doc=6956,freq=1.0), product of:
              0.44713405 = queryWeight, product of:
                5.4207826 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.010300333 = queryNorm
              1.2512512 = fieldWeight in 6956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.15625 = fieldNorm(doc=6956)
        0.12 = coord(3/25)
    
  2. Knautz, K.; Stock, W.G.: Collective indexing of emotions in videos (2011) 0.09
    0.09291593 = sum of:
      0.09291593 = product of:
        0.5807246 = sum of:
          0.012069228 = weight(abstract_txt:between in 295) [ClassicSimilarity], result of:
            0.012069228 = score(doc=295,freq=1.0), product of:
              0.055756927 = queryWeight, product of:
                1.5629565 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.010300333 = queryNorm
              0.21646151 = fieldWeight in 295, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=295)
          0.040520396 = weight(abstract_txt:total in 295) [ClassicSimilarity], result of:
            0.040520396 = score(doc=295,freq=1.0), product of:
              0.11358353 = queryWeight, product of:
                1.9319052 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010300333 = queryNorm
              0.35674536 = fieldWeight in 295, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=295)
          0.02904057 = weight(abstract_txt:user in 295) [ClassicSimilarity], result of:
            0.02904057 = score(doc=295,freq=1.0), product of:
              0.12614186 = queryWeight, product of:
                3.3246205 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.010300333 = queryNorm
              0.23022151 = fieldWeight in 295, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.0625 = fieldNorm(doc=295)
          0.4990944 = weight(abstract_txt:votes in 295) [ClassicSimilarity], result of:
            0.4990944 = score(doc=295,freq=2.0), product of:
              0.57006925 = queryWeight, product of:
                5.587484 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.010300333 = queryNorm
              0.8754978 = fieldWeight in 295, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=295)
        0.16 = coord(4/25)
    
  3. Naun, C.C.; Elhard, K.C.: Cataloguing, lies, and videotape : comparing the IMDb and the library catalogue (2005) 0.06
    0.05627303 = sum of:
      0.05627303 = product of:
        0.46894193 = sum of:
          0.1259244 = weight(abstract_txt:movie in 5734) [ClassicSimilarity], result of:
            0.1259244 = score(doc=5734,freq=1.0), product of:
              0.16125563 = queryWeight, product of:
                1.8794897 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.010300333 = queryNorm
              0.7808992 = fieldWeight in 5734, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=5734)
          0.29945666 = weight(abstract_txt:imdb in 5734) [ClassicSimilarity], result of:
            0.29945666 = score(doc=5734,freq=2.0), product of:
              0.22802772 = queryWeight, product of:
                2.2349937 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.010300333 = queryNorm
              1.3132467 = fieldWeight in 5734, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.09375 = fieldNorm(doc=5734)
          0.043560855 = weight(abstract_txt:user in 5734) [ClassicSimilarity], result of:
            0.043560855 = score(doc=5734,freq=1.0), product of:
              0.12614186 = queryWeight, product of:
                3.3246205 = boost
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.010300333 = queryNorm
              0.34533226 = fieldWeight in 5734, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6835442 = idf(docFreq=3020, maxDocs=44218)
                0.09375 = fieldNorm(doc=5734)
        0.12 = coord(3/25)
    
  4. Yee, M.M.: Manifestations and near-equivalents of moving image works : a research project (1994) 0.05
    0.0528248 = sum of:
      0.0528248 = product of:
        0.44020668 = sum of:
          0.012069228 = weight(abstract_txt:between in 862) [ClassicSimilarity], result of:
            0.012069228 = score(doc=862,freq=1.0), product of:
              0.055756927 = queryWeight, product of:
                1.5629565 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.010300333 = queryNorm
              0.21646151 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=862)
          0.040520396 = weight(abstract_txt:total in 862) [ClassicSimilarity], result of:
            0.040520396 = score(doc=862,freq=1.0), product of:
              0.11358353 = queryWeight, product of:
                1.9319052 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010300333 = queryNorm
              0.35674536 = fieldWeight in 862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=862)
          0.38761705 = weight(abstract_txt:films in 862) [ClassicSimilarity], result of:
            0.38761705 = score(doc=862,freq=3.0), product of:
              0.44713405 = queryWeight, product of:
                5.4207826 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.010300333 = queryNorm
              0.8668923 = fieldWeight in 862, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=862)
        0.12 = coord(3/25)
    
  5. Száva-Kováts, E.: Indirect-collective referencing (ICR) in the elite journal literature of physics : II: a literature science study on the level of communications (2002) 0.05
    0.048409898 = sum of:
      0.048409898 = product of:
        0.20170791 = sum of:
          0.012722188 = weight(abstract_txt:find in 180) [ClassicSimilarity], result of:
            0.012722188 = score(doc=180,freq=1.0), product of:
              0.055526823 = queryWeight, product of:
                1.1028943 = boost
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.010300333 = queryNorm
              0.22911787 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.046875 = fieldNorm(doc=180)
          0.01153415 = weight(abstract_txt:number in 180) [ClassicSimilarity], result of:
            0.01153415 = score(doc=180,freq=1.0), product of:
              0.059540953 = queryWeight, product of:
                1.3987368 = boost
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.010300333 = queryNorm
              0.19371793 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.132649 = idf(docFreq=1927, maxDocs=44218)
                0.046875 = fieldNorm(doc=180)
          0.018103842 = weight(abstract_txt:between in 180) [ClassicSimilarity], result of:
            0.018103842 = score(doc=180,freq=4.0), product of:
              0.055756927 = queryWeight, product of:
                1.5629565 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.010300333 = queryNorm
              0.32469225 = fieldWeight in 180, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.046875 = fieldNorm(doc=180)
          0.045280464 = weight(abstract_txt:normal in 180) [ClassicSimilarity], result of:
            0.045280464 = score(doc=180,freq=1.0), product of:
              0.12944011 = queryWeight, product of:
                1.6839024 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.010300333 = queryNorm
              0.34981787 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.046875 = fieldNorm(doc=180)
          0.030390298 = weight(abstract_txt:total in 180) [ClassicSimilarity], result of:
            0.030390298 = score(doc=180,freq=1.0), product of:
              0.11358353 = queryWeight, product of:
                1.9319052 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.010300333 = queryNorm
              0.26755902 = fieldWeight in 180, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.046875 = fieldNorm(doc=180)
          0.08367697 = weight(abstract_txt:correlation in 180) [ClassicSimilarity], result of:
            0.08367697 = score(doc=180,freq=4.0), product of:
              0.14056462 = queryWeight, product of:
                2.149147 = boost
                6.3497796 = idf(docFreq=209, maxDocs=44218)
                0.010300333 = queryNorm
              0.59529185 = fieldWeight in 180, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3497796 = idf(docFreq=209, maxDocs=44218)
                0.046875 = fieldNorm(doc=180)
        0.24 = coord(6/25)