Document (#43414)

Author
Matthews, P.
Glitre, K.
Title
Genre analysis of movies using a topic model of plot summaries
Source
Journal of the Association for Information Science and Technology. 72(2021) no.12, S.1511-1527
Year
2021
Abstract
Genre plays an important role in the description, navigation, and discovery of movies, but it is rarely studied at large scale using quantitative methods. This allows an analysis of how genre labels are applied, how genres are composed and how these ingredients change, and how genres compare. We apply unsupervised topic modeling to a large collection of textual movie summaries and then use the model's topic proportions to investigate key questions in genre, including recognizability, mapping, canonicity, and change over time. We find that many genres can be quite easily predicted by their lexical signatures and this defines their position on the genre landscape. We find significant genre composition changes between periods for westerns, science fiction and road movies, reflecting changes in production and consumption values. We show that in terms of canonicity, canonical examples are often at the high end of the topic distribution profile for the genre rather than central as might be predicted by categorization theory.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24525.
Theme
Automatisches Indexieren
Form
Filme

Similar documents (author)

  1. Matthews, J.R.: Suggested guidelines for screen layouts and design of online catalogs (1987) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:matthews in 1290) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 1290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=1290)
    
  2. Matthews, J.R.: Public access to online catalogs : a planning guide for managers (1982) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:matthews in 1974) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 1974, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=1974)
    
  3. Matthews, J.R.: Use of knowledge about users in software development (1991) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:matthews in 4833) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 4833, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=4833)
    
  4. Matthews, J.R.: ¬The online catalog : time to move beyond the boundary of a 'catalog'! (1991) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:matthews in 7679) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 7679, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=7679)
    
  5. Matthews, J.R.: ¬The distribution of information : the role for online public access catalogs (1994) 5.16
    5.164313 = sum of:
      5.164313 = weight(author_txt:matthews in 8692) [ClassicSimilarity], result of:
        5.164313 = fieldWeight in 8692, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.2629 = idf(docFreq=30, maxDocs=44218)
          0.625 = fieldNorm(doc=8692)
    

Similar documents (content)

  1. Crowston, K.; Kwasnik, B.H.: Can document-genre metadata improve information access to large digital collections? (2004) 0.18
    0.18216388 = sum of:
      0.18216388 = product of:
        1.1385243 = sum of:
          0.029905794 = weight(abstract_txt:large in 824) [ClassicSimilarity], result of:
            0.029905794 = score(doc=824,freq=2.0), product of:
              0.07596288 = queryWeight, product of:
                1.2507664 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.013635351 = queryNorm
              0.39368957 = fieldWeight in 824, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
          0.06209053 = weight(abstract_txt:topic in 824) [ClassicSimilarity], result of:
            0.06209053 = score(doc=824,freq=1.0), product of:
              0.19624628 = queryWeight, product of:
                2.8430939 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.013635351 = queryNorm
              0.31639087 = fieldWeight in 824, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
          0.30641842 = weight(abstract_txt:genres in 824) [ClassicSimilarity], result of:
            0.30641842 = score(doc=824,freq=5.0), product of:
              0.30224434 = queryWeight, product of:
                3.055626 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.013635351 = queryNorm
              1.0138103 = fieldWeight in 824, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
          0.7401095 = weight(abstract_txt:genre in 824) [ClassicSimilarity], result of:
            0.7401095 = score(doc=824,freq=9.0), product of:
              0.5932627 = queryWeight, product of:
                6.539326 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.013635351 = queryNorm
              1.2475241 = fieldWeight in 824, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.0625 = fieldNorm(doc=824)
        0.16 = coord(4/25)
    
  2. Hajibayova, L.; Jacob, E.K.: User-generated genre tags through the lens of genre theories (2014) 0.16
    0.15991361 = sum of:
      0.15991361 = product of:
        0.9994601 = sum of:
          0.015161038 = weight(abstract_txt:analysis in 1450) [ClassicSimilarity], result of:
            0.015161038 = score(doc=1450,freq=3.0), product of:
              0.05111079 = queryWeight, product of:
                1.0259632 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.013635351 = queryNorm
              0.2966309 = fieldWeight in 1450, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.015859943 = weight(abstract_txt:large in 1450) [ClassicSimilarity], result of:
            0.015859943 = score(doc=1450,freq=1.0), product of:
              0.07596288 = queryWeight, product of:
                1.2507664 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.013635351 = queryNorm
              0.20878543 = fieldWeight in 1450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.20555171 = weight(abstract_txt:genres in 1450) [ClassicSimilarity], result of:
            0.20555171 = score(doc=1450,freq=4.0), product of:
              0.30224434 = queryWeight, product of:
                3.055626 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.013635351 = queryNorm
              0.6800846 = fieldWeight in 1450, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
          0.7628874 = weight(abstract_txt:genre in 1450) [ClassicSimilarity], result of:
            0.7628874 = score(doc=1450,freq=17.0), product of:
              0.5932627 = queryWeight, product of:
                6.539326 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.013635351 = queryNorm
              1.2859185 = fieldWeight in 1450, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.046875 = fieldNorm(doc=1450)
        0.16 = coord(4/25)
    
  3. Wu, I.-C.; Niu, Y.-F.: Effects of anchoring process under preference stabilities for interactive movie recommendations (2015) 0.16
    0.15541647 = sum of:
      0.15541647 = product of:
        0.971353 = sum of:
          0.13830402 = weight(abstract_txt:movie in 2130) [ClassicSimilarity], result of:
            0.13830402 = score(doc=2130,freq=4.0), product of:
              0.1328315 = queryWeight, product of:
                1.1695291 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.013635351 = queryNorm
              1.041199 = fieldWeight in 2130, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.27406895 = weight(abstract_txt:genres in 2130) [ClassicSimilarity], result of:
            0.27406895 = score(doc=2130,freq=4.0), product of:
              0.30224434 = queryWeight, product of:
                3.055626 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.013635351 = queryNorm
              0.90677947 = fieldWeight in 2130, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.21008904 = weight(abstract_txt:movies in 2130) [ClassicSimilarity], result of:
            0.21008904 = score(doc=2130,freq=1.0), product of:
              0.40185916 = queryWeight, product of:
                3.5233684 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.013635351 = queryNorm
              0.5227927 = fieldWeight in 2130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
          0.34889096 = weight(abstract_txt:genre in 2130) [ClassicSimilarity], result of:
            0.34889096 = score(doc=2130,freq=2.0), product of:
              0.5932627 = queryWeight, product of:
                6.539326 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.013635351 = queryNorm
              0.5880885 = fieldWeight in 2130, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.0625 = fieldNorm(doc=2130)
        0.16 = coord(4/25)
    
  4. Nahotko, M.: Genre groups in knowledge organization (2016) 0.14
    0.13628016 = sum of:
      0.13628016 = product of:
        1.135668 = sum of:
          0.01750646 = weight(abstract_txt:analysis in 5139) [ClassicSimilarity], result of:
            0.01750646 = score(doc=5139,freq=1.0), product of:
              0.05111079 = queryWeight, product of:
                1.0259632 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.013635351 = queryNorm
              0.34251985 = fieldWeight in 5139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.09375 = fieldNorm(doc=5139)
          0.29069403 = weight(abstract_txt:genres in 5139) [ClassicSimilarity], result of:
            0.29069403 = score(doc=5139,freq=2.0), product of:
              0.30224434 = queryWeight, product of:
                3.055626 = boost
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.013635351 = queryNorm
              0.96178484 = fieldWeight in 5139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2542357 = idf(docFreq=84, maxDocs=44218)
                0.09375 = fieldNorm(doc=5139)
          0.82746756 = weight(abstract_txt:genre in 5139) [ClassicSimilarity], result of:
            0.82746756 = score(doc=5139,freq=5.0), product of:
              0.5932627 = queryWeight, product of:
                6.539326 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.013635351 = queryNorm
              1.3947743 = fieldWeight in 5139, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.09375 = fieldNorm(doc=5139)
        0.12 = coord(3/25)
    
  5. Finn, A.; Kushmerick, N.: Learning to classify documents according to genre (2006) 0.12
    0.12280663 = sum of:
      0.12280663 = product of:
        0.76754147 = sum of:
          0.014588716 = weight(abstract_txt:analysis in 6010) [ClassicSimilarity], result of:
            0.014588716 = score(doc=6010,freq=1.0), product of:
              0.05111079 = queryWeight, product of:
                1.0259632 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.013635351 = queryNorm
              0.2854332 = fieldWeight in 6010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.078125 = fieldNorm(doc=6010)
          0.026433239 = weight(abstract_txt:large in 6010) [ClassicSimilarity], result of:
            0.026433239 = score(doc=6010,freq=1.0), product of:
              0.07596288 = queryWeight, product of:
                1.2507664 = boost
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.013635351 = queryNorm
              0.34797573 = fieldWeight in 6010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.454089 = idf(docFreq=1397, maxDocs=44218)
                0.078125 = fieldNorm(doc=6010)
          0.10976159 = weight(abstract_txt:topic in 6010) [ClassicSimilarity], result of:
            0.10976159 = score(doc=6010,freq=2.0), product of:
              0.19624628 = queryWeight, product of:
                2.8430939 = boost
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.013635351 = queryNorm
              0.5593053 = fieldWeight in 6010, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.062254 = idf(docFreq=760, maxDocs=44218)
                0.078125 = fieldNorm(doc=6010)
          0.6167579 = weight(abstract_txt:genre in 6010) [ClassicSimilarity], result of:
            0.6167579 = score(doc=6010,freq=4.0), product of:
              0.5932627 = queryWeight, product of:
                6.539326 = boost
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.013635351 = queryNorm
              1.0396035 = fieldWeight in 6010, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.653462 = idf(docFreq=154, maxDocs=44218)
                0.078125 = fieldNorm(doc=6010)
        0.16 = coord(4/25)