Document (#38138)

Author
Darányi, S.
Wittek, P.
Title
Demonstrating conceptual dynamics in an evolving text collection
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.12, S.2564-2572
Year
2013
Abstract
Based on real-world user demands, we demonstrate how animated visualization of evolving text corpora displays the underlying dynamics of semantic content. To interpret the results, one needs a dynamic theory of word meaning. We suggest that conceptual dynamics as the interaction between kinds of intellectual and emotional content and language is key for such a theory. We demonstrate our method by two-way seriation, which is a popular technique to analyze groups of similar instances and their features as well as the connections between the groups themselves. The two-way seriated data may be visualized as a two-dimensional heat map or as a three-dimensional landscape in which color codes or height correspond to the values in the matrix. In this article, we focus on two-way seriation of sparse data in the Reuters-21568 test collection. To achieve a meaningful visualization, we introduce a compactly supported convolution kernel similar to filter kernels used in image reconstruction and geostatistics. This filter populates the high-dimensional sparse space with values that interpolate nearby elements and provides insight into the clustering structure. We also extend two-way seriation to deal with online updates of both the row and column spaces and, combined with the convolution kernel, demonstrate a three-dimensional visualization of dynamics.
Theme
Visualisierung
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.10
    0.10268018 = sum of:
      0.10268018 = product of:
        0.8556682 = sum of:
          0.030673916 = weight(abstract_txt:text in 1611) [ClassicSimilarity], result of:
            0.030673916 = score(doc=1611,freq=2.0), product of:
              0.06865424 = queryWeight, product of:
                1.0035279 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01691769 = queryNorm
              0.44678837 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.22538702 = weight(abstract_txt:kernels in 1611) [ClassicSimilarity], result of:
            0.22538702 = score(doc=1611,freq=2.0), product of:
              0.20595096 = queryWeight, product of:
                1.2290305 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01691769 = queryNorm
              1.0943723 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.5996073 = weight(abstract_txt:kernel in 1611) [ClassicSimilarity], result of:
            0.5996073 = score(doc=1611,freq=9.0), product of:
              0.30176 = queryWeight, product of:
                2.103907 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.01691769 = queryNorm
              1.9870338 = fieldWeight in 1611, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
        0.12 = coord(3/25)
    
  2. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 0.08
    0.08285246 = sum of:
      0.08285246 = product of:
        0.6904372 = sum of:
          0.017351788 = weight(abstract_txt:text in 2055) [ClassicSimilarity], result of:
            0.017351788 = score(doc=2055,freq=1.0), product of:
              0.06865424 = queryWeight, product of:
                1.0035279 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.01691769 = queryNorm
              0.25274166 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2055)
          0.22083327 = weight(abstract_txt:kernels in 2055) [ClassicSimilarity], result of:
            0.22083327 = score(doc=2055,freq=3.0), product of:
              0.20595096 = queryWeight, product of:
                1.2290305 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.01691769 = queryNorm
              1.0722615 = fieldWeight in 2055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=2055)
          0.45225215 = weight(abstract_txt:kernel in 2055) [ClassicSimilarity], result of:
            0.45225215 = score(doc=2055,freq=8.0), product of:
              0.30176 = queryWeight, product of:
                2.103907 = boost
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.01691769 = queryNorm
              1.4987148 = fieldWeight in 2055, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.478011 = idf(docFreq=24, maxDocs=44218)
                0.0625 = fieldNorm(doc=2055)
        0.12 = coord(3/25)
    
  3. Oh, K.E.; Halpern, D.; Tremaine, M.; Chiang, J.; Silver, D.; Bemis, K.: Blocked: when the information is hidden by the visualization (2016) 0.08
    0.076915175 = sum of:
      0.076915175 = product of:
        0.48071986 = sum of:
          0.031960383 = weight(abstract_txt:three in 2888) [ClassicSimilarity], result of:
            0.031960383 = score(doc=2888,freq=2.0), product of:
              0.08187837 = queryWeight, product of:
                1.0959238 = boost
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.01691769 = queryNorm
              0.39033973 = fieldWeight in 2888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.0625 = fieldNorm(doc=2888)
          0.02448977 = weight(abstract_txt:theory in 2888) [ClassicSimilarity], result of:
            0.02448977 = score(doc=2888,freq=1.0), product of:
              0.08638288 = queryWeight, product of:
                1.1256661 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.01691769 = queryNorm
              0.28350258 = fieldWeight in 2888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.0625 = fieldNorm(doc=2888)
          0.19023646 = weight(abstract_txt:visualization in 2888) [ClassicSimilarity], result of:
            0.19023646 = score(doc=2888,freq=4.0), product of:
              0.24433039 = queryWeight, product of:
                2.318623 = boost
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.01691769 = queryNorm
              0.7786034 = fieldWeight in 2888, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.0625 = fieldNorm(doc=2888)
          0.23403324 = weight(abstract_txt:dimensional in 2888) [ClassicSimilarity], result of:
            0.23403324 = score(doc=2888,freq=2.0), product of:
              0.38900596 = queryWeight, product of:
                3.3782275 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.01691769 = queryNorm
              0.60161865 = fieldWeight in 2888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.0625 = fieldNorm(doc=2888)
        0.16 = coord(4/25)
    
  4. Lin, N.; Li, D.; Ding, Y.; He, B.; Qin, Z.; Tang, J.; Li, J.; Dong, T.: ¬The dynamic features of Delicious, Flickr, and YouTube (2012) 0.07
    0.06901654 = sum of:
      0.06901654 = product of:
        0.3450827 = sum of:
          0.039143313 = weight(abstract_txt:three in 4970) [ClassicSimilarity], result of:
            0.039143313 = score(doc=4970,freq=3.0), product of:
              0.08187837 = queryWeight, product of:
                1.0959238 = boost
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.01691769 = queryNorm
              0.4780666 = fieldWeight in 4970, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.41619 = idf(docFreq=1451, maxDocs=44218)
                0.0625 = fieldNorm(doc=4970)
          0.034801517 = weight(abstract_txt:groups in 4970) [ClassicSimilarity], result of:
            0.034801517 = score(doc=4970,freq=1.0), product of:
              0.10918676 = queryWeight, product of:
                1.2655542 = boost
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.01691769 = queryNorm
              0.31873384 = fieldWeight in 4970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0997415 = idf(docFreq=732, maxDocs=44218)
                0.0625 = fieldNorm(doc=4970)
          0.03712346 = weight(abstract_txt:similar in 4970) [ClassicSimilarity], result of:
            0.03712346 = score(doc=4970,freq=1.0), product of:
              0.1139909 = queryWeight, product of:
                1.2930963 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.01691769 = queryNorm
              0.3256704 = fieldWeight in 4970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.0625 = fieldNorm(doc=4970)
          0.07275183 = weight(abstract_txt:evolving in 4970) [ClassicSimilarity], result of:
            0.07275183 = score(doc=4970,freq=1.0), product of:
              0.17851189 = queryWeight, product of:
                1.6181893 = boost
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.01691769 = queryNorm
              0.4075461 = fieldWeight in 4970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5207376 = idf(docFreq=176, maxDocs=44218)
                0.0625 = fieldNorm(doc=4970)
          0.16126256 = weight(abstract_txt:dynamics in 4970) [ClassicSimilarity], result of:
            0.16126256 = score(doc=4970,freq=1.0), product of:
              0.38235804 = queryWeight, product of:
                3.349237 = boost
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.01691769 = queryNorm
              0.42175797 = fieldWeight in 4970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7481275 = idf(docFreq=140, maxDocs=44218)
                0.0625 = fieldNorm(doc=4970)
        0.2 = coord(5/25)
    
  5. Chen, C.; Kuljis, J.: ¬The rising landscape : a visual exploration of superstring revolutions in physics (2003) 0.07
    0.06858812 = sum of:
      0.06858812 = product of:
        0.42867577 = sum of:
          0.10301658 = weight(abstract_txt:visualized in 1469) [ClassicSimilarity], result of:
            0.10301658 = score(doc=1469,freq=1.0), product of:
              0.13634476 = queryWeight, product of:
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.01691769 = queryNorm
              0.7555595 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.059301 = idf(docFreq=37, maxDocs=44218)
                0.09375 = fieldNorm(doc=1469)
          0.14624716 = weight(abstract_txt:animated in 1469) [ClassicSimilarity], result of:
            0.14624716 = score(doc=1469,freq=1.0), product of:
              0.17222334 = queryWeight, product of:
                1.1238977 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.01691769 = queryNorm
              0.8491715 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.09375 = fieldNorm(doc=1469)
          0.036734655 = weight(abstract_txt:theory in 1469) [ClassicSimilarity], result of:
            0.036734655 = score(doc=1469,freq=1.0), product of:
              0.08638288 = queryWeight, product of:
                1.1256661 = boost
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.01691769 = queryNorm
              0.42525387 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5360413 = idf(docFreq=1287, maxDocs=44218)
                0.09375 = fieldNorm(doc=1469)
          0.14267735 = weight(abstract_txt:visualization in 1469) [ClassicSimilarity], result of:
            0.14267735 = score(doc=1469,freq=1.0), product of:
              0.24433039 = queryWeight, product of:
                2.318623 = boost
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.01691769 = queryNorm
              0.58395255 = fieldWeight in 1469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.228827 = idf(docFreq=236, maxDocs=44218)
                0.09375 = fieldNorm(doc=1469)
        0.16 = coord(4/25)