Document (#24676)

Author
Rorvig, M.
Smith, M.M.
Uemura, A.
Title
¬The N-gram hypothesis applied to matched sets of visualized Japanese-English technical documents
Source
Knowledge: creation, organization and use. Proceedings of the 62nd Annual Meeting of the American Society for Information Science, 31.10.-4.11.1999. Ed.: L. Woods
Imprint
Medford, NJ : Information Today
Year
1999
Pages
S.359-364
Series
Proceedings of the American Society for Information Science; vol.36
Abstract
Shape Recovery Analysis (SHERA), a new visual analytical technique, is applied to the N-Gram hypothesis on matched Japanese-English technical documents supplied by the National Center for Science Information Systems (NACSIS) in Japan. The results of the SHERA study reveal compaction in the translation of Japanese subject terms to English subject terms. Surprisingly, the bigram approach to the Japanese data yields a remarkable similarity to the matching visualized English texts
Theme
Computerlinguistik

Similar documents (author)

  1. Rorvig, M.E.: ¬A method for automatically abstracting visual documents (1993) 2.52
    2.5237334 = sum of:
      2.5237334 = product of:
        5.0474668 = sum of:
          5.0474668 = weight(author_txt:rorvig in 2723) [ClassicSimilarity], result of:
            5.0474668 = score(doc=2723,freq=1.0), product of:
              0.8915984 = queryWeight, product of:
                1.4031965 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.07014983 = queryNorm
              5.661144 = fieldWeight in 2723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=2723)
        0.5 = coord(1/2)
    
  2. Rorvig, M.E.: Image information retrieval (1987) 2.52
    2.5237334 = sum of:
      2.5237334 = product of:
        5.0474668 = sum of:
          5.0474668 = weight(author_txt:rorvig in 5640) [ClassicSimilarity], result of:
            5.0474668 = score(doc=5640,freq=1.0), product of:
              0.8915984 = queryWeight, product of:
                1.4031965 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.07014983 = queryNorm
              5.661144 = fieldWeight in 5640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=5640)
        0.5 = coord(1/2)
    
  3. Rorvig, M.E.: ¬The bibliographic control of microcomputer software (1988) 2.52
    2.5237334 = sum of:
      2.5237334 = product of:
        5.0474668 = sum of:
          5.0474668 = weight(author_txt:rorvig in 1275) [ClassicSimilarity], result of:
            5.0474668 = score(doc=1275,freq=1.0), product of:
              0.8915984 = queryWeight, product of:
                1.4031965 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.07014983 = queryNorm
              5.661144 = fieldWeight in 1275, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=1275)
        0.5 = coord(1/2)
    
  4. Rorvig, M.E.: Psychometric measurement and information retrieval (1989) 2.52
    2.5237334 = sum of:
      2.5237334 = product of:
        5.0474668 = sum of:
          5.0474668 = weight(author_txt:rorvig in 333) [ClassicSimilarity], result of:
            5.0474668 = score(doc=333,freq=1.0), product of:
              0.8915984 = queryWeight, product of:
                1.4031965 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.07014983 = queryNorm
              5.661144 = fieldWeight in 333, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=333)
        0.5 = coord(1/2)
    
  5. Rorvig, M.: Scaled structure in visualized TREC data and query feedback (1998) 2.52
    2.5237334 = sum of:
      2.5237334 = product of:
        5.0474668 = sum of:
          5.0474668 = weight(author_txt:rorvig in 3269) [ClassicSimilarity], result of:
            5.0474668 = score(doc=3269,freq=1.0), product of:
              0.8915984 = queryWeight, product of:
                1.4031965 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.07014983 = queryNorm
              5.661144 = fieldWeight in 3269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=3269)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Lee, Y.-S.; Wu, Y.-C.; Yang, J.-C.: BVideoQA : Online English/Chinese bilingual video question answering (2009) 0.17
    0.16711494 = sum of:
      0.16711494 = product of:
        0.6963123 = sum of:
          0.03112841 = weight(abstract_txt:matching in 2739) [ClassicSimilarity], result of:
            0.03112841 = score(doc=2739,freq=1.0), product of:
              0.082351476 = queryWeight, product of:
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.013616511 = queryNorm
              0.37799457 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.018610617 = weight(abstract_txt:terms in 2739) [ClassicSimilarity], result of:
            0.018610617 = score(doc=2739,freq=1.0), product of:
              0.07363494 = queryWeight, product of:
                1.3372767 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013616511 = queryNorm
              0.25274166 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.11907209 = weight(abstract_txt:matched in 2739) [ClassicSimilarity], result of:
            0.11907209 = score(doc=2739,freq=1.0), product of:
              0.25377572 = queryWeight, product of:
                2.4825861 = boost
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.013616511 = queryNorm
              0.46920204 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.14064486 = weight(abstract_txt:gram in 2739) [ClassicSimilarity], result of:
            0.14064486 = score(doc=2739,freq=1.0), product of:
              0.28356937 = queryWeight, product of:
                2.6242728 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.013616511 = queryNorm
              0.49598044 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.13788225 = weight(abstract_txt:english in 2739) [ClassicSimilarity], result of:
            0.13788225 = score(doc=2739,freq=2.0), product of:
              0.27984378 = queryWeight, product of:
                3.6868217 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013616511 = queryNorm
              0.49271148 = fieldWeight in 2739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
          0.24897406 = weight(abstract_txt:japanese in 2739) [ClassicSimilarity], result of:
            0.24897406 = score(doc=2739,freq=1.0), product of:
              0.52282476 = queryWeight, product of:
                5.039325 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.013616511 = queryNorm
              0.47620937 = fieldWeight in 2739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=2739)
        0.24 = coord(6/25)
    
  2. Li, Q.; Chen, Y.P.; Myaeng, S.-H.; Jin, Y.; Kang, B.-Y.: Concept unification of terms in different languages via web mining for Information Retrieval (2009) 0.13
    0.1336932 = sum of:
      0.1336932 = product of:
        0.557055 = sum of:
          0.046865813 = weight(abstract_txt:translation in 4215) [ClassicSimilarity], result of:
            0.046865813 = score(doc=4215,freq=2.0), product of:
              0.085860655 = queryWeight, product of:
                1.0210838 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.013616511 = queryNorm
              0.54583573 = fieldWeight in 4215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=4215)
          0.041614607 = weight(abstract_txt:terms in 4215) [ClassicSimilarity], result of:
            0.041614607 = score(doc=4215,freq=5.0), product of:
              0.07363494 = queryWeight, product of:
                1.3372767 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013616511 = queryNorm
              0.5651476 = fieldWeight in 4215, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4215)
          0.019700343 = weight(abstract_txt:documents in 4215) [ClassicSimilarity], result of:
            0.019700343 = score(doc=4215,freq=1.0), product of:
              0.076482005 = queryWeight, product of:
                1.3628842 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.013616511 = queryNorm
              0.2575814 = fieldWeight in 4215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=4215)
          0.031029645 = weight(abstract_txt:applied in 4215) [ClassicSimilarity], result of:
            0.031029645 = score(doc=4215,freq=1.0), product of:
              0.10353677 = queryWeight, product of:
                1.5857204 = boost
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.013616511 = queryNorm
              0.29969686 = fieldWeight in 4215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.0625 = fieldNorm(doc=4215)
          0.16887058 = weight(abstract_txt:english in 4215) [ClassicSimilarity], result of:
            0.16887058 = score(doc=4215,freq=3.0), product of:
              0.27984378 = queryWeight, product of:
                3.6868217 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013616511 = queryNorm
              0.6034459 = fieldWeight in 4215, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=4215)
          0.24897406 = weight(abstract_txt:japanese in 4215) [ClassicSimilarity], result of:
            0.24897406 = score(doc=4215,freq=1.0), product of:
              0.52282476 = queryWeight, product of:
                5.039325 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.013616511 = queryNorm
              0.47620937 = fieldWeight in 4215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=4215)
        0.24 = coord(6/25)
    
  3. Kawamura, K.; Kita, K.; Shiba, M.: Prospects for the Japanese version of the Broad System of Ordering : the design and uses of machine-readable form (1996) 0.12
    0.11585737 = sum of:
      0.11585737 = product of:
        0.9654781 = sum of:
          0.06627827 = weight(abstract_txt:translation in 516) [ClassicSimilarity], result of:
            0.06627827 = score(doc=516,freq=1.0), product of:
              0.085860655 = queryWeight, product of:
                1.0210838 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.013616511 = queryNorm
              0.7719283 = fieldWeight in 516, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.125 = fieldNorm(doc=516)
          0.19499494 = weight(abstract_txt:english in 516) [ClassicSimilarity], result of:
            0.19499494 = score(doc=516,freq=1.0), product of:
              0.27984378 = queryWeight, product of:
                3.6868217 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013616511 = queryNorm
              0.6967993 = fieldWeight in 516, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.125 = fieldNorm(doc=516)
          0.7042049 = weight(abstract_txt:japanese in 516) [ClassicSimilarity], result of:
            0.7042049 = score(doc=516,freq=2.0), product of:
              0.52282476 = queryWeight, product of:
                5.039325 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.013616511 = queryNorm
              1.3469235 = fieldWeight in 516, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.125 = fieldNorm(doc=516)
        0.12 = coord(3/25)
    
  4. Bellaachia, A.; Amor-Tijani, G.: Proper nouns in English-Arabic cross language information retrieval (2008) 0.11
    0.11280432 = sum of:
      0.11280432 = product of:
        0.47001803 = sum of:
          0.04402222 = weight(abstract_txt:matching in 2372) [ClassicSimilarity], result of:
            0.04402222 = score(doc=2372,freq=2.0), product of:
              0.082351476 = queryWeight, product of:
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.013616511 = queryNorm
              0.53456503 = fieldWeight in 2372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.047913 = idf(docFreq=283, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.018610617 = weight(abstract_txt:terms in 2372) [ClassicSimilarity], result of:
            0.018610617 = score(doc=2372,freq=1.0), product of:
              0.07363494 = queryWeight, product of:
                1.3372767 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.013616511 = queryNorm
              0.25274166 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.031029645 = weight(abstract_txt:applied in 2372) [ClassicSimilarity], result of:
            0.031029645 = score(doc=2372,freq=1.0), product of:
              0.10353677 = queryWeight, product of:
                1.5857204 = boost
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.013616511 = queryNorm
              0.29969686 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.79515 = idf(docFreq=993, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.03525401 = weight(abstract_txt:technical in 2372) [ClassicSimilarity], result of:
            0.03525401 = score(doc=2372,freq=1.0), product of:
              0.112732485 = queryWeight, product of:
                1.6546413 = boost
                5.0035634 = idf(docFreq=806, maxDocs=44218)
                0.013616511 = queryNorm
              0.3127227 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0035634 = idf(docFreq=806, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.24360405 = weight(abstract_txt:gram in 2372) [ClassicSimilarity], result of:
            0.24360405 = score(doc=2372,freq=3.0), product of:
              0.28356937 = queryWeight, product of:
                2.6242728 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.013616511 = queryNorm
              0.8590633 = fieldWeight in 2372, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
          0.09749747 = weight(abstract_txt:english in 2372) [ClassicSimilarity], result of:
            0.09749747 = score(doc=2372,freq=1.0), product of:
              0.27984378 = queryWeight, product of:
                3.6868217 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013616511 = queryNorm
              0.34839964 = fieldWeight in 2372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.0625 = fieldNorm(doc=2372)
        0.24 = coord(6/25)
    
  5. Klein, R.D.: ¬The problem of cataloguing world literature using the Nippon Decimal Classification (1994) 0.11
    0.11019802 = sum of:
      0.11019802 = product of:
        0.91831684 = sum of:
          0.13151701 = weight(abstract_txt:remarkable in 867) [ClassicSimilarity], result of:
            0.13151701 = score(doc=867,freq=1.0), product of:
              0.14820494 = queryWeight, product of:
                1.3415153 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.013616511 = queryNorm
              0.8873996 = fieldWeight in 867, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.109375 = fieldNorm(doc=867)
          0.17062058 = weight(abstract_txt:english in 867) [ClassicSimilarity], result of:
            0.17062058 = score(doc=867,freq=1.0), product of:
              0.27984378 = queryWeight, product of:
                3.6868217 = boost
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.013616511 = queryNorm
              0.60969937 = fieldWeight in 867, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.574394 = idf(docFreq=455, maxDocs=44218)
                0.109375 = fieldNorm(doc=867)
          0.6161793 = weight(abstract_txt:japanese in 867) [ClassicSimilarity], result of:
            0.6161793 = score(doc=867,freq=2.0), product of:
              0.52282476 = queryWeight, product of:
                5.039325 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.013616511 = queryNorm
              1.178558 = fieldWeight in 867, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.109375 = fieldNorm(doc=867)
        0.12 = coord(3/25)