Document (#21242)

Author
Yongcheng, W.
Xiaoming, G.
Lixia, W.
Title
Automatic indexing on subject of Chinese text
Source
Journal of the China Society for Scientific and Technical Information. 17(1998) no.3, S.219-225
Year
1998
Abstract
Outlines the underlying ideas, the basic algorithm and structure of CSAIS 2.1, an automatic indexing system for the subjects of Chinese documents, developed by the authors in 1993
Footnote
[In Chinesisch]
Theme
Automatisches Indexieren

Similar documents (content)

  1. Li, Z.: Research on dynamic morphological indexing (1998) 0.39
    0.38907936 = sum of:
      0.38907936 = product of:
        1.3228698 = sum of:
          0.062465895 = weight(abstract_txt:documents in 3242) [ClassicSimilarity], result of:
            0.062465895 = score(doc=3242,freq=1.0), product of:
              0.12125466 = queryWeight, product of:
                1.2221013 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.02407447 = queryNorm
              0.5151628 = fieldWeight in 3242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.125 = fieldNorm(doc=3242)
          0.16573071 = weight(abstract_txt:algorithm in 3242) [ClassicSimilarity], result of:
            0.16573071 = score(doc=3242,freq=1.0), product of:
              0.23238343 = queryWeight, product of:
                1.6918449 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.02407447 = queryNorm
              0.71317786 = fieldWeight in 3242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.125 = fieldNorm(doc=3242)
          0.29372686 = weight(abstract_txt:indexing in 3242) [ClassicSimilarity], result of:
            0.29372686 = score(doc=3242,freq=4.0), product of:
              0.27011928 = queryWeight, product of:
                2.5795906 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02407447 = queryNorm
              1.0873969 = fieldWeight in 3242, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.125 = fieldNorm(doc=3242)
          0.35399 = weight(abstract_txt:automatic in 3242) [ClassicSimilarity], result of:
            0.35399 = score(doc=3242,freq=2.0), product of:
              0.38541666 = queryWeight, product of:
                3.08133 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02407447 = queryNorm
              0.91846055 = fieldWeight in 3242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.125 = fieldNorm(doc=3242)
          0.4469563 = weight(abstract_txt:chinese in 3242) [ClassicSimilarity], result of:
            0.4469563 = score(doc=3242,freq=1.0), product of:
              0.56727004 = queryWeight, product of:
                3.7382462 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.02407447 = queryNorm
              0.7879075 = fieldWeight in 3242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.125 = fieldNorm(doc=3242)
        0.29411766 = coord(5/17)
    
  2. Wan, T.-L.; Evens, M.; Wan, Y.-W.; Pao, Y.-Y.: Experiments with automatic indexing and a relational thesaurus in a Chinese information retrieval system (1997) 0.37
    0.37261355 = sum of:
      0.37261355 = product of:
        1.266886 = sum of:
          0.044457316 = weight(abstract_txt:system in 956) [ClassicSimilarity], result of:
            0.044457316 = score(doc=956,freq=3.0), product of:
              0.08118654 = queryWeight, product of:
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.02407447 = queryNorm
              0.54759467 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=956)
          0.04684942 = weight(abstract_txt:documents in 956) [ClassicSimilarity], result of:
            0.04684942 = score(doc=956,freq=1.0), product of:
              0.12125466 = queryWeight, product of:
                1.2221013 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.02407447 = queryNorm
              0.38637212 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=956)
          0.26980534 = weight(abstract_txt:indexing in 956) [ClassicSimilarity], result of:
            0.26980534 = score(doc=956,freq=6.0), product of:
              0.27011928 = queryWeight, product of:
                2.5795906 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02407447 = queryNorm
              0.9988378 = fieldWeight in 956, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.09375 = fieldNorm(doc=956)
          0.32516056 = weight(abstract_txt:automatic in 956) [ClassicSimilarity], result of:
            0.32516056 = score(doc=956,freq=3.0), product of:
              0.38541666 = queryWeight, product of:
                3.08133 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02407447 = queryNorm
              0.8436599 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=956)
          0.5806133 = weight(abstract_txt:chinese in 956) [ClassicSimilarity], result of:
            0.5806133 = score(doc=956,freq=3.0), product of:
              0.56727004 = queryWeight, product of:
                3.7382462 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.02407447 = queryNorm
              1.0235219 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.09375 = fieldNorm(doc=956)
        0.29411766 = coord(5/17)
    
  3. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.37
    0.3682671 = sum of:
      0.3682671 = product of:
        1.0434234 = sum of:
          0.07806367 = weight(abstract_txt:text in 4580) [ClassicSimilarity], result of:
            0.07806367 = score(doc=4580,freq=7.0), product of:
              0.11674092 = queryWeight, product of:
                1.199139 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02407447 = queryNorm
              0.6686916 = fieldWeight in 4580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.032957584 = weight(abstract_txt:developed in 4580) [ClassicSimilarity], result of:
            0.032957584 = score(doc=4580,freq=1.0), product of:
              0.12567823 = queryWeight, product of:
                1.2441937 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.02407447 = queryNorm
              0.26223782 = fieldWeight in 4580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.0633816 = weight(abstract_txt:authors in 4580) [ClassicSimilarity], result of:
            0.0633816 = score(doc=4580,freq=2.0), product of:
              0.1542607 = queryWeight, product of:
                1.3784329 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.02407447 = queryNorm
              0.4108733 = fieldWeight in 4580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.073431715 = weight(abstract_txt:indexing in 4580) [ClassicSimilarity], result of:
            0.073431715 = score(doc=4580,freq=1.0), product of:
              0.27011928 = queryWeight, product of:
                2.5795906 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02407447 = queryNorm
              0.27184922 = fieldWeight in 4580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.12515436 = weight(abstract_txt:automatic in 4580) [ClassicSimilarity], result of:
            0.12515436 = score(doc=4580,freq=1.0), product of:
              0.38541666 = queryWeight, product of:
                3.08133 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.02407447 = queryNorm
              0.32472485 = fieldWeight in 4580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
          0.6704344 = weight(abstract_txt:chinese in 4580) [ClassicSimilarity], result of:
            0.6704344 = score(doc=4580,freq=9.0), product of:
              0.56727004 = queryWeight, product of:
                3.7382462 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.02407447 = queryNorm
              1.1818612 = fieldWeight in 4580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=4580)
        0.3529412 = coord(6/17)
    
  4. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.29
    0.29336408 = sum of:
      0.29336408 = product of:
        0.99743783 = sum of:
          0.029505294 = weight(abstract_txt:text in 604) [ClassicSimilarity], result of:
            0.029505294 = score(doc=604,freq=1.0), product of:
              0.11674092 = queryWeight, product of:
                1.199139 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02407447 = queryNorm
              0.25274166 = fieldWeight in 604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.06983899 = weight(abstract_txt:documents in 604) [ClassicSimilarity], result of:
            0.06983899 = score(doc=604,freq=5.0), product of:
              0.12125466 = queryWeight, product of:
                1.2221013 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.02407447 = queryNorm
              0.5759696 = fieldWeight in 604, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.20297785 = weight(abstract_txt:algorithm in 604) [ClassicSimilarity], result of:
            0.20297785 = score(doc=604,freq=6.0), product of:
              0.23238343 = queryWeight, product of:
                1.6918449 = boost
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.02407447 = queryNorm
              0.87346095 = fieldWeight in 604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.705423 = idf(docFreq=399, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.10384813 = weight(abstract_txt:indexing in 604) [ClassicSimilarity], result of:
            0.10384813 = score(doc=604,freq=2.0), product of:
              0.27011928 = queryWeight, product of:
                2.5795906 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.02407447 = queryNorm
              0.38445285 = fieldWeight in 604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
          0.5912676 = weight(abstract_txt:chinese in 604) [ClassicSimilarity], result of:
            0.5912676 = score(doc=604,freq=7.0), product of:
              0.56727004 = queryWeight, product of:
                3.7382462 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.02407447 = queryNorm
              1.0423036 = fieldWeight in 604, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.0625 = fieldNorm(doc=604)
        0.29411766 = coord(5/17)
    
  5. Shen, Z.: CJK: the unique need of Chinese, Japanese, and Korean language cataloging (1993) 0.29
    0.28692967 = sum of:
      0.28692967 = product of:
        0.9755608 = sum of:
          0.048398998 = weight(abstract_txt:system in 3726) [ClassicSimilarity], result of:
            0.048398998 = score(doc=3726,freq=2.0), product of:
              0.08118654 = queryWeight, product of:
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.02407447 = queryNorm
              0.5961456 = fieldWeight in 3726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.125 = fieldNorm(doc=3726)
          0.06591517 = weight(abstract_txt:developed in 3726) [ClassicSimilarity], result of:
            0.06591517 = score(doc=3726,freq=1.0), product of:
              0.12567823 = queryWeight, product of:
                1.2441937 = boost
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.02407447 = queryNorm
              0.52447563 = fieldWeight in 3726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.195805 = idf(docFreq=1809, maxDocs=44218)
                0.125 = fieldNorm(doc=3726)
          0.19319394 = weight(abstract_txt:outlines in 3726) [ClassicSimilarity], result of:
            0.19319394 = score(doc=3726,freq=2.0), product of:
              0.20429394 = queryWeight, product of:
                1.5863014 = boost
                5.349498 = idf(docFreq=570, maxDocs=44218)
                0.02407447 = queryNorm
              0.94566655 = fieldWeight in 3726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.349498 = idf(docFreq=570, maxDocs=44218)
                0.125 = fieldNorm(doc=3726)
          0.22109638 = weight(abstract_txt:1993 in 3726) [ClassicSimilarity], result of:
            0.22109638 = score(doc=3726,freq=1.0), product of:
              0.28161615 = queryWeight, product of:
                1.8624592 = boost
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.02407447 = queryNorm
              0.7850984 = fieldWeight in 3726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.280787 = idf(docFreq=224, maxDocs=44218)
                0.125 = fieldNorm(doc=3726)
          0.4469563 = weight(abstract_txt:chinese in 3726) [ClassicSimilarity], result of:
            0.4469563 = score(doc=3726,freq=1.0), product of:
              0.56727004 = queryWeight, product of:
                3.7382462 = boost
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.02407447 = queryNorm
              0.7879075 = fieldWeight in 3726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.30326 = idf(docFreq=219, maxDocs=44218)
                0.125 = fieldNorm(doc=3726)
        0.29411766 = coord(5/17)