Document (#22578)

Author
Chowdhury, G.G.
Title
Template mining for information extraction from digital documents
Source
Library trends. 48(1999) no.1, S.182-208
Year
1999
Theme
Data Mining

Similar documents (author)

  1. Chowdhury, G.G.; Chowdhury, S.: ¬An overview of the information retrieval features of twenty digital libraries (2000) 5.65
    5.6450562 = sum of:
      5.6450562 = weight(author_txt:chowdhury in 519) [ClassicSimilarity], result of:
        5.6450562 = fieldWeight in 519, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.983315 = idf(docFreq=40, maxDocs=44218)
          0.5 = fieldNorm(doc=519)
    
  2. Chowdhury, S.; Chowdhury, G.G.: Text retrieval system : an overview (1992) 5.65
    5.6450562 = sum of:
      5.6450562 = weight(author_txt:chowdhury in 6508) [ClassicSimilarity], result of:
        5.6450562 = fieldWeight in 6508, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.983315 = idf(docFreq=40, maxDocs=44218)
          0.5 = fieldNorm(doc=6508)
    
  3. Chowdhury, S.; Chowdhury, G.G.: Development of library management system using Micro-CDS/ISIS (1992) 5.65
    5.6450562 = sum of:
      5.6450562 = weight(author_txt:chowdhury in 440) [ClassicSimilarity], result of:
        5.6450562 = fieldWeight in 440, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.983315 = idf(docFreq=40, maxDocs=44218)
          0.5 = fieldNorm(doc=440)
    
  4. Chowdhury, G.G.; Chowdhury, S.: Text retrieval and library management software in India (1994) 5.65
    5.6450562 = sum of:
      5.6450562 = weight(author_txt:chowdhury in 1542) [ClassicSimilarity], result of:
        5.6450562 = fieldWeight in 1542, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.983315 = idf(docFreq=40, maxDocs=44218)
          0.5 = fieldNorm(doc=1542)
    
  5. Chowdhury, S.; Chowdhury, G.G.: Using DDC to create a visual knowledge map as an aid to online information retrieval (2004) 5.65
    5.6450562 = sum of:
      5.6450562 = weight(author_txt:chowdhury in 2643) [ClassicSimilarity], result of:
        5.6450562 = fieldWeight in 2643, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.983315 = idf(docFreq=40, maxDocs=44218)
          0.5 = fieldNorm(doc=2643)
    

Similar documents (content)

  1. Lawson, M.: Automatic extraction of citations from the text of English-language patents : an example of template mining (1996) 0.82
    0.8189213 = sum of:
      0.8189213 = product of:
        1.1464899 = sum of:
          0.020548532 = weight(abstract_txt:from in 2654) [ClassicSimilarity], result of:
            0.020548532 = score(doc=2654,freq=2.0), product of:
              0.08411359 = queryWeight, product of:
                1.1416538 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.026657056 = queryNorm
              0.24429502 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.06812798 = weight(abstract_txt:documents in 2654) [ClassicSimilarity], result of:
            0.06812798 = score(doc=2654,freq=2.0), product of:
              0.18702343 = queryWeight, product of:
                1.7023519 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.026657056 = queryNorm
              0.36427513 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.16207196 = weight(abstract_txt:mining in 2654) [ClassicSimilarity], result of:
            0.16207196 = score(doc=2654,freq=1.0), product of:
              0.41991454 = queryWeight, product of:
                2.5508316 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.026657056 = queryNorm
              0.38596416 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.23100498 = weight(abstract_txt:extraction in 2654) [ClassicSimilarity], result of:
            0.23100498 = score(doc=2654,freq=2.0), product of:
              0.4221109 = queryWeight, product of:
                2.557494 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.026657056 = queryNorm
              0.54726136 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.6647364 = weight(abstract_txt:template in 2654) [ClassicSimilarity], result of:
            0.6647364 = score(doc=2654,freq=3.0), product of:
              0.74601614 = queryWeight, product of:
                3.3999727 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.026657056 = queryNorm
              0.89104825 = fieldWeight in 2654, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
        0.71428573 = coord(5/7)
    
  2. Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.48
    0.47649035 = sum of:
      0.47649035 = product of:
        0.6670865 = sum of:
          0.024166603 = weight(abstract_txt:information in 3412) [ClassicSimilarity], result of:
            0.024166603 = score(doc=3412,freq=8.0), product of:
              0.064535305 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.026657056 = queryNorm
              0.37447104 = fieldWeight in 3412, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.0127137555 = weight(abstract_txt:from in 3412) [ClassicSimilarity], result of:
            0.0127137555 = score(doc=3412,freq=1.0), product of:
              0.08411359 = queryWeight, product of:
                1.1416538 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.026657056 = queryNorm
              0.15114984 = fieldWeight in 3412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.059611984 = weight(abstract_txt:documents in 3412) [ClassicSimilarity], result of:
            0.059611984 = score(doc=3412,freq=2.0), product of:
              0.18702343 = queryWeight, product of:
                1.7023519 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.026657056 = queryNorm
              0.31874073 = fieldWeight in 3412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.14181297 = weight(abstract_txt:mining in 3412) [ClassicSimilarity], result of:
            0.14181297 = score(doc=3412,freq=1.0), product of:
              0.41991454 = queryWeight, product of:
                2.5508316 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.026657056 = queryNorm
              0.33771864 = fieldWeight in 3412, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
          0.42878115 = weight(abstract_txt:extraction in 3412) [ClassicSimilarity], result of:
            0.42878115 = score(doc=3412,freq=9.0), product of:
              0.4221109 = queryWeight, product of:
                2.557494 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.026657056 = queryNorm
              1.0158021 = fieldWeight in 3412, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3412)
        0.71428573 = coord(5/7)
    
  3. Yim, W.-w.; Kwan, S.W.; Yetisgen, M.: Classifying tumor event attributes in radiology reports (2017) 0.46
    0.46355522 = sum of:
      0.46355522 = product of:
        0.8112216 = sum of:
          0.016913097 = weight(abstract_txt:information in 3929) [ClassicSimilarity], result of:
            0.016913097 = score(doc=3929,freq=3.0), product of:
              0.064535305 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.026657056 = queryNorm
              0.26207513 = fieldWeight in 3929, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3929)
          0.020548532 = weight(abstract_txt:from in 3929) [ClassicSimilarity], result of:
            0.020548532 = score(doc=3929,freq=2.0), product of:
              0.08411359 = queryWeight, product of:
                1.1416538 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.026657056 = queryNorm
              0.24429502 = fieldWeight in 3929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3929)
          0.23100498 = weight(abstract_txt:extraction in 3929) [ClassicSimilarity], result of:
            0.23100498 = score(doc=3929,freq=2.0), product of:
              0.4221109 = queryWeight, product of:
                2.557494 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.026657056 = queryNorm
              0.54726136 = fieldWeight in 3929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=3929)
          0.542755 = weight(abstract_txt:template in 3929) [ClassicSimilarity], result of:
            0.542755 = score(doc=3929,freq=2.0), product of:
              0.74601614 = queryWeight, product of:
                3.3999727 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.026657056 = queryNorm
              0.7275379 = fieldWeight in 3929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=3929)
        0.5714286 = coord(4/7)
    
  4. Ku, L.-W.; Chen, H.-H.: Mining opinions from the Web : beyond relevance retrieval (2007) 0.43
    0.4304833 = sum of:
      0.4304833 = product of:
        0.60267663 = sum of:
          0.021834716 = weight(abstract_txt:information in 605) [ClassicSimilarity], result of:
            0.021834716 = score(doc=605,freq=5.0), product of:
              0.064535305 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.026657056 = queryNorm
              0.33833754 = fieldWeight in 605, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=605)
          0.029060012 = weight(abstract_txt:from in 605) [ClassicSimilarity], result of:
            0.029060012 = score(doc=605,freq=4.0), product of:
              0.08411359 = queryWeight, product of:
                1.1416538 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.026657056 = queryNorm
              0.34548533 = fieldWeight in 605, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=605)
          0.10771981 = weight(abstract_txt:documents in 605) [ClassicSimilarity], result of:
            0.10771981 = score(doc=605,freq=5.0), product of:
              0.18702343 = queryWeight, product of:
                1.7023519 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.026657056 = queryNorm
              0.5759696 = fieldWeight in 605, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=605)
          0.28071687 = weight(abstract_txt:mining in 605) [ClassicSimilarity], result of:
            0.28071687 = score(doc=605,freq=3.0), product of:
              0.41991454 = queryWeight, product of:
                2.5508316 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.026657056 = queryNorm
              0.66850954 = fieldWeight in 605, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0625 = fieldNorm(doc=605)
          0.16334519 = weight(abstract_txt:extraction in 605) [ClassicSimilarity], result of:
            0.16334519 = score(doc=605,freq=1.0), product of:
              0.4221109 = queryWeight, product of:
                2.557494 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.026657056 = queryNorm
              0.38697222 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=605)
        0.71428573 = coord(5/7)
    
  5. Yang, T.-H.; Hsieh, Y.-L.; Liu, S.-H.; Chang, Y.-C.; Hsu, W.-L.: ¬A flexible template generation and matching method with applications for publication reference metadata extraction (2021) 0.40
    0.4031492 = sum of:
      0.4031492 = product of:
        0.94068146 = sum of:
          0.009764782 = weight(abstract_txt:information in 63) [ClassicSimilarity], result of:
            0.009764782 = score(doc=63,freq=1.0), product of:
              0.064535305 = queryWeight, product of:
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.026657056 = queryNorm
              0.15130915 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=63)
          0.16334519 = weight(abstract_txt:extraction in 63) [ClassicSimilarity], result of:
            0.16334519 = score(doc=63,freq=1.0), product of:
              0.4221109 = queryWeight, product of:
                2.557494 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.026657056 = queryNorm
              0.38697222 = fieldWeight in 63, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=63)
          0.7675715 = weight(abstract_txt:template in 63) [ClassicSimilarity], result of:
            0.7675715 = score(doc=63,freq=4.0), product of:
              0.74601614 = queryWeight, product of:
                3.3999727 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.026657056 = queryNorm
              1.028894 = fieldWeight in 63, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=63)
        0.42857143 = coord(3/7)