Document (#38122)

Author
Manning, C.D.
Title
Part-of-Speech Tagging from 97% to 100% : is it time for some linguistics?
Source
Computational Linguistics and Intelligent Text Processing, 12th International Conference, CICLing 2011, Proceedings, Part I. Ed.: Alexander Gelbukh
Imprint
Berlin : Springer
Year
2011
Pages
S.171-189
Series
Lecture notes in computer science; 6608
Abstract
I examine what would be necessary to move part-of-speech tagging performance from its current level of about 97.3% token accuracy (56% sentence accuracy) to close to 100% accuracy. I suggest that it must still be possible to greatly increase tagging performance and examine some useful improvements that have recently been made to the Stanford Part-of-Speech Tagger. However, an error analysis of some of the remaining errors suggests that there is limited further mileage to be had either from better machine learning or better features in a discriminative sequence classifier. The prospects for further gains from semisupervised learning also seem quite limited. Rather, I suggest and begin to demonstrate that the largest opportunity for further progress comes from improving the taxonomic basis of the linguistic resources from which taggers are trained. That is, from improved descriptive linguistics. However, I conclude by suggesting that there are also limits to this process. The status of some words may not be able to be adequately captured by assigning them to one of a small number of categories. While conventions can be used in such cases to improve tagging consistency, they lack a strong linguistic basis.
Content
Vgl.: http://nlp.stanford.edu/~manning/papers/CICLing2011-manning-tagging.pdf.
Theme
Computerlinguistik

Similar documents (author)

  1. Manning, R.W.: ¬The Anglo-American Cataloguing Rules and their future (1999) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:manning in 6809) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 6809, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=6809)
    
  2. Manning, R.W.: ¬The Anglo American Cataloguing Rules and their future (2000) 5.76
    5.7574883 = sum of:
      5.7574883 = weight(author_txt:manning in 189) [ClassicSimilarity], result of:
        5.7574883 = fieldWeight in 189, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.625 = fieldNorm(doc=189)
    
  3. Mallett, J.; Manning, C.: Multimedia and database design : a discussion of database technology and its use in multimedia (1993) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:manning in 6277) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 6277, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=6277)
    
  4. Toutanova, K.; Manning, C.D.: Enriching the knowledge sources used in a maximum entropy Part-of-Speech Tagger (2000) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:manning in 1060) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 1060, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=1060)
    
  5. Manning, C.D.; Schütze, H.: Foundations of statistical natural language processing (2000) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:manning in 1603) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 1603, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=1603)
    

Similar documents (content)

  1. L'Homme, D.; L'Homme, M.-C.; Lemay, C.: Benchmarking the performance of two Part-of-Speech (POS) taggers for terminological purposes (2002) 0.50
    0.502506 = sum of:
      0.502506 = product of:
        1.256265 = sum of:
          0.031682994 = weight(abstract_txt:however in 1855) [ClassicSimilarity], result of:
            0.031682994 = score(doc=1855,freq=1.0), product of:
              0.09618078 = queryWeight, product of:
                1.0530605 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.021661429 = queryNorm
              0.32941085 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.31129226 = weight(abstract_txt:taggers in 1855) [ClassicSimilarity], result of:
            0.31129226 = score(doc=1855,freq=5.0), product of:
              0.2047936 = queryWeight, product of:
                1.0865567 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021661429 = queryNorm
              1.5200292 = fieldWeight in 1855, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.19687851 = weight(abstract_txt:tagger in 1855) [ClassicSimilarity], result of:
            0.19687851 = score(doc=1855,freq=2.0), product of:
              0.2047936 = queryWeight, product of:
                1.0865567 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021661429 = queryNorm
              0.96135086 = fieldWeight in 1855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.04196081 = weight(abstract_txt:performance in 1855) [ClassicSimilarity], result of:
            0.04196081 = score(doc=1855,freq=1.0), product of:
              0.11599343 = queryWeight, product of:
                1.1564474 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.021661429 = queryNorm
              0.3617516 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.06092711 = weight(abstract_txt:part in 1855) [ClassicSimilarity], result of:
            0.06092711 = score(doc=1855,freq=1.0), product of:
              0.17025831 = queryWeight, product of:
                1.7159672 = boost
                4.580493 = idf(docFreq=1231, maxDocs=44218)
                0.021661429 = queryNorm
              0.35785103 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.580493 = idf(docFreq=1231, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.016867874 = weight(abstract_txt:that in 1855) [ClassicSimilarity], result of:
            0.016867874 = score(doc=1855,freq=1.0), product of:
              0.09112093 = queryWeight, product of:
                1.7753292 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021661429 = queryNorm
              0.18511525 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.13489798 = weight(abstract_txt:accuracy in 1855) [ClassicSimilarity], result of:
            0.13489798 = score(doc=1855,freq=1.0), product of:
              0.2892266 = queryWeight, product of:
                2.236526 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.021661429 = queryNorm
              0.46640933 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.044169705 = weight(abstract_txt:from in 1855) [ClassicSimilarity], result of:
            0.044169705 = score(doc=1855,freq=2.0), product of:
              0.14464381 = queryWeight, product of:
                2.4159791 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.021661429 = queryNorm
              0.30536878 = fieldWeight in 1855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.20543459 = weight(abstract_txt:speech in 1855) [ClassicSimilarity], result of:
            0.20543459 = score(doc=1855,freq=1.0), product of:
              0.3828397 = queryWeight, product of:
                2.5731394 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.021661429 = queryNorm
              0.5366073 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
          0.2121532 = weight(abstract_txt:tagging in 1855) [ClassicSimilarity], result of:
            0.2121532 = score(doc=1855,freq=1.0), product of:
              0.43050733 = queryWeight, product of:
                3.150754 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.021661429 = queryNorm
              0.4927981 = fieldWeight in 1855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.078125 = fieldNorm(doc=1855)
        0.4 = coord(10/25)
    
  2. Toutanova, K.; Manning, C.D.: Enriching the knowledge sources used in a maximum entropy Part-of-Speech Tagger (2000) 0.30
    0.29685092 = sum of:
      0.29685092 = product of:
        1.0601819 = sum of:
          0.23625422 = weight(abstract_txt:tagger in 1060) [ClassicSimilarity], result of:
            0.23625422 = score(doc=1060,freq=2.0), product of:
              0.2047936 = queryWeight, product of:
                1.0865567 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021661429 = queryNorm
              1.1536211 = fieldWeight in 1060, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
          0.05035297 = weight(abstract_txt:performance in 1060) [ClassicSimilarity], result of:
            0.05035297 = score(doc=1060,freq=1.0), product of:
              0.11599343 = queryWeight, product of:
                1.1564474 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.021661429 = queryNorm
              0.43410188 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
          0.07311253 = weight(abstract_txt:part in 1060) [ClassicSimilarity], result of:
            0.07311253 = score(doc=1060,freq=1.0), product of:
              0.17025831 = queryWeight, product of:
                1.7159672 = boost
                4.580493 = idf(docFreq=1231, maxDocs=44218)
                0.021661429 = queryNorm
              0.42942122 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.580493 = idf(docFreq=1231, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
          0.16187757 = weight(abstract_txt:accuracy in 1060) [ClassicSimilarity], result of:
            0.16187757 = score(doc=1060,freq=1.0), product of:
              0.2892266 = queryWeight, product of:
                2.236526 = boost
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.021661429 = queryNorm
              0.5596912 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9700394 = idf(docFreq=306, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
          0.037479237 = weight(abstract_txt:from in 1060) [ClassicSimilarity], result of:
            0.037479237 = score(doc=1060,freq=1.0), product of:
              0.14464381 = queryWeight, product of:
                2.4159791 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.021661429 = queryNorm
              0.259114 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
          0.2465215 = weight(abstract_txt:speech in 1060) [ClassicSimilarity], result of:
            0.2465215 = score(doc=1060,freq=1.0), product of:
              0.3828397 = queryWeight, product of:
                2.5731394 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.021661429 = queryNorm
              0.64392877 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
          0.25458384 = weight(abstract_txt:tagging in 1060) [ClassicSimilarity], result of:
            0.25458384 = score(doc=1060,freq=1.0), product of:
              0.43050733 = queryWeight, product of:
                3.150754 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.021661429 = queryNorm
              0.5913577 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.09375 = fieldNorm(doc=1060)
        0.28 = coord(7/25)
    
  3. Xu, C.; Ma, B.; Chen, X.; Ma, F.: Social tagging in the scholarly world (2013) 0.28
    0.27898717 = sum of:
      0.27898717 = product of:
        0.9963828 = sum of:
          0.023294989 = weight(abstract_txt:there in 1091) [ClassicSimilarity], result of:
            0.023294989 = score(doc=1091,freq=1.0), product of:
              0.090918556 = queryWeight, product of:
                1.0238479 = boost
                4.099491 = idf(docFreq=1992, maxDocs=44218)
                0.021661429 = queryNorm
              0.2562182 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.099491 = idf(docFreq=1992, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
          0.1113713 = weight(abstract_txt:taggers in 1091) [ClassicSimilarity], result of:
            0.1113713 = score(doc=1091,freq=1.0), product of:
              0.2047936 = queryWeight, product of:
                1.0865567 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021661429 = queryNorm
              0.54382217 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
          0.047753587 = weight(abstract_txt:suggest in 1091) [ClassicSimilarity], result of:
            0.047753587 = score(doc=1091,freq=1.0), product of:
              0.14671737 = queryWeight, product of:
                1.3006186 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.021661429 = queryNorm
              0.32548013 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
          0.055470362 = weight(abstract_txt:limited in 1091) [ClassicSimilarity], result of:
            0.055470362 = score(doc=1091,freq=1.0), product of:
              0.16212557 = queryWeight, product of:
                1.3672092 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.021661429 = queryNorm
              0.34214443 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
          0.023372808 = weight(abstract_txt:that in 1091) [ClassicSimilarity], result of:
            0.023372808 = score(doc=1091,freq=3.0), product of:
              0.09112093 = queryWeight, product of:
                1.7753292 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021661429 = queryNorm
              0.2565032 = fieldWeight in 1091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
          0.035335764 = weight(abstract_txt:from in 1091) [ClassicSimilarity], result of:
            0.035335764 = score(doc=1091,freq=2.0), product of:
              0.14464381 = queryWeight, product of:
                2.4159791 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.021661429 = queryNorm
              0.24429502 = fieldWeight in 1091, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
          0.699784 = weight(abstract_txt:tagging in 1091) [ClassicSimilarity], result of:
            0.699784 = score(doc=1091,freq=17.0), product of:
              0.43050733 = queryWeight, product of:
                3.150754 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.021661429 = queryNorm
              1.6254869 = fieldWeight in 1091, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1091)
        0.28 = coord(7/25)
    
  4. Heckner, M.; Mühlbacher, S.; Wolff, C.: Tagging tagging : a classification model for user keywords in scientific bibliography management systems (2007) 0.26
    0.2575048 = sum of:
      0.2575048 = product of:
        0.7152911 = sum of:
          0.019009795 = weight(abstract_txt:however in 533) [ClassicSimilarity], result of:
            0.019009795 = score(doc=533,freq=1.0), product of:
              0.09618078 = queryWeight, product of:
                1.0530605 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.021661429 = queryNorm
              0.1976465 = fieldWeight in 533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.083528474 = weight(abstract_txt:tagger in 533) [ClassicSimilarity], result of:
            0.083528474 = score(doc=533,freq=1.0), product of:
              0.2047936 = queryWeight, product of:
                1.0865567 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.021661429 = queryNorm
              0.40786663 = fieldWeight in 533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.026043452 = weight(abstract_txt:basis in 533) [ClassicSimilarity], result of:
            0.026043452 = score(doc=533,freq=1.0), product of:
              0.11864125 = queryWeight, product of:
                1.1695722 = boost
                4.682972 = idf(docFreq=1111, maxDocs=44218)
                0.021661429 = queryNorm
              0.21951431 = fieldWeight in 533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682972 = idf(docFreq=1111, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.050115272 = weight(abstract_txt:linguistic in 533) [ClassicSimilarity], result of:
            0.050115272 = score(doc=533,freq=1.0), product of:
              0.18354818 = queryWeight, product of:
                1.4547362 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.021661429 = queryNorm
              0.27303606 = fieldWeight in 533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.07447612 = weight(abstract_txt:linguistics in 533) [ClassicSimilarity], result of:
            0.07447612 = score(doc=533,freq=1.0), product of:
              0.2390276 = queryWeight, product of:
                1.660096 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.021661429 = queryNorm
              0.31157959 = fieldWeight in 533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.017529607 = weight(abstract_txt:that in 533) [ClassicSimilarity], result of:
            0.017529607 = score(doc=533,freq=3.0), product of:
              0.09112093 = queryWeight, product of:
                1.7753292 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021661429 = queryNorm
              0.19237739 = fieldWeight in 533, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.025233364 = weight(abstract_txt:some in 533) [ClassicSimilarity], result of:
            0.025233364 = score(doc=533,freq=1.0), product of:
              0.1463626 = queryWeight, product of:
                1.8371273 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.021661429 = queryNorm
              0.17240308 = fieldWeight in 533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.037479237 = weight(abstract_txt:from in 533) [ClassicSimilarity], result of:
            0.037479237 = score(doc=533,freq=4.0), product of:
              0.14464381 = queryWeight, product of:
                2.4159791 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.021661429 = queryNorm
              0.259114 = fieldWeight in 533, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
          0.38187575 = weight(abstract_txt:tagging in 533) [ClassicSimilarity], result of:
            0.38187575 = score(doc=533,freq=9.0), product of:
              0.43050733 = queryWeight, product of:
                3.150754 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.021661429 = queryNorm
              0.88703656 = fieldWeight in 533, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.046875 = fieldNorm(doc=533)
        0.36 = coord(9/25)
    
  5. Losee, R.M.: Learning syntactic rules and tags with genetic algorithms for information retrieval and filtering : an empirical basis for grammatical rules (1996) 0.25
    0.25090855 = sum of:
      0.25090855 = product of:
        0.6969682 = sum of:
          0.04196081 = weight(abstract_txt:performance in 4068) [ClassicSimilarity], result of:
            0.04196081 = score(doc=4068,freq=1.0), product of:
              0.11599343 = queryWeight, product of:
                1.1564474 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.021661429 = queryNorm
              0.3617516 = fieldWeight in 4068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.04532136 = weight(abstract_txt:learning in 4068) [ClassicSimilarity], result of:
            0.04532136 = score(doc=4068,freq=1.0), product of:
              0.12210669 = queryWeight, product of:
                1.1865306 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.021661429 = queryNorm
              0.37116197 = fieldWeight in 4068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.08352546 = weight(abstract_txt:linguistic in 4068) [ClassicSimilarity], result of:
            0.08352546 = score(doc=4068,freq=1.0), product of:
              0.18354818 = queryWeight, product of:
                1.4547362 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.021661429 = queryNorm
              0.45506012 = fieldWeight in 4068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.06092711 = weight(abstract_txt:part in 4068) [ClassicSimilarity], result of:
            0.06092711 = score(doc=4068,freq=1.0), product of:
              0.17025831 = queryWeight, product of:
                1.7159672 = boost
                4.580493 = idf(docFreq=1231, maxDocs=44218)
                0.021661429 = queryNorm
              0.35785103 = fieldWeight in 4068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.580493 = idf(docFreq=1231, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.06462504 = weight(abstract_txt:further in 4068) [ClassicSimilarity], result of:
            0.06462504 = score(doc=4068,freq=1.0), product of:
              0.17707959 = queryWeight, product of:
                1.7500042 = boost
                4.671349 = idf(docFreq=1124, maxDocs=44218)
                0.021661429 = queryNorm
              0.36494914 = fieldWeight in 4068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.671349 = idf(docFreq=1124, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.023854773 = weight(abstract_txt:that in 4068) [ClassicSimilarity], result of:
            0.023854773 = score(doc=4068,freq=2.0), product of:
              0.09112093 = queryWeight, product of:
                1.7753292 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.021661429 = queryNorm
              0.26179248 = fieldWeight in 4068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.042055607 = weight(abstract_txt:some in 4068) [ClassicSimilarity], result of:
            0.042055607 = score(doc=4068,freq=1.0), product of:
              0.1463626 = queryWeight, product of:
                1.8371273 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.021661429 = queryNorm
              0.28733847 = fieldWeight in 4068, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.044169705 = weight(abstract_txt:from in 4068) [ClassicSimilarity], result of:
            0.044169705 = score(doc=4068,freq=2.0), product of:
              0.14464381 = queryWeight, product of:
                2.4159791 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.021661429 = queryNorm
              0.30536878 = fieldWeight in 4068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
          0.2905284 = weight(abstract_txt:speech in 4068) [ClassicSimilarity], result of:
            0.2905284 = score(doc=4068,freq=2.0), product of:
              0.3828397 = queryWeight, product of:
                2.5731394 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.021661429 = queryNorm
              0.75887734 = fieldWeight in 4068, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.078125 = fieldNorm(doc=4068)
        0.36 = coord(9/25)