Document (#38863)

Author
Soo, J.
Frieder, O.
Title
On searching misspelled collections
Source
Journal of the Association for Information Science and Technology. 66(2015) no.6, S.1294-1298
Year
2015
Series
Brief communication
Abstract
We describe an unsupervised, language-independent spelling correction search system. We compare the proposed approach with unsupervised and supervised algorithms. The described approach consistently outperforms other unsupervised efforts and nearly matches the performance of a current state-of-the-art supervised approach.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23240/abstract.
Theme
Computerlinguistik

Similar documents (author)

  1. Ruocco, A.S.; Frieder, O.: Clustering and classification of large document bases in a parallel environment (1997) 4.46
    4.462149 = sum of:
      4.462149 = weight(author_txt:frieder in 1661) [ClassicSimilarity], result of:
        4.462149 = fieldWeight in 1661, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.5 = fieldNorm(doc=1661)
    
  2. Grossman, D.A.; Frieder, O.: Information retrieval : algorithms and heuristics (1998) 4.46
    4.462149 = sum of:
      4.462149 = weight(author_txt:frieder in 2182) [ClassicSimilarity], result of:
        4.462149 = fieldWeight in 2182, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.5 = fieldNorm(doc=2182)
    
  3. Grossman, D.A.; Frieder, O.: Information retrieval : algorithms and heuristics (2004) 4.46
    4.462149 = sum of:
      4.462149 = weight(author_txt:frieder in 1486) [ClassicSimilarity], result of:
        4.462149 = fieldWeight in 1486, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.5 = fieldNorm(doc=1486)
    
  4. Aljlayl, M.; Frieder, O.; Grossman, D.: On bidirectional English-Arabic search (2002) 3.35
    3.346612 = sum of:
      3.346612 = weight(author_txt:frieder in 5227) [ClassicSimilarity], result of:
        3.346612 = fieldWeight in 5227, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.375 = fieldNorm(doc=5227)
    
  5. Urbain, J.; Goharian, N.; Frieder, O.: Probabilistic passage models for semantic search of genomics literature (2008) 3.35
    3.346612 = sum of:
      3.346612 = weight(author_txt:frieder in 2380) [ClassicSimilarity], result of:
        3.346612 = fieldWeight in 2380, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.375 = fieldNorm(doc=2380)
    

Similar documents (content)

  1. Hubert, G.; Pitarch, Y.; Pinel-Sauvagnat, K.; Tournier, R.; Laporte, L.: TournaRank : when retrieval becomes document competition (2018) 0.27
    0.27227592 = sum of:
      0.27227592 = product of:
        0.85086226 = sum of:
          0.009647168 = weight(abstract_txt:other in 5087) [ClassicSimilarity], result of:
            0.009647168 = score(doc=5087,freq=1.0), product of:
              0.04384459 = queryWeight, product of:
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.012454096 = queryNorm
              0.22003098 = fieldWeight in 5087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.021651957 = weight(abstract_txt:proposed in 5087) [ClassicSimilarity], result of:
            0.021651957 = score(doc=5087,freq=1.0), product of:
              0.07515898 = queryWeight, product of:
                1.3092797 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.012454096 = queryNorm
              0.2880821 = fieldWeight in 5087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.022864845 = weight(abstract_txt:collections in 5087) [ClassicSimilarity], result of:
            0.022864845 = score(doc=5087,freq=1.0), product of:
              0.07794022 = queryWeight, product of:
                1.3332844 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.012454096 = queryNorm
              0.29336387 = fieldWeight in 5087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.039134584 = weight(abstract_txt:compare in 5087) [ClassicSimilarity], result of:
            0.039134584 = score(doc=5087,freq=1.0), product of:
              0.11152099 = queryWeight, product of:
                1.5948516 = boost
                5.6146684 = idf(docFreq=437, maxDocs=44218)
                0.012454096 = queryNorm
              0.35091677 = fieldWeight in 5087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6146684 = idf(docFreq=437, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.16939585 = weight(abstract_txt:matches in 5087) [ClassicSimilarity], result of:
            0.16939585 = score(doc=5087,freq=3.0), product of:
              0.2053734 = queryWeight, product of:
                2.1642833 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.012454096 = queryNorm
              0.8248188 = fieldWeight in 5087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.034848113 = weight(abstract_txt:approach in 5087) [ClassicSimilarity], result of:
            0.034848113 = score(doc=5087,freq=1.0), product of:
              0.14887075 = queryWeight, product of:
                3.1915915 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.012454096 = queryNorm
              0.234083 = fieldWeight in 5087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.25991747 = weight(abstract_txt:supervised in 5087) [ClassicSimilarity], result of:
            0.25991747 = score(doc=5087,freq=2.0), product of:
              0.3940395 = queryWeight, product of:
                4.2396193 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.012454096 = queryNorm
              0.65962285 = fieldWeight in 5087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
          0.29340225 = weight(abstract_txt:unsupervised in 5087) [ClassicSimilarity], result of:
            0.29340225 = score(doc=5087,freq=1.0), product of:
              0.6161203 = queryWeight, product of:
                6.49285 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.012454096 = queryNorm
              0.47620937 = fieldWeight in 5087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=5087)
        0.32 = coord(8/25)
    
  2. Ferreira, A.A.; Veloso, A.; Gonçalves, M.A.; Laender, A.H.F.: Self-training author name disambiguation for information scarce scenarios (2014) 0.24
    0.2377116 = sum of:
      0.2377116 = product of:
        0.74284875 = sum of:
          0.009647168 = weight(abstract_txt:other in 1292) [ClassicSimilarity], result of:
            0.009647168 = score(doc=1292,freq=1.0), product of:
              0.04384459 = queryWeight, product of:
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.012454096 = queryNorm
              0.22003098 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5204957 = idf(docFreq=3555, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.030620491 = weight(abstract_txt:proposed in 1292) [ClassicSimilarity], result of:
            0.030620491 = score(doc=1292,freq=2.0), product of:
              0.07515898 = queryWeight, product of:
                1.3092797 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.012454096 = queryNorm
              0.4074096 = fieldWeight in 1292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.021950763 = weight(abstract_txt:performance in 1292) [ClassicSimilarity], result of:
            0.021950763 = score(doc=1292,freq=1.0), product of:
              0.075848885 = queryWeight, product of:
                1.3152751 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.012454096 = queryNorm
              0.28940126 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.022864845 = weight(abstract_txt:collections in 1292) [ClassicSimilarity], result of:
            0.022864845 = score(doc=1292,freq=1.0), product of:
              0.07794022 = queryWeight, product of:
                1.3332844 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.012454096 = queryNorm
              0.29336387 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.024912508 = weight(abstract_txt:state in 1292) [ClassicSimilarity], result of:
            0.024912508 = score(doc=1292,freq=1.0), product of:
              0.08252669 = queryWeight, product of:
                1.3719529 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.012454096 = queryNorm
              0.30187213 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.079533294 = weight(abstract_txt:outperforms in 1292) [ClassicSimilarity], result of:
            0.079533294 = score(doc=1292,freq=1.0), product of:
              0.17892957 = queryWeight, product of:
                2.0201473 = boost
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.012454096 = queryNorm
              0.444495 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.25991747 = weight(abstract_txt:supervised in 1292) [ClassicSimilarity], result of:
            0.25991747 = score(doc=1292,freq=2.0), product of:
              0.3940395 = queryWeight, product of:
                4.2396193 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.012454096 = queryNorm
              0.65962285 = fieldWeight in 1292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
          0.29340225 = weight(abstract_txt:unsupervised in 1292) [ClassicSimilarity], result of:
            0.29340225 = score(doc=1292,freq=1.0), product of:
              0.6161203 = queryWeight, product of:
                6.49285 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.012454096 = queryNorm
              0.47620937 = fieldWeight in 1292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=1292)
        0.32 = coord(8/25)
    
  3. Kiela, D.; Clark, S.: Detecting compositionality of multi-word expressions using nearest neighbours in vector space models (2013) 0.20
    0.19561607 = sum of:
      0.19561607 = product of:
        0.97808033 = sum of:
          0.038413834 = weight(abstract_txt:performance in 1161) [ClassicSimilarity], result of:
            0.038413834 = score(doc=1161,freq=1.0), product of:
              0.075848885 = queryWeight, product of:
                1.3152751 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.012454096 = queryNorm
              0.5064522 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.109375 = fieldNorm(doc=1161)
          0.043596886 = weight(abstract_txt:state in 1161) [ClassicSimilarity], result of:
            0.043596886 = score(doc=1161,freq=1.0), product of:
              0.08252669 = queryWeight, product of:
                1.3719529 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.012454096 = queryNorm
              0.5282762 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.109375 = fieldNorm(doc=1161)
          0.060984198 = weight(abstract_txt:approach in 1161) [ClassicSimilarity], result of:
            0.060984198 = score(doc=1161,freq=1.0), product of:
              0.14887075 = queryWeight, product of:
                3.1915915 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.012454096 = queryNorm
              0.40964526 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.109375 = fieldNorm(doc=1161)
          0.32163146 = weight(abstract_txt:supervised in 1161) [ClassicSimilarity], result of:
            0.32163146 = score(doc=1161,freq=1.0), product of:
              0.3940395 = queryWeight, product of:
                4.2396193 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.012454096 = queryNorm
              0.8162417 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.109375 = fieldNorm(doc=1161)
          0.51345396 = weight(abstract_txt:unsupervised in 1161) [ClassicSimilarity], result of:
            0.51345396 = score(doc=1161,freq=1.0), product of:
              0.6161203 = queryWeight, product of:
                6.49285 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.012454096 = queryNorm
              0.8333664 = fieldWeight in 1161, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.109375 = fieldNorm(doc=1161)
        0.2 = coord(5/25)
    
  4. Li, M.; Li, H.; Zhou, Z.-H.: Semi-supervised document retrieval (2009) 0.19
    0.19278626 = sum of:
      0.19278626 = product of:
        0.8032761 = sum of:
          0.010822764 = weight(abstract_txt:search in 4218) [ClassicSimilarity], result of:
            0.010822764 = score(doc=4218,freq=1.0), product of:
              0.04733782 = queryWeight, product of:
                1.0390731 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012454096 = queryNorm
              0.22862828 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.021651957 = weight(abstract_txt:proposed in 4218) [ClassicSimilarity], result of:
            0.021651957 = score(doc=4218,freq=1.0), product of:
              0.07515898 = queryWeight, product of:
                1.3092797 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.012454096 = queryNorm
              0.2880821 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.079533294 = weight(abstract_txt:outperforms in 4218) [ClassicSimilarity], result of:
            0.079533294 = score(doc=4218,freq=1.0), product of:
              0.17892957 = queryWeight, product of:
                2.0201473 = boost
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.012454096 = queryNorm
              0.444495 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.079533294 = weight(abstract_txt:consistently in 4218) [ClassicSimilarity], result of:
            0.079533294 = score(doc=4218,freq=1.0), product of:
              0.17892957 = queryWeight, product of:
                2.0201473 = boost
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.012454096 = queryNorm
              0.444495 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.11192 = idf(docFreq=97, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.31833258 = weight(abstract_txt:supervised in 4218) [ClassicSimilarity], result of:
            0.31833258 = score(doc=4218,freq=3.0), product of:
              0.3940395 = queryWeight, product of:
                4.2396193 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.012454096 = queryNorm
              0.80786973 = fieldWeight in 4218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
          0.29340225 = weight(abstract_txt:unsupervised in 4218) [ClassicSimilarity], result of:
            0.29340225 = score(doc=4218,freq=1.0), product of:
              0.6161203 = queryWeight, product of:
                6.49285 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.012454096 = queryNorm
              0.47620937 = fieldWeight in 4218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=4218)
        0.24 = coord(6/25)
    
  5. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.17
    0.1705393 = sum of:
      0.1705393 = product of:
        0.60906893 = sum of:
          0.010822764 = weight(abstract_txt:search in 2335) [ClassicSimilarity], result of:
            0.010822764 = score(doc=2335,freq=1.0), product of:
              0.04733782 = queryWeight, product of:
                1.0390731 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.012454096 = queryNorm
              0.22862828 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.017392335 = weight(abstract_txt:searching in 2335) [ClassicSimilarity], result of:
            0.017392335 = score(doc=2335,freq=1.0), product of:
              0.064946346 = queryWeight, product of:
                1.2170806 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.012454096 = queryNorm
              0.26779544 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.043901525 = weight(abstract_txt:performance in 2335) [ClassicSimilarity], result of:
            0.043901525 = score(doc=2335,freq=4.0), product of:
              0.075848885 = queryWeight, product of:
                1.3152751 = boost
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.012454096 = queryNorm
              0.5788025 = fieldWeight in 2335, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.63042 = idf(docFreq=1171, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.024912508 = weight(abstract_txt:state in 2335) [ClassicSimilarity], result of:
            0.024912508 = score(doc=2335,freq=1.0), product of:
              0.08252669 = queryWeight, product of:
                1.3719529 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.012454096 = queryNorm
              0.30187213 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.034848113 = weight(abstract_txt:approach in 2335) [ClassicSimilarity], result of:
            0.034848113 = score(doc=2335,freq=1.0), product of:
              0.14887075 = queryWeight, product of:
                3.1915915 = boost
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.012454096 = queryNorm
              0.234083 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.745328 = idf(docFreq=2839, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.18378942 = weight(abstract_txt:supervised in 2335) [ClassicSimilarity], result of:
            0.18378942 = score(doc=2335,freq=1.0), product of:
              0.3940395 = queryWeight, product of:
                4.2396193 = boost
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.012454096 = queryNorm
              0.4664238 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.462781 = idf(docFreq=68, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
          0.29340225 = weight(abstract_txt:unsupervised in 2335) [ClassicSimilarity], result of:
            0.29340225 = score(doc=2335,freq=1.0), product of:
              0.6161203 = queryWeight, product of:
                6.49285 = boost
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.012454096 = queryNorm
              0.47620937 = fieldWeight in 2335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.61935 = idf(docFreq=58, maxDocs=44218)
                0.0625 = fieldNorm(doc=2335)
        0.28 = coord(7/25)