Document (#38593)

Author
Nédellec, C.
Bossy, R.
Valsamou, D.
Ranoux, M.
Golik, W.
Sourdille, P.
Title
Information extraction from bbliography for marker-assisted selection in wheat
Source
Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al
Imprint
Cham : Springer
Year
2014
Pages
S.301-313
Series
Communications in computer and information science; 478
Abstract
Improvement of most animal and plant species of agronomical interest in the near future has become an international stake because of the increasing demand for feeding a growing world population and to mitigate the reduction of the industrial resources. The recent advent of genomic tools contributed to improve the discovery of linkage between molecular markers and genes that are involved in the control of traits of agronomical interest such as grain number or disease resistance. This information is mostly published as scientific papers but rarely available in databases. Here, we present a method aiming at automatically extract this information from the scientific literature and relying on a knowledge model of the target information and on the WheatPhenotype ontology that we developed for this purpose. The information extraction results were evaluated and integrated into the on-line semantic search engine AlvisIR WheatMarker.
Field
Agrarwissenschaften

Similar documents (content)

  1. Hofmann-Apitius, M.: Direct use of information extraction from scientific text for modeling and simulation in the life sciences (2009) 0.14
    0.13930209 = sum of:
      0.13930209 = product of:
        0.69651043 = sum of:
          0.1729441 = weight(abstract_txt:disease in 2814) [ClassicSimilarity], result of:
            0.1729441 = score(doc=2814,freq=4.0), product of:
              0.17949794 = queryWeight, product of:
                1.0822399 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.02151789 = queryNorm
              0.9634879 = fieldWeight in 2814, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=2814)
          0.14118163 = weight(abstract_txt:molecular in 2814) [ClassicSimilarity], result of:
            0.14118163 = score(doc=2814,freq=2.0), product of:
              0.19753821 = queryWeight, product of:
                1.1353228 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.02151789 = queryNorm
              0.7147054 = fieldWeight in 2814, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.0625 = fieldNorm(doc=2814)
          0.25567406 = weight(abstract_txt:genes in 2814) [ClassicSimilarity], result of:
            0.25567406 = score(doc=2814,freq=3.0), product of:
              0.25638524 = queryWeight, product of:
                1.2934222 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.02151789 = queryNorm
              0.9972262 = fieldWeight in 2814, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=2814)
          0.09991758 = weight(abstract_txt:scientific in 2814) [ClassicSimilarity], result of:
            0.09991758 = score(doc=2814,freq=7.0), product of:
              0.130181 = queryWeight, product of:
                1.3034147 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.02151789 = queryNorm
              0.7675282 = fieldWeight in 2814, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0625 = fieldNorm(doc=2814)
          0.026793072 = weight(abstract_txt:information in 2814) [ClassicSimilarity], result of:
            0.026793072 = score(doc=2814,freq=4.0), product of:
              0.088537514 = queryWeight, product of:
                1.6995833 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02151789 = queryNorm
              0.3026183 = fieldWeight in 2814, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2814)
        0.2 = coord(5/25)
    
  2. Dextre Clarke, S.G.: ¬The Information Retrieval Thesaurus (2019) 0.10
    0.10161255 = sum of:
      0.10161255 = product of:
        0.42338565 = sum of:
          0.014070019 = weight(abstract_txt:this in 5210) [ClassicSimilarity], result of:
            0.014070019 = score(doc=5210,freq=2.0), product of:
              0.052775115 = queryWeight, product of:
                1.0164102 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.02151789 = queryNorm
              0.2666033 = fieldWeight in 5210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=5210)
          0.09099583 = weight(abstract_txt:industrial in 5210) [ClassicSimilarity], result of:
            0.09099583 = score(doc=5210,freq=1.0), product of:
              0.16003563 = queryWeight, product of:
                1.0218853 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.02151789 = queryNorm
              0.5685974 = fieldWeight in 5210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.078125 = fieldNorm(doc=5210)
          0.18451688 = weight(abstract_txt:genes in 5210) [ClassicSimilarity], result of:
            0.18451688 = score(doc=5210,freq=1.0), product of:
              0.25638524 = queryWeight, product of:
                1.2934222 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.02151789 = queryNorm
              0.71968603 = fieldWeight in 5210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=5210)
          0.047206625 = weight(abstract_txt:scientific in 5210) [ClassicSimilarity], result of:
            0.047206625 = score(doc=5210,freq=1.0), product of:
              0.130181 = queryWeight, product of:
                1.3034147 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.02151789 = queryNorm
              0.362623 = fieldWeight in 5210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.078125 = fieldNorm(doc=5210)
          0.062914334 = weight(abstract_txt:interest in 5210) [ClassicSimilarity], result of:
            0.062914334 = score(doc=5210,freq=1.0), product of:
              0.15765655 = queryWeight, product of:
                1.434382 = boost
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.02151789 = queryNorm
              0.3990594 = fieldWeight in 5210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.078125 = fieldNorm(doc=5210)
          0.023681952 = weight(abstract_txt:information in 5210) [ClassicSimilarity], result of:
            0.023681952 = score(doc=5210,freq=2.0), product of:
              0.088537514 = queryWeight, product of:
                1.6995833 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02151789 = queryNorm
              0.2674793 = fieldWeight in 5210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=5210)
        0.24 = coord(6/25)
    
  3. Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.08
    0.0779395 = sum of:
      0.0779395 = product of:
        0.3896975 = sum of:
          0.007959205 = weight(abstract_txt:this in 1107) [ClassicSimilarity], result of:
            0.007959205 = score(doc=1107,freq=1.0), product of:
              0.052775115 = queryWeight, product of:
                1.0164102 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.02151789 = queryNorm
              0.1508136 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.21181239 = weight(abstract_txt:disease in 1107) [ClassicSimilarity], result of:
            0.21181239 = score(doc=1107,freq=6.0), product of:
              0.17949794 = queryWeight, product of:
                1.0822399 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.02151789 = queryNorm
              1.1800269 = fieldWeight in 1107, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.050331466 = weight(abstract_txt:interest in 1107) [ClassicSimilarity], result of:
            0.050331466 = score(doc=1107,freq=1.0), product of:
              0.15765655 = queryWeight, product of:
                1.434382 = boost
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.02151789 = queryNorm
              0.31924754 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1079607 = idf(docFreq=726, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.029955564 = weight(abstract_txt:information in 1107) [ClassicSimilarity], result of:
            0.029955564 = score(doc=1107,freq=5.0), product of:
              0.088537514 = queryWeight, product of:
                1.6995833 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02151789 = queryNorm
              0.33833754 = fieldWeight in 1107, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.08963885 = weight(abstract_txt:extraction in 1107) [ClassicSimilarity], result of:
            0.08963885 = score(doc=1107,freq=1.0), product of:
              0.23164156 = queryWeight, product of:
                1.7386695 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.02151789 = queryNorm
              0.38697222 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
        0.2 = coord(5/25)
    
  4. Michon, J.: Biomedicine and the Semantic Web : a knowledge model for visual phenotype (2006) 0.08
    0.07731932 = sum of:
      0.07731932 = product of:
        0.38659656 = sum of:
          0.013785746 = weight(abstract_txt:this in 246) [ClassicSimilarity], result of:
            0.013785746 = score(doc=246,freq=3.0), product of:
              0.052775115 = queryWeight, product of:
                1.0164102 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.02151789 = queryNorm
              0.2612168 = fieldWeight in 246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=246)
          0.08647205 = weight(abstract_txt:disease in 246) [ClassicSimilarity], result of:
            0.08647205 = score(doc=246,freq=1.0), product of:
              0.17949794 = queryWeight, product of:
                1.0822399 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.02151789 = queryNorm
              0.48174396 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=246)
          0.09983049 = weight(abstract_txt:molecular in 246) [ClassicSimilarity], result of:
            0.09983049 = score(doc=246,freq=1.0), product of:
              0.19753821 = queryWeight, product of:
                1.1353228 = boost
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.02151789 = queryNorm
              0.50537306 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.085969 = idf(docFreq=36, maxDocs=44218)
                0.0625 = fieldNorm(doc=246)
          0.15655272 = weight(abstract_txt:genomic in 246) [ClassicSimilarity], result of:
            0.15655272 = score(doc=246,freq=1.0), product of:
              0.26663432 = queryWeight, product of:
                1.3190213 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.02151789 = queryNorm
              0.5871439 = fieldWeight in 246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=246)
          0.029955564 = weight(abstract_txt:information in 246) [ClassicSimilarity], result of:
            0.029955564 = score(doc=246,freq=5.0), product of:
              0.088537514 = queryWeight, product of:
                1.6995833 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02151789 = queryNorm
              0.33833754 = fieldWeight in 246, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=246)
        0.2 = coord(5/25)
    
  5. Sy, M.-F.; Ranwez, S.; Montmain, J.; Ragnault, A.; Crampes, M.; Ranwez, V.: User centered and ontology based information retrieval system for life sciences (2012) 0.08
    0.07591926 = sum of:
      0.07591926 = product of:
        0.37959632 = sum of:
          0.015572658 = weight(abstract_txt:this in 699) [ClassicSimilarity], result of:
            0.015572658 = score(doc=699,freq=5.0), product of:
              0.052775115 = queryWeight, product of:
                1.0164102 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.02151789 = queryNorm
              0.29507577 = fieldWeight in 699, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=699)
          0.07166714 = weight(abstract_txt:aiming in 699) [ClassicSimilarity], result of:
            0.07166714 = score(doc=699,freq=1.0), product of:
              0.17312123 = queryWeight, product of:
                1.0628426 = boost
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.02151789 = queryNorm
              0.41397086 = fieldWeight in 699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5697527 = idf(docFreq=61, maxDocs=44218)
                0.0546875 = fieldNorm(doc=699)
          0.1291618 = weight(abstract_txt:genes in 699) [ClassicSimilarity], result of:
            0.1291618 = score(doc=699,freq=1.0), product of:
              0.25638524 = queryWeight, product of:
                1.2934222 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.02151789 = queryNorm
              0.5037802 = fieldWeight in 699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0546875 = fieldNorm(doc=699)
          0.13698362 = weight(abstract_txt:genomic in 699) [ClassicSimilarity], result of:
            0.13698362 = score(doc=699,freq=1.0), product of:
              0.26663432 = queryWeight, product of:
                1.3190213 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.02151789 = queryNorm
              0.5137509 = fieldWeight in 699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0546875 = fieldNorm(doc=699)
          0.02621112 = weight(abstract_txt:information in 699) [ClassicSimilarity], result of:
            0.02621112 = score(doc=699,freq=5.0), product of:
              0.088537514 = queryWeight, product of:
                1.6995833 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.02151789 = queryNorm
              0.29604536 = fieldWeight in 699, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0546875 = fieldNorm(doc=699)
        0.2 = coord(5/25)