Document (#26851)

Author
Ferret, O.
Grau, B.
Hurault-Plantet, M.
Illouz, G.
Jacquemin, C.
Monceaux, L.
Robba, I.
Vilnat, A.
Title
How NLP can improve question answering
Source
Knowledge organization. 29(2002) nos.3/4, S.135-155
Year
2002
Abstract
Answering open-domain factual questions requires Natural Language processing for refining document selection and answer identification. With our system QALC, we have participated in the Question Answering track of the TREC8, TREC9 and TREC10 evaluations. QALC performs an analysis of documents relying an multiword term searches and their linguistic variation both to minimize the number of documents selected and to provide additional clues when comparing question and sentence representations. This comparison process also makes use of the results of a syntactic parsing of the questions and Named Entity recognition functionalities. Answer extraction relies an the application of syntactic patterns chosen according to the kind of information that is sought, and categorized depending an the syntactic form of the question. These patterns allow QALC to handle nicely linguistic variations at the answer level.
Theme
Computerlinguistik
Retrievalstudien
Sprachretrieval
Object
TREC

Similar documents (author)

  1. Grau, O.: Infos lokal gewoben : die WWW-Sprache HTML und die passende Software (1994) 6.01
    6.010904 = sum of:
      6.010904 = weight(author_txt:grau in 7566) [ClassicSimilarity], result of:
        6.010904 = fieldWeight in 7566, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.625 = fieldNorm(doc=7566)
    
  2. Grau, O.: Alles integriert : Informationssurfen im World Wide Web (1994) 6.01
    6.010904 = sum of:
      6.010904 = weight(author_txt:grau in 7613) [ClassicSimilarity], result of:
        6.010904 = fieldWeight in 7613, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.625 = fieldNorm(doc=7613)
    
  3. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 6.01
    6.010904 = sum of:
      6.010904 = weight(author_txt:grau in 107) [ClassicSimilarity], result of:
        6.010904 = fieldWeight in 107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.625 = fieldNorm(doc=107)
    
  4. Grau, J.E.; Mehrotra, R.: Similar shape retrieval using a structural feature index (1993) 4.81
    4.808723 = sum of:
      4.808723 = weight(author_txt:grau in 7332) [ClassicSimilarity], result of:
        4.808723 = fieldWeight in 7332, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.5 = fieldNorm(doc=7332)
    
  5. Ferret, O.; Grau, B.; Masson, N.: Utilisation d'un réseau de cooccurences lexikales pour a méliorer une analyse thématique fondée sur la distribution des mots (1999) 3.61
    3.606542 = sum of:
      3.606542 = weight(author_txt:grau in 6295) [ClassicSimilarity], result of:
        3.606542 = fieldWeight in 6295, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.375 = fieldNorm(doc=6295)
    

Similar documents (content)

  1. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 0.29
    0.28621396 = sum of:
      0.28621396 = product of:
        0.89441866 = sum of:
          0.105513565 = weight(abstract_txt:factual in 107) [ClassicSimilarity], result of:
            0.105513565 = score(doc=107,freq=2.0), product of:
              0.158034 = queryWeight, product of:
                1.1035662 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.018957864 = queryNorm
              0.6676637 = fieldWeight in 107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.09441809 = weight(abstract_txt:clues in 107) [ClassicSimilarity], result of:
            0.09441809 = score(doc=107,freq=1.0), product of:
              0.184895 = queryWeight, product of:
                1.1936738 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.018957864 = queryNorm
              0.5106579 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.024234753 = weight(abstract_txt:documents in 107) [ClassicSimilarity], result of:
            0.024234753 = score(doc=107,freq=1.0), product of:
              0.0940858 = queryWeight, product of:
                1.2042042 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018957864 = queryNorm
              0.2575814 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.05740809 = weight(abstract_txt:questions in 107) [ClassicSimilarity], result of:
            0.05740809 = score(doc=107,freq=2.0), product of:
              0.13269985 = queryWeight, product of:
                1.4301227 = boost
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.018957864 = queryNorm
              0.4326161 = fieldWeight in 107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.068418145 = weight(abstract_txt:linguistic in 107) [ClassicSimilarity], result of:
            0.068418145 = score(doc=107,freq=1.0), product of:
              0.18793711 = queryWeight, product of:
                1.7019405 = boost
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.018957864 = queryNorm
              0.3640481 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8247695 = idf(docFreq=354, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.10753319 = weight(abstract_txt:answer in 107) [ClassicSimilarity], result of:
            0.10753319 = score(doc=107,freq=1.0), product of:
              0.29081967 = queryWeight, product of:
                2.592959 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018957864 = queryNorm
              0.369759 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.21822575 = weight(abstract_txt:answering in 107) [ClassicSimilarity], result of:
            0.21822575 = score(doc=107,freq=2.0), product of:
              0.3699895 = queryWeight, product of:
                2.9246807 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.018957864 = queryNorm
              0.58981603 = fieldWeight in 107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
          0.2186671 = weight(abstract_txt:question in 107) [ClassicSimilarity], result of:
            0.2186671 = score(doc=107,freq=5.0), product of:
              0.30045122 = queryWeight, product of:
                3.0432673 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.018957864 = queryNorm
              0.7277957 = fieldWeight in 107, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.0625 = fieldNorm(doc=107)
        0.32 = coord(8/25)
    
  2. Lin, J.; Katz, B.: Building a reusable test collection for question answering (2006) 0.18
    0.18471101 = sum of:
      0.18471101 = product of:
        0.923555 = sum of:
          0.052469775 = weight(abstract_txt:documents in 5045) [ClassicSimilarity], result of:
            0.052469775 = score(doc=5045,freq=3.0), product of:
              0.0940858 = queryWeight, product of:
                1.2042042 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.018957864 = queryNorm
              0.5576801 = fieldWeight in 5045, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.078125 = fieldNorm(doc=5045)
          0.050742064 = weight(abstract_txt:questions in 5045) [ClassicSimilarity], result of:
            0.050742064 = score(doc=5045,freq=1.0), product of:
              0.13269985 = queryWeight, product of:
                1.4301227 = boost
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.018957864 = queryNorm
              0.38238224 = fieldWeight in 5045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.078125 = fieldNorm(doc=5045)
          0.19009362 = weight(abstract_txt:answer in 5045) [ClassicSimilarity], result of:
            0.19009362 = score(doc=5045,freq=2.0), product of:
              0.29081967 = queryWeight, product of:
                2.592959 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018957864 = queryNorm
              0.6536477 = fieldWeight in 5045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.078125 = fieldNorm(doc=5045)
          0.38577226 = weight(abstract_txt:answering in 5045) [ClassicSimilarity], result of:
            0.38577226 = score(doc=5045,freq=4.0), product of:
              0.3699895 = queryWeight, product of:
                2.9246807 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.018957864 = queryNorm
              1.0426573 = fieldWeight in 5045, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.078125 = fieldNorm(doc=5045)
          0.24447726 = weight(abstract_txt:question in 5045) [ClassicSimilarity], result of:
            0.24447726 = score(doc=5045,freq=4.0), product of:
              0.30045122 = queryWeight, product of:
                3.0432673 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.018957864 = queryNorm
              0.8137003 = fieldWeight in 5045, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.078125 = fieldNorm(doc=5045)
        0.2 = coord(5/25)
    
  3. Saint-Dizier, P.; Moens, M.-F.: Knowledge and reasoning for question answering : research perspectives (2011) 0.18
    0.18095261 = sum of:
      0.18095261 = product of:
        1.1309538 = sum of:
          0.11191403 = weight(abstract_txt:factual in 2746) [ClassicSimilarity], result of:
            0.11191403 = score(doc=2746,freq=1.0), product of:
              0.158034 = queryWeight, product of:
                1.1035662 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.018957864 = queryNorm
              0.7081643 = fieldWeight in 2746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.09375 = fieldNorm(doc=2746)
          0.22811233 = weight(abstract_txt:answer in 2746) [ClassicSimilarity], result of:
            0.22811233 = score(doc=2746,freq=2.0), product of:
              0.29081967 = queryWeight, product of:
                2.592959 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018957864 = queryNorm
              0.7843772 = fieldWeight in 2746, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.09375 = fieldNorm(doc=2746)
          0.46292672 = weight(abstract_txt:answering in 2746) [ClassicSimilarity], result of:
            0.46292672 = score(doc=2746,freq=4.0), product of:
              0.3699895 = queryWeight, product of:
                2.9246807 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.018957864 = queryNorm
              1.2511888 = fieldWeight in 2746, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.09375 = fieldNorm(doc=2746)
          0.3280007 = weight(abstract_txt:question in 2746) [ClassicSimilarity], result of:
            0.3280007 = score(doc=2746,freq=5.0), product of:
              0.30045122 = queryWeight, product of:
                3.0432673 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.018957864 = queryNorm
              1.0916936 = fieldWeight in 2746, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.09375 = fieldNorm(doc=2746)
        0.16 = coord(4/25)
    
  4. Liu, Z.; Jansen, B.J.: ASK: A taxonomy of accuracy, social, and knowledge information seeking posts in social question and answering (2017) 0.18
    0.1794153 = sum of:
      0.1794153 = product of:
        0.8970765 = sum of:
          0.0811873 = weight(abstract_txt:questions in 3345) [ClassicSimilarity], result of:
            0.0811873 = score(doc=3345,freq=4.0), product of:
              0.13269985 = queryWeight, product of:
                1.4301227 = boost
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.018957864 = queryNorm
              0.6118116 = fieldWeight in 3345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.0625 = fieldNorm(doc=3345)
          0.10753319 = weight(abstract_txt:answer in 3345) [ClassicSimilarity], result of:
            0.10753319 = score(doc=3345,freq=1.0), product of:
              0.29081967 = queryWeight, product of:
                2.592959 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018957864 = queryNorm
              0.369759 = fieldWeight in 3345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.0625 = fieldNorm(doc=3345)
          0.20415643 = weight(abstract_txt:syntactic in 3345) [ClassicSimilarity], result of:
            0.20415643 = score(doc=3345,freq=2.0), product of:
              0.35391107 = queryWeight, product of:
                2.8604267 = boost
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.018957864 = queryNorm
              0.576858 = fieldWeight in 3345, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5264034 = idf(docFreq=175, maxDocs=44218)
                0.0625 = fieldNorm(doc=3345)
          0.3086178 = weight(abstract_txt:answering in 3345) [ClassicSimilarity], result of:
            0.3086178 = score(doc=3345,freq=4.0), product of:
              0.3699895 = queryWeight, product of:
                2.9246807 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.018957864 = queryNorm
              0.8341258 = fieldWeight in 3345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.0625 = fieldNorm(doc=3345)
          0.19558181 = weight(abstract_txt:question in 3345) [ClassicSimilarity], result of:
            0.19558181 = score(doc=3345,freq=4.0), product of:
              0.30045122 = queryWeight, product of:
                3.0432673 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.018957864 = queryNorm
              0.65096027 = fieldWeight in 3345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.0625 = fieldNorm(doc=3345)
        0.2 = coord(5/25)
    
  5. Galitsky, B.: Can many agents answer questions better than one? (2005) 0.16
    0.1620369 = sum of:
      0.1620369 = product of:
        1.0127306 = sum of:
          0.050742064 = weight(abstract_txt:questions in 3094) [ClassicSimilarity], result of:
            0.050742064 = score(doc=3094,freq=1.0), product of:
              0.13269985 = queryWeight, product of:
                1.4301227 = boost
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.018957864 = queryNorm
              0.38238224 = fieldWeight in 3094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8944926 = idf(docFreq=899, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.19009362 = weight(abstract_txt:answer in 3094) [ClassicSimilarity], result of:
            0.19009362 = score(doc=3094,freq=2.0), product of:
              0.29081967 = queryWeight, product of:
                2.592959 = boost
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.018957864 = queryNorm
              0.6536477 = fieldWeight in 3094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.916144 = idf(docFreq=323, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.47247258 = weight(abstract_txt:answering in 3094) [ClassicSimilarity], result of:
            0.47247258 = score(doc=3094,freq=6.0), product of:
              0.3699895 = queryWeight, product of:
                2.9246807 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.018957864 = queryNorm
              1.2769891 = fieldWeight in 3094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
          0.2994223 = weight(abstract_txt:question in 3094) [ClassicSimilarity], result of:
            0.2994223 = score(doc=3094,freq=6.0), product of:
              0.30045122 = queryWeight, product of:
                3.0432673 = boost
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.018957864 = queryNorm
              0.99657536 = fieldWeight in 3094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.207682 = idf(docFreq=657, maxDocs=44218)
                0.078125 = fieldNorm(doc=3094)
        0.16 = coord(4/25)