Document (#30048)

Author
Crestani, F.
Du, H.
Title
Written versus spoken queries : a qualitative and quantitative comparative analysis
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.7, S.881-890
Year
2006
Abstract
The authors report on an experimental study on the differences between spoken and written queries. A set of written and spontaneous spoken queries are generated by users from written topics. These two sets of queries are compared in qualitative terms and in terms of their retrieval effectiveness. Written and spoken queries are compared in terms of length, duration, and part of speech. In addition, assuming perfect transcription of the spoken queries, written and spoken queries are compared in terms of their aptitude to describe relevant documents. The retrieval effectiveness of spoken and written queries is compared using three different information retrieval models. The results show that using speech to formulate one's information need provides a way to express it more naturally and encourages the formulation of longer queries. Despite that, longer spoken queries do not seem to significantly improve retrieval effectiveness compared with written queries.
Theme
Suchtaktik

Similar documents (author)

  1. Crestani, F.: Combination of similarity measures for effective spoken document retrieval (2003) 5.44
    5.438222 = sum of:
      5.438222 = weight(author_txt:crestani in 4690) [ClassicSimilarity], result of:
        5.438222 = fieldWeight in 4690, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.625 = fieldNorm(doc=4690)
    
  2. Crestani, F.; Lee, P.L.: Searching the web by constraining spreading activities (2000) 4.35
    4.3505774 = sum of:
      4.3505774 = weight(author_txt:crestani in 1326) [ClassicSimilarity], result of:
        4.3505774 = fieldWeight in 1326, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.5 = fieldNorm(doc=1326)
    
  3. Tombros, T.; Crestani, F.: Users' perception of relevance of spoken documents (2000) 4.35
    4.3505774 = sum of:
      4.3505774 = weight(author_txt:crestani in 4996) [ClassicSimilarity], result of:
        4.3505774 = fieldWeight in 4996, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.5 = fieldNorm(doc=4996)
    
  4. Crestani, F.; Wu, S.: Testing the cluster hypothesis in distributed information retrieval (2006) 4.35
    4.3505774 = sum of:
      4.3505774 = weight(author_txt:crestani in 984) [ClassicSimilarity], result of:
        4.3505774 = fieldWeight in 984, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.5 = fieldNorm(doc=984)
    
  5. Crestani, F.; Rijsbergen, C.J. van: Information retrieval by logical imaging (1995) 3.81
    3.806755 = sum of:
      3.806755 = weight(author_txt:crestani in 1759) [ClassicSimilarity], result of:
        3.806755 = fieldWeight in 1759, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.701155 = idf(docFreq=19, maxDocs=44218)
          0.4375 = fieldNorm(doc=1759)
    

Similar documents (content)

  1. Sparck Jones, K.; Jones, G.J.F.; Foote, J.T.; Young, S.J.: Experiments in spoken document retrieval (1996) 0.19
    0.1901941 = sum of:
      0.1901941 = product of:
        1.1887132 = sum of:
          0.090924844 = weight(abstract_txt:transcription in 1951) [ClassicSimilarity], result of:
            0.090924844 = score(doc=1951,freq=1.0), product of:
              0.1094202 = queryWeight, product of:
                1.3731188 = boost
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.00899033 = queryNorm
              0.83096945 = fieldWeight in 1951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.863674 = idf(docFreq=16, maxDocs=44218)
                0.09375 = fieldNorm(doc=1951)
          0.11967016 = weight(abstract_txt:speech in 1951) [ClassicSimilarity], result of:
            0.11967016 = score(doc=1951,freq=2.0), product of:
              0.1314114 = queryWeight, product of:
                2.1280944 = boost
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.00899033 = queryNorm
              0.91065276 = fieldWeight in 1951, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8685737 = idf(docFreq=124, maxDocs=44218)
                0.09375 = fieldNorm(doc=1951)
          0.04901205 = weight(abstract_txt:retrieval in 1951) [ClassicSimilarity], result of:
            0.04901205 = score(doc=1951,freq=5.0), product of:
              0.0672782 = queryWeight, product of:
                2.153409 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.00899033 = queryNorm
              0.7284982 = fieldWeight in 1951, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=1951)
          0.9291061 = weight(abstract_txt:spoken in 1951) [ClassicSimilarity], result of:
            0.9291061 = score(doc=1951,freq=3.0), product of:
              0.71451104 = queryWeight, product of:
                9.924504 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.00899033 = queryNorm
              1.3003384 = fieldWeight in 1951, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=1951)
        0.16 = coord(4/25)
    
  2. Bacchin, M.; Ferro, N.; Melucci, M.: ¬A probabilistic model for stemmer generation (2005) 0.17
    0.16872752 = sum of:
      0.16872752 = product of:
        0.8436376 = sum of:
          0.025831617 = weight(abstract_txt:retrieval in 1001) [ClassicSimilarity], result of:
            0.025831617 = score(doc=1001,freq=2.0), product of:
              0.0672782 = queryWeight, product of:
                2.153409 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.00899033 = queryNorm
              0.38395226 = fieldWeight in 1001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1001)
          0.043259084 = weight(abstract_txt:effectiveness in 1001) [ClassicSimilarity], result of:
            0.043259084 = score(doc=1001,freq=1.0), product of:
              0.10860636 = queryWeight, product of:
                2.3694503 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.00899033 = queryNorm
              0.39831078 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.078125 = fieldNorm(doc=1001)
          0.1681465 = weight(abstract_txt:written in 1001) [ClassicSimilarity], result of:
            0.1681465 = score(doc=1001,freq=1.0), product of:
              0.37232184 = queryWeight, product of:
                7.1641326 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.00899033 = queryNorm
              0.45161602 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=1001)
          0.15938395 = weight(abstract_txt:queries in 1001) [ClassicSimilarity], result of:
            0.15938395 = score(doc=1001,freq=1.0), product of:
              0.39950657 = queryWeight, product of:
                8.701972 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.00899033 = queryNorm
              0.39895204 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.078125 = fieldNorm(doc=1001)
          0.44701642 = weight(abstract_txt:spoken in 1001) [ClassicSimilarity], result of:
            0.44701642 = score(doc=1001,freq=1.0), product of:
              0.71451104 = queryWeight, product of:
                9.924504 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.00899033 = queryNorm
              0.6256256 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.078125 = fieldNorm(doc=1001)
        0.2 = coord(5/25)
    
  3. SARA (SGML Aware Retrieval Application) Workshop, 19th June 1994 (1994) 0.13
    0.13363092 = sum of:
      0.13363092 = product of:
        0.8351932 = sum of:
          0.009038411 = weight(abstract_txt:using in 756) [ClassicSimilarity], result of:
            0.009038411 = score(doc=756,freq=1.0), product of:
              0.033406783 = queryWeight, product of:
                1.0729802 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.00899033 = queryNorm
              0.27055615 = fieldWeight in 756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=756)
          0.025831617 = weight(abstract_txt:retrieval in 756) [ClassicSimilarity], result of:
            0.025831617 = score(doc=756,freq=2.0), product of:
              0.0672782 = queryWeight, product of:
                2.153409 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.00899033 = queryNorm
              0.38395226 = fieldWeight in 756, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=756)
          0.1681465 = weight(abstract_txt:written in 756) [ClassicSimilarity], result of:
            0.1681465 = score(doc=756,freq=1.0), product of:
              0.37232184 = queryWeight, product of:
                7.1641326 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.00899033 = queryNorm
              0.45161602 = fieldWeight in 756, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=756)
          0.6321767 = weight(abstract_txt:spoken in 756) [ClassicSimilarity], result of:
            0.6321767 = score(doc=756,freq=2.0), product of:
              0.71451104 = queryWeight, product of:
                9.924504 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.00899033 = queryNorm
              0.88476825 = fieldWeight in 756, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.078125 = fieldNorm(doc=756)
        0.16 = coord(4/25)
    
  4. Pilch, H.: Empirical linguistics (1976) 0.13
    0.12537265 = sum of:
      0.12537265 = product of:
        0.7835791 = sum of:
          0.010846092 = weight(abstract_txt:using in 7860) [ClassicSimilarity], result of:
            0.010846092 = score(doc=7860,freq=1.0), product of:
              0.033406783 = queryWeight, product of:
                1.0729802 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.00899033 = queryNorm
              0.32466736 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.09375 = fieldNorm(doc=7860)
          0.03453756 = weight(abstract_txt:terms in 7860) [ClassicSimilarity], result of:
            0.03453756 = score(doc=7860,freq=1.0), product of:
              0.091101095 = queryWeight, product of:
                2.5058274 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.00899033 = queryNorm
              0.37911248 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=7860)
          0.2017758 = weight(abstract_txt:written in 7860) [ClassicSimilarity], result of:
            0.2017758 = score(doc=7860,freq=1.0), product of:
              0.37232184 = queryWeight, product of:
                7.1641326 = boost
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.00899033 = queryNorm
              0.5419392 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.09375 = fieldNorm(doc=7860)
          0.5364197 = weight(abstract_txt:spoken in 7860) [ClassicSimilarity], result of:
            0.5364197 = score(doc=7860,freq=1.0), product of:
              0.71451104 = queryWeight, product of:
                9.924504 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.00899033 = queryNorm
              0.7507508 = fieldWeight in 7860, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=7860)
        0.16 = coord(4/25)
    
  5. Srinivasan, P.: Query expansion and MEDLINE (1996) 0.12
    0.120435834 = sum of:
      0.120435834 = product of:
        0.60217917 = sum of:
          0.021916978 = weight(abstract_txt:using in 8453) [ClassicSimilarity], result of:
            0.021916978 = score(doc=8453,freq=3.0), product of:
              0.033406783 = queryWeight, product of:
                1.0729802 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.00899033 = queryNorm
              0.65606374 = fieldWeight in 8453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.109375 = fieldNorm(doc=8453)
          0.051143993 = weight(abstract_txt:retrieval in 8453) [ClassicSimilarity], result of:
            0.051143993 = score(doc=8453,freq=4.0), product of:
              0.0672782 = queryWeight, product of:
                2.153409 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.00899033 = queryNorm
              0.76018673 = fieldWeight in 8453, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=8453)
          0.08564862 = weight(abstract_txt:effectiveness in 8453) [ClassicSimilarity], result of:
            0.08564862 = score(doc=8453,freq=2.0), product of:
              0.10860636 = queryWeight, product of:
                2.3694503 = boost
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.00899033 = queryNorm
              0.7886151 = fieldWeight in 8453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.098378 = idf(docFreq=733, maxDocs=44218)
                0.109375 = fieldNorm(doc=8453)
          0.056984074 = weight(abstract_txt:terms in 8453) [ClassicSimilarity], result of:
            0.056984074 = score(doc=8453,freq=2.0), product of:
              0.091101095 = queryWeight, product of:
                2.5058274 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.00899033 = queryNorm
              0.6255037 = fieldWeight in 8453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=8453)
          0.38648555 = weight(abstract_txt:queries in 8453) [ClassicSimilarity], result of:
            0.38648555 = score(doc=8453,freq=3.0), product of:
              0.39950657 = queryWeight, product of:
                8.701972 = boost
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.00899033 = queryNorm
              0.9674072 = fieldWeight in 8453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.106586 = idf(docFreq=727, maxDocs=44218)
                0.109375 = fieldNorm(doc=8453)
        0.2 = coord(5/25)