Document (#20638)

Author
Gil-Leiva, I.
Munoz, J.V.R.
Title
Analisis de los descriptores de diferentes areas del conocimiento indizades en bases de datos del CSIC : Aplicacion a la indizacion automatica
Source
Revista Española de Documentaçion Cientifica. 20(1997) no.2, S.150-160
Year
1997
Abstract
Studies the value of scientific articles' titles and abstracts as sources of terms for document indexing in relation to 6 areas of knowledge: library and information science, medicine, chemistry, biology, psychology and physics, indexed in the databases ISOC, IME and ICYT of the CSIC. Also examines the syntagmatic structures of the indexing terms found in the field 'descriptors'. as well as the relationship between length of document and number of descriptors. Concludes that if the abstracts are not well made and the titles are not precise, they are not definitive sources for the extractions of concepts; the most common syntactic structure is the noun phrase, followed by noun+adjective and noun+noun; and no significant relationship was found between length of document and number of descriptors assigned to it
Footnote
Übers. d. Titels: Descriptors analysis on different knowledge ares in CSIC databases: application on automatic indexing
Theme
Automatisches Indexieren
Field
Physik
Chemie
Medizin
Psychologie
Biologie
Informationswissenschaft
Bibliothekswesen

Similar documents (author)

  1. Gil-Leiva, I.; Munoz, V.R.: ¬Los origines del almacenamiento y recuperacion de informacion (1996) 5.69
    5.6869993 = sum of:
      5.6869993 = sum of:
        2.6555915 = weight(author_txt:leiva in 5517) [ClassicSimilarity], result of:
          2.6555915 = score(doc=5517,freq=1.0), product of:
            0.6752735 = queryWeight, product of:
              8.988837 = idf(docFreq=14, maxDocs=44218)
              0.07512356 = queryNorm
            3.9326162 = fieldWeight in 5517, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.988837 = idf(docFreq=14, maxDocs=44218)
              0.4375 = fieldNorm(doc=5517)
        3.0314076 = weight(author_txt:munoz in 5517) [ClassicSimilarity], result of:
          3.0314076 = score(doc=5517,freq=1.0), product of:
            0.7375674 = queryWeight, product of:
              1.0451076 = boost
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.07512356 = queryNorm
            4.1100073 = fieldWeight in 5517, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.394302 = idf(docFreq=9, maxDocs=44218)
              0.4375 = fieldNorm(doc=5517)
    
  2. Munoz, A.M.; Munoz, F.A.: Nuevas areas de conocimiento y la problematica documental : la prospectiva de la paz en la Universidad de Granada (1997) 2.45
    2.449747 = sum of:
      2.449747 = product of:
        4.899494 = sum of:
          4.899494 = weight(author_txt:munoz in 340) [ClassicSimilarity], result of:
            4.899494 = score(doc=340,freq=2.0), product of:
              0.7375674 = queryWeight, product of:
                1.0451076 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07512356 = queryNorm
              6.6427746 = fieldWeight in 340, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.5 = fieldNorm(doc=340)
        0.5 = coord(1/2)
    
  3. Munoz, J.V.R.: Documentos electronicos y normalizacion : informacion y conocimiento (1997) 2.17
    2.165291 = sum of:
      2.165291 = product of:
        4.330582 = sum of:
          4.330582 = weight(author_txt:munoz in 2813) [ClassicSimilarity], result of:
            4.330582 = score(doc=2813,freq=1.0), product of:
              0.7375674 = queryWeight, product of:
                1.0451076 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07512356 = queryNorm
              5.871439 = fieldWeight in 2813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.625 = fieldNorm(doc=2813)
        0.5 = coord(1/2)
    
  4. Leiva, I.G. -> Gil-Leiva, I.: 1.88
    1.8777868 = sum of:
      1.8777868 = product of:
        3.7555735 = sum of:
          3.7555735 = weight(author_txt:leiva in 98) [ClassicSimilarity], result of:
            3.7555735 = score(doc=98,freq=2.0), product of:
              0.6752735 = queryWeight, product of:
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.07512356 = queryNorm
              5.561559 = fieldWeight in 98, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.4375 = fieldNorm(doc=98)
        0.5 = coord(1/2)
    
  5. Fernández, F.J. Munoz- -> Munoz-Fernández, F.J.: 1.84
    1.8373103 = sum of:
      1.8373103 = product of:
        3.6746206 = sum of:
          3.6746206 = weight(author_txt:munoz in 2707) [ClassicSimilarity], result of:
            3.6746206 = score(doc=2707,freq=2.0), product of:
              0.7375674 = queryWeight, product of:
                1.0451076 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.07512356 = queryNorm
              4.982081 = fieldWeight in 2707, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.375 = fieldNorm(doc=2707)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.18
    0.17947893 = sum of:
      0.17947893 = product of:
        0.64099616 = sum of:
          0.0133922715 = weight(abstract_txt:between in 1442) [ClassicSimilarity], result of:
            0.0133922715 = score(doc=1442,freq=2.0), product of:
              0.05833071 = queryWeight, product of:
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.016842114 = queryNorm
              0.22959211 = fieldWeight in 1442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
          0.013590508 = weight(abstract_txt:well in 1442) [ClassicSimilarity], result of:
            0.013590508 = score(doc=1442,freq=1.0), product of:
              0.07421555 = queryWeight, product of:
                1.1279733 = boost
                3.9066048 = idf(docFreq=2416, maxDocs=44218)
                0.016842114 = queryNorm
              0.1831221 = fieldWeight in 1442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9066048 = idf(docFreq=2416, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
          0.036923535 = weight(abstract_txt:terms in 1442) [ClassicSimilarity], result of:
            0.036923535 = score(doc=1442,freq=6.0), product of:
              0.079522416 = queryWeight, product of:
                1.1676055 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016842114 = queryNorm
              0.46431607 = fieldWeight in 1442, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
          0.026527507 = weight(abstract_txt:indexing in 1442) [ClassicSimilarity], result of:
            0.026527507 = score(doc=1442,freq=2.0), product of:
              0.0920009 = queryWeight, product of:
                1.2558779 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016842114 = queryNorm
              0.28833964 = fieldWeight in 1442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
          0.020458316 = weight(abstract_txt:found in 1442) [ClassicSimilarity], result of:
            0.020458316 = score(doc=1442,freq=1.0), product of:
              0.09748049 = queryWeight, product of:
                1.2927371 = boost
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.016842114 = queryNorm
              0.20987087 = fieldWeight in 1442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
          0.052809037 = weight(abstract_txt:areas in 1442) [ClassicSimilarity], result of:
            0.052809037 = score(doc=1442,freq=4.0), product of:
              0.11555532 = queryWeight, product of:
                1.4074934 = boost
                4.87469 = idf(docFreq=917, maxDocs=44218)
                0.016842114 = queryNorm
              0.4570022 = fieldWeight in 1442, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.87469 = idf(docFreq=917, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
          0.47729498 = weight(abstract_txt:noun in 1442) [ClassicSimilarity], result of:
            0.47729498 = score(doc=1442,freq=5.0), product of:
              0.58642936 = queryWeight, product of:
                4.484089 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.016842114 = queryNorm
              0.81390023 = fieldWeight in 1442, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.046875 = fieldNorm(doc=1442)
        0.28 = coord(7/25)
    
  2. Souza, R.R.; Raghavan, K.S.: ¬A methodology for noun phrase-based automatic indexing (2006) 0.18
    0.17719738 = sum of:
      0.17719738 = product of:
        0.8859869 = sum of:
          0.025123285 = weight(abstract_txt:terms in 173) [ClassicSimilarity], result of:
            0.025123285 = score(doc=173,freq=1.0), product of:
              0.079522416 = queryWeight, product of:
                1.1676055 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016842114 = queryNorm
              0.3159271 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.031262964 = weight(abstract_txt:indexing in 173) [ClassicSimilarity], result of:
            0.031262964 = score(doc=173,freq=1.0), product of:
              0.0920009 = queryWeight, product of:
                1.2558779 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016842114 = queryNorm
              0.3398115 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.045075543 = weight(abstract_txt:document in 173) [ClassicSimilarity], result of:
            0.045075543 = score(doc=173,freq=1.0), product of:
              0.13440941 = queryWeight, product of:
                1.859139 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016842114 = queryNorm
              0.33536002 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.16833991 = weight(abstract_txt:descriptors in 173) [ClassicSimilarity], result of:
            0.16833991 = score(doc=173,freq=1.0), product of:
              0.32353935 = queryWeight, product of:
                2.8844335 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.016842114 = queryNorm
              0.52030736 = fieldWeight in 173, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
          0.6161852 = weight(abstract_txt:noun in 173) [ClassicSimilarity], result of:
            0.6161852 = score(doc=173,freq=3.0), product of:
              0.58642936 = queryWeight, product of:
                4.484089 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.016842114 = queryNorm
              1.0507407 = fieldWeight in 173, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=173)
        0.2 = coord(5/25)
    
  3. Rodriguez Bravo, B.: ¬The visibility of women in indexing languages (2006) 0.15
    0.15411226 = sum of:
      0.15411226 = product of:
        0.77056134 = sum of:
          0.031262964 = weight(abstract_txt:indexing in 263) [ClassicSimilarity], result of:
            0.031262964 = score(doc=263,freq=1.0), product of:
              0.0920009 = queryWeight, product of:
                1.2558779 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016842114 = queryNorm
              0.3398115 = fieldWeight in 263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.078125 = fieldNorm(doc=263)
          0.16897997 = weight(abstract_txt:adjective in 263) [ClassicSimilarity], result of:
            0.16897997 = score(doc=263,freq=1.0), product of:
              0.22489792 = queryWeight, product of:
                1.3884463 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.016842114 = queryNorm
              0.751363 = fieldWeight in 263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.078125 = fieldNorm(doc=263)
          0.046223834 = weight(abstract_txt:relationship in 263) [ClassicSimilarity], result of:
            0.046223834 = score(doc=263,freq=1.0), product of:
              0.11940319 = queryWeight, product of:
                1.4307355 = boost
                4.9551864 = idf(docFreq=846, maxDocs=44218)
                0.016842114 = queryNorm
              0.38712394 = fieldWeight in 263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9551864 = idf(docFreq=846, maxDocs=44218)
                0.078125 = fieldNorm(doc=263)
          0.16833991 = weight(abstract_txt:descriptors in 263) [ClassicSimilarity], result of:
            0.16833991 = score(doc=263,freq=1.0), product of:
              0.32353935 = queryWeight, product of:
                2.8844335 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.016842114 = queryNorm
              0.52030736 = fieldWeight in 263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.078125 = fieldNorm(doc=263)
          0.35575467 = weight(abstract_txt:noun in 263) [ClassicSimilarity], result of:
            0.35575467 = score(doc=263,freq=1.0), product of:
              0.58642936 = queryWeight, product of:
                4.484089 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.016842114 = queryNorm
              0.6066454 = fieldWeight in 263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=263)
        0.2 = coord(5/25)
    
  4. Lopez-Ostenero, F.; Gonzalo, J.; Verdejo, F.: Noun phrases as building blocks for cross-language search assistance (2005) 0.14
    0.14168906 = sum of:
      0.14168906 = product of:
        0.7084453 = sum of:
          0.06803859 = weight(abstract_txt:phrase in 1021) [ClassicSimilarity], result of:
            0.06803859 = score(doc=1021,freq=1.0), product of:
              0.12263059 = queryWeight, product of:
                1.0252641 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.016842114 = queryNorm
              0.5548256 = fieldWeight in 1021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.078125 = fieldNorm(doc=1021)
          0.025123285 = weight(abstract_txt:terms in 1021) [ClassicSimilarity], result of:
            0.025123285 = score(doc=1021,freq=1.0), product of:
              0.079522416 = queryWeight, product of:
                1.1676055 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016842114 = queryNorm
              0.3159271 = fieldWeight in 1021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1021)
          0.034097195 = weight(abstract_txt:found in 1021) [ClassicSimilarity], result of:
            0.034097195 = score(doc=1021,freq=1.0), product of:
              0.09748049 = queryWeight, product of:
                1.2927371 = boost
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.016842114 = queryNorm
              0.3497848 = fieldWeight in 1021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4772453 = idf(docFreq=1365, maxDocs=44218)
                0.078125 = fieldNorm(doc=1021)
          0.07807314 = weight(abstract_txt:document in 1021) [ClassicSimilarity], result of:
            0.07807314 = score(doc=1021,freq=3.0), product of:
              0.13440941 = queryWeight, product of:
                1.859139 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.016842114 = queryNorm
              0.5808606 = fieldWeight in 1021, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.078125 = fieldNorm(doc=1021)
          0.5031131 = weight(abstract_txt:noun in 1021) [ClassicSimilarity], result of:
            0.5031131 = score(doc=1021,freq=2.0), product of:
              0.58642936 = queryWeight, product of:
                4.484089 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.016842114 = queryNorm
              0.85792613 = fieldWeight in 1021, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=1021)
        0.2 = coord(5/25)
    
  5. Larouk, O.: Modelling users need : schemas of interrogation and filtering of answers from the WEB in co-operative mode (1998) 0.14
    0.13575988 = sum of:
      0.13575988 = product of:
        0.56566614 = sum of:
          0.017586298 = weight(abstract_txt:terms in 60) [ClassicSimilarity], result of:
            0.017586298 = score(doc=60,freq=1.0), product of:
              0.079522416 = queryWeight, product of:
                1.1676055 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.016842114 = queryNorm
              0.22114895 = fieldWeight in 60, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=60)
          0.030948758 = weight(abstract_txt:indexing in 60) [ClassicSimilarity], result of:
            0.030948758 = score(doc=60,freq=2.0), product of:
              0.0920009 = queryWeight, product of:
                1.2558779 = boost
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.016842114 = queryNorm
              0.33639625 = fieldWeight in 60, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3495874 = idf(docFreq=1551, maxDocs=44218)
                0.0546875 = fieldNorm(doc=60)
          0.07049981 = weight(abstract_txt:titles in 60) [ClassicSimilarity], result of:
            0.07049981 = score(doc=60,freq=2.0), product of:
              0.1592778 = queryWeight, product of:
                1.6524525 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.016842114 = queryNorm
              0.44262168 = fieldWeight in 60, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0546875 = fieldNorm(doc=60)
          0.07976505 = weight(abstract_txt:abstracts in 60) [ClassicSimilarity], result of:
            0.07976505 = score(doc=60,freq=2.0), product of:
              0.17294382 = queryWeight, product of:
                1.721884 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.016842114 = queryNorm
              0.46121946 = fieldWeight in 60, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.0546875 = fieldNorm(doc=60)
          0.117837936 = weight(abstract_txt:descriptors in 60) [ClassicSimilarity], result of:
            0.117837936 = score(doc=60,freq=1.0), product of:
              0.32353935 = queryWeight, product of:
                2.8844335 = boost
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.016842114 = queryNorm
              0.36421517 = fieldWeight in 60, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6599345 = idf(docFreq=153, maxDocs=44218)
                0.0546875 = fieldNorm(doc=60)
          0.24902828 = weight(abstract_txt:noun in 60) [ClassicSimilarity], result of:
            0.24902828 = score(doc=60,freq=1.0), product of:
              0.58642936 = queryWeight, product of:
                4.484089 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.016842114 = queryNorm
              0.4246518 = fieldWeight in 60, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0546875 = fieldNorm(doc=60)
        0.24 = coord(6/25)