Document (#1811)

Author
Ruge, G.
Title
Experiments on linguistically-based term associations
Source
Information processing and management. 28(1992) no.3, S.317-332
Year
1992
Abstract
Describes the hyperterm system REALIST (REtrieval Aids by LInguistic and STatistics) and describes its semantic component. The semantic component of REALIST generates semantic term relations such synonyms. It takes as input a free text data base and generates as output term pairs that are semantically related with respect to their meanings in the data base. In the 1st step an automatic syntactic analysis provides linguistical knowledge about the terms of the data base. In the 2nd step this knowledge is compared by statistical similarity computation. Various experiments with different similarity measures are described
Theme
Computerlinguistik
Object
REALIST

Similar documents (author)

  1. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:ruge in 4506) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 4506, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=4506)
    
  2. Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 5.94
    5.937289 = sum of:
      5.937289 = weight(author_txt:ruge in 1534) [ClassicSimilarity], result of:
        5.937289 = fieldWeight in 1534, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.625 = fieldNorm(doc=1534)
    
  3. Ruge, G.; Schwarz, C.: Term association and computational linguistics (1991) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:ruge in 2310) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 2310, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=2310)
    
  4. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:ruge in 5544) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 5544, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=5544)
    
  5. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 4.75
    4.749831 = sum of:
      4.749831 = weight(author_txt:ruge in 3567) [ClassicSimilarity], result of:
        4.749831 = fieldWeight in 3567, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.5 = fieldNorm(doc=3567)
    

Similar documents (content)

  1. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 0.14
    0.14208485 = sum of:
      0.14208485 = product of:
        0.88803035 = sum of:
          0.093499094 = weight(abstract_txt:statistics in 5544) [ClassicSimilarity], result of:
            0.093499094 = score(doc=5544,freq=1.0), product of:
              0.11917635 = queryWeight, product of:
                1.0443335 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.01818208 = queryNorm
              0.78454405 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.11964876 = weight(abstract_txt:aids in 5544) [ClassicSimilarity], result of:
            0.11964876 = score(doc=5544,freq=1.0), product of:
              0.14047228 = queryWeight, product of:
                1.1338079 = boost
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.01818208 = queryNorm
              0.8517607 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.12556176 = weight(abstract_txt:term in 5544) [ClassicSimilarity], result of:
            0.12556176 = score(doc=5544,freq=1.0), product of:
              0.20921709 = queryWeight, product of:
                2.3966432 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01818208 = queryNorm
              0.6001506 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
          0.54932076 = weight(abstract_txt:realist in 5544) [ClassicSimilarity], result of:
            0.54932076 = score(doc=5544,freq=1.0), product of:
              0.48889148 = queryWeight, product of:
                2.9913373 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.01818208 = queryNorm
              1.1236047 = fieldWeight in 5544, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.125 = fieldNorm(doc=5544)
        0.16 = coord(4/25)
    
  2. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.14
    0.14146765 = sum of:
      0.14146765 = product of:
        0.5052416 = sum of:
          0.074966125 = weight(abstract_txt:input in 5055) [ClassicSimilarity], result of:
            0.074966125 = score(doc=5055,freq=3.0), product of:
              0.11320739 = queryWeight, product of:
                1.0178448 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.01818208 = queryNorm
              0.6622017 = fieldWeight in 5055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.016958741 = weight(abstract_txt:knowledge in 5055) [ClassicSimilarity], result of:
            0.016958741 = score(doc=5055,freq=1.0), product of:
              0.07637377 = queryWeight, product of:
                1.1823097 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.01818208 = queryNorm
              0.2220493 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.042132836 = weight(abstract_txt:data in 5055) [ClassicSimilarity], result of:
            0.042132836 = score(doc=5055,freq=4.0), product of:
              0.101027444 = queryWeight, product of:
                1.6654227 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01818208 = queryNorm
              0.41704348 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.105385154 = weight(abstract_txt:similarity in 5055) [ClassicSimilarity], result of:
            0.105385154 = score(doc=5055,freq=2.0), product of:
              0.20489189 = queryWeight, product of:
                1.936518 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.01818208 = queryNorm
              0.51434517 = fieldWeight in 5055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.10162212 = weight(abstract_txt:semantic in 5055) [ClassicSimilarity], result of:
            0.10162212 = score(doc=5055,freq=4.0), product of:
              0.18169838 = queryWeight, product of:
                2.233471 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01818208 = queryNorm
              0.5592902 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.06278088 = weight(abstract_txt:term in 5055) [ClassicSimilarity], result of:
            0.06278088 = score(doc=5055,freq=1.0), product of:
              0.20921709 = queryWeight, product of:
                2.3966432 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01818208 = queryNorm
              0.3000753 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.101395704 = weight(abstract_txt:base in 5055) [ClassicSimilarity], result of:
            0.101395704 = score(doc=5055,freq=1.0), product of:
              0.28799963 = queryWeight, product of:
                2.8119056 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.01818208 = queryNorm
              0.35206887 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Taylor, C.: Navigation via similarity (1997) 0.14
    0.14026852 = sum of:
      0.14026852 = product of:
        0.58445215 = sum of:
          0.05237714 = weight(abstract_txt:takes in 155) [ClassicSimilarity], result of:
            0.05237714 = score(doc=155,freq=1.0), product of:
              0.11078806 = queryWeight, product of:
                1.00691 = boost
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.01818208 = queryNorm
              0.47276878 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0514402 = idf(docFreq=282, maxDocs=44218)
                0.078125 = fieldNorm(doc=155)
          0.076857775 = weight(abstract_txt:semantically in 155) [ClassicSimilarity], result of:
            0.076857775 = score(doc=155,freq=1.0), product of:
              0.1430618 = queryWeight, product of:
                1.1442107 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.01818208 = queryNorm
              0.5372348 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=155)
          0.026434226 = weight(abstract_txt:describes in 155) [ClassicSimilarity], result of:
            0.026434226 = score(doc=155,freq=1.0), product of:
              0.088481575 = queryWeight, product of:
                1.2725815 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.01818208 = queryNorm
              0.298754 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.078125 = fieldNorm(doc=155)
          0.2082857 = weight(abstract_txt:similarity in 155) [ClassicSimilarity], result of:
            0.2082857 = score(doc=155,freq=5.0), product of:
              0.20489189 = queryWeight, product of:
                1.936518 = boost
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.01818208 = queryNorm
              1.0165639 = fieldWeight in 155, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8191514 = idf(docFreq=356, maxDocs=44218)
                0.078125 = fieldNorm(doc=155)
          0.14202122 = weight(abstract_txt:semantic in 155) [ClassicSimilarity], result of:
            0.14202122 = score(doc=155,freq=5.0), product of:
              0.18169838 = queryWeight, product of:
                2.233471 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01818208 = queryNorm
              0.78163177 = fieldWeight in 155, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.078125 = fieldNorm(doc=155)
          0.0784761 = weight(abstract_txt:term in 155) [ClassicSimilarity], result of:
            0.0784761 = score(doc=155,freq=1.0), product of:
              0.20921709 = queryWeight, product of:
                2.3966432 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01818208 = queryNorm
              0.37509412 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.078125 = fieldNorm(doc=155)
        0.24 = coord(6/25)
    
  4. Kantardzic, M.: Data mining : concepts, models, methods, and algorithms (2003) 0.14
    0.13673018 = sum of:
      0.13673018 = product of:
        0.4272818 = sum of:
          0.046749547 = weight(abstract_txt:statistics in 2291) [ClassicSimilarity], result of:
            0.046749547 = score(doc=2291,freq=1.0), product of:
              0.11917635 = queryWeight, product of:
                1.0443335 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.01818208 = queryNorm
              0.39227203 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.05982438 = weight(abstract_txt:aids in 2291) [ClassicSimilarity], result of:
            0.05982438 = score(doc=2291,freq=1.0), product of:
              0.14047228 = queryWeight, product of:
                1.1338079 = boost
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.01818208 = queryNorm
              0.42588034 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8140855 = idf(docFreq=131, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.016958741 = weight(abstract_txt:knowledge in 2291) [ClassicSimilarity], result of:
            0.016958741 = score(doc=2291,freq=1.0), product of:
              0.07637377 = queryWeight, product of:
                1.1823097 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.01818208 = queryNorm
              0.2220493 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.02114738 = weight(abstract_txt:describes in 2291) [ClassicSimilarity], result of:
            0.02114738 = score(doc=2291,freq=1.0), product of:
              0.088481575 = queryWeight, product of:
                1.2725815 = boost
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.01818208 = queryNorm
              0.2390032 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8240511 = idf(docFreq=2624, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.086589314 = weight(abstract_txt:computation in 2291) [ClassicSimilarity], result of:
            0.086589314 = score(doc=2291,freq=1.0), product of:
              0.17974135 = queryWeight, product of:
                1.2825319 = boost
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.01818208 = queryNorm
              0.48174396 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7079034 = idf(docFreq=53, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.075956054 = weight(abstract_txt:data in 2291) [ClassicSimilarity], result of:
            0.075956054 = score(doc=2291,freq=13.0), product of:
              0.101027444 = queryWeight, product of:
                1.6654227 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.01818208 = queryNorm
              0.7518358 = fieldWeight in 2291, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.05727551 = weight(abstract_txt:experiments in 2291) [ClassicSimilarity], result of:
            0.05727551 = score(doc=2291,freq=1.0), product of:
              0.17192055 = queryWeight, product of:
                1.7738751 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01818208 = queryNorm
              0.33315104 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
          0.06278088 = weight(abstract_txt:term in 2291) [ClassicSimilarity], result of:
            0.06278088 = score(doc=2291,freq=1.0), product of:
              0.20921709 = queryWeight, product of:
                2.3966432 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.01818208 = queryNorm
              0.3000753 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=2291)
        0.32 = coord(8/25)
    
  5. Quillian, M.R.: Word concepts : a theory and simulation of some basic semantic capabilities. (1967) 0.12
    0.124550685 = sum of:
      0.124550685 = product of:
        0.44482386 = sum of:
          0.055212732 = weight(abstract_txt:associations in 4414) [ClassicSimilarity], result of:
            0.055212732 = score(doc=4414,freq=1.0), product of:
              0.1331572 = queryWeight, product of:
                1.1038917 = boost
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.01818208 = queryNorm
              0.41464326 = fieldWeight in 4414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
          0.082694314 = weight(abstract_txt:meanings in 4414) [ClassicSimilarity], result of:
            0.082694314 = score(doc=4414,freq=2.0), product of:
              0.13834992 = queryWeight, product of:
                1.12521 = boost
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.01818208 = queryNorm
              0.59771854 = fieldWeight in 4414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7624135 = idf(docFreq=138, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
          0.059429172 = weight(abstract_txt:pairs in 4414) [ClassicSimilarity], result of:
            0.059429172 = score(doc=4414,freq=1.0), product of:
              0.13985294 = queryWeight, product of:
                1.1313057 = boost
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.01818208 = queryNorm
              0.42494047 = fieldWeight in 4414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7990475 = idf(docFreq=133, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
          0.016958741 = weight(abstract_txt:knowledge in 4414) [ClassicSimilarity], result of:
            0.016958741 = score(doc=4414,freq=1.0), product of:
              0.07637377 = queryWeight, product of:
                1.1823097 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.01818208 = queryNorm
              0.2220493 = fieldWeight in 4414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
          0.05727551 = weight(abstract_txt:experiments in 4414) [ClassicSimilarity], result of:
            0.05727551 = score(doc=4414,freq=1.0), product of:
              0.17192055 = queryWeight, product of:
                1.7738751 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.01818208 = queryNorm
              0.33315104 = fieldWeight in 4414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
          0.07185769 = weight(abstract_txt:semantic in 4414) [ClassicSimilarity], result of:
            0.07185769 = score(doc=4414,freq=2.0), product of:
              0.18169838 = queryWeight, product of:
                2.233471 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.01818208 = queryNorm
              0.39547786 = fieldWeight in 4414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
          0.101395704 = weight(abstract_txt:base in 4414) [ClassicSimilarity], result of:
            0.101395704 = score(doc=4414,freq=1.0), product of:
              0.28799963 = queryWeight, product of:
                2.8119056 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.01818208 = queryNorm
              0.35206887 = fieldWeight in 4414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.0625 = fieldNorm(doc=4414)
        0.28 = coord(7/25)