Document (#9153)

Author
Schulze, U.
Title
Erfahrungen bei der Anwendung automatischer Klassifizierungsverfahren zur Inhaltsanalyse einer Dokumentenmenge
Source
Kooperation in der Klassifikation I. Proc. der Sekt.1-3 der 2. Fachtagung der Gesellschaft für Klassifikation, Frankfurt-Hoechst, 6.-7.4.1978. Bearb.: W. Dahlberg
Imprint
Frankfurt : Gesellschaft für Klassifikation
Year
1978
Pages
S.166-185
Series
Studien zur Klassifikation; Bd.2
Abstract
Die der Analyse zugrundeliegende Dokumentenmenge besteht aus 1.000 Entscheidungen des Bundesverfassungsgerichtes, deren volle Texte maschinenlesbar zur Verfügung standen. Vorgestellt werden die Anwendung eines iterativen Centroidverfahrens auf etwa 1.000 Wörter und die Anwendung eines Single-Linkage-Verfahrens in einer nicht-hierarchischen Variante, sowie die auf der Graphentheorie basierenden Verfahren und die verschiedener Ähnlichkeitsfunktionen und der Einfluß auf die Ergebnisse
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Schulze, E.: ¬Der Terminus : Eigenschaften und Wesen sowie seine Abgrenzung von anderen Lexemarten (1993) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:schulze in 4627) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4627, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4627)
    
  2. Schulze, G.: ¬Die Rolle der Europäischen Union beim Aufbau transeuropäischer Netze (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:schulze in 6036) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 6036, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=6036)
    
  3. Schulze, S.: Ahnenforschung und Alterszucker : Noch sind sie eine Minderheit - Senioren surfen durchs Internet, sammeln Informationen und schließen online Freundschaften (1998) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:schulze in 474) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 474, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=474)
    
  4. Schulze, M.: ¬Das Projekt "nestor" : Aufbau eines Kompetenznetzwerks Langzeitarchivierung und Langzeitverfügbarkeit digitaler Ressourcen für Deutschland (2004) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:schulze in 4534) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4534, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4534)
    
  5. Schulze, V.: ¬Die Klassifikation der Kunstgeschichte : Geschichte der Ordnungsgrundsätze und Erörterung des Entwurfs der Systematik 'Kunst' für die Universitätsbibliothek Bremen (1967) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:schulze in 5268) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 5268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=5268)
    

Similar documents (content)

  1. Lepsky, K.: Automatische Indexierung zur Erschließung deutschsprachiger Dokumente (1999) 0.09
    0.09184971 = sum of:
      0.09184971 = product of:
        0.5740607 = sum of:
          0.08532645 = weight(abstract_txt:texte in 4656) [ClassicSimilarity], result of:
            0.08532645 = score(doc=4656,freq=1.0), product of:
              0.13430151 = queryWeight, product of:
                1.1723362 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.016904302 = queryNorm
              0.63533497 = fieldWeight in 4656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.09375 = fieldNorm(doc=4656)
          0.25360242 = weight(abstract_txt:verfahrens in 4656) [ClassicSimilarity], result of:
            0.25360242 = score(doc=4656,freq=3.0), product of:
              0.19249538 = queryWeight, product of:
                1.4035306 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.016904302 = queryNorm
              1.3174468 = fieldWeight in 4656, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.09375 = fieldNorm(doc=4656)
          0.052553706 = weight(abstract_txt:eines in 4656) [ClassicSimilarity], result of:
            0.052553706 = score(doc=4656,freq=1.0), product of:
              0.12249096 = queryWeight, product of:
                1.5833566 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.016904302 = queryNorm
              0.4290415 = fieldWeight in 4656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.09375 = fieldNorm(doc=4656)
          0.18257806 = weight(abstract_txt:anwendung in 4656) [ClassicSimilarity], result of:
            0.18257806 = score(doc=4656,freq=1.0), product of:
              0.32163596 = queryWeight, product of:
                3.1423507 = boost
                6.0549803 = idf(docFreq=281, maxDocs=44218)
                0.016904302 = queryNorm
              0.5676544 = fieldWeight in 4656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0549803 = idf(docFreq=281, maxDocs=44218)
                0.09375 = fieldNorm(doc=4656)
        0.16 = coord(4/25)
    
  2. Scheele, M.: ¬Die automatische Indexierung beliebiger Titel und Schlagwörter auf der Grundlage eines Modells für einen Gesamtthesaurus des Wissens (1983) 0.09
    0.08956581 = sum of:
      0.08956581 = product of:
        0.44782907 = sum of:
          0.06763856 = weight(abstract_txt:erfahrungen in 110) [ClassicSimilarity], result of:
            0.06763856 = score(doc=110,freq=1.0), product of:
              0.11503272 = queryWeight, product of:
                1.0849817 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.016904302 = queryNorm
              0.5879941 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.09375 = fieldNorm(doc=110)
          0.12377591 = weight(abstract_txt:wörter in 110) [ClassicSimilarity], result of:
            0.12377591 = score(doc=110,freq=1.0), product of:
              0.17210066 = queryWeight, product of:
                1.327098 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.016904302 = queryNorm
              0.71920645 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.09375 = fieldNorm(doc=110)
          0.045422513 = weight(abstract_txt:einer in 110) [ClassicSimilarity], result of:
            0.045422513 = score(doc=110,freq=2.0), product of:
              0.08821435 = queryWeight, product of:
                1.3436816 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.016904302 = queryNorm
              0.5149107 = fieldWeight in 110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.09375 = fieldNorm(doc=110)
          0.15843837 = weight(abstract_txt:automatischer in 110) [ClassicSimilarity], result of:
            0.15843837 = score(doc=110,freq=1.0), product of:
              0.20289221 = queryWeight, product of:
                1.4409351 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.016904302 = queryNorm
              0.7808992 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.09375 = fieldNorm(doc=110)
          0.052553706 = weight(abstract_txt:eines in 110) [ClassicSimilarity], result of:
            0.052553706 = score(doc=110,freq=1.0), product of:
              0.12249096 = queryWeight, product of:
                1.5833566 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.016904302 = queryNorm
              0.4290415 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.09375 = fieldNorm(doc=110)
        0.2 = coord(5/25)
    
  3. Umlauf, K.: Sacherschließung auf der VLBPlus-CD-ROM durch Klassifikation : Die Warengruppen-Systematik des Buchhandels (2001) 0.09
    0.087420255 = sum of:
      0.087420255 = product of:
        0.5463766 = sum of:
          0.044131216 = weight(abstract_txt:analyse in 1404) [ClassicSimilarity], result of:
            0.044131216 = score(doc=1404,freq=1.0), product of:
              0.09771845 = queryWeight, product of:
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.016904302 = queryNorm
              0.45161602 = fieldWeight in 1404, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.780685 = idf(docFreq=370, maxDocs=44218)
                0.078125 = fieldNorm(doc=1404)
          0.037852097 = weight(abstract_txt:einer in 1404) [ClassicSimilarity], result of:
            0.037852097 = score(doc=1404,freq=2.0), product of:
              0.08821435 = queryWeight, product of:
                1.3436816 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.016904302 = queryNorm
              0.4290923 = fieldWeight in 1404, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.078125 = fieldNorm(doc=1404)
          0.24922298 = weight(abstract_txt:1.000 in 1404) [ClassicSimilarity], result of:
            0.24922298 = score(doc=1404,freq=1.0), product of:
              0.39043435 = queryWeight, product of:
                2.8268368 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016904302 = queryNorm
              0.63832235 = fieldWeight in 1404, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.078125 = fieldNorm(doc=1404)
          0.21517031 = weight(abstract_txt:anwendung in 1404) [ClassicSimilarity], result of:
            0.21517031 = score(doc=1404,freq=2.0), product of:
              0.32163596 = queryWeight, product of:
                3.1423507 = boost
                6.0549803 = idf(docFreq=281, maxDocs=44218)
                0.016904302 = queryNorm
              0.6689871 = fieldWeight in 1404, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0549803 = idf(docFreq=281, maxDocs=44218)
                0.078125 = fieldNorm(doc=1404)
        0.16 = coord(4/25)
    
  4. Kompakt Brockhaus multimedial : das digitale Lexikon von A bis Z (1996) 0.08
    0.082004055 = sum of:
      0.082004055 = product of:
        1.0250508 = sum of:
          0.2275372 = weight(abstract_txt:texte in 6024) [ClassicSimilarity], result of:
            0.2275372 = score(doc=6024,freq=1.0), product of:
              0.13430151 = queryWeight, product of:
                1.1723362 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.016904302 = queryNorm
              1.6942266 = fieldWeight in 6024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.25 = fieldNorm(doc=6024)
          0.79751354 = weight(abstract_txt:1.000 in 6024) [ClassicSimilarity], result of:
            0.79751354 = score(doc=6024,freq=1.0), product of:
              0.39043435 = queryWeight, product of:
                2.8268368 = boost
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016904302 = queryNorm
              2.0426316 = fieldWeight in 6024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.25 = fieldNorm(doc=6024)
        0.08 = coord(2/25)
    
  5. Nöther, I.: Modell einer Konkordanz-Klassifikation für systematische Kataloge : T.1-2 (1994) 0.08
    0.08155029 = sum of:
      0.08155029 = product of:
        0.40775147 = sum of:
          0.053180914 = weight(abstract_txt:etwa in 2135) [ClassicSimilarity], result of:
            0.053180914 = score(doc=2135,freq=1.0), product of:
              0.097993135 = queryWeight, product of:
                1.0014045 = boost
                5.788804 = idf(docFreq=367, maxDocs=44218)
                0.016904302 = queryNorm
              0.5427004 = fieldWeight in 2135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.788804 = idf(docFreq=367, maxDocs=44218)
                0.09375 = fieldNorm(doc=2135)
          0.068805985 = weight(abstract_txt:besteht in 2135) [ClassicSimilarity], result of:
            0.068805985 = score(doc=2135,freq=1.0), product of:
              0.11635256 = queryWeight, product of:
                1.0911883 = boost
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.016904302 = queryNorm
              0.5913577 = fieldWeight in 2135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3078156 = idf(docFreq=218, maxDocs=44218)
                0.09375 = fieldNorm(doc=2135)
          0.055630997 = weight(abstract_txt:einer in 2135) [ClassicSimilarity], result of:
            0.055630997 = score(doc=2135,freq=3.0), product of:
              0.08821435 = queryWeight, product of:
                1.3436816 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.016904302 = queryNorm
              0.6306343 = fieldWeight in 2135, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.09375 = fieldNorm(doc=2135)
          0.17757986 = weight(abstract_txt:variante in 2135) [ClassicSimilarity], result of:
            0.17757986 = score(doc=2135,freq=1.0), product of:
              0.2189211 = queryWeight, product of:
                1.4967716 = boost
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.016904302 = queryNorm
              0.8111592 = fieldWeight in 2135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.652365 = idf(docFreq=20, maxDocs=44218)
                0.09375 = fieldNorm(doc=2135)
          0.052553706 = weight(abstract_txt:eines in 2135) [ClassicSimilarity], result of:
            0.052553706 = score(doc=2135,freq=1.0), product of:
              0.12249096 = queryWeight, product of:
                1.5833566 = boost
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.016904302 = queryNorm
              0.4290415 = fieldWeight in 2135, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5764427 = idf(docFreq=1236, maxDocs=44218)
                0.09375 = fieldNorm(doc=2135)
        0.2 = coord(5/25)