Document (#23783)

Author
Danowski, P.
Voß, J.
Title
Wikipedia sammelt Metadaten
Source
Bibliotheksdienst. 39(2005) H.3, S.385
Year
2005
Abstract
Im Rahmen der Vorbereitung auf die Wikipedia-DVD, die zur Buchmesse in Leipzig erscheinen soll, wurden fast 30.000 Artikel der freien Enzyklopädie Wikipedia mit Personendaten versehen. Damit sind die biographischen Artikel erstmals mit strukturierten Metadaten versehen, die wie alle Inhalte des Projekts unter den Bedingungen der GFDL frei weiterverwendet werden können. Die Personendaten umfassen Angaben zu Namen, Geburtsdatum, Geburtsort, Sterbedatum und Sterbeort. Gleichzeitig wird eine Kurzbeschreibung zu den einzelnen Personen gespeichert. Bisher waren diese Daten nur im Fließtext und Personennamen nur in der Form "Vorname Nachname" abgespeichert. Da auf der DVD jedoch eine gezielte Suche nach Personen möglich sein soll, müssen die Namen und anderen Angaben einheitlich, wie es in bibliothekarischen Datenbanken die Regel ist, in der Form "Nachname, Vorname" angesetzt werden. Ziel der Sammlung von Personendaten ist die dokumentarische Erschließung aller biographischen Artikel. Da wie an der gesamten Wikipedia viele Freiwillige an diesem Prozess beteiligt sind, entsprechen die Ergebnisse sicherlich noch nicht professionellen Regelwerken wie RAK. Sie sind ein erster Schritt um die Wikipedia besser automatisch weiterverwendbar zu machen und somit neue Möglichkeiten der Anwendung zu erschließen. Die Personendaten wurden zum größten Teil in einer vom Verlag Directmedia Publishing ausgerichteten "Tagging-Party" vom 28. bis 30. Januar mit Hilfe eines selbst entwickelten Softwaretools direkt in Online-Enzyklopädie eingetragen. Dazu wurden alle Artikel angeschaut und Fehler in den Datenfeldern korrigiert. Die Strukturierung der Personendaten könnte noch wesentlich durch bestehende bibliothekarische Datenbanken wie die Personennormdatei (PND) verbessert werden. Bibliotheken könnten im Gegenzug die Informationen aus der Wikipedia zur Kataloganreicherung nutzen - beispielsweise zur Anzeige von Kurzbiographien zu einzelnen Autoren. Auch weitere Kooperationsmöglichkeiten sind denkbar. Bei Interesse können Sie sich an Jakob Voss oder Patrick Danowski wenden.
Footnote
Ansprechpartner: zu den Personendaten [email protected] (Jakob Voß), [email protected] (Patrick Danowski); zur DVD [email protected]; zu allgemeinen Fragen über die Wikipedia [email protected]. Informationen zur deutschsprachigen Wikipedia: http://www.wikipedia.de/
Theme
Informationsmittel
Internet
Object
Wikipedia

Similar documents (author)

  1. Danowski, J.A.: Network analysis of message content (1993) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:danowski in 839) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 839, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=839)
    
  2. Danowski, P.: Kontext Open Access : Creative Commons (2012) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:danowski in 828) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 828, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=828)
    
  3. Danowski, P.: Authority files and Web 2.0 : Wikipedia and the PND. An Example (2007) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:danowski in 1291) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 1291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=1291)
    
  4. Danowski, P.: Step one: blow up the silo! : Open bibliographic data, the first step towards Linked Open Data (2010) 5.58
    5.5776863 = sum of:
      5.5776863 = weight(author_txt:danowski in 3962) [ClassicSimilarity], result of:
        5.5776863 = fieldWeight in 3962, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.625 = fieldNorm(doc=3962)
    
  5. Voß, J.; Danowski, P.: Bibliothek, Information und Dokumentation in der Wikipedia (2004) 4.46
    4.462149 = sum of:
      4.462149 = weight(author_txt:danowski in 3046) [ClassicSimilarity], result of:
        4.462149 = fieldWeight in 3046, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.924298 = idf(docFreq=15, maxDocs=44218)
          0.5 = fieldNorm(doc=3046)
    

Similar documents (content)

  1. Online-Enzyklopädie Wikipedia (2003) 0.23
    0.23020492 = sum of:
      0.23020492 = product of:
        0.7193904 = sum of:
          0.027950862 = weight(abstract_txt:alle in 1410) [ClassicSimilarity], result of:
            0.027950862 = score(doc=1410,freq=2.0), product of:
              0.09981851 = queryWeight, product of:
                1.079132 = boost
                5.0688457 = idf(docFreq=755, maxDocs=44218)
                0.018248511 = queryNorm
              0.2800168 = fieldWeight in 1410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0688457 = idf(docFreq=755, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.016995562 = weight(abstract_txt:werden in 1410) [ClassicSimilarity], result of:
            0.016995562 = score(doc=1410,freq=3.0), product of:
              0.07164259 = queryWeight, product of:
                1.1196964 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018248511 = queryNorm
              0.23722707 = fieldWeight in 1410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.050057955 = weight(abstract_txt:personen in 1410) [ClassicSimilarity], result of:
            0.050057955 = score(doc=1410,freq=1.0), product of:
              0.18546972 = queryWeight, product of:
                1.4709758 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.018248511 = queryNorm
              0.26989827 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.058217257 = weight(abstract_txt:enzyklopädie in 1410) [ClassicSimilarity], result of:
            0.058217257 = score(doc=1410,freq=1.0), product of:
              0.20511249 = queryWeight, product of:
                1.54691 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.018248511 = queryNorm
              0.28383088 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.058505595 = weight(abstract_txt:angaben in 1410) [ClassicSimilarity], result of:
            0.058505595 = score(doc=1410,freq=1.0), product of:
              0.20578918 = queryWeight, product of:
                1.5494597 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.018248511 = queryNorm
              0.2842987 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.040800773 = weight(abstract_txt:sind in 1410) [ClassicSimilarity], result of:
            0.040800773 = score(doc=1410,freq=5.0), product of:
              0.11924034 = queryWeight, product of:
                1.6679983 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.018248511 = queryNorm
              0.34217256 = fieldWeight in 1410, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.17030087 = weight(abstract_txt:artikel in 1410) [ClassicSimilarity], result of:
            0.17030087 = score(doc=1410,freq=10.0), product of:
              0.24534592 = queryWeight, product of:
                2.3926184 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018248511 = queryNorm
              0.69412553 = fieldWeight in 1410, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
          0.2965615 = weight(abstract_txt:wikipedia in 1410) [ClassicSimilarity], result of:
            0.2965615 = score(doc=1410,freq=7.0), product of:
              0.45783454 = queryWeight, product of:
                4.0029845 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.018248511 = queryNorm
              0.64774823 = fieldWeight in 1410, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1410)
        0.32 = coord(8/25)
    
  2. Kleinz, T.: Wikipedia professionalisiert sich : Das Büro der deutschen Sektion soll im Oktober in Frankfurt eröffnen - Schreiber und Spender werden umworben (2006) 0.21
    0.20542121 = sum of:
      0.20542121 = product of:
        0.85592175 = sum of:
          0.052595977 = weight(abstract_txt:soll in 2871) [ClassicSimilarity], result of:
            0.052595977 = score(doc=2871,freq=1.0), product of:
              0.10693465 = queryWeight, product of:
                1.1169358 = boost
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.018248511 = queryNorm
              0.49185157 = fieldWeight in 2871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2464166 = idf(docFreq=632, maxDocs=44218)
                0.09375 = fieldNorm(doc=2871)
          0.023549741 = weight(abstract_txt:werden in 2871) [ClassicSimilarity], result of:
            0.023549741 = score(doc=2871,freq=1.0), product of:
              0.07164259 = queryWeight, product of:
                1.1196964 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018248511 = queryNorm
              0.32871145 = fieldWeight in 2871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.09375 = fieldNorm(doc=2871)
          0.13972141 = weight(abstract_txt:enzyklopädie in 2871) [ClassicSimilarity], result of:
            0.13972141 = score(doc=2871,freq=1.0), product of:
              0.20511249 = queryWeight, product of:
                1.54691 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.018248511 = queryNorm
              0.68119407 = fieldWeight in 2871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.09375 = fieldNorm(doc=2871)
          0.07682341 = weight(abstract_txt:wurden in 2871) [ClassicSimilarity], result of:
            0.07682341 = score(doc=2871,freq=1.0), product of:
              0.15758309 = queryWeight, product of:
                1.6606169 = boost
                5.2001123 = idf(docFreq=662, maxDocs=44218)
                0.018248511 = queryNorm
              0.48751053 = fieldWeight in 2871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2001123 = idf(docFreq=662, maxDocs=44218)
                0.09375 = fieldNorm(doc=2871)
          0.18278608 = weight(abstract_txt:artikel in 2871) [ClassicSimilarity], result of:
            0.18278608 = score(doc=2871,freq=2.0), product of:
              0.24534592 = queryWeight, product of:
                2.3926184 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018248511 = queryNorm
              0.7450137 = fieldWeight in 2871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.09375 = fieldNorm(doc=2871)
          0.38044512 = weight(abstract_txt:wikipedia in 2871) [ClassicSimilarity], result of:
            0.38044512 = score(doc=2871,freq=2.0), product of:
              0.45783454 = queryWeight, product of:
                4.0029845 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.018248511 = queryNorm
              0.8309664 = fieldWeight in 2871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.09375 = fieldNorm(doc=2871)
        0.24 = coord(6/25)
    
  3. Ersch, J.S.; Gruber, J.G.: Allgemeine Encyclopädie der Wissenschaften und Künste (1996) 0.20
    0.19679771 = sum of:
      0.19679771 = product of:
        0.9839886 = sum of:
          0.095555454 = weight(abstract_txt:einzelnen in 1859) [ClassicSimilarity], result of:
            0.095555454 = score(doc=1859,freq=1.0), product of:
              0.13143003 = queryWeight, product of:
                1.2382728 = boost
                5.8163543 = idf(docFreq=357, maxDocs=44218)
                0.018248511 = queryNorm
              0.7270443 = fieldWeight in 1859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8163543 = idf(docFreq=357, maxDocs=44218)
                0.125 = fieldNorm(doc=1859)
          0.23090473 = weight(abstract_txt:versehen in 1859) [ClassicSimilarity], result of:
            0.23090473 = score(doc=1859,freq=1.0), product of:
              0.23667161 = queryWeight, product of:
                1.6616597 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.018248511 = queryNorm
              0.9756334 = fieldWeight in 1859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.125 = fieldNorm(doc=1859)
          0.08257496 = weight(abstract_txt:sind in 1859) [ClassicSimilarity], result of:
            0.08257496 = score(doc=1859,freq=2.0), product of:
              0.11924034 = queryWeight, product of:
                1.6679983 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.018248511 = queryNorm
              0.6925086 = fieldWeight in 1859, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.125 = fieldNorm(doc=1859)
          0.4026211 = weight(abstract_txt:biographischen in 1859) [ClassicSimilarity], result of:
            0.4026211 = score(doc=1859,freq=1.0), product of:
              0.34286407 = queryWeight, product of:
                2.0 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.018248511 = queryNorm
              1.1742878 = fieldWeight in 1859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.125 = fieldNorm(doc=1859)
          0.17233236 = weight(abstract_txt:artikel in 1859) [ClassicSimilarity], result of:
            0.17233236 = score(doc=1859,freq=1.0), product of:
              0.24534592 = queryWeight, product of:
                2.3926184 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018248511 = queryNorm
              0.70240563 = fieldWeight in 1859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.125 = fieldNorm(doc=1859)
        0.2 = coord(5/25)
    
  4. Portal "Bibliothek Information Dokumentation" eingestellt (2004) 0.19
    0.18616897 = sum of:
      0.18616897 = product of:
        1.1635561 = sum of:
          0.27944282 = weight(abstract_txt:enzyklopädie in 3293) [ClassicSimilarity], result of:
            0.27944282 = score(doc=3293,freq=1.0), product of:
              0.20511249 = queryWeight, product of:
                1.54691 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.018248511 = queryNorm
              1.3623881 = fieldWeight in 3293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.1875 = fieldNorm(doc=3293)
          0.087583974 = weight(abstract_txt:sind in 3293) [ClassicSimilarity], result of:
            0.087583974 = score(doc=3293,freq=1.0), product of:
              0.11924034 = queryWeight, product of:
                1.6679983 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.018248511 = queryNorm
              0.73451626 = fieldWeight in 3293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.1875 = fieldNorm(doc=3293)
          0.25849852 = weight(abstract_txt:artikel in 3293) [ClassicSimilarity], result of:
            0.25849852 = score(doc=3293,freq=1.0), product of:
              0.24534592 = queryWeight, product of:
                2.3926184 = boost
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.018248511 = queryNorm
              1.0536084 = fieldWeight in 3293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.619245 = idf(docFreq=435, maxDocs=44218)
                0.1875 = fieldNorm(doc=3293)
          0.5380307 = weight(abstract_txt:wikipedia in 3293) [ClassicSimilarity], result of:
            0.5380307 = score(doc=3293,freq=1.0), product of:
              0.45783454 = queryWeight, product of:
                4.0029845 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.018248511 = queryNorm
              1.175164 = fieldWeight in 3293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.1875 = fieldNorm(doc=3293)
        0.16 = coord(4/25)
    
  5. Stöcklin, N.: Wikipedia clever nutzen : in Schule und Beruf (2010) 0.17
    0.16991061 = sum of:
      0.16991061 = product of:
        0.7079609 = sum of:
          0.03162279 = weight(abstract_txt:alle in 4531) [ClassicSimilarity], result of:
            0.03162279 = score(doc=4531,freq=1.0), product of:
              0.09981851 = queryWeight, product of:
                1.079132 = boost
                5.0688457 = idf(docFreq=755, maxDocs=44218)
                0.018248511 = queryNorm
              0.31680286 = fieldWeight in 4531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0688457 = idf(docFreq=755, maxDocs=44218)
                0.0625 = fieldNorm(doc=4531)
          0.022202909 = weight(abstract_txt:werden in 4531) [ClassicSimilarity], result of:
            0.022202909 = score(doc=4531,freq=2.0), product of:
              0.07164259 = queryWeight, product of:
                1.1196964 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.018248511 = queryNorm
              0.30991215 = fieldWeight in 4531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=4531)
          0.08009273 = weight(abstract_txt:personen in 4531) [ClassicSimilarity], result of:
            0.08009273 = score(doc=4531,freq=1.0), product of:
              0.18546972 = queryWeight, product of:
                1.4709758 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.018248511 = queryNorm
              0.43183723 = fieldWeight in 4531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0625 = fieldNorm(doc=4531)
          0.13173062 = weight(abstract_txt:enzyklopädie in 4531) [ClassicSimilarity], result of:
            0.13173062 = score(doc=4531,freq=2.0), product of:
              0.20511249 = queryWeight, product of:
                1.54691 = boost
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.018248511 = queryNorm
              0.64223593 = fieldWeight in 4531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2660704 = idf(docFreq=83, maxDocs=44218)
                0.0625 = fieldNorm(doc=4531)
          0.04128748 = weight(abstract_txt:sind in 4531) [ClassicSimilarity], result of:
            0.04128748 = score(doc=4531,freq=2.0), product of:
              0.11924034 = queryWeight, product of:
                1.6679983 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.018248511 = queryNorm
              0.3462543 = fieldWeight in 4531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.0625 = fieldNorm(doc=4531)
          0.40102437 = weight(abstract_txt:wikipedia in 4531) [ClassicSimilarity], result of:
            0.40102437 = score(doc=4531,freq=5.0), product of:
              0.45783454 = queryWeight, product of:
                4.0029845 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.018248511 = queryNorm
              0.8759155 = fieldWeight in 4531, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.0625 = fieldNorm(doc=4531)
        0.24 = coord(6/25)