Document (#28062)

Author
Batlle, E.
Neuschmied, H.
Uray, P.
Ackermann, G.
Title
Recognition and analysis of audio for copyright protection : the RAA project
Source
Journal of the American Society for Information Science and Technology. 55(2004) no.12, S.1084-1091
Year
2004
Abstract
Automatic generation of play lists for commercial broadcast radio stations has become a major research topic. Audio identification systems have been around for a while, and they show good performance for clean audio files. However, songs transmitted by commercial radio stations are highly distorted to cause greater impact an the casual listener. This impact helps increase the probability that the listener will stay tuned in, but the price we have to pay is a severe modification in the audio itself. This causes the failure of traditional identification systems. Another problem is the fact that songs are never played from the beginning to the end. Actually, they are put an the air several seconds after their real beginning and almost always under the voice of a speaker. The same thing happens at the end. In this article, we present the RAA project, which was conceived to deal with real broadcast audio problems. The idea behind this project is to extract automatically an audio fingerprint (the so-called AudioDNA) that identifies the fragment of audio. This AudioDNA has to be robust enough to appear almost the same under several degrees of distortion. Once this AudioDNA is extracted from the broadcast audio, a matching algorithm is able to find its fragments inside a database. With this approach, the system can find not only a whole song but also small fragments of it, even with high distortion caused by broadcast (and DJ) manipulations.
Footnote
Beitrag in einem Themenheft zur Musikerschließung und zum Musikretrieval
Field
Musik

Similar documents (author)

  1. Ackermann, A.: Zur Rolle der Inhaltsanalyse bei der Sacherschließung : theoretischer Anspruch und praktische Wirklichkeit in der RSWK (2001) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:ackermann in 2061) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 2061, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=2061)
    
  2. Ackermann, J.: Knuth-Morris-Pratt (2005) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:ackermann in 865) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 865, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=865)
    
  3. Ackermann, E.: Piaget's constructivism, Papert's constructionism : what's the difference? (2001) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:ackermann in 692) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 692, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=692)
    
  4. Ackermann, U.; Schumann, N.: DissOnline Portal (2007) 4.88
    4.8754888 = sum of:
      4.8754888 = weight(author_txt:ackermann in 2404) [ClassicSimilarity], result of:
        4.8754888 = fieldWeight in 2404, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.5 = fieldNorm(doc=2404)
    
  5. Payome, T.; Ackermann-Stommel, K.: Berufen zum Teletutor? : Interview mit Kerstin Ackermann-Stommel (2005) 4.27
    4.2660527 = sum of:
      4.2660527 = weight(author_txt:ackermann in 3520) [ClassicSimilarity], result of:
        4.2660527 = fieldWeight in 3520, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.4375 = fieldNorm(doc=3520)
    

Similar documents (content)

  1. Inskip, C.: Music information retrieval research (2011) 0.12
    0.11925762 = sum of:
      0.11925762 = product of:
        0.49690676 = sum of:
          0.015688391 = weight(abstract_txt:impact in 13) [ClassicSimilarity], result of:
            0.015688391 = score(doc=13,freq=1.0), product of:
              0.06252929 = queryWeight, product of:
                1.0207828 = boost
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.013351906 = queryNorm
              0.2508967 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5878253 = idf(docFreq=1222, maxDocs=44218)
                0.0546875 = fieldNorm(doc=13)
          0.018971901 = weight(abstract_txt:find in 13) [ClassicSimilarity], result of:
            0.018971901 = score(doc=13,freq=1.0), product of:
              0.07097495 = queryWeight, product of:
                1.0875373 = boost
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.013351906 = queryNorm
              0.26730418 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.887848 = idf(docFreq=905, maxDocs=44218)
                0.0546875 = fieldNorm(doc=13)
          0.011298539 = weight(abstract_txt:this in 13) [ClassicSimilarity], result of:
            0.011298539 = score(doc=13,freq=2.0), product of:
              0.060542278 = queryWeight, product of:
                1.8791221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013351906 = queryNorm
              0.1866223 = fieldWeight in 13, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0546875 = fieldNorm(doc=13)
          0.11799629 = weight(abstract_txt:songs in 13) [ClassicSimilarity], result of:
            0.11799629 = score(doc=13,freq=1.0), product of:
              0.24003622 = queryWeight, product of:
                2.0 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.013351906 = queryNorm
              0.49157703 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0546875 = fieldNorm(doc=13)
          0.13927794 = weight(abstract_txt:listener in 13) [ClassicSimilarity], result of:
            0.13927794 = score(doc=13,freq=1.0), product of:
              0.26809338 = queryWeight, product of:
                2.1136577 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.013351906 = queryNorm
              0.5195128 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0546875 = fieldNorm(doc=13)
          0.19367369 = weight(abstract_txt:audio in 13) [ClassicSimilarity], result of:
            0.19367369 = score(doc=13,freq=1.0), product of:
              0.5301901 = queryWeight, product of:
                5.9448023 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.013351906 = queryNorm
              0.36529103 = fieldWeight in 13, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.0546875 = fieldNorm(doc=13)
        0.24 = coord(6/25)
    
  2. Turner, J.M.; Colinet, E.: Using audio description for indexing moving images (2004) 0.10
    0.095570326 = sum of:
      0.095570326 = product of:
        0.7964194 = sum of:
          0.011413248 = weight(abstract_txt:this in 3724) [ClassicSimilarity], result of:
            0.011413248 = score(doc=3724,freq=1.0), product of:
              0.060542278 = queryWeight, product of:
                1.8791221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013351906 = queryNorm
              0.18851699 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.305788 = weight(abstract_txt:broadcast in 3724) [ClassicSimilarity], result of:
            0.305788 = score(doc=3724,freq=1.0), product of:
              0.44983527 = queryWeight, product of:
                3.8719823 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.013351906 = queryNorm
              0.67977774 = fieldWeight in 3724, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
          0.4792181 = weight(abstract_txt:audio in 3724) [ClassicSimilarity], result of:
            0.4792181 = score(doc=3724,freq=3.0), product of:
              0.5301901 = queryWeight, product of:
                5.9448023 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.013351906 = queryNorm
              0.90386087 = fieldWeight in 3724, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.078125 = fieldNorm(doc=3724)
        0.12 = coord(3/25)
    
  3. Hu, X.; Choi, K.; Downie, J.S.: ¬A framework for evaluating multimodal music mood classification (2017) 0.09
    0.09274748 = sum of:
      0.09274748 = product of:
        0.7728957 = sum of:
          0.0247372 = weight(abstract_txt:same in 3354) [ClassicSimilarity], result of:
            0.0247372 = score(doc=3354,freq=1.0), product of:
              0.06678264 = queryWeight, product of:
                1.0549294 = boost
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.013351906 = queryNorm
              0.37041363 = fieldWeight in 3354, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7412944 = idf(docFreq=1048, maxDocs=44218)
                0.078125 = fieldNorm(doc=3354)
          0.01614077 = weight(abstract_txt:this in 3354) [ClassicSimilarity], result of:
            0.01614077 = score(doc=3354,freq=2.0), product of:
              0.060542278 = queryWeight, product of:
                1.8791221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013351906 = queryNorm
              0.2666033 = fieldWeight in 3354, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.078125 = fieldNorm(doc=3354)
          0.7320177 = weight(abstract_txt:audio in 3354) [ClassicSimilarity], result of:
            0.7320177 = score(doc=3354,freq=7.0), product of:
              0.5301901 = queryWeight, product of:
                5.9448023 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.013351906 = queryNorm
              1.3806702 = fieldWeight in 3354, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.078125 = fieldNorm(doc=3354)
        0.12 = coord(3/25)
    
  4. Huthwaite, A.: IASA Cataloguing Rules for Audiovisual Media with emphasis on sound recordings : project goal and progress report (1996) 0.09
    0.08889993 = sum of:
      0.08889993 = product of:
        0.7408328 = sum of:
          0.040908154 = weight(abstract_txt:project in 7229) [ClassicSimilarity], result of:
            0.040908154 = score(doc=7229,freq=1.0), product of:
              0.08542433 = queryWeight, product of:
                1.4612617 = boost
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.013351906 = queryNorm
              0.4788818 = fieldWeight in 7229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.378348 = idf(docFreq=1507, maxDocs=44218)
                0.109375 = fieldNorm(doc=7229)
          0.15213268 = weight(abstract_txt:radio in 7229) [ClassicSimilarity], result of:
            0.15213268 = score(doc=7229,freq=1.0), product of:
              0.17912637 = queryWeight, product of:
                1.727712 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.013351906 = queryNorm
              0.8493036 = fieldWeight in 7229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.109375 = fieldNorm(doc=7229)
          0.54779196 = weight(abstract_txt:audio in 7229) [ClassicSimilarity], result of:
            0.54779196 = score(doc=7229,freq=2.0), product of:
              0.5301901 = queryWeight, product of:
                5.9448023 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.013351906 = queryNorm
              1.0331991 = fieldWeight in 7229, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.109375 = fieldNorm(doc=7229)
        0.12 = coord(3/25)
    
  5. Christel, M.G.: Automated metadata in multimedia information systems : creation, refinement, use in surrogates, and evaluation (2009) 0.08
    0.08008804 = sum of:
      0.08008804 = product of:
        0.50055027 = sum of:
          0.025447927 = weight(abstract_txt:under in 3086) [ClassicSimilarity], result of:
            0.025447927 = score(doc=3086,freq=1.0), product of:
              0.07897171 = queryWeight, product of:
                1.1471689 = boost
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.013351906 = queryNorm
              0.32224107 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.155857 = idf(docFreq=692, maxDocs=44218)
                0.0625 = fieldNorm(doc=3086)
          0.009130599 = weight(abstract_txt:this in 3086) [ClassicSimilarity], result of:
            0.009130599 = score(doc=3086,freq=1.0), product of:
              0.060542278 = queryWeight, product of:
                1.8791221 = boost
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.013351906 = queryNorm
              0.1508136 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4130175 = idf(docFreq=10762, maxDocs=44218)
                0.0625 = fieldNorm(doc=3086)
          0.2446304 = weight(abstract_txt:broadcast in 3086) [ClassicSimilarity], result of:
            0.2446304 = score(doc=3086,freq=1.0), product of:
              0.44983527 = queryWeight, product of:
                3.8719823 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.013351906 = queryNorm
              0.54382217 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=3086)
          0.22134136 = weight(abstract_txt:audio in 3086) [ClassicSimilarity], result of:
            0.22134136 = score(doc=3086,freq=1.0), product of:
              0.5301901 = queryWeight, product of:
                5.9448023 = boost
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.013351906 = queryNorm
              0.41747546 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6796074 = idf(docFreq=150, maxDocs=44218)
                0.0625 = fieldNorm(doc=3086)
        0.16 = coord(4/25)