Document (#37522)

Author
Chen, Y.-L.
Liu, Y.-H.
Ho, W.-L.
Title
¬A text mining approach to assist the general public in the retrieval of legal documents
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.2, S.280-290
Year
2013
Abstract
Applying text mining techniques to legal issues has been an emerging research topic in recent years. Although some previous studies focused on assisting professionals in the retrieval of related legal documents, they did not take into account the general public and their difficulty in describing legal problems in professional legal terms. Because this problem has not been addressed by previous research, this study aims to design a text-mining-based method that allows the general public to use everyday vocabulary to search for and retrieve criminal judgments. The experimental results indicate that our method can help the general public, who are not familiar with professional legal terms, to acquire relevant criminal judgments more accurately and effectively.
Theme
Data Mining
Field
Rechtswissenschaft

Similar documents (author)

  1. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 4.35
    4.3499155 = sum of:
      4.3499155 = weight(author_txt:chen in 3384) [ClassicSimilarity], result of:
        4.3499155 = fieldWeight in 3384, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.5 = fieldNorm(doc=3384)
    
  2. Chen, C.C.; Chen, H.H.; Chen, K.H.: ¬The design of the XML/Metadata management system (2000) 4.00
    3.9956524 = sum of:
      3.9956524 = weight(author_txt:chen in 4633) [ClassicSimilarity], result of:
        3.9956524 = fieldWeight in 4633, product of:
          1.7320508 = tf(freq=3.0), with freq of:
            3.0 = termFreq=3.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.375 = fieldNorm(doc=4633)
    
  3. Chen, W.Y.: Observations on cataloguing and classification (1991) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 4184) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 4184, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=4184)
    
  4. Chen, H.: Knowledge-based document retrieval : framework and design (1992) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 5283) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 5283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=5283)
    
  5. Chen, P.S.: On inference rules of logic-based information retrieval systems (1994) 3.84
    3.8448186 = sum of:
      3.8448186 = weight(author_txt:chen in 6731) [ClassicSimilarity], result of:
        3.8448186 = fieldWeight in 6731, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.1517096 = idf(docFreq=255, maxDocs=44218)
          0.625 = fieldNorm(doc=6731)
    

Similar documents (content)

  1. Berry, M.W.; Esau, R.; Kiefer, B.: ¬The use of text mining techniques in electronic discovery for legal matters (2012) 0.23
    0.23084718 = sum of:
      0.23084718 = product of:
        0.8244542 = sum of:
          0.02036495 = weight(abstract_txt:retrieval in 91) [ClassicSimilarity], result of:
            0.02036495 = score(doc=91,freq=1.0), product of:
              0.06250861 = queryWeight, product of:
                1.1754111 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0153030455 = queryNorm
              0.3257943 = fieldWeight in 91, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
          0.032488957 = weight(abstract_txt:been in 91) [ClassicSimilarity], result of:
            0.032488957 = score(doc=91,freq=2.0), product of:
              0.06773786 = queryWeight, product of:
                1.2235891 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0153030455 = queryNorm
              0.47962773 = fieldWeight in 91, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
          0.03396802 = weight(abstract_txt:documents in 91) [ClassicSimilarity], result of:
            0.03396802 = score(doc=91,freq=1.0), product of:
              0.087915294 = queryWeight, product of:
                1.3939656 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0153030455 = queryNorm
              0.38637212 = fieldWeight in 91, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
          0.048133604 = weight(abstract_txt:text in 91) [ClassicSimilarity], result of:
            0.048133604 = score(doc=91,freq=1.0), product of:
              0.12696391 = queryWeight, product of:
                2.051661 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0153030455 = queryNorm
              0.37911248 = fieldWeight in 91, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
          0.15891549 = weight(abstract_txt:judgments in 91) [ClassicSimilarity], result of:
            0.15891549 = score(doc=91,freq=1.0), product of:
              0.24592072 = queryWeight, product of:
                2.3314033 = boost
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.0153030455 = queryNorm
              0.6462062 = fieldWeight in 91, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
          0.17141895 = weight(abstract_txt:mining in 91) [ClassicSimilarity], result of:
            0.17141895 = score(doc=91,freq=1.0), product of:
              0.29608786 = queryWeight, product of:
                3.133111 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0153030455 = queryNorm
              0.57894623 = fieldWeight in 91, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
          0.3591642 = weight(abstract_txt:legal in 91) [ClassicSimilarity], result of:
            0.3591642 = score(doc=91,freq=1.0), product of:
              0.6108296 = queryWeight, product of:
                6.3641515 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0153030455 = queryNorm
              0.5879941 = fieldWeight in 91, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.09375 = fieldNorm(doc=91)
        0.28 = coord(7/25)
    
  2. Turle, H.: Text retrieval in the legal world (1995) 0.20
    0.19964783 = sum of:
      0.19964783 = product of:
        0.99823916 = sum of:
          0.036079466 = weight(abstract_txt:research in 4484) [ClassicSimilarity], result of:
            0.036079466 = score(doc=4484,freq=4.0), product of:
              0.05202433 = queryWeight, product of:
                1.0723168 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0153030455 = queryNorm
              0.6935114 = fieldWeight in 4484, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.109375 = fieldNorm(doc=4484)
          0.04115198 = weight(abstract_txt:retrieval in 4484) [ClassicSimilarity], result of:
            0.04115198 = score(doc=4484,freq=3.0), product of:
              0.06250861 = queryWeight, product of:
                1.1754111 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0153030455 = queryNorm
              0.658341 = fieldWeight in 4484, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.109375 = fieldNorm(doc=4484)
          0.026802024 = weight(abstract_txt:been in 4484) [ClassicSimilarity], result of:
            0.026802024 = score(doc=4484,freq=1.0), product of:
              0.06773786 = queryWeight, product of:
                1.2235891 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0153030455 = queryNorm
              0.3956727 = fieldWeight in 4484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.109375 = fieldNorm(doc=4484)
          0.05615587 = weight(abstract_txt:text in 4484) [ClassicSimilarity], result of:
            0.05615587 = score(doc=4484,freq=1.0), product of:
              0.12696391 = queryWeight, product of:
                2.051661 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0153030455 = queryNorm
              0.4422979 = fieldWeight in 4484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.109375 = fieldNorm(doc=4484)
          0.8380498 = weight(abstract_txt:legal in 4484) [ClassicSimilarity], result of:
            0.8380498 = score(doc=4484,freq=4.0), product of:
              0.6108296 = queryWeight, product of:
                6.3641515 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0153030455 = queryNorm
              1.3719863 = fieldWeight in 4484, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.109375 = fieldNorm(doc=4484)
        0.2 = coord(5/25)
    
  3. Uyttendaele, C.; Moens, M.-F.; Dumortier, J.: SALOMON: automatic abstracting of legal cases for effective access to court decisions (1998) 0.16
    0.16419382 = sum of:
      0.16419382 = product of:
        0.8209691 = sum of:
          0.050161906 = weight(abstract_txt:effectively in 495) [ClassicSimilarity], result of:
            0.050161906 = score(doc=495,freq=1.0), product of:
              0.09048786 = queryWeight, product of:
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.0153030455 = queryNorm
              0.55434954 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.09375 = fieldNorm(doc=495)
          0.03208907 = weight(abstract_txt:terms in 495) [ClassicSimilarity], result of:
            0.03208907 = score(doc=495,freq=1.0), product of:
              0.08464261 = queryWeight, product of:
                1.3677741 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0153030455 = queryNorm
              0.37911248 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=495)
          0.048133604 = weight(abstract_txt:text in 495) [ClassicSimilarity], result of:
            0.048133604 = score(doc=495,freq=1.0), product of:
              0.12696391 = queryWeight, product of:
                2.051661 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0153030455 = queryNorm
              0.37911248 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.09375 = fieldNorm(doc=495)
          0.33142027 = weight(abstract_txt:criminal in 495) [ClassicSimilarity], result of:
            0.33142027 = score(doc=495,freq=1.0), product of:
              0.4014243 = queryWeight, product of:
                2.978665 = boost
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.0153030455 = queryNorm
              0.8256109 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.806516 = idf(docFreq=17, maxDocs=44218)
                0.09375 = fieldNorm(doc=495)
          0.3591642 = weight(abstract_txt:legal in 495) [ClassicSimilarity], result of:
            0.3591642 = score(doc=495,freq=1.0), product of:
              0.6108296 = queryWeight, product of:
                6.3641515 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0153030455 = queryNorm
              0.5879941 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.09375 = fieldNorm(doc=495)
        0.2 = coord(5/25)
    
  4. Cumyn, M.; Reiner, G.; Mas, S.; Lesieur, D.: Legal knowledge representation using a faceted scheme (2019) 0.16
    0.16078132 = sum of:
      0.16078132 = product of:
        1.0048833 = sum of:
          0.020616839 = weight(abstract_txt:research in 5788) [ClassicSimilarity], result of:
            0.020616839 = score(doc=5788,freq=1.0), product of:
              0.05202433 = queryWeight, product of:
                1.0723168 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0153030455 = queryNorm
              0.39629224 = fieldWeight in 5788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.125 = fieldNorm(doc=5788)
          0.06405071 = weight(abstract_txt:documents in 5788) [ClassicSimilarity], result of:
            0.06405071 = score(doc=5788,freq=2.0), product of:
              0.087915294 = queryWeight, product of:
                1.3939656 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0153030455 = queryNorm
              0.72855026 = fieldWeight in 5788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.125 = fieldNorm(doc=5788)
          0.0907616 = weight(abstract_txt:text in 5788) [ClassicSimilarity], result of:
            0.0907616 = score(doc=5788,freq=2.0), product of:
              0.12696391 = queryWeight, product of:
                2.051661 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0153030455 = queryNorm
              0.7148614 = fieldWeight in 5788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=5788)
          0.8294542 = weight(abstract_txt:legal in 5788) [ClassicSimilarity], result of:
            0.8294542 = score(doc=5788,freq=3.0), product of:
              0.6108296 = queryWeight, product of:
                6.3641515 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0153030455 = queryNorm
              1.3579142 = fieldWeight in 5788, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.125 = fieldNorm(doc=5788)
        0.16 = coord(4/25)
    
  5. Russell-Rose, T.; Chamberlain, J.; Azzopardi, L.: Information retrieval in the workplace : a comparison of professional search practices (2018) 0.14
    0.14412253 = sum of:
      0.14412253 = product of:
        0.5147233 = sum of:
          0.009019867 = weight(abstract_txt:research in 5048) [ClassicSimilarity], result of:
            0.009019867 = score(doc=5048,freq=1.0), product of:
              0.05202433 = queryWeight, product of:
                1.0723168 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0153030455 = queryNorm
              0.17337786 = fieldWeight in 5048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
          0.011879555 = weight(abstract_txt:retrieval in 5048) [ClassicSimilarity], result of:
            0.011879555 = score(doc=5048,freq=1.0), product of:
              0.06250861 = queryWeight, product of:
                1.1754111 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0153030455 = queryNorm
              0.19004668 = fieldWeight in 5048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
          0.013401012 = weight(abstract_txt:been in 5048) [ClassicSimilarity], result of:
            0.013401012 = score(doc=5048,freq=1.0), product of:
              0.06773786 = queryWeight, product of:
                1.2235891 = boost
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0153030455 = queryNorm
              0.19783635 = fieldWeight in 5048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.617579 = idf(docFreq=3226, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
          0.019814678 = weight(abstract_txt:documents in 5048) [ClassicSimilarity], result of:
            0.019814678 = score(doc=5048,freq=1.0), product of:
              0.087915294 = queryWeight, product of:
                1.3939656 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0153030455 = queryNorm
              0.22538373 = fieldWeight in 5048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
          0.057138458 = weight(abstract_txt:professional in 5048) [ClassicSimilarity], result of:
            0.057138458 = score(doc=5048,freq=2.0), product of:
              0.14136724 = queryWeight, product of:
                1.7676417 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0153030455 = queryNorm
              0.40418458 = fieldWeight in 5048, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
          0.040583473 = weight(abstract_txt:previous in 5048) [ClassicSimilarity], result of:
            0.040583473 = score(doc=5048,freq=1.0), product of:
              0.14178792 = queryWeight, product of:
                1.7702698 = boost
                5.2338576 = idf(docFreq=640, maxDocs=44218)
                0.0153030455 = queryNorm
              0.2862266 = fieldWeight in 5048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2338576 = idf(docFreq=640, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
          0.36288622 = weight(abstract_txt:legal in 5048) [ClassicSimilarity], result of:
            0.36288622 = score(doc=5048,freq=3.0), product of:
              0.6108296 = queryWeight, product of:
                6.3641515 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0153030455 = queryNorm
              0.5940875 = fieldWeight in 5048, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5048)
        0.28 = coord(7/25)