Document (#43503)

Author
Goldberg, D.M.
Zaman, N.
Brahma, A.
Aloiso, M.
Title
Are mortgage loan closing delay risks predictable? : A predictive analysis using text mining on discussion threads
Source
Journal of the Association for Information Science and Technology. 73(2022) no.3, S.419-437
Year
2022
Abstract
Loan processors and underwriters at mortgage firms seek to gather substantial supporting documentation to properly understand and model loan risks. In doing so, loan originations become prone to closing delays, risking client dissatisfaction and consequent revenue losses. We collaborate with a large national mortgage firm to examine the extent to which these delays are predictable, using internal discussion threads to prioritize interventions for loans most at risk. Substantial work experience is required to predict delays, and we find that even highly trained employees have difficulty predicting delays by reviewing discussion threads. We develop an array of methods to predict loan delays. We apply four modern out-of-the-box sentiment analysis techniques, two dictionary-based and two rule-based, to predict delays. We contrast these approaches with domain-specific approaches, including firm-provided keyword searches and "smoke terms" derived using machine learning. Performance varies widely across sentiment approaches; while some sentiment approaches prioritize the top-ranking records well, performance quickly declines thereafter. The firm-provided keyword searches perform at the rate of random chance. We observe that the domain-specific smoke term approaches consistently outperform other approaches and offer better prediction than loan and borrower characteristics. We conclude that text mining solutions would greatly assist mortgage firms in delay prevention.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24559.
Theme
Data Mining

Similar documents (author)

  1. Goldberg, M.: CD-ROM periodical indexes : better evaluation necessary (1992) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:goldberg in 8303) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 8303, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=8303)
    
  2. Goldberg, J.E.: Library of Congress Classification : shelving device for collections or organization of knowledge fields? (1996) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:goldberg in 4579) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 4579, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=4579)
    
  3. Goldberg, E.: ¬The retrieval problem in photography (1932) (1992) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:goldberg in 322) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 322, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=322)
    
  4. Goldberg, J.: Classification of religion in LCC (2000) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:goldberg in 5402) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 5402, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=5402)
    
  5. Goldberg, E.: ¬Die Regie im Gehrin : Wo wir Pläne schmieden und Entscheidungen treffen (2002) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:goldberg in 1343) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 1343, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=1343)
    

Similar documents (content)

  1. Taylor, N.J.; Dennis, A.R.; Cummings, J.W.: Situation normality and the shape of search : the effects of time delays and information presentation on search behavior (2013) 0.12
    0.12488166 = sum of:
      0.12488166 = product of:
        1.0406805 = sum of:
          0.021422131 = weight(abstract_txt:searches in 741) [ClassicSimilarity], result of:
            0.021422131 = score(doc=741,freq=1.0), product of:
              0.06556578 = queryWeight, product of:
                1.1872202 = boost
                5.227637 = idf(docFreq=644, maxDocs=44218)
                0.010564296 = queryNorm
              0.3267273 = fieldWeight in 741, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.227637 = idf(docFreq=644, maxDocs=44218)
                0.0625 = fieldNorm(doc=741)
          0.17331955 = weight(abstract_txt:delay in 741) [ClassicSimilarity], result of:
            0.17331955 = score(doc=741,freq=4.0), product of:
              0.16646151 = queryWeight, product of:
                1.8916883 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.010564296 = queryNorm
              1.041199 = fieldWeight in 741, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.0625 = fieldNorm(doc=741)
          0.8459388 = weight(abstract_txt:delays in 741) [ClassicSimilarity], result of:
            0.8459388 = score(doc=741,freq=7.0), product of:
              0.57323915 = queryWeight, product of:
                6.080247 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.010564296 = queryNorm
              1.4757171 = fieldWeight in 741, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0625 = fieldNorm(doc=741)
        0.12 = coord(3/25)
    
  2. Zhang, C.; Zeng, D.; Li, J.; Wang, F.-Y.; Zuo, W.: Sentiment analysis of Chinese documents : from sentence to document level (2009) 0.09
    0.09270461 = sum of:
      0.09270461 = product of:
        0.46352303 = sum of:
          0.011677498 = weight(abstract_txt:using in 3296) [ClassicSimilarity], result of:
            0.011677498 = score(doc=3296,freq=1.0), product of:
              0.043161087 = queryWeight, product of:
                1.1797351 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.010564296 = queryNorm
              0.27055615 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.04414254 = weight(abstract_txt:mining in 3296) [ClassicSimilarity], result of:
            0.04414254 = score(doc=3296,freq=1.0), product of:
              0.091495626 = queryWeight, product of:
                1.4024676 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.010564296 = queryNorm
              0.4824552 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.08750688 = weight(abstract_txt:predict in 3296) [ClassicSimilarity], result of:
            0.08750688 = score(doc=3296,freq=1.0), product of:
              0.16528015 = queryWeight, product of:
                2.3085997 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.010564296 = queryNorm
              0.5294458 = fieldWeight in 3296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.2423628 = weight(abstract_txt:sentiment in 3296) [ClassicSimilarity], result of:
            0.2423628 = score(doc=3296,freq=4.0), product of:
              0.20534456 = queryWeight, product of:
                2.5732377 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010564296 = queryNorm
              1.1802738 = fieldWeight in 3296, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
          0.077833295 = weight(abstract_txt:approaches in 3296) [ClassicSimilarity], result of:
            0.077833295 = score(doc=3296,freq=2.0), product of:
              0.15286316 = queryWeight, product of:
                3.1398196 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.010564296 = queryNorm
              0.50916976 = fieldWeight in 3296, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.078125 = fieldNorm(doc=3296)
        0.2 = coord(5/25)
    
  3. Thelwall, M.; Buckley, K.; Paltoglou, G.; Cai, D.; Kappas, A.: Sentiment strength detection in short informal text (2010) 0.07
    0.07415495 = sum of:
      0.07415495 = product of:
        0.37077475 = sum of:
          0.009341997 = weight(abstract_txt:using in 4200) [ClassicSimilarity], result of:
            0.009341997 = score(doc=4200,freq=1.0), product of:
              0.043161087 = queryWeight, product of:
                1.1797351 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.010564296 = queryNorm
              0.21644491 = fieldWeight in 4200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=4200)
          0.030622218 = weight(abstract_txt:discussion in 4200) [ClassicSimilarity], result of:
            0.030622218 = score(doc=4200,freq=1.0), product of:
              0.095240936 = queryWeight, product of:
                1.7524681 = boost
                5.144379 = idf(docFreq=700, maxDocs=44218)
                0.010564296 = queryNorm
              0.3215237 = fieldWeight in 4200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.144379 = idf(docFreq=700, maxDocs=44218)
                0.0625 = fieldNorm(doc=4200)
          0.07000551 = weight(abstract_txt:predict in 4200) [ClassicSimilarity], result of:
            0.07000551 = score(doc=4200,freq=1.0), product of:
              0.16528015 = queryWeight, product of:
                2.3085997 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.010564296 = queryNorm
              0.42355666 = fieldWeight in 4200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0625 = fieldNorm(doc=4200)
          0.21677588 = weight(abstract_txt:sentiment in 4200) [ClassicSimilarity], result of:
            0.21677588 = score(doc=4200,freq=5.0), product of:
              0.20534456 = queryWeight, product of:
                2.5732377 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010564296 = queryNorm
              1.055669 = fieldWeight in 4200, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0625 = fieldNorm(doc=4200)
          0.04402916 = weight(abstract_txt:approaches in 4200) [ClassicSimilarity], result of:
            0.04402916 = score(doc=4200,freq=1.0), product of:
              0.15286316 = queryWeight, product of:
                3.1398196 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.010564296 = queryNorm
              0.2880299 = fieldWeight in 4200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0625 = fieldNorm(doc=4200)
        0.2 = coord(5/25)
    
  4. Wei, W.; Liu, Y.-P.; Wei, L-R.: Feature-level sentiment analysis based on rules and fine-grained domain ontology (2020) 0.07
    0.06574619 = sum of:
      0.06574619 = product of:
        0.4109137 = sum of:
          0.03447782 = weight(abstract_txt:domain in 5876) [ClassicSimilarity], result of:
            0.03447782 = score(doc=5876,freq=3.0), product of:
              0.053804044 = queryWeight, product of:
                1.0754745 = boost
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.010564296 = queryNorm
              0.6408035 = fieldWeight in 5876, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7355914 = idf(docFreq=1054, maxDocs=44218)
                0.078125 = fieldNorm(doc=5876)
          0.011677498 = weight(abstract_txt:using in 5876) [ClassicSimilarity], result of:
            0.011677498 = score(doc=5876,freq=1.0), product of:
              0.043161087 = queryWeight, product of:
                1.1797351 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.010564296 = queryNorm
              0.27055615 = fieldWeight in 5876, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=5876)
          0.04414254 = weight(abstract_txt:mining in 5876) [ClassicSimilarity], result of:
            0.04414254 = score(doc=5876,freq=1.0), product of:
              0.091495626 = queryWeight, product of:
                1.4024676 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.010564296 = queryNorm
              0.4824552 = fieldWeight in 5876, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=5876)
          0.32061586 = weight(abstract_txt:sentiment in 5876) [ClassicSimilarity], result of:
            0.32061586 = score(doc=5876,freq=7.0), product of:
              0.20534456 = queryWeight, product of:
                2.5732377 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010564296 = queryNorm
              1.5613555 = fieldWeight in 5876, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=5876)
        0.16 = coord(4/25)
    
  5. Pang, B.; Lee, L.: Opinion mining and sentiment analysis (2008) 0.06
    0.06481244 = sum of:
      0.06481244 = product of:
        0.32406217 = sum of:
          0.01554332 = weight(abstract_txt:provided in 1171) [ClassicSimilarity], result of:
            0.01554332 = score(doc=1171,freq=1.0), product of:
              0.057870775 = queryWeight, product of:
                1.1153786 = boost
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.010564296 = queryNorm
              0.2685867 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9112997 = idf(docFreq=884, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.053519987 = weight(abstract_txt:mining in 1171) [ClassicSimilarity], result of:
            0.053519987 = score(doc=1171,freq=3.0), product of:
              0.091495626 = queryWeight, product of:
                1.4024676 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.010564296 = queryNorm
              0.58494586 = fieldWeight in 1171, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.026794441 = weight(abstract_txt:discussion in 1171) [ClassicSimilarity], result of:
            0.026794441 = score(doc=1171,freq=1.0), product of:
              0.095240936 = queryWeight, product of:
                1.7524681 = boost
                5.144379 = idf(docFreq=700, maxDocs=44218)
                0.010564296 = queryNorm
              0.28133324 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.144379 = idf(docFreq=700, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.18967889 = weight(abstract_txt:sentiment in 1171) [ClassicSimilarity], result of:
            0.18967889 = score(doc=1171,freq=5.0), product of:
              0.20534456 = queryWeight, product of:
                2.5732377 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.010564296 = queryNorm
              0.92371035 = fieldWeight in 1171, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
          0.038525518 = weight(abstract_txt:approaches in 1171) [ClassicSimilarity], result of:
            0.038525518 = score(doc=1171,freq=1.0), product of:
              0.15286316 = queryWeight, product of:
                3.1398196 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.010564296 = queryNorm
              0.25202617 = fieldWeight in 1171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1171)
        0.2 = coord(5/25)