{"id":1360,"date":"2020-12-15T11:16:33","date_gmt":"2020-12-15T10:16:33","guid":{"rendered":"https:\/\/eaa-online.org\/arc\/blog\/blog\/why-consider-using-quantile-regression-your-research\/"},"modified":"2020-12-15T11:16:33","modified_gmt":"2020-12-15T10:16:33","slug":"why-consider-using-quantile-regression-your-research","status":"publish","type":"post","link":"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/","title":{"rendered":"Why consider using quantile regression in your research"},"content":{"rendered":"<p><u>Why consider using quantile regression in your research<\/u><\/p>\n<p>In graduate school, we were taught about the nice properties of the coefficients estimated with the ordinary least square (OLS) regression model: They are BLUE (best linear unbiased estimators). (Note: &nbsp;The &lsquo;best&rsquo; in BLUE refers to the sampling distribution having the minimum variance, i.e., most efficient.) If normality is additionally assumed on the disturbance term, then the estimated coefficients are also normally distributed, allowing hypotheses on them to be tested with t- and F-tests. What might not have been emphasized are the <strong>consequences when the normality assumption is violated<\/strong>.<\/p>\n<p>Profitability (as scaled earnings) is among the most important inputs used for valuation. In a profitability <strong>forecasting setting<\/strong>, my co-authors and I show that median regression, as a special case of quantile regression (with &#x1D70F; = 0.5), produces more accurate forecasts than OLS regression does. Simulation and archival-data analyses indicate that the incremental forecasting accuracy is related to the <strong>tail-heaviness <\/strong>of the earnings distribution. As an external validation, the distributional shape analysis is applied to cash flow forecasting and yields the same conclusion (<a href=\"https:\/\/dx.doi.org\/10.2139\/ssrn.3008666\" title=\"This is a link to the cited document.\">Tian, Yim, and Newton [2020]<\/a>: Tail-Heaviness, Asymmetry, and Profitability Forecasting by Quantile Regression, <a href=\"https:\/\/doi.org\/10.1287\/mnsc.2020.3694\" title=\"This is a link to the cited document.\">forthcoming in <em>Management Science<\/em><\/a>). Recognizing quantile regression&rsquo;s advantage, other accounting researchers have also used this estimation approach in their research (e.g., <a href=\"http:\/\/dx.doi.org\/10.2139\/ssrn.3040354\" title=\"This is a link to the cited document.\">Easton et al 2020<\/a>). &nbsp;&nbsp;<\/p>\n<p>Even in an <strong>inference setting<\/strong>, disregarding the violation of the normality assumption can critically bias the conclusion of statistical testing because the t- and F-tests are not accurate when the OLS coefficients are no longer normally distributed. <a href=\"https:\/\/lambdaclass.com\/data_etudes\/central_limit_theorem_misuse\/\" title=\"This is a link to the cited document.\">A large sample size might not be an effective fix<\/a>. &nbsp;&nbsp;<\/p>\n<p>In practice, one often needs to deal with the violation of the normality assumption arising from heavy tails (see the discussion in this blog post: <a href=\"https:\/\/www.sr-sv.com\/the-dangerous-disregard-of-fat-tails-in-quantitative-finance\/\" title=\"This is a link to the cited document.\">The dangerous disregard for fat tails in quantitative finance<\/a>). A frequently&nbsp;forgotten truth is that with heavy tails, <a href=\"https:\/\/www.johndcook.com\/blog\/2009\/03\/06\/student-t-distribution-mean-median\/\" title=\"This is a link to the cited document.\">the sample median can be better<\/a> than the sample mean as an estimator for the population <u>mean<\/u> &nbsp;(see <a href=\"https:\/\/www.jstor.org\/stable\/2684695\" title=\"This is a link to the cited document.\">this publication in <em>The American Statistician<\/em><\/a> for a more systematic comparison.) &nbsp;<\/p>\n<p><!--break--> <\/p>\n<p>Prior research has proposed median regression as an alternative to OLS to avoid misleading conclusions that might result from the latter when the distribution of the error term has heavy tails (<a href=\"https:\/\/www.jstor.org\/stable\/41575832\" title=\"This is a link to the cited document.\">Harden and Desmarais 2011<\/a>). For large samples, OLS and median regression estimates are often quite similar (see the example on <a href=\"https:\/\/fmwww.bc.edu\/EC-C\/S2013\/823\/EC823.S2013.nn04.slides.pdf\" title=\"This is a link to the cited document.\">p. 16 of these slides<\/a>). If the normality assumption is met, the cost of using median regression is that the estimate being not as efficient is likely to set a higher hurdle for rejecting a null hypothesis. However, if the error term has heavy tails, median regression has the benefits of producing more efficient estimates and more robust conclusions less affected by outliers (see the discussions <a href=\"https:\/\/stats.stackexchange.com\/a\/251562\/301558\" title=\"This is a link to the cited document.\">here<\/a> and <a href=\"https:\/\/stats.stackexchange.com\/a\/49091\/301558\" title=\"This is a link to the cited document.\">here<\/a>).<\/p>\n<p><a href=\"http:\/\/dx.doi.org\/10.2139\/ssrn.3008666\" title=\"This is a link to the cited document.\">Tian, Yim, and Newton (2020)<\/a> have only scratched the surface of quantile regression&rsquo;s usefulness by focusing on the median regression as its special case. Quantile regression in general can produce optimal estimates\/forecasts for asymmetric loss functions (when &#x1D70F; &ne; 0.5). Prior research has argued that financial analysts have an asymmetric loss function (<a href=\"http:\/\/www.bristol.ac.uk\/efm\/people\/mark-a-clatworthy\" title=\"This is a link to the author.\">Clatworthy<\/a>, <a href=\"https:\/\/www.lancaster.ac.uk\/lums\/people\/david-peel\" title=\"This is a link to the author.\">Peel<\/a>, and <a href=\"http:\/\/faculty.unibocconi.eu\/peterfrancispope\" title=\"This is a link to the author.\">Pope<\/a> [2012]: <a href=\"https:\/\/onlinelibrary.wiley.com\/doi\/abs\/10.1002\/for.1253\" title=\"This is a link to the cited document.\">Are Analysts&#8217; Loss Functions Asymmetric?, <em>Journal of Forecasting<\/em><\/a>). If they do, would they find formulating their forecasts based on quantile regression with &#x1D70F; &ne; 0.5 more aligned with their forecasting objective? What is the implied &#x1D70F; that can be inferred from analyst earnings forecasts? Are the implied &#x1D70F;&rsquo;s similar across different types of analyst forecasts (cash flow forecasts, revenue forecasts, etc)? These are interesting questions left for future research to answer.&nbsp;&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Why consider using quantile regression in your research In graduate school, we were taught about the nice properties of the coefficients estimated with the ordinary least square (OLS) regression model: They are BLUE (best linear unbiased estimators). (Note: &nbsp;The &lsquo;best&rsquo; in BLUE refers to the sampling distribution having the minimum variance, i.e., most efficient.) If [&hellip;]<\/p>\n","protected":false},"author":87,"featured_media":1361,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0},"categories":[1],"tags":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.12 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Why consider using quantile regression in your research - ARC<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Why consider using quantile regression in your research - ARC\" \/>\n<meta property=\"og:description\" content=\"Why consider using quantile regression in your research In graduate school, we were taught about the nice properties of the coefficients estimated with the ordinary least square (OLS) regression model: They are BLUE (best linear unbiased estimators). (Note: &nbsp;The &lsquo;best&rsquo; in BLUE refers to the sampling distribution having the minimum variance, i.e., most efficient.) If [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/\" \/>\n<meta property=\"og:site_name\" content=\"ARC\" \/>\n<meta property=\"article:published_time\" content=\"2020-12-15T10:16:33+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/eaa-online.org\/app\/uploads\/sites\/3\/2020\/12\/1_gda8iv1r7t1upstaz4pa5w_1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"640\" \/>\n\t<meta property=\"og:image:height\" content=\"351\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Andrew Yim\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Andrew Yim\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/\",\"url\":\"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/\",\"name\":\"Why consider using quantile regression in your research - ARC\",\"isPartOf\":{\"@id\":\"https:\/\/eaa-online.org\/arc\/#website\"},\"datePublished\":\"2020-12-15T10:16:33+00:00\",\"dateModified\":\"2020-12-15T10:16:33+00:00\",\"author\":{\"@id\":\"https:\/\/eaa-online.org\/arc\/#\/schema\/person\/a90e8889ee2abad035780fd36788d55f\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/\"]}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/eaa-online.org\/arc\/#website\",\"url\":\"https:\/\/eaa-online.org\/arc\/\",\"name\":\"ARC\",\"description\":\"Advanced Resources Center\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/eaa-online.org\/arc\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/eaa-online.org\/arc\/#\/schema\/person\/a90e8889ee2abad035780fd36788d55f\",\"name\":\"Andrew Yim\",\"sameAs\":[\"https:\/\/papers.ssrn.com\/sol3\/cf_dev\/AbsByAuth.cfm?per_id=28023\"],\"url\":\"https:\/\/eaa-online.org\/arc\/blog\/members\/87\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Why consider using quantile regression in your research - ARC","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/","og_locale":"en_US","og_type":"article","og_title":"Why consider using quantile regression in your research - ARC","og_description":"Why consider using quantile regression in your research In graduate school, we were taught about the nice properties of the coefficients estimated with the ordinary least square (OLS) regression model: They are BLUE (best linear unbiased estimators). (Note: &nbsp;The &lsquo;best&rsquo; in BLUE refers to the sampling distribution having the minimum variance, i.e., most efficient.) If [&hellip;]","og_url":"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/","og_site_name":"ARC","article_published_time":"2020-12-15T10:16:33+00:00","og_image":[{"width":640,"height":351,"url":"https:\/\/eaa-online.org\/app\/uploads\/sites\/3\/2020\/12\/1_gda8iv1r7t1upstaz4pa5w_1.png","type":"image\/png"}],"author":"Andrew Yim","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Andrew Yim","Est. reading time":"3 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/","url":"https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/","name":"Why consider using quantile regression in your research - ARC","isPartOf":{"@id":"https:\/\/eaa-online.org\/arc\/#website"},"datePublished":"2020-12-15T10:16:33+00:00","dateModified":"2020-12-15T10:16:33+00:00","author":{"@id":"https:\/\/eaa-online.org\/arc\/#\/schema\/person\/a90e8889ee2abad035780fd36788d55f"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/eaa-online.org\/arc\/blog\/2020\/12\/15\/why-consider-using-quantile-regression-your-research\/"]}]},{"@type":"WebSite","@id":"https:\/\/eaa-online.org\/arc\/#website","url":"https:\/\/eaa-online.org\/arc\/","name":"ARC","description":"Advanced Resources Center","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/eaa-online.org\/arc\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/eaa-online.org\/arc\/#\/schema\/person\/a90e8889ee2abad035780fd36788d55f","name":"Andrew Yim","sameAs":["https:\/\/papers.ssrn.com\/sol3\/cf_dev\/AbsByAuth.cfm?per_id=28023"],"url":"https:\/\/eaa-online.org\/arc\/blog\/members\/87\/"}]}},"jetpack_featured_media_url":"https:\/\/eaa-online.org\/app\/uploads\/sites\/3\/2020\/12\/1_gda8iv1r7t1upstaz4pa5w_1.png","_links":{"self":[{"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/posts\/1360"}],"collection":[{"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/users\/87"}],"replies":[{"embeddable":true,"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/comments?post=1360"}],"version-history":[{"count":0,"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/posts\/1360\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/media\/1361"}],"wp:attachment":[{"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/media?parent=1360"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/categories?post=1360"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/eaa-online.org\/arc\/wp-json\/wp\/v2\/tags?post=1360"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}