Before Gross Profitability: The Tangled History of Quality Measurement
For decades, quantitative researchers struggled with a deceptively simple question: how should you measure the quality of a company? By the early 2000s, the academic landscape was littered with competing approaches, each capturing a different slice of what it meant to be a "good" firm -- and each carrying its own baggage of accounting noise and definitional ambiguity.
Piotroski (2000) introduced the F-score, a composite of nine binary accounting signals spanning profitability, leverage, and operating efficiency. The F-score was effective at separating winners from losers among value stocks, but it was a blunt instrument -- a checklist of pass/fail criteria rather than a continuous measure of economic strength. Other researchers favored return on equity (ROE), return on assets (ROA), or net profit margins as their preferred quality metrics. Institutional practitioners at firms like MSCI built quality indices around ROE, earnings variability, and debt-to-equity ratios.
The problem with all of these approaches was that they measured profitability at the wrong level of the income statement. Net income, operating income, and ROE are all contaminated by management's discretionary accounting decisions: depreciation schedules, amortization of intangibles, restructuring charges, tax strategies, and capital structure choices. Two firms with identical economic productivity could report wildly different bottom-line numbers depending on their CFOs' preferences. The signal-to-noise ratio in these profitability measures was poor, and researchers knew it.
It was into this context that Robert Novy-Marx published "The Other Side of Value: The Gross Profitability Premium" in the Journal of Financial Economics in 2013. His central insight was disarmingly simple: to find the cleanest measure of a firm's economic output, stop reading the income statement before management gets a chance to distort it. Measure profitability at the gross profit line -- revenue minus cost of goods sold -- and scale it by total assets. This single ratio, Novy-Marx (2013) argued, captures a firm's core economic engine with minimal accounting contamination, and it predicts stock returns as powerfully as the value factor itself.
The Construction: Why Gross Profit Divided by Total Assets
The specific construction of the gross profitability measure warrants careful examination, because the choice of both numerator and denominator is central to the paper's contribution.
The numerator: gross profit. Gross profit is defined as total revenue minus cost of goods sold (COGS). It sits at the very top of the income statement, above selling, general, and administrative expenses (SG&A), research and development costs, depreciation and amortization, interest expense, and taxes. By stopping at this line, the measure captures what a firm earns from its core production and sales activities before any discretionary spending decisions are applied.
Novy-Marx argued that this placement on the income statement is not arbitrary -- it reflects a deliberate choice to measure economic productivity rather than accounting profitability. Consider two pharmaceutical companies with identical drug portfolios and identical revenues. Company A capitalizes its R&D spending aggressively, Company B expenses it immediately. Company A has high SG&A in one year due to a restructuring charge. At the net income or operating income level, these firms look very different. At the gross profit level, they look the same -- because gross profit captures the fundamental economics of their business before management's financial engineering takes effect.
The denominator: total assets. Rather than scaling by book equity (as in ROE) or by market capitalization (as in earnings yield), Novy-Marx chose total assets. This choice avoids two problems. First, book equity is itself subject to accounting distortions -- share buybacks, accumulated other comprehensive income, and goodwill impairments can all make book equity a noisy denominator. Second, scaling by market capitalization would mechanically create correlation with the value factor, since low market cap relative to fundamentals is precisely the definition of value. By using total assets, the measure captures how efficiently a firm converts its asset base into gross profit, independent of how the market prices the firm.
| Profitability Measure | Numerator | Denominator | Key Weaknesses |
|---|---|---|---|
| ROE | Net income | Book equity | Distorted by leverage, buybacks, one-time charges |
| ROA | Net income | Total assets | Contaminated by below-the-line items |
| Operating margin | Operating income | Revenue | Affected by SG&A allocation, restructuring |
| Net margin | Net income | Revenue | Most distorted; includes taxes, interest, special items |
| Gross profitability | Gross profit | Total assets | Least distorted; closest to economic productivity |
This table illustrates the core argument: as you move down the income statement, each successive profitability measure incorporates more managerial discretion and accounting noise. Gross profitability, by stopping at the top, preserves the purest signal of a firm's economic engine.
Empirical Findings: As Powerful as Book-to-Market
The empirical results in Novy-Marx (2013) were striking in both magnitude and robustness. Using CRSP and Compustat data covering U.S. equities from 1963 to 2010, Novy-Marx sorted stocks into quintiles based on gross profitability (gross profit divided by total assets) and examined subsequent returns.
Stocks in the top quintile of gross profitability outperformed stocks in the bottom quintile by approximately 0.31% per month -- roughly 3.7% annualized -- on a raw return basis. After adjusting for the Fama-French three-factor model (market, size, and value), the spread actually increased, because gross profitability is negatively correlated with value: profitable firms tend to be growth firms, meaning the three-factor model's value loading works against them. The three-factor alpha of the long-short gross profitability strategy was approximately 0.52% per month (about 6.4% annualized), with a t-statistic exceeding 4.0 -- well beyond conventional thresholds for statistical significance.
Novy-Marx then directly compared gross profitability to other profitability measures in a horse race of predictive power:
| Profitability Measure | Monthly Long-Short Return | Three-Factor Alpha | t-Statistic |
|---|---|---|---|
| Gross profitability (GP/AT) | 0.31% | 0.52% | >4.0 |
| Operating profitability | Weaker | Lower | Lower |
| Net income / assets | Weaker | Lower | Lower |
| Free cash flow / assets | Weakest | Lowest | Insignificant in some specs |
The pattern was consistent: the higher up the income statement you measured profitability, the stronger the return predictability. Gross profitability dominated all other measures. This finding was counterintuitive to many practitioners who assumed that bottom-line profitability -- the number most directly tied to shareholder value -- should be the most informative signal. Novy-Marx demonstrated the opposite: the accounting noise introduced below the gross profit line destroys more information than it adds.
A critical robustness test involved controlling for value (book-to-market). In standard Fama-MacBeth cross-sectional regressions, gross profitability and book-to-market had similar predictive coefficients for future returns, and both remained significant when included simultaneously. The gross profitability premium did not subsume the value premium, nor was it subsumed by it. The two were genuinely independent dimensions of expected returns.
The Other Side of Value: Why Profitability and Cheapness Are Complements
The paper's title -- "The Other Side of Value" -- captures perhaps its most important practical insight. Novy-Marx showed that profitable firms and cheap firms are largely different sets of companies. High gross profitability is associated with growth characteristics: these firms tend to have high market valuations, strong recent performance, and above-average analyst expectations. Value firms, by contrast, tend to be distressed, out-of-favor companies with weak recent performance and low market expectations.
This negative correlation between profitability and value creates a powerful diversification opportunity. A portfolio that combines a quality tilt (overweighting high gross profitability firms) with a value tilt (overweighting high book-to-market firms) captures two nearly independent sources of risk premium. The combined strategy substantially outperforms either factor in isolation, with a Sharpe ratio materially higher than either component.
Novy-Marx formalized this insight by showing that adding a gross profitability factor to the Fama-French three-factor model significantly improved the model's explanatory power. The three-factor model had long been known to struggle with certain anomalies -- most notably, it could not explain why highly profitable growth firms earned strong returns despite having low value exposure. Adding gross profitability as a fourth factor largely resolved this shortcoming.
This complementarity between profitability and value had direct implications for portfolio construction. Traditional value strategies often inadvertently loaded on low-quality firms -- companies that were cheap for good reason. By screening for both value and gross profitability, investors could avoid these value traps while capturing both premia. As Novy-Marx demonstrated, the intersection of high quality and low price is where the most attractive risk-adjusted returns reside.
Why Gross Profit? The Economic Intuition
Novy-Marx provided a theoretical framework rooted in the dividend discount model (DDM) to explain why profitability should predict returns. The Gordon Growth Model implies:
Expected Return = Earnings Yield + Growth Rate
Holding price constant, firms with higher current profitability must have either higher expected returns or lower expected growth. Since highly profitable firms tend to have higher growth expectations (not lower), the DDM logic implies that profitability should be positively associated with expected returns -- conditional on price.
But why gross profitability specifically, rather than any other profitability measure? Novy-Marx offered two arguments.
First, gross profit is the most persistent component of profitability. A firm's gross margin reflects its fundamental competitive position -- pricing power, cost structure, supply chain efficiency -- which tends to be durable over time. Items below the gross profit line (SG&A, R&D, restructuring charges, interest expense) are more volatile and subject to managerial discretion. The persistence of gross profitability means it is a better proxy for the firm's long-run economic engine, which is what the DDM requires.
Second, the noisier profitability measures introduce systematic biases. Firms that engage in aggressive investment (high R&D, high capital expenditure) will have lower net income and operating income, ceteris paribus. But aggressive investment is also associated with growth, which complicates the relationship between measured profitability and expected returns. By measuring above the line where investment spending hits, gross profitability avoids this confound entirely.
Ball, Gerakos, Linnainmaa, and Nikolaev (2015) subsequently extended this logic, showing that an even simpler measure -- gross profit scaled by market equity rather than total assets -- had incremental predictive power. Their work confirmed Novy-Marx's core insight while suggesting that the optimal denominator choice depends on the specific application.
Impact on Asset Pricing: From Three Factors to Five
The gross profitability premium played a pivotal role in motivating the expansion of the Fama-French factor model. When Fama and French (2015) published their five-factor model, they added two new factors: RMW (Robust Minus Weak profitability) and CMA (Conservative Minus Aggressive investment). The RMW factor was directly inspired by Novy-Marx's finding that profitability predicts cross-sectional returns.
However, Fama and French chose a different profitability measure for RMW: operating profitability (revenue minus COGS, minus SG&A, minus interest expense, divided by book equity). This choice was deliberate -- Fama and French were constructing a factor model rather than identifying the single best predictor, and they preferred a measure that captured a broader set of income statement information. Novy-Marx's own research suggested this was a suboptimal choice: gross profitability, precisely because it is simpler and less contaminated, should have been the preferred measure for a pricing factor.
This methodological disagreement highlights an important nuance. The gross profitability premium is not merely an academic curiosity -- it directly shaped the dominant asset pricing framework used in empirical finance today. The question of which profitability measure belongs in the canonical factor model remains actively debated, with implications for how researchers and practitioners measure alpha, evaluate fund performance, and construct factor portfolios.
Hou, Xue, and Zhang (2015) proposed an alternative q-factor model that also included a profitability factor, using ROE rather than gross profitability or operating profitability. Their model and the Fama-French five-factor model competed to explain the same set of anomalies, further demonstrating that profitability -- in some form -- is essential to modern asset pricing.
Practical Implications for Factor Portfolio Construction
For practitioners building factor investing strategies, the Novy-Marx paper offers several actionable lessons.
Signal construction. Gross profitability (gross profit / total assets) should be a core input in any quality or profitability screen. Its simplicity is a feature, not a bug: the measure is available for virtually all publicly traded firms, requires no estimation or subjective inputs, and is robust to the accounting manipulations that plague more complex measures. Practitioners who rely exclusively on ROE, net margins, or composite quality scores are leaving predictive power on the table.
Factor combination. The negative correlation between gross profitability and book-to-market makes them natural complements in multi-factor portfolios. Constructing a combined quality-value strategy -- overweighting stocks that score highly on both gross profitability and book-to-market -- produces a portfolio with a markedly higher risk-adjusted return and Sharpe ratio than either factor alone. This synergy is not merely theoretical; it has been confirmed in live factor portfolios and is widely exploited by quantitative asset managers.
Avoiding value traps. One of the most common failure modes in value investing is buying stocks that are cheap because they deserve to be -- firms with deteriorating fundamentals, competitive disadvantage, or structural decline. Gross profitability serves as a natural screen against value traps: firms with high gross profitability have strong core economics, reducing the probability that their cheapness reflects genuine fundamental impairment rather than temporary market pessimism.
Sector considerations. Gross profitability varies substantially across sectors. Technology and healthcare firms tend to have high gross margins; utilities, financials, and commodity producers tend to have lower gross margins. A naive gross profitability sort will therefore generate sector concentration. Practitioners often implement sector-neutral versions of the strategy, ranking firms by gross profitability within each sector rather than across the full universe. Novy-Marx showed that the premium persists even within sectors, though the sector-neutral version sacrifices some magnitude for better diversification.
Limitations and Ongoing Debates
The gross profitability premium, while robust, is not without limitations and open questions.
Financial firms. Gross profit is not well-defined for banks, insurance companies, and other financial institutions, whose revenue structures differ fundamentally from industrial and service firms. The Novy-Marx framework applies most cleanly to non-financial firms, and practitioners must use alternative profitability measures (such as ROA or net interest margin) for the financial sector.
International evidence. While the original paper focused on U.S. equities, subsequent research by Fama and French (2017) and others has generally confirmed the profitability premium in international markets, though the specific magnitude and the optimal profitability measure vary across countries and accounting regimes.
Temporal stability. Like all factor premia, the gross profitability premium is not constant over time. There have been periods when it weakened or was overshadowed by other factors. McLean and Pontiff (2016) documented that factor premia tend to decay post-publication as arbitrage capital flows toward known anomalies. However, profitability-related factors have been among the most resilient to post-publication decay, likely because the premium is grounded in fundamental valuation logic (the DDM) rather than pure statistical patterns.
The denominator debate. Whether to scale gross profit by total assets (Novy-Marx's choice), by market equity (Ball, Gerakos, Linnainmaa, and Nikolaev, 2015), or by book equity remains an open question. Each choice produces slightly different factor portfolios with different risk-return characteristics. The asset-scaled version has the advantage of independence from market pricing, while the market-scaled version incorporates current market information.
The Paper's Lasting Contribution
Novy-Marx's 2013 paper reshaped how researchers and practitioners think about profitability as a factor. Before this paper, quality was a vague, multidimensional concept that different researchers operationalized in incompatible ways. After it, the field had a specific, empirically validated measure -- gross profitability -- that was simple to compute, economically intuitive, and as powerful as the canonical value factor.
The paper's influence extends beyond the specific measure it proposed. It established a methodological principle: when constructing accounting-based factors, simpler and higher-on-the-income-statement is better. The accounting noise that accumulates as you move from gross profit to operating income to net income destroys more predictive information than it adds. This insight has informed subsequent work on factor construction and has pushed practitioners to question the complexity of their quality models.
For investors constructing multi-factor portfolios today, the Novy-Marx finding remains directly actionable. Gross profitability serves as a clean, standalone quality signal with return predictive power comparable to book-to-market. Combined with value screens, it produces a strategy that captures two complementary sources of alpha while naturally avoiding value traps. The simplicity and transparency of the measure make it accessible to investors of all sizes, from individual quantitative traders to the largest institutional asset managers.
References
-
Ball, R., Gerakos, J., Linnainmaa, J. T., & Nikolaev, V. (2015). "Deflating profitability." Journal of Financial Economics, 117(2), 225-248. https://doi.org/10.1016/j.jfineco.2015.05.002
-
Fama, E. F., & French, K. R. (2015). "A Five-Factor Model of Expected Stock Returns." Journal of Financial Economics, 116(1), 1-22. https://doi.org/10.1016/j.jfineco.2014.10.010
-
Fama, E. F., & French, K. R. (2017). "International tests of a five-factor asset pricing model." Journal of Financial Economics, 123(3), 441-463. https://doi.org/10.1016/j.jfineco.2016.11.004
-
Hou, K., Xue, C., & Zhang, L. (2015). "Digesting Anomalies: An Investment Approach." The Review of Financial Studies, 28(3), 650-705. https://doi.org/10.1093/rfs/hhu068
-
McLean, R. D., & Pontiff, J. (2016). "Does Academic Research Destroy Stock Return Predictability?" The Journal of Finance, 71(1), 5-32. https://doi.org/10.1111/jofi.12365
-
Novy-Marx, R. (2013). "The Other Side of Value: The Gross Profitability Premium." Journal of Financial Economics, 108(1), 1-28. https://doi.org/10.1016/j.jfineco.2013.04.003
-
Piotroski, J. D. (2000). "Value Investing: The Use of Historical Financial Statement Information to Separate Winners from Losers." Journal of Accounting Research, 38, 1-41. https://doi.org/10.2307/2672906