Forecasting enrollment and program viability: where prediction meets cabinet-level decisions

Different question, different model family. A guide to time-series forecasting, yield prediction, and program-margin analysis for IR leadership.

CRT

Clema Research Team

June 5, 2026

10 mins read

Table of Contents

Introduction

Student-level prediction (who will return next term, who will graduate within six years) gets most of the attention in the predictive analytics conversation. It is also the part of the conversation furthest from where Provosts, CFOs, and trustees actually spend their time. The questions on a cabinet agenda are different in scope and different in shape: how many students will enroll next fall, which programs are quietly losing money, which of our peers are likely to consolidate or close.

This post is the guide to that second cluster of questions. The model families are different (mostly time-series, not classification). The data is different (institution-level rather than student-level). The honest limits are different too. For the student-level use cases that anchor most predictive work, the retention deep-dive and the graduation deep-dive are the companion pieces.

The three questions on every cabinet agenda

Almost every leadership-level predictive question at a higher-education institution sorts into one of three buckets. Each one calls for a different model family and a different evaluation standard.

How many students will enroll

Enrollment forecasting

For the next term, the next year, and (with steeply rising uncertainty) the next several years. This is the line item every other budget number ultimately depends on, and it is the question CFOs ask first.

Which programs are paying for themselves

Program margin and viability

Per-program revenue vs cost of delivery. The conversation that determines which majors get expanded, which get restructured, and which get retired. Often the most politically sensitive analysis on the docket.

What are our peers and competitors doing

Sector-level prediction

Which neighboring institutions are likely to consolidate or close. Where our students are migrating from and to. What demographic and policy shifts are about to land. The conversation that shapes multi-year strategy.

Enrollment forecasting

Enrollment forecasting is the most mature predictive use case in higher education and the one where IR teams have the most institutional muscle memory. The question is also deceptively simple: given a long history of headcount, plus what we know about admitted-student pipelines and demographic trends, how many students will be on the roster next fall.

Mature does not mean universal. In the IR maturity model from our whitepaper, forecasting is effectively impossible at Level 1, becomes historical trend analysis at Level 2, and only turns predictive at Level 3, once request intake and self-service are under control. Where an institution sits on that ladder matters more than which algorithm it picks.

The model families that fit the question are time-series methods, not classification. Three are worth knowing.

ARIMA (AutoRegressive Integrated Moving Average) is the workhorse. It models how a number evolves over time based on its own history and is the right starting point when you have many years of relatively stable data. ARIMA is well-understood, well-tooled, and forgiving of small datasets. It is the model to beat, not necessarily the model to settle on.

Prophet is Facebook's open-source forecasting library, designed specifically for the kind of time series that has trend changes, holidays, and seasonality. It handles missing data gracefully and produces forecasts with intuitive confidence intervals. It is a strong default when ARIMA feels too rigid.

LSTM (Long Short-Term Memory) is a deep-learning model that fits complex sequence data with multiple interacting variables. It is the right tool when enrollment is driven by many simultaneous factors (county HS graduation rates, competitor pricing, financial-aid changes, demographic shifts) and the simpler models are leaving signal on the table. LSTM is also overkill for most use cases, and the engineering cost is real.

The right answer is usually to start with ARIMA, benchmark Prophet on the same data, and only move to LSTM if the gap between the simpler models and the actual outcomes justifies it.

Question	Data needed	Suitable model family
Total fall enrollment forecasting	Historical headcount by term, IPEDS migration tables, county HS graduation rates, demographic projections	ARIMA, Prophet, LSTM for complex trends
College vs university enrollment shift	Historical headcount by institution type, competitor enrollment trends, IPEDS migration tables	ARIMA, Prophet
Transfer and admissions yield	NSC enrollment data, admitted-student records, competitor list, aid offers, distance, HS GPA, deposit timing	Logistic Regression, XGBoost (classification, not time-series)
Out-of-state enrollment under cap policy	Residency flags, tuition revenue per student, GPA, retention, aid by residency, policy-change dates, application volumes by state	Difference-in-differences for policy impact; regression for scenario forecasting
Program margin and viability	Enrollment per program, cost to run, tuition revenue, graduation numbers	Regression, time-series (ARIMA, Prophet)

Yield prediction

Yield prediction is the bridge between admissions and enrollment forecasting. The question is not "how many students will enroll" but "of the students we admitted, which ones will actually deposit and arrive."

The model family for this is classification, not time-series. Each admitted student is a row, the outcome is enrolled or not, and the features include the obvious admissions data (HS GPA, application timing, aid offer, distance from campus) plus competitor enrollment data from the National Student Clearinghouse where it can be pulled.

The same gradient-boosted approach that works for retention works well here, and for the same reasons: structured tabular data, hundreds of features, class balance that varies but is generally tractable. The XGBoost vs LLM post covers why this model family fits.

Yield prediction is also where data sharing across institutions creates real value, and also where partner-only model designs hit their hardest limit. A model trained only on one institution's admitted students may have less signal than a model trained on a regional consortium. The trade-off is straightforward and worth explicit conversation in any procurement.

Program margin and viability

Program-level margin and viability is the analysis Provosts and CFOs ask for, do not fully trust, and end up running anyway. The mechanics are not hard. Per-program revenue (tuition plus aid attributable to majors in the program) minus per-program cost (faculty load, dedicated facilities, program-specific support). The output is a margin per program, ideally rolled up over multiple years to smooth one-time effects.

The predictive question on top of the descriptive one is: which programs are likely to continue, improve, or deteriorate in margin, given current enrollment trends and labor-market signals. That is a regression or time-series problem on a small number of rows per program, and the modeling has to be honest about the noise.

Two failure modes to watch for. First, treating program margin as a pure financial calculation ignores cross-subsidy: a humanities program with a thin margin might be the prerequisite path for a much larger pre-professional pipeline. The model output is the start of the conversation, not the decision. Second, forecasting program-level revenue with only a handful of years of data is risky. Small programs are noisy by construction, and tight confidence intervals on small-program forecasts should be treated with suspicion.

Public-data prediction with IPEDS

Beyond institution-internal data, the same toolkit applies to public IPEDS data for sector-level questions. Two come up often.

Enrollment migration patterns between institution types (college-to-university, university-to-community-college, public-to-private) are largely time-series problems on the IPEDS migration tables. ARIMA and Prophet both handle this well, and the data is fully public.

Closure and accreditation risk screening is a classification problem on a combination of IPEDS financial indicators, enrollment trends, and demographic projections. A model trained on the history of closed and accredited-loss institutions can flag the institutions whose profile most resembles them. The output is not a verdict (closure is driven by board decisions and political factors a model cannot see), but it is a useful screening signal for boards, consultants, and rating agencies.

For IR leaders, the IPEDS-based work is also valuable as a content and credibility asset. The audiences your institution wants to be in front of (other IR teams, conference programs, sector publications) work in IPEDS daily, and original analysis on public data tends to land well.

What "good enough" looks like for each

Different questions justify different levels of model investment. A rough guide.

For total fall enrollment forecasting, a well-tuned ARIMA or Prophet model that comes within roughly 2 to 3% of actuals on a one-year horizon is operationally useful. Beyond that horizon, confidence bands widen quickly and the right move is to report ranges, not point estimates.

For yield prediction, a model that ranks admitted students by enrollment probability with an AUC in the 0.80 to 0.85 range is genuinely useful for tailoring outreach, aid offers, and yield events. Going higher requires either more features or cross-institutional data sharing, and the marginal value drops fast.

For program-margin forecasting, "good enough" is mostly about uncertainty management. A model that produces honest confidence bands (often quite wide for small programs) is more useful than one that produces tight intervals the audience will overweight.

For IPEDS-based closure-risk screening, the goal is rank-ordering, not point prediction. A model that puts the genuinely at-risk institutions in the top decile is doing its job, even if it cannot tell you which specific institutions will close in which specific year.

The honest limits

Four limits are worth surfacing before any of this work reaches a board or cabinet meeting.

Historical bias

The model learns from the past

A model trained on past enrollment learns the patterns of past enrollment, including the demographic, economic, and policy conditions that produced those patterns. If those conditions are changing (and for higher ed in the 2020s, most of them are), the model's confidence on future predictions overstates the actual reliability. Retrain frequently and re-evaluate against actuals.

Sufficient data

Small programs are noisy

Forecasting an enrollment of 12 for a program that has fluctuated between 8 and 18 over the past six years is technically a forecast. It is not a useful one. The model can return a number; the right answer is often "we do not have enough history to forecast this reliably." Honest pipelines surface that.

Policy change

Models do not see legislation

A retention model, an enrollment forecast, or a program-margin projection cannot anticipate a state cap on out-of-state enrollment, a federal Title IV change, or a demographic-cliff acceleration. When a policy shift is on the horizon, the model needs to be paired with explicit scenario analysis, not run in isolation.

Life-event unpredictability

The longer the horizon, the more is unseen

The further out a prediction reaches, the more of the eventual outcome is determined by things the model cannot see (financial events, health events, personal decisions). Predictive validity decays with horizon. The right response is honest confidence bands and frequent retraining, not pretending the long-horizon prediction is as reliable as the short-horizon one.

Where Clema fits in

Clema's work on institution-level prediction (enrollment, yield, program viability, IPEDS-based screening) follows the same operating principles as the student-level models: open-source tooling, honest confidence bands, partner-only training, and an explicit owner for the output. The model selection follows the question (time-series for forecasting, classification for yield, regression for program margin), not the other way around.

For the broader strategic case, the reactive-to-proactive playbook is the companion piece. For the cost and vendor picture (which matters as much for institution-level analytics as it does for student-level), the cost and vendor reality post covers the math.

See enrollment, yield, or program-margin forecasts on your data

Walk through what a working enrollment forecast or program-viability analysis looks like with your institution's historical data and IPEDS context.

Book a Clema demo

Sources

NCES IPEDS, use the data. Facebook Prophet, forecasting at scale. statsmodels, ARIMA documentation. National Student Clearinghouse Research Center. WICHE, Knocking at the College Door demographic projections. Higher Education Demand Index, Carleton College.

CRT

Written by

Clema Research Team

The Clema research team publishes original analysis and practical guides for institutional research and institutional effectiveness professionals.

Frequently asked questions

Why do enrollment and program-viability questions use a different model family than retention?

Because the question shape is different. Student-level prediction (who returns, who graduates) is classification on student-level data. Cabinet-level questions are mostly time-series on institution-level data: how many students will enroll, which programs are losing money, which peers may close. The model selection should follow the question, time-series for forecasting, classification for yield, regression for program margin, not the other way around.

Which forecasting model should we start with for enrollment?

Start with ARIMA, the workhorse for many years of relatively stable headcount data; it is well-understood, well-tooled, and forgiving of small datasets. Benchmark Prophet on the same data when ARIMA feels too rigid, since it handles trend changes and seasonality with intuitive confidence intervals. Only move to LSTM if the gap between the simpler models and actual outcomes justifies the real engineering cost.

Why is yield prediction the hardest case for a partner-only model?

Yield prediction is classification: each admitted student is a row, the outcome is enrolled or not. It is where cross-institutional data sharing creates the most value, because a model trained only on one institution's admitted students may have less signal than one trained on a regional consortium. That trade-off between data-sharing value and partner-only privacy is worth an explicit conversation in any procurement.

What does "good enough" look like for each cabinet-level forecast?

For total fall enrollment, a model within roughly 2 to 3 percent of actuals on a one-year horizon is useful; beyond that, report ranges, not point estimates. For yield, an AUC of 0.80 to 0.85 is genuinely useful for tailoring outreach and aid. For program margin, honest confidence bands matter more than tight ones. For IPEDS closure-risk screening, rank-ordering into a top decile is the goal, not point prediction.

What are the honest limits of this institution-level forecasting?

Four. Historical bias: the model learns past conditions that may be changing. Insufficient data: small programs are noisy, and the right answer is often "we cannot forecast this reliably." Policy change: models do not see legislation like an out-of-state cap or a Title IV change, so pair them with scenario analysis. And life-event unpredictability: validity decays with horizon, so use honest confidence bands and frequent retraining.

From reactive reporting to proactive prediction: a playbook for IR teams

IR teams already sit on years of attendance, GPA, LMS, and aid data. A practical guide to turning that record into forward-looking predictions that change what advisors, deans, and provosts can do this term.

Why gradient boosting beats LLMs for student retention prediction

A plain-English guide for IR leadership on why classical ML (specifically XGBoost) outperforms LLMs on student-level retention and graduation prediction, what the benchmark literature actually shows, and how to read a per-student risk score.

Predictive analytics in higher ed: the $40-a-month stack vs the $200K platform

A straight comparison between the open-source predictive-analytics stack ($40 to $95 per month) and the major incumbent platforms ($30K to $200K per year). Covers infrastructure cost, licensing, where incumbents are strong and where they leave gaps, and a build-vs-buy framework.

Ready to get started?

Reclaim Your Team's Capacity

See how Clema can help your IR team handle routine requests automatically

Try for Free Book a Demo

Forecasting enrollment and program viability: where prediction meets cabinet-level decisions

Introduction

The three questions on every cabinet agenda

How many students will enroll

Which programs are paying for themselves

What are our peers and competitors doing

Enrollment forecasting

Yield prediction

Program margin and viability

Public-data prediction with IPEDS

What "good enough" looks like for each

The honest limits

Historical bias

Sufficient data

Policy change

Life-event unpredictability

Where Clema fits in

See enrollment, yield, or program-margin forecasts on your data

Sources

Frequently asked questions

Why do enrollment and program-viability questions use a different model family than retention?

Which forecasting model should we start with for enrollment?

Why is yield prediction the hardest case for a partner-only model?

What does "good enough" look like for each cabinet-level forecast?

What are the honest limits of this institution-level forecasting?

Related articles

From reactive reporting to proactive prediction: a playbook for IR teams

Why gradient boosting beats LLMs for student retention prediction

Predictive analytics in higher ed: the $40-a-month stack vs the $200K platform

Ready to get started?

Reclaim Your Team's Capacity