CRT

Clema Research Team

Research

The Clema research team publishes original analysis and practical guides for institutional research and institutional effectiveness professionals.

Articles by Clema Research Team

Data Definitions

Why 'enrolled student' has five definitions

In a study of 20 IR and IE offices, one term routinely carried several valid definitions at once. Here is why that happens, the eight layers it happens at, and why it breaks reporting even when the data is clean.

Jun 26, 202611 mins read
Data Definitions

Is your institutional knowledge one resignation away from gone?

A diagnostic for institutional research teams: the Institutional Intelligence Gap, the three tiers (55% Large, 40% Moderate, 5% Small), a five-factor self-assessment, and an illustrative cost model for key-person dependency in higher ed.

Jun 26, 202612 mins read
Data Definitions

From person-dependent to system-supported: a framework for governing data definitions

Creating data definitions is achievable. Maintenance is where most efforts collapse. Here is a six-step framework, an AI-readiness argument, and tier-by-tier next steps for IR/IE teams.

Jun 26, 202612 mins read
Professional Development

How to learn institutional research: certificates, courses, and free training

A directory of 25 ways to learn institutional research: graduate certificates, AIR professional development, free IPEDS and PDP training, executive programs, and self-paced courses, with formats, costs, and links.

Jun 25, 202612 mins read
Predictive Analytics

From reactive reporting to proactive prediction: a playbook for IR teams

IR teams already sit on years of attendance, GPA, LMS, and aid data. A practical guide to turning that record into forward-looking predictions that change what advisors, deans, and provosts can do this term.

Jun 5, 202610 mins read
Predictive Analytics

Why gradient boosting beats LLMs for student retention prediction

A plain-English guide for IR leadership on why classical ML (specifically XGBoost) outperforms LLMs on student-level retention and graduation prediction, what the benchmark literature actually shows, and how to read a per-student risk score.

Jun 5, 202611 mins read
Predictive Analytics

Predictive analytics in higher ed: the $40-a-month stack vs the $200K platform

A straight comparison between the open-source predictive-analytics stack ($40 to $95 per month) and the major incumbent platforms ($30K to $200K per year). Covers infrastructure cost, licensing, where incumbents are strong and where they leave gaps, and a build-vs-buy framework.

Jun 5, 202610 mins read
Predictive Analytics

Predicting term-to-term retention: what a working model actually looks at

A practical look at the features, engineering choices, and timing decisions that drive a real term-to-term retention model. Written for IR leadership who want to understand what the model sees before they hand its output to advisors.

Jun 5, 202611 mins read
Predictive Analytics

Predicting graduation within 150% time: course combinations, confidence, and committee-ready explanations

The graduation use case in detail. Features, course-combination effects, plain-English metric reading, the small-group floor that prevents false certainty, and the drift management every long-horizon prediction needs.

Jun 5, 202611 mins read
Predictive Analytics

Forecasting enrollment and program viability: where prediction meets cabinet-level decisions

A practical guide to the predictive questions Provosts and CFOs own. Enrollment forecasting (ARIMA, Prophet, LSTM), yield prediction, program margin and viability, IPEDS-based screening, and the honest limits of each.

Jun 5, 202610 mins read
Data Request Management

Six files, 209,321 programs, three join keys: turning PPD:2026 into answers your provost actually asks

PPD:2026 looks like a single dataset. On disk it's six Excel files joined on opeid6 + credlev + cip4, with mixed-vintage dollars, statistical-noise privacy suppression, and a CIP-4 key that doesn't match the CIP-6 the STATS test will use. Here's where the analyst hours actually go, and how to skip them.

Jun 2, 202610 mins read
Data Request Management

Is Your University's Data Request Form Actually Optimized? How Better Design Can Reduce 77% of Clarification Time

35% of universities have no formal intake process, and 77% of IR offices require extensive clarification before analysis begins. Analysis of 18 university data request forms reveals what the best systems do differently.

Feb 25, 20268 mins read
Reporting & Analytics

Who's Growing Through Dual Enrollment? A Side-by-Side Benchmark of Community Colleges and Four-Year Institutions

IPEDS 2023–2024 data reveals four-year institutions growing 3x faster in dual enrollment than community colleges, while community colleges carry 32.4% structural reliance. A full side-by-side benchmark.

Feb 25, 20268 mins read
Reporting & Analytics

What IR Teams Really Benchmark: Evidence from 50 Interviews

Discover what IR teams actually benchmark most often, from student outcomes and faculty metrics to aspirational peer lists, based on evidence from 50 institutional research interviews.

Feb 25, 202610 mins read
Data Request Management

What Is the Relationship Between Data Request Intake Quality, Request Intelligence, and Workload in Institutional Research Given That 78% of Requests Require 10+ Follow-Up Questions?

Research shows 80% of IR data requests arrive missing critical elements. Learn how Request Intelligence (the quality of requests at intake) determines whether your team spends time analyzing or clarifying.

Feb 12, 202610 mins read
Data Request Management

Best Practices for Institutional Researchers to Optimize Data Requests and Reclaim the 60% of Team's Capacity

Discover proven strategies from IR leaders to streamline data requests, implement self-service dashboards effectively, and reclaim up to 60% of your team's capacity for strategic work.

Jan 12, 202612 mins read