Big Data // Big Data Analytics
News
2/8/2013
11:40 AM
Connect Directly
Facebook
Google+
RSS
E-Mail
50%
50%

University Data Sharing Project Takes Big Step Forward

Predictive Analytics Reporting (PAR) Framework initiative publishes its data definitions, clearing way for researchers to glean information from anonymized student data that might result in ways to improve graduation rates.

 Big Data Analytics Masters Degrees: 20 Top Programs
Big Data Analytics Masters Degrees: 20 Top Programs
(click image for larger view and for slideshow)
The Predictive Analytics Reporting (PAR) Framework, a nearly two-year-old project that has been aggregating student data from two-year and four-year institutions, passed an important milestone this week, releasing the common data definitions for all the variables in its database.

That database, compiled from from PAR's member institutions, now includes more than 1.7 million anonymized and institutionally de-identified student records and 8.1 million course-level records.

Launched in May 2011 by the WICHE Cooperative for Educational Technologies (WCET), the PAR Framework started with six institutions sharing data. Its goal was to identify variables that influence student retention and progression, and guide decision making that improves postsecondary student completion in the U.S. The project has since grown to 16 participating institutions, and has received $3.56 million from The Bill & Melinda Gates Foundation to date.

[ Big data has value that's often not reflected in the books. Read What's Your Big Data Worth? ]

"Interest in analytics across the board, and learning analytics in particular, have taken higher education by storm," WCET Executive Director Ellen Wagner told InformationWeek.

Attention to the topic of educational outcomes in postsecondary education have increased as graduation rates in the U.S. have declined. According to the Department of Education, of those who enroll in a higher education program, only about 55% graduate within six years.

Although the PAR Framework isn't the only effort investigating educational outcomes, it might be the most granular. That's because its data set, with some 60 variables, includes not only whether a particular student passed a course, achieved a major or dropped out, but it has down- to-the-course-level details from student records.

Capturing this structured data hasn't been easy. Even the seemingly routine "grade point average" is handled differently at different institutions, Wagner said.

In the future, the database might expand to include semantic data.

The data definitions were released under a Creative Commons license to "to encourage distribution of the definitions into the higher education research community," WCET said in a statement. Until now, the PAR's data fields and definitions have been available only to the organization's 16 institutional partners.

The PAR Framework published the data definitions using the Data Cookbook, a collaborative data dictionary and data management tool built for higher education by IData.

Wagner hopes the release of the data definitions will spur interest and participation in the data-sharing project. "I would like to double our number [of participant institutions] this year," she said. "Another goal for 2013," Wagner said, would be the release of an "intervention taxonomy," which will help benchmark the efficacy of different types of educational interventions for at-risk students.

The PAR Framework has already published some research based on its preliminary data set from the proof of concept. Last summer, it published research in Journal of Asynchronous Learning showing that the higher the number of consecutive credit hours a college student takes, the greater risk of dropping out.

Institutions interested in learning more about becoming members of the PAR Framework are invited to review the requirements for PAR participation and submit an interest form.

Can data analysis keep students on track and improve college retention rates? Also in the premiere all-digital Analytics' Big Test issue of InformationWeek Education: Higher education is just as prone to tech-based disruption as other industries. (Free with registration.)

Comment  | 
Print  | 
More Insights
Comments
Oldest First  |  Newest First  |  Threaded View
PJS880
50%
50%
PJS880,
User Rank: Apprentice
2/15/2013 | 6:14:03 PM
re: University Data Sharing Project Takes Big Step Forward
At this point in time one would think that the GPA of Universities at the very least all have the same system they use when ranking their students. The article did not mention it but are they seeking to establish GnormsG across the board for future Universities that will join in? Another question has the desired result been achieved or are they just collecting data for the future?

Paul Sprague
InformationWeek Contributor
6 Tools to Protect Big Data
6 Tools to Protect Big Data
Most IT teams have their conventional databases covered in terms of security and business continuity. But as we enter the era of big data, Hadoop, and NoSQL, protection schemes need to evolve. In fact, big data could drive the next big security strategy shift.
Register for InformationWeek Newsletters
White Papers
Current Issue
InformationWeek Must Reads Oct. 21, 2014
InformationWeek's new Must Reads is a compendium of our best recent coverage of digital strategy. Learn why you should learn to embrace DevOps, how to avoid roadblocks for digital projects, what the five steps to API management are, and more.
Video
Slideshows
Twitter Feed
InformationWeek Radio
Archived InformationWeek Radio
A roundup of the top stories and trends on InformationWeek.com
Sponsored Live Streaming Video
Everything You've Been Told About Mobility Is Wrong
Attend this video symposium with Sean Wisdom, Global Director of Mobility Solutions, and learn about how you can harness powerful new products to mobilize your business potential.